RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14859
         (1097 letters)



>gnl|CDD|187699 cd09275, RNase_HI_RT_DIRS1, DIRS1 family of RNase HI in long-term
           repeat retroelements.  Ribonuclease H (RNase H) enzymes
           are divided into two major families, Type 1 and Type 2,
           based on amino acid sequence similarities and
           biochemical properties. RNase H is an endonuclease that
           cleaves the RNA strand of an RNA/DNA hybrid in a
           sequence non-specific manner in the presence of divalent
           cations. RNase H is widely present in various organisms,
           including bacteria, archaea and eukaryotes.  RNase HI
           has also been observed as adjunct domains to the reverse
           transcriptase gene in retroviruses, in long-term repeat
           (LTR)-bearing retrotransposons and non-LTR
           retrotransposons. RNase HI in LTR retrotransposons
           perform degradation of the original RNA template,
           generation of a polypurine tract (the primer for
           plus-strand DNA synthesis), and final removal of RNA
           primers from newly synthesized minus and plus strands.
           The catalytic residues for RNase H enzymatic activity,
           three aspartatic acids and one glutamatic acid residue
           (DEDD), are unvaried across all RNase H domains.
           Phylogenetic patterns of RNase HI of LTR retroelements
           is classified into five major families, Ty3/Gypsy,
           Ty1/Copia, Bel/Pao, DIRS1 and the vertebrate
           retroviruses. The structural features of DIRS1-group
           elements are different from typical LTR elements. RNase
           H inhibitors have been explored as an anti-HIV drug
           target because RNase H inactivation inhibits reverse
           transcription.
          Length = 120

 Score =  117 bits (296), Expect = 6e-31
 Identities = 51/116 (43%), Positives = 71/116 (61%)

Query: 774 ISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQS 833
           + TDAS  GWG+ +  S+  GLWS E++N HIN  E+ AV  AL      L +  V+V+S
Sbjct: 2   LFTDASLSGWGAVLSGSWAQGLWSAEERNKHINFLELLAVLLALQHWGARLSNRKVLVRS 61

Query: 834 DNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
           DN T V+Y+ RQGGT+S  LL+   ++ L  ++  I + A+ IPG  N  AD LSR
Sbjct: 62  DNTTAVAYINRQGGTRSPELLALARQLVLWCEERNIWLRARHIPGVLNVAADRLSR 117


>gnl|CDD|239684 cd03714, RT_DIRS1, RT_DIRS1: Reverse transcriptases (RTs) occurring
           in the DIRS1 group of retransposons. Members of the
           subfamily include the Dictyostelium DIRS-1, Volvox
           carteri kangaroo, and Panagrellus redivivus PAT
           elements. These elements differ from LTR and
           conventional non-LTR retrotransposons. They contain
           split direct repeat (SDR) termini, and have been
           proposed to integrate via double-stranded closed-circle
           DNA intermediates assisted by an encoded recombinase
           which is similar to gamma-site-specific integrase.
          Length = 119

 Score = 90.1 bits (224), Expect = 2e-21
 Identities = 41/117 (35%), Positives = 66/117 (56%), Gaps = 1/117 (0%)

Query: 551 IDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRG 610
           +DL  AYFH+PI    +  L  ++ G+      LPFGL+ AP+ F  +   + + LR  G
Sbjct: 1   VDLKDAYFHIPILPRSRDLLGFAWQGETYQFKALPFGLSLAPRVFTKVVEALLAPLRLLG 60

Query: 611 MRVVVYLDDFLLVNQDPRILEIQGK-LAVSILGSLGWIVNLQKSSLSPAPVLQFLGI 666
           +R+  YLDD L++    +  E   + L  ++L +LG+ +NL+KS L P   + FLG+
Sbjct: 61  VRIFSYLDDLLIIASSIKTSEAVLRHLRATLLANLGFTLNLEKSKLGPTQRITFLGL 117


>gnl|CDD|239685 cd03715, RT_ZFREV_like, RT_ZFREV_like: A subfamily of reverse
           transcriptases (RTs) found in sequences similar to the
           intact endogenous retrovirus ZFERV from zebrafish and to
           Moloney murine leukemia virus RT.  An RT gene is usually
           indicative of a mobile element such as a retrotransposon
           or retrovirus. RTs occur in a variety of mobile
           elements, including retrotransposons, retroviruses,
           group II introns, bacterial msDNAs, hepadnaviruses, and
           caulimoviruses. These elements can be divided into two
           major groups. One group contains retroviruses and DNA
           viruses whose propagation involves an RNA intermediate.
           They are grouped together with transposable elements
           containing long terminal repeats (LTRs). The other
           group, also called poly(A)-type retrotransposons,
           contain fungal mitochondrial introns and transposable
           elements that lack LTRs. Phylogenetic analysis suggests
           that  ZFERV belongs to a distinct group of retroviruses.
          Length = 210

 Score = 78.5 bits (194), Expect = 3e-16
 Identities = 55/194 (28%), Positives = 91/194 (46%), Gaps = 7/194 (3%)

Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNLKGLNQFLSPKKFSLINHFR 537
           HIQE+LE G+L      + + + +  V K  G   R V +L+ +NQ + P   ++ N + 
Sbjct: 20  HIQELLEAGILVP--CQSPWNTPILPVKKPGGNDYRMVQDLRLVNQAVLPIHPAVPNPYT 77

Query: 538 IPSFLQ-KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF- 595
           + S L  K  +   +DL+ A+F +P+    Q   A  + G     T LP G   +P  F 
Sbjct: 78  LLSLLPPKHQWYTVLDLANAFFSLPLAPDSQPLFAFEWEGQQYTFTRLPQGFKNSPTLFH 137

Query: 596 -ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
            A   +     L   G  ++ Y+DD LL               ++ LG LG+ V+ +K+ 
Sbjct: 138 EALARDLAPFPLEHEGTILLQYVDDLLLAADSEEDCLKGTDALLTHLGELGYKVSPKKAQ 197

Query: 655 LSPAPVLQFLGIMW 668
           +  A V +FLG++W
Sbjct: 198 ICRAEV-KFLGVVW 210


>gnl|CDD|238825 cd01647, RT_LTR, RT_LTR: Reverse transcriptases (RTs) from
           retrotransposons and retroviruses which have long
           terminal repeats (LTRs) in their DNA copies but not in
           their RNA template. RT catalyzes DNA replication from an
           RNA template, and is responsible for the replication of
           retroelements. An RT gene is usually indicative of a
           mobile element such as a retrotransposon or retrovirus.
           RTs are present in a variety of mobile elements,
           including retrotransposons, retroviruses, group II
           introns, bacterial msDNAs, hepadnaviruses, and
           Caulimoviruses.
          Length = 177

 Score = 72.6 bits (179), Expect = 2e-14
 Identities = 34/135 (25%), Positives = 54/135 (40%), Gaps = 4/135 (2%)

Query: 487 GVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGD 546
           G+++   S     S + +V K +G  R  ++ + LN+     ++ L     +   L    
Sbjct: 1   GIIEPSSSPYA--SPVVVVKKKDGKLRLCVDYRKLNKVTIKDRYPLPTIDELLEELAGAK 58

Query: 547 YMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLL 606
               +DL   Y  +P+    +   A      +   T +PFGL  AP  F  L N +   L
Sbjct: 59  VFSKLDLRSGYHQIPLAEESRPKTAFRTPFGLYEYTRMPFGLKNAPATFQRLMNKILGDL 118

Query: 607 RSRGMRVVVYLDDFL 621
                 V VYLDD L
Sbjct: 119 LGDF--VEVYLDDIL 131


>gnl|CDD|215698 pfam00078, RVT_1, Reverse transcriptase (RNA-dependent DNA
           polymerase).  A reverse transcriptase gene is usually
           indicative of a mobile element such as a retrotransposon
           or retrovirus. Reverse transcriptases occur in a variety
           of mobile elements, including retrotransposons,
           retroviruses, group II introns, bacterial msDNAs,
           hepadnaviruses, and caulimoviruses.
          Length = 194

 Score = 69.3 bits (170), Expect = 3e-13
 Identities = 41/192 (21%), Positives = 75/192 (39%), Gaps = 30/192 (15%)

Query: 505 VPKGNGGT-RPVLN----------LKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
           +PK   G  RP+ +           K L    SP +          + L+   + + +DL
Sbjct: 1   IPKKGKGKYRPISSVDYKILNKATKKRLEPIFSPPQPGFRPGRSTHNLLKGSKWFLKLDL 60

Query: 554 SQAYFHVPI--------------KTTHQRFLALSYNGD---VLAMTCLPFGLATAPQAFA 596
            +A+  +P               +    R  ++  NG+         LP G   +P  F 
Sbjct: 61  KKAFDSIPHDPLDRPLTAFGLPQRFIRWRTFSVLVNGNPGGRYEWRGLPQGSVLSPLLFN 120

Query: 597 SLSNWVASLLRSR--GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
              N +   LR R  G+  + Y DD L+ ++ P  L+   +  +  L  LG  +N +K+ 
Sbjct: 121 LFMNELLRPLRKRFPGLTYLRYADDILIFSKSPEELQEILEEVLEFLKELGLKLNPEKTK 180

Query: 655 LSPAPVLQFLGI 666
           ++ +  ++FLG 
Sbjct: 181 ITHSDEVKFLGY 192


>gnl|CDD|187698 cd09274, RNase_HI_RT_Ty3, Ty3/Gypsy family of RNase HI in long-term
           repeat retroelements.  Ribonuclease H (RNase H) enzymes
           are divided into two major families, Type 1 and Type 2,
           based on amino acid sequence similarities and
           biochemical properties. RNase H is an endonuclease that
           cleaves the RNA strand of an RNA/DNA hybrid in a
           sequence non-specific manner in the presence of divalent
           cations. RNase H is widely present in various organisms,
           including bacteria, archaea and eukaryotes.  RNase HI
           has also been observed as adjunct domains to the reverse
           transcriptase gene in retroviruses, in long-term repeat
           (LTR)-bearing retrotransposons and non-LTR
           retrotransposons. RNase HI in LTR retrotransposons
           perform degradation of the original RNA template,
           generation of a polypurine tract (the primer for
           plus-strand DNA synthesis), and final removal of RNA
           primers from newly synthesized minus and plus strands.
           The catalytic residues for RNase H enzymatic activity,
           three aspartatic acids and one glutamatic acid residue
           (DEDD), are unvaried across all RNase H domains.
           Phylogenetic patterns of RNase HI of LTR retroelements
           is classified into five major families, Ty3/Gypsy,
           Ty1/Copia, Bel/Pao, DIRS1 and the vertebrate
           retroviruses. Ty3/Gypsy family widely distributed among
           the genomes of plants, fungi and animals. RNase H
           inhibitors have been explored as an anti-HIV drug target
           because RNase H inactivation inhibits reverse
           transcription.
          Length = 121

 Score = 57.1 bits (139), Expect = 8e-10
 Identities = 38/126 (30%), Positives = 62/126 (49%), Gaps = 17/126 (13%)

Query: 773 FISTDASDLGWG---SQVDS------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPL 823
            + TDASD G G   SQ+D       +F S   +  ++N+   +KE+ A+  AL      
Sbjct: 1   VLETDASDFGIGAVLSQLDDGKERPIAFASRKLTPAERNYSTYEKELLAIVFALKKFRHY 60

Query: 824 LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSV 883
           L      V +D++++  YL  +   K L+    + +  LL Q++   I  ++ PG  N V
Sbjct: 61  LLGRKFTVYTDHKSLK-YLLTK---KDLN--PRLARWLLLLQEFDFEI--EYRPGKENVV 112

Query: 884 ADSLSR 889
           AD+LSR
Sbjct: 113 ADALSR 118


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 51.5 bits (124), Expect = 2e-06
 Identities = 35/157 (22%), Positives = 51/157 (32%), Gaps = 4/157 (2%)

Query: 49  DQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP 108
             +P V+ P                   P +A     AA  P+    A    + +  A P
Sbjct: 642 APAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQP 701

Query: 109 LPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKAS 168
            P+ +  P        AA   Q A  AS P  AA    PLP   +P        +   A 
Sbjct: 702 APAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLP--PEPDDPPDP--AGAPAQ 757

Query: 169 RGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVD 205
                 P P +      P SP S ++E   ++   +D
Sbjct: 758 PPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMD 794



 Score = 48.8 bits (117), Expect = 1e-05
 Identities = 29/155 (18%), Positives = 41/155 (26%), Gaps = 2/155 (1%)

Query: 49  DQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP 108
                V+ P  + G            A P         A   +    A    + +  A P
Sbjct: 652 HHPKHVAVPDASDGGDGWPAKA--GGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATP 709

Query: 109 LPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKAS 168
               +  P   PP  +    A   +     P       P      P+        A  A+
Sbjct: 710 PAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAA 769

Query: 169 RGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEV 203
                 P PPSE          S D ED  + +EV
Sbjct: 770 PAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEV 804



 Score = 45.0 bits (107), Expect = 2e-04
 Identities = 38/203 (18%), Positives = 56/203 (27%), Gaps = 25/203 (12%)

Query: 39  AITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAP 98
           A  +         PA      A      +   ++APA    A AP  AA +      A P
Sbjct: 602 APASSGPPEEAARPAAPAAPAAPAAPAPAGA-AAAPAEASAAPAPGVAAPEHHPKHVAVP 660

Query: 99  GSSASV--------LAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPF 150
            +S            AAP          +P A + A  AQPA   +  P A         
Sbjct: 661 DASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQA------ 714

Query: 151 FCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPA 210
                        A +  +       P   +  P+PL P   D  D +         P  
Sbjct: 715 ----------DDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAP 764

Query: 211 LFSFAPSTKEREPSIPDPDSELA 233
             + AP+         + +    
Sbjct: 765 APAAAPAAAPPPSPPSEEEEMAE 787



 Score = 40.7 bits (96), Expect = 0.005
 Identities = 30/115 (26%), Positives = 45/115 (39%), Gaps = 3/115 (2%)

Query: 30  ERCLAQLVQAITARKS-STLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAH 88
           ER L   ++ +  R   +    +PA + P  A+     +P  ++A      A AP  AA 
Sbjct: 371 ERGLLARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAP-AAAP 429

Query: 89  QPSATVQAAPGSSASVLAAPLPSS-SGQPFISPPAQSAAFLAQPASTASLPPSAA 142
           QP+     AP   +    AP   + S  P  +P AQ A   A      + P  A 
Sbjct: 430 QPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAP 484



 Score = 37.7 bits (88), Expect = 0.037
 Identities = 21/102 (20%), Positives = 30/102 (29%)

Query: 34  AQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSAT 93
           A    A  A  ++      A   P  A   +   P  +         S P  AA      
Sbjct: 408 AAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPA 467

Query: 94  VQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTA 135
              A     +   AP P ++  P  +P A +A      A  A
Sbjct: 468 PAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDA 509



 Score = 36.9 bits (86), Expect = 0.076
 Identities = 19/93 (20%), Positives = 31/93 (33%), Gaps = 4/93 (4%)

Query: 51  SPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLP 110
           +PA +    A+     +    +  A P  A AP  A      +      +  +    P  
Sbjct: 405 APAAAPAPAAAAP--AAAAAPAPAAAPQPAPAP--APAPAPPSPAGNAPAGGAPSPPPAA 460

Query: 111 SSSGQPFISPPAQSAAFLAQPASTASLPPSAAH 143
           + S QP  +P A      A   +  + P  AA 
Sbjct: 461 APSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAA 493



 Score = 36.5 bits (85), Expect = 0.092
 Identities = 28/140 (20%), Positives = 45/140 (32%), Gaps = 13/140 (9%)

Query: 57  PGMASGVSDQSPPLS-SAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQ 115
               +G    + P + +A      A A    A   +    AAP  + +   AP P S   
Sbjct: 387 VAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAG 446

Query: 116 PFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQP 175
              +  A S    A P++  +  P+AA      P              A  A+    A P
Sbjct: 447 NAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAP------------APAPPAAPAPAAAP 494

Query: 176 PPPSESVTPIPLSPVSSDQE 195
             P+    P      ++ +E
Sbjct: 495 AAPAAPAAPAGADDAATLRE 514



 Score = 36.1 bits (84), Expect = 0.13
 Identities = 20/108 (18%), Positives = 32/108 (29%), Gaps = 17/108 (15%)

Query: 49  DQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAP--------------VQAAHQPSATV 94
             + A +    A   +    P ++    P  A AP                     + + 
Sbjct: 405 APAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSA 464

Query: 95  QAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
           Q AP  +A+      P+ +       PA +AA  A  A  A      A
Sbjct: 465 QPAPAPAAAPEPTAAPAPAPPA---APAPAAAPAAPAAPAAPAGADDA 509



 Score = 32.7 bits (75), Expect = 1.3
 Identities = 24/165 (14%), Positives = 42/165 (25%), Gaps = 32/165 (19%)

Query: 73  APANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPA 132
            P+        +    +    +  A G+ A   AAP  +++  P  +P   +AA     A
Sbjct: 364 LPSASDDERGLLARLERLERRLGVAGGAGAPAAAAP-SAAAAAPAAAPAPAAAA--PAAA 420

Query: 133 STASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSS 192
           +  +   +                          A     A  PP      P   +P   
Sbjct: 421 AAPAPAAAPQP-----------------------APAPAPAPAPPSPAGNAPAGGAPSPP 457

Query: 193 DQEDFSEEDEVVDCNPPALFSFAPSTKEREPSIPDPDSELASQGV 237
                S +        P        T    P+ P   +  A+   
Sbjct: 458 PAAAPSAQPAPAPAAAP------EPTAAPAPAPPAAPAPAAAPAA 496


>gnl|CDD|187690 cd06222, RNase_H, RNase H is an endonuclease that cleaves the RNA
           strand of an RNA/DNA hybrid in a sequence non-specific
           manner.  Ribonuclease H (RNase H) enzymes are divided
           into two major families, Type 1 and Type 2, based on
           amino acid sequence similarities and biochemical
           properties. RNase H is an endonuclease that cleaves the
           RNA strand of an RNA/DNA hybrid in a sequence
           non-specific manner in the presence of divalent cations.
           RNase H is widely present in various organisms,
           including bacteria, archaea and eukaryotes. Most
           prokaryotic and eukaryotic genomes contain multiple
           RNase H genes. Despite the lack of amino acid sequence
           homology, Type 1 and type 2 RNase H share a main-chain
           fold and steric configurations of the four acidic
           active-site residues and have the same catalytic
           mechanism and functions in cells. RNase H is involved in
           DNA replication, repair and transcription. One of the
           important functions of RNase H is to remove Okazaki
           fragments during DNA replication. RNase H inhibitors
           have been explored as an anti-HIV drug target because
           RNase H inactivation inhibits reverse transcription.
          Length = 123

 Score = 45.0 bits (107), Expect = 1e-05
 Identities = 22/87 (25%), Positives = 42/87 (48%), Gaps = 7/87 (8%)

Query: 805 INKKEMFAVHQAL--SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFL 862
            N+ E+ A+ +AL  +L+L L +  +    +D++ VV  +         + L   + + L
Sbjct: 39  NNEAELLALLEALELALDLGLKKLIIE---TDSKYVVDLINSWSKGWKKNNLLLWDILLL 95

Query: 863 LSQDWRIHILAQFIPGAYNSVADSLSR 889
           LS+   I    + +P   N VAD L++
Sbjct: 96  LSKFIDIRF--EHVPREGNEVADRLAK 120


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 48.4 bits (115), Expect = 3e-05
 Identities = 38/210 (18%), Positives = 57/210 (27%), Gaps = 12/210 (5%)

Query: 39   AITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAP 98
            A  A  +     +P     G A+      P      A P   + P   A  P   +    
Sbjct: 2728 ARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787

Query: 99   GSSASVLAAPLPS----SSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLP--FFC 152
             +S S     LPS    +     +  PA +    A PA     P SA     P P     
Sbjct: 2788 VASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPP 2847

Query: 153  DPSYYGHYLQSAMKASRGQVAQPPPPSESVTP------IPLSPVSSDQEDFSEEDEVVDC 206
                 G  +       R   ++ P    +         +    VS   E F+   +  + 
Sbjct: 2848 PSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPER 2907

Query: 207  NPPALFSFAPSTKEREPSIPDPDSELASQG 236
             P       P  + + P  P P        
Sbjct: 2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937



 Score = 46.1 bits (109), Expect = 1e-04
 Identities = 24/137 (17%), Positives = 40/137 (29%)

Query: 51   SPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLP 110
              A   P   +     SP L +APA P   + P                   +      P
Sbjct: 2716 VSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775

Query: 111  SSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRG 170
            ++     ++ PA ++   ++ +  +   P+        P    P             S  
Sbjct: 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ 2835

Query: 171  QVAQPPPPSESVTPIPL 187
              A PPPP      +PL
Sbjct: 2836 PTAPPPPPGPPPPSLPL 2852



 Score = 42.6 bits (100), Expect = 0.001
 Identities = 37/187 (19%), Positives = 49/187 (26%), Gaps = 45/187 (24%)

Query: 51   SPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQA-------------------AHQPS 91
            +PA + P  AS      PP S+ P  P     P                      +  P+
Sbjct: 2813 APAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPA 2872

Query: 92   ATVQAAPGSSASVLAAPLPSSSGQPFISPP--------------AQSAAFLAQPASTASL 137
            A   A        LA P  S S + F  PP               Q       P      
Sbjct: 2873 AKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPP 2932

Query: 138  PPSAAHHLYPLPFFCDPS------------YYGHYLQSAMKASRGQVAQPPPPSESVTPI 185
            PP       PL    DP+            + G  +   +   R +V QP P  E+    
Sbjct: 2933 PPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASS 2992

Query: 186  PLSPVSS 192
                   
Sbjct: 2993 TPPLTGH 2999



 Score = 35.7 bits (82), Expect = 0.18
 Identities = 42/180 (23%), Positives = 61/180 (33%), Gaps = 23/180 (12%)

Query: 51   SPAVSDPGMASGVSDQSPPL-SSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPL 109
             P    P         S P  +       QAS+P Q   + +A  +   GS  S+   P 
Sbjct: 2646 VPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAA--RPTVGSLTSLADPPP 2703

Query: 110  PSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASR 169
            P  + +P  +P A  +A    P   A+   S A    P P        G     A     
Sbjct: 2704 PPPTPEP--APHALVSATPLPPGPAAARQASPALPAAPAP---PAVPAG----PATPGGP 2754

Query: 170  GQVAQPPPPSESVTPI-PLSPVSSDQEDFSEEDEVVDCNPPALFSFAPSTKEREPSIPDP 228
             + A+PP  +    P  P +P +      +          PA+ S + S  E  PS  DP
Sbjct: 2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLT---------RPAVASLSESR-ESLPSPWDP 2804



 Score = 32.6 bits (74), Expect = 1.7
 Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 3/80 (3%)

Query: 57   PGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQP 116
            P +    SD +P ++ A +  + A +   A   P +TV AA  +   VLA  + + S  P
Sbjct: 1115 PLLVQFFSDHAPRIARAVSEAINAGSAAVATASPESTVDAAVRAHG-VLADAVAALS--P 1171

Query: 117  FISPPAQSAAFLAQPASTAS 136
             +  PA   AFL   A +A+
Sbjct: 1172 AVRDPACPLAFLVALADSAA 1191



 Score = 32.2 bits (73), Expect = 2.2
 Identities = 23/117 (19%), Positives = 34/117 (29%), Gaps = 6/117 (5%)

Query: 39  AITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAP 98
               R+S+    +P         G  DQ+ P +  PA+    +     A  P       P
Sbjct: 381 PTRKRRSARHAATPF----ARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLP 436

Query: 99  GSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPS 155
            +       P P    QP    PA   A      +T     +      P P   D +
Sbjct: 437 SAEPGSDDGPAPPPERQP--PAPATEPAPDDPDDATRKALDALRERRPPEPPGADLA 491


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 47.4 bits (112), Expect = 5e-05
 Identities = 41/239 (17%), Positives = 66/239 (27%), Gaps = 20/239 (8%)

Query: 53  AVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAA----HQPSATVQAAPG-SSASVLAA 107
            +++P   +  +  +    +AP  PV  + PV +      QP+   Q  PG  +   + A
Sbjct: 315 PITEPVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIA 374

Query: 108 PLPSS--SGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAM 165
           P P        +  P  Q    L QP        + A           P Y     Q A 
Sbjct: 375 PAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQ----PYYAPAPEQPAQ 430

Query: 166 KASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPALFSFAPSTKEREPSI 225
           +       + P    +          + Q  +  E               P        +
Sbjct: 431 QPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQP------QPV 484

Query: 226 PDPDSELASQGVVCQKLGSPAWNRVRYTESQRAL---QAGGVFQPLAIPSEFPHSNSST 281
                      V   K   P        E +RA    Q    +QP+  P + P    S+
Sbjct: 485 EQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSS 543



 Score = 32.0 bits (72), Expect = 2.5
 Identities = 30/146 (20%), Positives = 39/146 (26%), Gaps = 8/146 (5%)

Query: 52  PAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPS 111
           P       A+    Q P  + AP  P Q      A  QP A          S  A     
Sbjct: 403 PQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTY 462

Query: 112 SSGQPFISPPAQSAAFLAQPA---STASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKAS 168
            + Q +  P AQ   +             P        P      P YY   ++      
Sbjct: 463 QTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPAR---PPLYYFEEVEEKRARE 519

Query: 169 RGQVAQ--PPPPSESVTPIPLSPVSS 192
           R Q+A    P P     P P+     
Sbjct: 520 REQLAAWYQPIPEPVKEPEPIKSSLK 545



 Score = 31.2 bits (70), Expect = 4.0
 Identities = 24/114 (21%), Positives = 41/114 (35%), Gaps = 10/114 (8%)

Query: 5   PKRDTSDPQPEAKR-------LCHLEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDP 57
            ++   +P+P  +        L + EE  EK  R   QL  A   +      + P     
Sbjct: 485 EQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQL--AAWYQPIPEPVKEPEPIKS 542

Query: 58  GMASGVSDQSPPLSSAPA-NPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLP 110
            + +      PP+ +A A +P+ +          +A   AAP  S +    P P
Sbjct: 543 SLKAPSVAAVPPVEAAAAVSPLASGVKKATLATGAAATVAAPVFSLANSGGPRP 596


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 45.5 bits (108), Expect = 1e-04
 Identities = 28/134 (20%), Positives = 47/134 (35%), Gaps = 5/134 (3%)

Query: 69  PLSSAPANPV-QASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAF 127
           P ++A A    +   P +      A    A  ++A   AA   +++  P   P A   A 
Sbjct: 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAP 425

Query: 128 LAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPL 187
           +A PA+ A  P +A     P      P+           A   +VA  P  + +      
Sbjct: 426 VAAPAAAA--PAAAPAAA-PAAVALAPAPP-AQAAPETVAIPVRVAPEPAVASAAPAPAA 481

Query: 188 SPVSSDQEDFSEED 201
           +P ++      E D
Sbjct: 482 APAAARLTPTEEGD 495



 Score = 40.9 bits (96), Expect = 0.004
 Identities = 24/104 (23%), Positives = 35/104 (33%), Gaps = 5/104 (4%)

Query: 38  QAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAA 97
            A      +    +PA +    AS  +        AP     A+AP  A     A V  A
Sbjct: 389 PAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALA 448

Query: 98  PGSSASVLAAPLPSSSG---QPFISPPAQSAAFLAQPASTASLP 138
           P  +    AAP   +      P  +  + + A  A PA+    P
Sbjct: 449 P--APPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTP 490



 Score = 40.5 bits (95), Expect = 0.005
 Identities = 25/112 (22%), Positives = 38/112 (33%), Gaps = 4/112 (3%)

Query: 39  AITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQ-AAHQPSATVQAA 97
           A  A K +      A       + V+  +   + A A    ASAP    A  P A V A 
Sbjct: 373 AAPAEKKTPARPEAAAPAA---APVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAP 429

Query: 98  PGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLP 149
             ++ +   A  P++       P   +   +A P   A  P  A+    P  
Sbjct: 430 AAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAA 481



 Score = 35.1 bits (81), Expect = 0.21
 Identities = 25/142 (17%), Positives = 43/142 (30%), Gaps = 29/142 (20%)

Query: 87  AHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLY 146
           A +P+A  +AA  +     A P  ++     ++  A + A  A PA+ AS P +      
Sbjct: 363 AFKPAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPP---- 418

Query: 147 PLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDC 206
                            A  A+    A    P+ +   + L+P    Q            
Sbjct: 419 ------------AAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQ------------ 454

Query: 207 NPPALFSFAPSTKEREPSIPDP 228
                    P     EP++   
Sbjct: 455 -AAPETVAIPVRVAPEPAVASA 475


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 44.5 bits (105), Expect = 3e-04
 Identities = 42/195 (21%), Positives = 62/195 (31%), Gaps = 32/195 (16%)

Query: 34  AQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSAT 93
           AQ   A  A  ++    +   + P  A   +  +  +++APA    A   + AA Q SA 
Sbjct: 383 AQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASA- 441

Query: 94  VQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCD 153
            +   G+ A    AP P+++      P A         A+ A+ P  AA    P P    
Sbjct: 442 -RGPGGAPA---PAPAPAAAPAAAARPAAAGPR--PVAAAAAAAPARAAPAAAPAP---- 491

Query: 154 PSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPALFS 213
                                 PPP E + P   SP  +  +             PA   
Sbjct: 492 ------------------ADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPAT-- 531

Query: 214 FAPSTKEREPSIPDP 228
            A      E   P P
Sbjct: 532 -ADPDDAFETLAPAP 545



 Score = 33.7 bits (77), Expect = 0.65
 Identities = 30/134 (22%), Positives = 42/134 (31%), Gaps = 26/134 (19%)

Query: 34  AQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSAT 93
           A+ V A  AR+S   +   A               P  +  A P  A+ P  A  +P A 
Sbjct: 416 ARAVAAAPARRSPAPEALAAARQASARGPGG-APAPAPAPAAAPAAAARPAAAGPRPVAA 474

Query: 94  VQAAPGSSASVLAAPLPSSSGQP----------FISPPAQSAAFLAQ------------- 130
             AA  + A+  AAP P+    P            +P    AA                 
Sbjct: 475 AAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADP 534

Query: 131 --PASTASLPPSAA 142
                T +  P+AA
Sbjct: 535 DDAFETLAPAPAAA 548



 Score = 32.9 bits (75), Expect = 1.1
 Identities = 26/110 (23%), Positives = 36/110 (32%), Gaps = 18/110 (16%)

Query: 50  QSPAVSDPGMASGVSDQSPP----------------LSSAPANPVQASAPVQAAHQPSAT 93
            +PA + P  A   +D  PP                  +APA  V  S P  A   P   
Sbjct: 478 AAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDA 537

Query: 94  VQA-APGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
            +  AP  +A+       ++       PP  SA+ L         P  AA
Sbjct: 538 FETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLP-DMFDGDWPALAA 586


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 44.1 bits (104), Expect = 5e-04
 Identities = 34/184 (18%), Positives = 52/184 (28%), Gaps = 11/184 (5%)

Query: 52  PAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSA---SVLAAP 108
           PAV+  G   G        +        A+A   +A      V  A G++    +  AA 
Sbjct: 360 PAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAA 419

Query: 109 LPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKAS 168
              +   P    P  +A      A   +  P+ A+        CD         S   ++
Sbjct: 420 ATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASA 479

Query: 169 RGQVAQPPPPSESVTP--------IPLSPVSSDQEDFSEEDEVVDCNPPALFSFAPSTKE 220
               A P    E                P +      S ED      PPA  +  P+   
Sbjct: 480 PASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAA 539

Query: 221 REPS 224
             P+
Sbjct: 540 AAPA 543



 Score = 41.8 bits (98), Expect = 0.002
 Identities = 20/107 (18%), Positives = 33/107 (30%), Gaps = 1/107 (0%)

Query: 37  VQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAP-ANPVQASAPVQAAHQPSATVQ 95
                    +    +P  +     +    +     + P A+   ASAP   A   +A   
Sbjct: 433 PATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEP 492

Query: 96  AAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
           A   ++ S         +  P  +    + A  A PA  A  P  AA
Sbjct: 493 APRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAA 539



 Score = 41.4 bits (97), Expect = 0.003
 Identities = 33/156 (21%), Positives = 50/156 (32%), Gaps = 13/156 (8%)

Query: 49  DQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAA-PGSSASVLAA 107
               A +           +   ++A      A+A   A      T  AA P + A   AA
Sbjct: 492 PAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAGGAAA 551

Query: 108 PLP---------SSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYG 158
            L          SS      +  A+ AA  A  A+     P  A  + P P     +   
Sbjct: 552 ALDVLRNAGMRVSSDRGARAAAAAKPAA--APAAAPKPAAPRVAVQV-PTPRARAATGDA 608

Query: 159 HYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQ 194
               +A      +    PPP E + P    P+S+D+
Sbjct: 609 PPNGAARAEQAAESRGAPPPWEDIPPDDYVPLSADE 644



 Score = 41.0 bits (96), Expect = 0.003
 Identities = 32/161 (19%), Positives = 49/161 (30%), Gaps = 7/161 (4%)

Query: 37  VQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQ-PSATVQ 95
             A     S+    +      G A      +   ++    P  A AP   A +   A   
Sbjct: 386 RAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADG 445

Query: 96  AAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPS 155
            AP  + +   A   S   +    PPA S +  A PAS A  PP AA    P       +
Sbjct: 446 DAPVPAKANARASADSRCDERDAQPPADSGSASA-PASDA--PPDAAFEPAPRAAAPSAA 502

Query: 156 YYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQED 196
                  +   A+  +   P   +    P P +   +    
Sbjct: 503 TPAAVPDARAPAAASREDAPAAAAP---PAPEARPPTPAAA 540



 Score = 36.0 bits (83), Expect = 0.13
 Identities = 23/94 (24%), Positives = 29/94 (30%), Gaps = 7/94 (7%)

Query: 49  DQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP 108
           D      D    +     S P S AP +     AP  AA   +     A    A   AA 
Sbjct: 460 DSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAAT---PAAVPDARAPAA- 515

Query: 109 LPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
             S    P  +  A  A     P   A+ P + A
Sbjct: 516 -ASREDAP--AAAAPPAPEARPPTPAAAAPAARA 546



 Score = 30.6 bits (69), Expect = 5.8
 Identities = 28/163 (17%), Positives = 46/163 (28%), Gaps = 24/163 (14%)

Query: 87  AHQPSATVQAAPGSSASVL-----------AAPLPSSSGQPFISPPAQSAAFLAQPASTA 135
           A +P+ T   APG                 AA    +S  P ++    +A     P + A
Sbjct: 357 AFEPAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAA 416

Query: 136 SLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQE 195
           +   + A      P    P        +A +         P P+++           +++
Sbjct: 417 AAAATRAE---APPAAPAPP------ATADRGDDAADGDAPVPAKANARASADSRCDERD 467

Query: 196 DFSEEDEVVDCNPPALFSFAPSTKEREPSIP-DPDSELASQGV 237
                D      P    S AP     EP+      S      V
Sbjct: 468 AQPPADSGSASAPA---SDAPPDAAFEPAPRAAAPSAATPAAV 507


>gnl|CDD|238587 cd01182, INT_REC_C, DNA breaking-rejoining enzymes,
            intergrase/recombinases, C-terminal catalytic domain. The
            tyrosine recombinase/integrase family share the same
            catalytic domain containing six conserved active site
            residues. The best-studied members of this diverse family
            include the bacteriophage lambda integrase, the
            bacteriophage P1 Cre recombinase, the yeast Flp
            recombinase and the bacterial XerD/C recombinases. Their
            overall reaction mechanism is essentially identical and
            involves cleavage of a single strand of a DNA duplex by
            nucleophilic attack of a conserved tyrosine to give a 3'
            phosphotyrosyl protein-DNA adduct. In the second
            rejoining step, a terminal 5' hydroxyl attacks the
            covalent adduct to release the enzyme and generate duplex
            DNA. Many intergrase/recombinases also have N-terminal
            domains, which show little sequence or structure
            similarity.
          Length = 162

 Score = 40.6 bits (95), Expect = 0.001
 Identities = 37/154 (24%), Positives = 55/154 (35%), Gaps = 22/154 (14%)

Query: 935  HVAILLLLSSGRRVHDLTLLSLDPDHFQELDDFVVFWPVFGSKTDSSSHLQSGWKIKENS 994
               ILLLL +G RV +L  L+L       LD   +      +KT                
Sbjct: 18   RALILLLLYTGLRVSEL--LALRWSDID-LDKGTITVR--RTKT--------------GK 58

Query: 995  SDPLFCIPTWIRHLST-LSQRRMGSRPLTSLFITTRGIVQPASRSVIAGWVKTALRGANI 1053
               +   P     L   L  RR   +P   LF + RG  +  +R  +   +K A + A I
Sbjct: 59   ERTVPLSPELAELLREYLELRRPAPKPDDYLFPSRRGGPKRLTRRAVRRLLKKAGKRAGI 118

Query: 1054 I--ASPGSMRSAVATFRYNSDLPLDSILRQGNWR 1085
                +P  +R   AT    + +PL+ I       
Sbjct: 119  PERLTPHDLRHTFATRLLEAGVPLEVIQELLGHS 152


>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
          Length = 1560

 Score = 42.0 bits (98), Expect = 0.002
 Identities = 30/102 (29%), Positives = 47/102 (46%), Gaps = 11/102 (10%)

Query: 45  SSTLDQSPAVSDPGMA----SGVSDQSPPLSSAPANPV-------QASAPVQAAHQPSAT 93
           S+T   +P  S+P  +    S     + PLS+AP +          A+   +AA+QP+A 
Sbjct: 440 SNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRAANQPAAN 499

Query: 94  VQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTA 135
           +  A   +A+       +S G PF S P  SA +    A+TA
Sbjct: 500 LPTANQPAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTA 541


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 41.8 bits (99), Expect = 0.002
 Identities = 21/79 (26%), Positives = 38/79 (48%), Gaps = 1/79 (1%)

Query: 64  SDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQ 123
           +D  P  ++AP     A+A   +A   +   +A    + +  AA  P++  +P  +  A 
Sbjct: 34  ADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAA 93

Query: 124 SAAFLAQPASTASLPPSAA 142
           +A   A PA+ A+  P+AA
Sbjct: 94  AAPA-APPAAAAAAAPAAA 111



 Score = 38.7 bits (91), Expect = 0.018
 Identities = 17/82 (20%), Positives = 31/82 (37%)

Query: 49  DQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP 108
           D  P  +    A+  +  +   + A A   +A A    A   +A   A P  +A+  AA 
Sbjct: 35  DYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAA 94

Query: 109 LPSSSGQPFISPPAQSAAFLAQ 130
            P++      +    +AA   +
Sbjct: 95  APAAPPAAAAAAAPAAAAVEDE 116



 Score = 37.2 bits (87), Expect = 0.063
 Identities = 21/87 (24%), Positives = 35/87 (40%), Gaps = 3/87 (3%)

Query: 55  SDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSG 114
           +D G  S  +  +   ++A A    A+AP   A    A    A  + A   A P P+++ 
Sbjct: 34  ADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPA---APPKPAAAA 90

Query: 115 QPFISPPAQSAAFLAQPASTASLPPSA 141
               +P A  AA  A   + A++    
Sbjct: 91  AAAAAPAAPPAAAAAAAPAAAAVEDEV 117



 Score = 33.3 bits (77), Expect = 0.90
 Identities = 18/68 (26%), Positives = 28/68 (41%)

Query: 41  TARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGS 100
            A  ++    + A S P  A      + P  + PA    A+ P  AA   +A   AAP +
Sbjct: 42  AAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPA 101

Query: 101 SASVLAAP 108
           +A+  A  
Sbjct: 102 AAAAAAPA 109


>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
          Length = 331

 Score = 40.8 bits (96), Expect = 0.003
 Identities = 21/108 (19%), Positives = 32/108 (29%), Gaps = 5/108 (4%)

Query: 38  QAITARKSSTLDQSPAVSDPGMASGVS---DQSPPLSSAPANPVQASAPVQAAHQPSATV 94
           +A     ++  DQS A         V      +   ++ PA           +  P+   
Sbjct: 137 KAQQEEITTMADQSSAELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVAT 196

Query: 95  QAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
             AP       A   PS +     +    + A  A P   A LP   A
Sbjct: 197 APAPAVDPQQNAVVAPSQANVD--TAATPAPAAPATPDGAAPLPTDQA 242



 Score = 37.7 bits (88), Expect = 0.024
 Identities = 17/106 (16%), Positives = 33/106 (31%), Gaps = 3/106 (2%)

Query: 41  TARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAP-VQAAHQPSATVQAAPG 99
           +A  S    QS  +      +  +    P +     P  +  P V  A  P+   Q    
Sbjct: 151 SAELSQNSGQSVPLDT-STTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAV 209

Query: 100 SSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHL 145
            + S       ++      + P  +A      A  ++ P +  + L
Sbjct: 210 VAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVST-PAADPNAL 254



 Score = 33.8 bits (78), Expect = 0.46
 Identities = 16/71 (22%), Positives = 25/71 (35%)

Query: 38  QAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAA 97
            A     + T  Q+PAV+     +    Q+  ++ + AN   A+ P  AA          
Sbjct: 178 PAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPL 237

Query: 98  PGSSASVLAAP 108
           P   A V    
Sbjct: 238 PTDQAGVSTPA 248



 Score = 30.8 bits (70), Expect = 4.0
 Identities = 16/104 (15%), Positives = 28/104 (26%), Gaps = 1/104 (0%)

Query: 34  AQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSAT 93
           A+L Q            +   +     +   D +P  S  PA     +  V    Q +  
Sbjct: 152 AELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAV-DPQQNAVV 210

Query: 94  VQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASL 137
             +      +   AP   ++       P   A      A   +L
Sbjct: 211 APSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADPNAL 254


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 40.4 bits (95), Expect = 0.004
 Identities = 34/177 (19%), Positives = 54/177 (30%), Gaps = 15/177 (8%)

Query: 23  EEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASA 82
           EE  +KI+    +  +   A K       P    P             +S  + P     
Sbjct: 122 EEVAQKIKYAKWKAARIHKALKEGE---DPNPGPPLDEEDEDADVATTNSDNSFP---GE 175

Query: 83  PVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
               A    +   ++     S  + P   SS      PPA        P+S  S  P  +
Sbjct: 176 DADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPA--------PSSFQSDTPPPS 227

Query: 143 HHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSE 199
                 P    P           +      A+P PPS S TP P+  ++ D +  ++
Sbjct: 228 PESPTNPSP-PPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLDDDAIAK 283



 Score = 29.7 bits (67), Expect = 8.6
 Identities = 24/139 (17%), Positives = 37/139 (26%), Gaps = 19/139 (13%)

Query: 98  PGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYY 157
           P       A    ++S   F    A         +++ S PPS++  +   P   +    
Sbjct: 153 PLDEEDEDADVATTNSDNSFPGEDAD------PASASPSDPPSSSPGVPSFPSPPEDPSS 206

Query: 158 GHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPALFSFAPS 217
                            PP PS   +  P     S               PP       S
Sbjct: 207 PSDSS-----------LPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLS 255

Query: 218 TKEREPSIPDPDSELASQG 236
           T   +P+ P   +  A  G
Sbjct: 256 T--AKPTPPSASATPAPIG 272


>gnl|CDD|205634 pfam13456, RVT_3, Reverse transcriptase-like.  This domain is found
           in plants and appears to be part of a retrotransposon.
          Length = 88

 Score = 37.2 bits (87), Expect = 0.004
 Identities = 19/63 (30%), Positives = 36/63 (57%), Gaps = 6/63 (9%)

Query: 829 VMVQSDNQTVVSYLRRQGGTKSL--SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADS 886
           ++V+SD+Q VV  ++ +   +S   +LL E+ K  LL +   + +    +P   N VAD+
Sbjct: 25  LIVESDSQLVVQQIQGEYEARSRLAALLREIRK--LLKKFDSVSV--SHVPRECNRVADA 80

Query: 887 LSR 889
           L++
Sbjct: 81  LAK 83


>gnl|CDD|234383 TIGR03895, protease_PatA, cyanobactin maturation protease,
           PatA/PatG family.  This model describes a protease
           domain associated with the maturation of various members
           of the cyanobactin family of ribosomally produced,
           heavily modified bioactive metabolites. Members include
           the PatA protein and C-terminal domain of the PatG
           protein of Prochloron didemni, TenA and a region of TenG
           from Nostoc spongiaeforme var. tenue, etc.
          Length = 602

 Score = 40.9 bits (96), Expect = 0.004
 Identities = 35/151 (23%), Positives = 47/151 (31%), Gaps = 25/151 (16%)

Query: 12  PQPEAKRLCHLE-------EKTEKIERCLA------QLVQAITARKSS----TLDQSPAV 54
           P P+A R   L        E  E+ ERCLA        +Q +T    S    T +Q    
Sbjct: 185 PDPQAVRTALLNSAIPCDPEDDEECERCLAGRLNIPGAIQLLTGETMSEGLVTSEQDGVE 244

Query: 55  SDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSG 114
              G     + +S   S  P       APV               +   + A  LP++  
Sbjct: 245 EASGCGVQGTIESST-SVIPPGRAAEPAPVSIPVAAPGEGATPAAAQIELSAGVLPNA-- 301

Query: 115 QPFISPPAQSAAFLAQPASTASLPPSAAHHL 145
                 PA      +    T S  PSA   L
Sbjct: 302 ----ISPATPVRPASNGV-TPSQAPSAEPQL 327


>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 598

 Score = 40.7 bits (96), Expect = 0.005
 Identities = 20/69 (28%), Positives = 31/69 (44%)

Query: 48  LDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAA 107
           L + P     G          P + A A    A+A   AA  P+A + A P ++A+  A 
Sbjct: 387 LQEGPPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAP 446

Query: 108 PLPSSSGQP 116
           P P+++ QP
Sbjct: 447 PAPAAAPQP 455



 Score = 31.0 bits (71), Expect = 3.7
 Identities = 13/75 (17%), Positives = 20/75 (26%)

Query: 36  LVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQ 95
            ++ +     S           G A G    +    +A A P         A  P A   
Sbjct: 383 ALRRLQEGPPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAA 442

Query: 96  AAPGSSASVLAAPLP 110
           AA   + +    P  
Sbjct: 443 AAAPPAPAAAPQPAV 457


>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
          Length = 361

 Score = 39.7 bits (93), Expect = 0.008
 Identities = 31/104 (29%), Positives = 40/104 (38%), Gaps = 15/104 (14%)

Query: 54  VSDPGMA-SGVSDQS------PPLSSAPANPVQASAPV-QAAHQP-------SATVQAAP 98
           +S PG A + V+ QS      P LS     P    AP  Q    P       S     AP
Sbjct: 177 LSGPGTAGTTVAKQSYALPARPDLSGGMGTPSVQPAPAPQGDVLPVSNSTLKSEDPTGAP 236

Query: 99  GSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
            +S+  L AP   + G    S P  +A   A   + A+  P AA
Sbjct: 237 VTSSGFLGAPTTLAPGVLEGSEPTPTAPSSAPATAPAAAAPQAA 280



 Score = 30.4 bits (69), Expect = 4.8
 Identities = 19/80 (23%), Positives = 29/80 (36%), Gaps = 15/80 (18%)

Query: 34  AQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPL------SSAPANPVQASAPVQAA 87
              V  ++   +STL        P  +SG       L       S P     +SAP    
Sbjct: 216 QGDVLPVS---NSTLKSEDPTGAPVTSSGFLGAPTTLAPGVLEGSEPTPTAPSSAPA--- 269

Query: 88  HQPSATVQAAPGSSASVLAA 107
              +A   AAP ++A+  +A
Sbjct: 270 ---TAPAAAAPQAAATSSSA 286


>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
          Length = 421

 Score = 38.5 bits (89), Expect = 0.015
 Identities = 24/105 (22%), Positives = 38/105 (36%)

Query: 37  VQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQA 96
           ++  T  K      +  V+    +S V+  S    S+ A P  A  P     Q + T QA
Sbjct: 284 IEKATGLKQIDTHGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQA 343

Query: 97  APGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSA 141
           +  + +S    P   +       P A+      QP ST     S+
Sbjct: 344 SAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTTETQQSS 388



 Score = 37.4 bits (86), Expect = 0.042
 Identities = 24/131 (18%), Positives = 48/131 (36%), Gaps = 4/131 (3%)

Query: 21  HLEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQA 80
           +L+ +   IE+  A  ++ I    +  +      S    +S ++  S  + S    P   
Sbjct: 276 NLDNRALDIEK--ATGLKQIDTHGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSV 333

Query: 81  SAPVQAAHQPSATVQAAPGSSASVLAAPLPSS--SGQPFISPPAQSAAFLAQPASTASLP 138
           +       Q SA   ++ G   S +  P   +  + +P    P   +    Q +ST ++ 
Sbjct: 334 TTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNIT 393

Query: 139 PSAAHHLYPLP 149
            +A      LP
Sbjct: 394 STANGPTTSLP 404


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 39.0 bits (91), Expect = 0.017
 Identities = 25/123 (20%), Positives = 32/123 (26%), Gaps = 4/123 (3%)

Query: 65  DQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQS 124
            Q+P L   P   +    P + A  P       P                Q F+  P+Q+
Sbjct: 167 QQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQA 226

Query: 125 AAFLAQPAST-ASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVT 183
            A    P       PP        L     P           +    Q    PPP    T
Sbjct: 227 PAQPPLPPQLPQQPPPLQQPQFPGLSQQMPP---PPPQPPQQQQQPPQPQAQPPPQNQPT 283

Query: 184 PIP 186
           P P
Sbjct: 284 PHP 286



 Score = 33.2 bits (76), Expect = 0.91
 Identities = 21/85 (24%), Positives = 24/85 (28%), Gaps = 5/85 (5%)

Query: 62  GVSDQSPPLSSAPANPVQASAPVQAAHQ--PSATVQAAPGSSASVLAAPLPSSSGQPFIS 119
           G+S Q PP    P  P Q   P Q   Q  P       PG       APLP       + 
Sbjct: 250 GLSQQMPP--PPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQG-QNAPLPPPQQPQLLP 306

Query: 120 PPAQSAAFLAQPASTASLPPSAAHH 144
              Q       P     L   +   
Sbjct: 307 LVQQPQGQQRGPQFREQLVQLSQQQ 331



 Score = 29.7 bits (67), Expect = 9.7
 Identities = 26/122 (21%), Positives = 33/122 (27%), Gaps = 6/122 (4%)

Query: 68  PPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQ---S 124
           P L      P         + QP     A    S   + A L      P +  P Q    
Sbjct: 123 PSLWGTAPKPEPQPPQAPES-QPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLP 181

Query: 125 AAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTP 184
                + A+     P      YP P    P       Q  + A     AQPP P +    
Sbjct: 182 QGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQP--QQFLPAPSQAPAQPPLPPQLPQQ 239

Query: 185 IP 186
            P
Sbjct: 240 PP 241


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 39.0 bits (91), Expect = 0.019
 Identities = 45/287 (15%), Positives = 70/287 (24%), Gaps = 46/287 (16%)

Query: 1   MASNPKRDTSDPQPEAKRLCHLEEKTEKIERCLAQLV--QAITARKSSTLDQSPAVSDPG 58
            A +     + P P            E        L      +  +  +       S   
Sbjct: 60  AACDRFEPPTGPPPGPG---TEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDP 116

Query: 59  MASGVSDQSPPLSSAPA-------------NPVQASAPVQAAHQPSATVQAAPGSSASVL 105
                   SPP S AP               P  +     A+    A+  A+   +A  L
Sbjct: 117 PPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPL 176

Query: 106 AAPLPS--SSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCD--------PS 155
           ++P  +  +   P   PP  +    A P       P +A    P P             S
Sbjct: 177 SSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASS 236

Query: 156 YYGHYLQSAMKASRGQVAQPPPP-----------------SESVTPIPLSPVSSDQEDFS 198
                 +S+      +   P P                    S  P P S  SS +E   
Sbjct: 237 SDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSP 296

Query: 199 EEDEVVDCNPPALFSFAPSTKEREPSIPDPDSELASQGVVCQKLGSP 245
                   + PA  S   ++     S     S  +S     +     
Sbjct: 297 SPSPSSPGSGPAP-SSPRASSSSSSSRESSSSSTSSSSESSRGAAVS 342



 Score = 34.8 bits (80), Expect = 0.30
 Identities = 19/109 (17%), Positives = 33/109 (30%), Gaps = 6/109 (5%)

Query: 45  SSTLDQSPAVSDPGMASGV------SDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAP 98
           SST   S +     ++ G       S   PP  + P++P +   P +A   P+A+     
Sbjct: 327 SSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPT 386

Query: 99  GSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYP 147
              A    A            P  +         + +    +    L P
Sbjct: 387 RRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTP 435



 Score = 30.9 bits (70), Expect = 4.6
 Identities = 23/105 (21%), Positives = 36/105 (34%), Gaps = 10/105 (9%)

Query: 45  SSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASV 104
           S T D     S P    G + + PP  S+ ++            +P+A    A G +   
Sbjct: 823 SHTPDGGSESSGPARPPGAAARPPPARSSESSK----------SKPAAAGGRARGKNGRR 872

Query: 105 LAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLP 149
              P    +     +PP  +AA     A      P+    L P+P
Sbjct: 873 RPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMP 917


>gnl|CDD|238823 cd01645, RT_Rtv, RT_Rtv: Reverse transcriptases (RTs) from
           retroviruses (Rtvs). RTs catalyze the conversion of
           single-stranded RNA into double-stranded viral DNA for
           integration into host chromosomes. Proteins in this
           subfamily contain long terminal repeats (LTRs) and are
           multifunctional enzymes with RNA-directed DNA
           polymerase, DNA directed DNA polymerase, and
           ribonuclease hybrid (RNase H) activities. The viral RNA
           genome enters the cytoplasm as part of a nucleoprotein
           complex, and the process of reverse transcription
           generates in the cytoplasm forming a linear DNA duplex
           via an intricate series of steps. This duplex DNA is
           colinear with its RNA template, but contains terminal
           duplications known as LTRs that are not present in viral
           RNA. It has been proposed that two specialized template
           switches, known as strand-transfer reactions or "jumps",
           are required to generate the LTRs.
          Length = 213

 Score = 37.6 bits (88), Expect = 0.019
 Identities = 46/203 (22%), Positives = 82/203 (40%), Gaps = 20/203 (9%)

Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
           A++  + E L+ G ++   ST+ + + +F++ K +G  R + +L+ +N        +L  
Sbjct: 16  ALTELVTEQLKEGHIEP--STSPWNTPVFVIKKKSGKWRLLHDLRAVNAQTQDM-GALQP 72

Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQR------FLALSYNGDVLAM----TCL 584
               P+ L KG  +I +DL   +F +P+   H        F   S N    A       L
Sbjct: 73  GLPHPAALPKGWPLIVLDLKDCFFSIPL---HPDDRERFAFTVPSINNKGPAKRYQWKVL 129

Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVV--YLDDFLLVNQDPRILEIQGKLAVSILG 642
           P G+  +P    S         R +   +V+  Y+DD L+ +     L    +     L 
Sbjct: 130 PQGMKNSPTICQSFVAQALEPFRKQYPDIVIYHYMDDILIASDLEGQLREIYEELRQTLL 189

Query: 643 SLGWIVNLQKSSLSPAPVLQFLG 665
             G  +  +K      P  Q+LG
Sbjct: 190 RWGLTIPPEKVQKE--PPFQYLG 210


>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
           dihydrolipoamide succinyltransferase.  This model
           represents an Actinobacterial clade of E2 enzyme, a
           component of the 2-oxoglutarate dehydrogenase complex
           involved in the TCA cycle. These proteins have multiple
           domains including the catalytic domain (pfam00198), one
           or two biotin domains (pfam00364) and an E3-component
           binding domain (pfam02817).
          Length = 579

 Score = 38.5 bits (89), Expect = 0.021
 Identities = 36/121 (29%), Positives = 51/121 (42%), Gaps = 17/121 (14%)

Query: 3   SNPKRDTSDPQPEAKRLCHL---EEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGM 59
           S  K DT  P P A  L  +   E+ T ++   LA +  A  A      +++PA S+ G 
Sbjct: 164 STDKVDTEIPSPVAGTLLEIRAPEDDTVEVGTVLAIIGDANAAPAEPAEEEAPAPSEAG- 222

Query: 60  ASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFIS 119
                       S PA    A AP  AA  P A   A   ++A   AAP+ S    P+++
Sbjct: 223 ------------SEPAPDPAARAP-HAAPDPPAPAPAPAKTAAPAAAAPVSSGDSGPYVT 269

Query: 120 P 120
           P
Sbjct: 270 P 270


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 38.0 bits (87), Expect = 0.024
 Identities = 33/129 (25%), Positives = 55/129 (42%), Gaps = 9/129 (6%)

Query: 23  EEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASA 82
           E+  E+  R  A    A   + ++    +P+      A+  +  +   + A A P +A+A
Sbjct: 182 EKDRERARREDAAAAAAAKQKAAAKKAAAPSGKKSAKAAAPAKAAAAPAKAAAPPAKAAA 241

Query: 83  PVQAAHQPSATVQAAPGSSAS--VLAAPLPSSSGQP---FISPPAQSAAFLAQ----PAS 133
               A    A   A P  +A+    AA  P+ +  P     +PPA++AA  A+    PA 
Sbjct: 242 APAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAPAK 301

Query: 134 TASLPPSAA 142
            A+ P  AA
Sbjct: 302 AAAAPAKAA 310



 Score = 36.5 bits (83), Expect = 0.064
 Identities = 26/105 (24%), Positives = 42/105 (40%), Gaps = 1/105 (0%)

Query: 39  AITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPS-ATVQAA 97
           A  A+ ++   ++ A      A+     +PP  +A      A+ P +AA  P+ A    A
Sbjct: 227 AAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPA 286

Query: 98  PGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
             ++    AA  P+ +        A  A   A PA  A+ P  AA
Sbjct: 287 KAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAA 331



 Score = 34.2 bits (77), Expect = 0.40
 Identities = 30/109 (27%), Positives = 47/109 (43%), Gaps = 9/109 (8%)

Query: 39  AITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAP 98
           A  A+ ++   ++ A      A      +PP  +A      A+ P +AA  P A   A P
Sbjct: 234 APPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPP-AKAAAPP 292

Query: 99  GSSASV---LAAPLPSSSGQP--FISPPAQSAAFLAQPASTASLPPSAA 142
             +A+     AA    ++  P    +PPA++A   A PA  A+ P  AA
Sbjct: 293 AKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAA---APPAKAATPPAKAA 338


>gnl|CDD|220944 pfam11018, Cuticle_3, Pupal cuticle protein C1.  Insect cuticles
           are composite structures whose mechanical properties are
           optimised for biological function. The major components
           are the chitin filament system and the cuticular
           proteins, and the cuticle's properties are determined
           largely by the interactions between these two sets of
           molecules. The proteins can be ordered by species.
          Length = 164

 Score = 36.5 bits (84), Expect = 0.026
 Identities = 22/91 (24%), Positives = 30/91 (32%), Gaps = 5/91 (5%)

Query: 61  SGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISP 120
           S V      +S   A     +    A   P+    AAP  +A   AAP    +     + 
Sbjct: 34  SSVRKSDTRIS-NNAYQPAYAKTAYAYAAPAVYAAAAPVYAAHAYAAPAVHYAAAAHYAA 92

Query: 121 PAQSAAFLAQPASTA----SLPPSAAHHLYP 147
           PA +    A PA TA    + P       Y 
Sbjct: 93  PAYAKYAYAAPAVTAKAAYAAPAPVYKTAYA 123


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 37.8 bits (88), Expect = 0.027
 Identities = 30/179 (16%), Positives = 46/179 (25%), Gaps = 15/179 (8%)

Query: 59  MASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP--LPSSSGQP 116
             SGV++     S     PV           P+  +QA P         P   P+++   
Sbjct: 40  SQSGVAEFPWDPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPA 99

Query: 117 FISPPAQSAAFLAQPASTASLPPSAAH--HLYPLPFFCDPSYYGHYLQ---------SAM 165
             + PA  A    Q       P          P     D       LQ         ++ 
Sbjct: 100 TFALPAGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASG 159

Query: 166 KASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDC--NPPALFSFAPSTKERE 222
           +    Q +        +   P       Q D + +DE         A  +     K  E
Sbjct: 160 QLPSQQQSAQKNDESQLQQQPNGETPPQQTDGAGDDESEALVRLREADGTLEQRIKGAE 218


>gnl|CDD|236652 PRK10118, PRK10118, flagellar hook-length control protein;
           Provisional.
          Length = 408

 Score = 37.9 bits (88), Expect = 0.029
 Identities = 23/102 (22%), Positives = 32/102 (31%), Gaps = 5/102 (4%)

Query: 37  VQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQA 96
           V         T D   A  D        +    L+SA     Q         QP   + A
Sbjct: 166 VLPAEKPTLLTKDMPSAPQDETHTLSSDEHEKGLTSAQLTTAQPDDAPGTPAQPLTPLAA 225

Query: 97  APGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLP 138
              + A V++ P P ++     SP         QP  TA+ P
Sbjct: 226 EAQAKAEVISTPSPVTAA---ASPTITPHQ--TQPLPTAAAP 262


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 38.1 bits (88), Expect = 0.031
 Identities = 18/107 (16%), Positives = 29/107 (27%), Gaps = 10/107 (9%)

Query: 84  VQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAH 143
           ++ A   + +   A  +  +V+AAP   +       P        + PA +         
Sbjct: 348 LKTASLTAPSRVLAAAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVP 407

Query: 144 HLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPV 190
                P    P        S            P P   V P P +P 
Sbjct: 408 QFCGDPGLVSPYNPQSPGTSY----------GPEPVGPVPPQPTNPY 444


>gnl|CDD|221404 pfam12067, Sox_C_TAD, Sox C-terminal transactivation domain.  This
           domain is found at the C-terminus of the Sox family of
           transcription factors. It is found associated with
           pfam00505. It binds to the Armadillo repeats (pfam00514)
           in Catenin beta-1 (CTNNB1), which is involved in
           transcriptional regulation. It functions as a
           transactivating domain (TAD).
          Length = 197

 Score = 36.6 bits (85), Expect = 0.035
 Identities = 19/76 (25%), Positives = 27/76 (35%), Gaps = 1/76 (1%)

Query: 107 APLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGH-YLQSAM 165
             +P      +    A + A L Q A T ++P  +              YYG  Y+    
Sbjct: 49  QMMPYGYNSSYAPQNAHAPALLRQMAVTENIPQGSPAPSIMGCPTPPQMYYGQMYVPECA 108

Query: 166 KASRGQVAQPPPPSES 181
           K    Q+ Q  PP ES
Sbjct: 109 KHHPVQLGQLSPPPES 124


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 37.5 bits (88), Expect = 0.040
 Identities = 26/157 (16%), Positives = 43/157 (27%), Gaps = 25/157 (15%)

Query: 71  SSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQ 130
              P  P Q++AP  +A   +A   A     A  +  P  S       +P    A  L +
Sbjct: 365 LPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPAS-------APQQAPAVPLPE 417

Query: 131 PASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSES----VTPIP 186
             S      +A   L                 +  K S    A    P  S    +  + 
Sbjct: 418 TTSQLL---AARQQLQRAQ-----------GATKAKKSEPAAASRARPVNSALERLASVR 463

Query: 187 LSPVSSDQEDFSEEDEVVDCNPPALFSFAPSTKEREP 223
            +P + ++    +E        P      P    +  
Sbjct: 464 PAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKAL 500



 Score = 37.2 bits (87), Expect = 0.061
 Identities = 24/136 (17%), Positives = 34/136 (25%), Gaps = 26/136 (19%)

Query: 77  PVQASAPVQAAHQPSATV--QAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPAST 134
           P       +   Q +A      A  +  + +A P   +   P  S P Q       PA  
Sbjct: 361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQ------APAVP 414

Query: 135 ASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQ 194
                S                    L +  +  R Q A     SE        PV+S  
Sbjct: 415 LPETTSQL------------------LAARQQLQRAQGATKAKKSEPAAASRARPVNSAL 456

Query: 195 EDFSEEDEVVDCNPPA 210
           E  +           A
Sbjct: 457 ERLASVRPAPSALEKA 472



 Score = 31.8 bits (73), Expect = 2.6
 Identities = 18/123 (14%), Positives = 34/123 (27%), Gaps = 20/123 (16%)

Query: 34  AQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQ---- 89
                A +A+ ++    + A             +P  + A   P   S  + A  Q    
Sbjct: 373 QSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRA 432

Query: 90  ----------PSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPP 139
                     P+A  +A P +SA    A +  +          + A    +     +  P
Sbjct: 433 QGATKAKKSEPAAASRARPVNSALERLASVRPA------PSALEKAPAKKEAYRWKATNP 486

Query: 140 SAA 142
              
Sbjct: 487 VEV 489


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 36.9 bits (86), Expect = 0.045
 Identities = 19/96 (19%), Positives = 27/96 (28%), Gaps = 1/96 (1%)

Query: 49  DQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP 108
                   P   +    Q P     P  P+Q     Q A  P   V +AP  +       
Sbjct: 113 VPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQVAPAP-QPVHSAPQPAQQAFQPA 171

Query: 109 LPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHH 144
            P ++ QP             +      +   AAHH
Sbjct: 172 EPVAAPQPEPVAEPAPVMDKPKRKEAVIVMNVAAHH 207



 Score = 33.5 bits (77), Expect = 0.62
 Identities = 38/169 (22%), Positives = 46/169 (27%), Gaps = 18/169 (10%)

Query: 43  RKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSA 102
           R  S  D      D     GV +          N   A+A    A +PS   Q  P  ++
Sbjct: 43  RMKSKRDDDSYDEDVEDDEGVGEVRVH----RVNHAPANAQEHEAARPSPQHQYQPPYAS 98

Query: 103 SVLAAPLPSSSGQPFISPPAQSAAFLAQPA-STASLPPSAAHHLYPLPFFCDPSYYGHYL 161
              A P       P    P Q A   AQPA      P        PL     P       
Sbjct: 99  ---AQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQV----- 150

Query: 162 QSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPA 210
                A   Q     P        P  PV++ Q +   E   V   P  
Sbjct: 151 -----APAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVMDKPKR 194


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 37.4 bits (86), Expect = 0.058
 Identities = 45/193 (23%), Positives = 65/193 (33%), Gaps = 20/193 (10%)

Query: 52  PAVSDPGMASGVSDQSPPLSSAPANPVQASA-PVQAAHQPSATVQAAPGSSASVLAAPLP 110
           P++  P  A+      PP  SA A P Q S    Q A QP      +  S+ S+    LP
Sbjct: 179 PSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLP 238

Query: 111 SSSGQPFISPPAQSAAFLAQPASTASLP--------PSAAHHLYPLPFFCDPSYYGHYLQ 162
           S    P + P   S      PA ++  P        P   H L   P F           
Sbjct: 239 SP--HPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQP 296

Query: 163 SAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPALFSFAPSTKERE 222
             +  S+      P  ++  +  P S  +   +    E  +    PP     APS    +
Sbjct: 297 FGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQPL----PP-----APSMPHIK 347

Query: 223 PSIPDPDSELASQ 235
           P    P  +L +Q
Sbjct: 348 PPPTTPIPQLPNQ 360



 Score = 33.9 bits (77), Expect = 0.55
 Identities = 32/134 (23%), Positives = 47/134 (35%), Gaps = 21/134 (15%)

Query: 66  QSPPLSSAPANP----VQASAPVQAAHQPSATVQAAPGSSA--------SVLAAPLPSSS 113
           QS PL S PA P       S P +A+  P + + + P  S           L A  P  S
Sbjct: 411 QSQPLQSVPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPS 470

Query: 114 GQP-FISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQV 172
                 + P ++++    P S        A    PLP            +  +  +  + 
Sbjct: 471 LPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPI-------QIKEEPLDEAE-EP 522

Query: 173 AQPPPPSESVTPIP 186
             PPPP  S +P P
Sbjct: 523 ESPPPPPRSPSPEP 536



 Score = 33.1 bits (75), Expect = 1.1
 Identities = 39/199 (19%), Positives = 59/199 (29%), Gaps = 30/199 (15%)

Query: 48  LDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAA 107
           +DQ    S P + S   ++S   SSA    +Q   P         ++Q  PG++ +  A 
Sbjct: 143 IDQDNRSSSPSIPSPQDNESDSDSSAQQQLLQPQGP--------PSIQVPPGAALAPSAP 194

Query: 108 PLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKA 167
           P           P   + A   Q +  A+ P        PL     PS +   L S    
Sbjct: 195 P-----------PTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPP 243

Query: 168 SRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPALFSFAPSTKEREPSIPD 227
            + Q A    P +   P    P SS               PP   +        +    +
Sbjct: 244 LQPQTASQQSP-QPPAPSSRHPQSSHHGPG----------PPMPHALQQGPVFLQHPSSN 292

Query: 228 PDSELASQGVVCQKLGSPA 246
           P             L  P+
Sbjct: 293 PPQPFGLAQSQVPPLPLPS 311



 Score = 32.0 bits (72), Expect = 2.2
 Identities = 65/277 (23%), Positives = 83/277 (29%), Gaps = 44/277 (15%)

Query: 39  AITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAP 98
           +I     + L  S     P  A  V  Q  P+++ PA   Q  +P+     PS   Q  P
Sbjct: 180 SIQVPPGAALAPSAPPPTPS-AQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLP 238

Query: 99  G---------SSASVLAAPLPSSS---------GQPFISPPAQSAAFLAQPASTASLPPS 140
                     +S      P PSS          G P      Q   FL  P+S    P  
Sbjct: 239 SPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFG 298

Query: 141 AAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPP-----PPSESV--------TPIPL 187
            A    P      PS    +  +    S  Q  QPP     PP+ S+        TPIP 
Sbjct: 299 LAQSQVPPLPL--PSQAQPHSHTPPSQSALQPQQPPREQPLPPAPSMPHIKPPPTTPIPQ 356

Query: 188 SPVSSDQE--DFSEEDEVVDCN-----PPALFSFAPSTKEREPSIPDPDSELASQGVVCQ 240
            P  S +                    PPAL   +       PS   P  +L  Q    Q
Sbjct: 357 LPNQSHKHPPHLQGPSPFPQMPSNLPPPPALKPLSSLPTHHPPSAHPPPLQLMPQS---Q 413

Query: 241 KLGSPAWNRVRYTESQRALQAGGVFQPLAIPSEFPHS 277
            L S        T+SQ             + S  P S
Sbjct: 414 PLQSVPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQS 450


>gnl|CDD|235904 PRK06995, flhF, flagellar biosynthesis regulator FlhF; Validated.
          Length = 484

 Score = 36.9 bits (86), Expect = 0.061
 Identities = 25/112 (22%), Positives = 32/112 (28%), Gaps = 8/112 (7%)

Query: 40  ITARKSSTLD--QSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAA 97
           I A   S L     PA + P  A      +P   S PA P    AP    H    T Q  
Sbjct: 40  IVALADSDLAALAPPAAAAPAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQRE 99

Query: 98  P------GSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAH 143
                    +A    AP   +      +   + A   A          +AA 
Sbjct: 100 QLVARAAAPAAPEAQAPAAPAERAAAENAARRLARAAAAAPRPRVPADAAAA 151



 Score = 31.9 bits (73), Expect = 2.2
 Identities = 14/97 (14%), Positives = 25/97 (25%), Gaps = 9/97 (9%)

Query: 52  PAVSDPGMASGVSDQS----PPLSSAPANPVQASAPV--QAAHQPSATVQAAPGSSASVL 105
           P  + P   S  +  +    P L             +  +AA   +   QA    +    
Sbjct: 65  PPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQREQLVARAAAPAAPEAQAPAAPAERAA 124

Query: 106 AAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
           A        +   + P      +   A+ A      A
Sbjct: 125 AENAARRLARAAAAAP---RPRVPADAAAAVADAVKA 158


>gnl|CDD|237000 PRK11855, PRK11855, dihydrolipoamide acetyltransferase; Reviewed.
          Length = 547

 Score = 36.7 bits (86), Expect = 0.070
 Identities = 15/55 (27%), Positives = 23/55 (41%)

Query: 78  VQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPA 132
           V A+AP  AA   +A   AA  ++ +   A   + +     +  A   A  A PA
Sbjct: 194 VAAAAPAAAAAPAAAAPAAAAAAAPAPAPAAAAAPAAAAPAAAAAPGKAPHASPA 248


>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
           carboxylase.
          Length = 274

 Score = 36.4 bits (84), Expect = 0.072
 Identities = 30/90 (33%), Positives = 35/90 (38%), Gaps = 10/90 (11%)

Query: 33  LAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSA 92
           L QL   +  RK   L Q P       A  V  Q PP  + P     A+ P      PSA
Sbjct: 126 LKQLDCELVIRKKEALPQPPPP-----APVVMMQPPPPHAMPPASPPAAQPA-----PSA 175

Query: 93  TVQAAPGSSASVLAAPLPSSSGQPFISPPA 122
              + P + AS   A  P SS  P  SP A
Sbjct: 176 PASSPPPTPASPPPAKAPKSSHPPLKSPMA 205



 Score = 33.7 bits (77), Expect = 0.41
 Identities = 16/61 (26%), Positives = 21/61 (34%), Gaps = 6/61 (9%)

Query: 79  QASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLP 138
              APV     P           A+  A   P+SS  P  +P +   A     A  +S P
Sbjct: 145 PPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPP--TPASPPPA----KAPKSSHP 198

Query: 139 P 139
           P
Sbjct: 199 P 199


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 36.7 bits (85), Expect = 0.074
 Identities = 29/111 (26%), Positives = 39/111 (35%), Gaps = 12/111 (10%)

Query: 33  LAQLVQAITARKSSTLDQ---SPAVSDPGMASGVSDQSPPLSSAPA-NPVQASAPVQAAH 88
           LAQL Q           +    P  + P      +   P  ++A + +P Q+SA  Q + 
Sbjct: 359 LAQLTQKGDDASGGRGPKQHIKPVFTQP-----AAAPQPSAAAAASPSPSQSSAAAQPSA 413

Query: 89  QPSATVQAA---PGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTAS 136
             SAT  A      S     A P+   S  P    PAQ       P S  S
Sbjct: 414 PQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVS 464



 Score = 34.4 bits (79), Expect = 0.44
 Identities = 31/173 (17%), Positives = 56/173 (32%), Gaps = 23/173 (13%)

Query: 29  IERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAH 88
           +E  L QL Q       ++  + P      + +  +    P ++A A+P  + +   A  
Sbjct: 352 VELTLIQLAQLTQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAA-- 409

Query: 89  QPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPL 148
           QPSA   A   +        +   +  P    P  +A    +PA                
Sbjct: 410 QPSAPQSATQPAGTPP-TVSVDPPAAVP--VNPPSTAPQAVRPAQFKEEKKIP------- 459

Query: 149 PFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEED 201
                 S       S ++  + +  Q          I  +P  + +E F+EED
Sbjct: 460 -----VSKVSSLGPSTLRPIQEKAEQAT------GNIKEAPTGTQKEIFTEED 501


>gnl|CDD|238822 cd01644, RT_pepA17, RT_pepA17: Reverse transcriptase (RTs) in
           retrotransposons. This subfamily represents the RT
           domain of a multifunctional enzyme. C-terminal to the RT
           domain is a domain homologous to aspartic proteinases
           (corresponding to Merops family A17) encoded by
           retrotransposons and retroviruses. RT catalyzes DNA
           replication from an RNA template and is responsible for
           the replication of retroelements.
          Length = 213

 Score = 35.4 bits (82), Expect = 0.095
 Identities = 34/172 (19%), Positives = 53/172 (30%), Gaps = 45/172 (26%)

Query: 535 HFRIPSFLQKGDYMISIDLSQAY--FHVPIKTTH-QRFLALSYNGD----VLAMTCLPFG 587
            FR      +G   +S D+ + +    V  +     RFL      +       MT +PFG
Sbjct: 54  RFR------QGKIAVSADIEKMFHQVKVRPEDRDVLRFLWRKDGDEPKPIEYRMTVVPFG 107

Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVV--------YLDDFL-----------LVNQDPR 628
            A+AP     L+N                       Y+DD L           +  +   
Sbjct: 108 AASAP----FLANRALKQHAEDHPHEAAAKIIKRNFYVDDILVSTDTLNEAVNVAKRLIA 163

Query: 629 ILEIQG-KLA------VSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLD 673
           +L+  G  L         +L  L     L           + LG+ W+P  D
Sbjct: 164 LLKKGGFNLRKWASNSQEVLDDLPEERVLLDRDSDVT--EKTLGLRWNPKTD 213


>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).
           This family consists of several brain acid soluble
           protein 1 (BASP1) or neuronal axonal membrane protein
           NAP-22. The BASP1 is a neuron enriched Ca(2+)-dependent
           calmodulin-binding protein of unknown function.
          Length = 233

 Score = 35.6 bits (81), Expect = 0.10
 Identities = 47/194 (24%), Positives = 73/194 (37%), Gaps = 9/194 (4%)

Query: 7   RDTSDPQPEAKRLCHLEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQ 66
           ++  + Q  A+     E K EK ++  AQ     T  K    + + A  +   A    ++
Sbjct: 37  KENEEAQAAAETTEVKEAKEEKPDK-DAQDTANKTEEKEGEKEAAAAKEEAPKAE--PEK 93

Query: 67  SPPLSSAPANPVQASAPVQ---AAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQ 123
           +   + A A P +AS P Q   AA  P+A  +A   S AS   A   + + +   S    
Sbjct: 94  TEGAAEAKAEPPKASDPEQEPAAAPGPAAGGEAPKASEASSQPAESAAPAKEEEKSKEEG 153

Query: 124 SAAFLAQPASTASLPPS---AAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSE 180
            A     PA+ A    S    A    P      PS       +   +S  + + P  P+E
Sbjct: 154 EAKKTEAPAAAAQETKSDAAPASDSKPSSSEAAPSSKETPAATEAPSSTAKASAPAAPAE 213

Query: 181 SVTPIPLSPVSSDQ 194
            V P      +SDQ
Sbjct: 214 EVKPSEAPAANSDQ 227


>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 944

 Score = 35.9 bits (83), Expect = 0.13
 Identities = 32/176 (18%), Positives = 56/176 (31%), Gaps = 13/176 (7%)

Query: 33  LAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSA 92
           LA  VQA  A +   ++ +PA     +    + Q             A+A   A    SA
Sbjct: 385 LAAAVQAPHANEPQFVNAAPAEKKTALTEQTTAQ--------QQVQAANAEAVAEADASA 436

Query: 93  TVQAAPGSSASVLAAPLPS-SSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLP-- 149
                   +    +  L + ++ Q  I   AQS  F A  +S  +   +    +      
Sbjct: 437 EPADTVEQALDDESELLAALNAEQAVILSQAQSQGFEAS-SSLDADNSAVPEQIDSTAEQ 495

Query: 150 FFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVD 205
              +PS     +     AS    A             L     D+ D++++   +D
Sbjct: 496 SVVNPSVTDTQVDD-TSASNNSAADNTVDDNYSAEDTLESNGLDEGDYAQDSAPLD 550



 Score = 30.8 bits (70), Expect = 4.7
 Identities = 35/167 (20%), Positives = 50/167 (29%), Gaps = 24/167 (14%)

Query: 2   ASNPKRDTSDPQPEAKRLCHLEEKTEKIERCLAQLVQAITARKSSTLDQS---PAVSDPG 58
           A +PK +    +  A R    + KT       A L +  ++  +S    S       +  
Sbjct: 628 ALSPK-EGDGKKSSADR----KPKTPPSRAPPASLSKPASSPDASQTSASFDLDPDFELA 682

Query: 59  MASGVSDQSPPLSSAPANPVQA---------SAPVQAAHQPSATVQAAPGSSASVLAAPL 109
               V + +    SAPA P             AP  A+        A    S SV  A  
Sbjct: 683 THQSVPEAALASGSAPAPPPVPDPYDRPPWEEAPEVASANDGPNNAAEGNLSESVEDASN 742

Query: 110 PSSSG-------QPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLP 149
                       QP +   AQS A       T+S       +L  L 
Sbjct: 743 SELQAVEQQATHQPQVQAEAQSPASTTALTQTSSEVQDTELNLVLLS 789


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 35.2 bits (81), Expect = 0.20
 Identities = 16/108 (14%), Positives = 26/108 (24%), Gaps = 8/108 (7%)

Query: 36  LVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQ 95
           +++A+     +     P  + P         S    +A A                  V 
Sbjct: 355 VIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAAN-IPPKEPVRETATPPPVP 413

Query: 96  AAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAH 143
             P      +A P+P +         A        P  T   PP    
Sbjct: 414 PRP------VAPPVPHTPESAPKLTRAAIPVDEK-PKYTPPAPPKEEE 454


>gnl|CDD|238828 cd01651, RT_G2_intron, RT_G2_intron: Reverse transcriptases (RTs)
           with group II intron origin. RT transcribes DNA using
           RNA as template. Proteins in this subfamily are found in
           bacterial and mitochondrial group II introns. Their most
           probable ancestor was a retrotransposable element with
           both gag-like and pol-like genes. This subfamily of
           proteins appears to have captured the RT sequences from
           transposable elements, which lack long terminal repeats
           (LTRs).
          Length = 226

 Score = 34.5 bits (80), Expect = 0.23
 Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 2/60 (3%)

Query: 608 SRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV--LQFLG 665
            R +R V Y DDF++  + P+  E   +L    L  LG  +N +K+ ++        FLG
Sbjct: 164 FRRLRYVRYADDFVIGVRGPKEAEEIKELIREFLEELGLELNPEKTRITHFKSEGFDFLG 223


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 35.0 bits (80), Expect = 0.23
 Identities = 32/138 (23%), Positives = 46/138 (33%), Gaps = 7/138 (5%)

Query: 52  PAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPS 111
                P   +  + Q PP +  P  P  A  P   A +P+A    A   +A+   A  P+
Sbjct: 681 ANTMLPIQWAPGTMQPPPRAPTPMRPPAA--PPGRAQRPAAATGRARPPAAAPGRARPPA 738

Query: 112 SSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQ 171
           ++      P A         A+     P AA    P P    P        +  +  RG 
Sbjct: 739 AAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQ--PPPQAP---PAPQQRPRGA 793

Query: 172 VAQPPPPSESVTPIPLSP 189
               PPP    T + L P
Sbjct: 794 PTPQPPPQAGPTSMQLMP 811



 Score = 30.4 bits (68), Expect = 6.2
 Identities = 34/157 (21%), Positives = 49/157 (31%), Gaps = 22/157 (14%)

Query: 51  SPAVSDPGMASGVSDQSPPLSSAPANPVQASA----PVQAAHQPSATVQAAPGSSA---- 102
           +   + P  A+    + P  +   A P  A+     P  AA   +    AAPG+      
Sbjct: 720 ATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPP 779

Query: 103 -SVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHY- 160
                AP     G P   PP Q     A P S   +P +A     P          G   
Sbjct: 780 PQAPPAPQQRPRGAPTPQPPPQ-----AGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVK 834

Query: 161 -------LQSAMKASRGQVAQPPPPSESVTPIPLSPV 190
                    +A++        P P S +   I  +PV
Sbjct: 835 RGRPSLKKPAALERQAAAGPTPSPGSGTSDKIVQAPV 871


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 34.3 bits (79), Expect = 0.27
 Identities = 15/56 (26%), Positives = 25/56 (44%), Gaps = 2/56 (3%)

Query: 69  PLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQS 124
           P+   PA+P + +AP +     +    + P +S S  + P   S  QP     AQ+
Sbjct: 123 PVEPRPASPFENTAPARPTSSTATAAASKPVTSVS--SGPRALSRNQPQYPARAQA 176


>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region.  The
           defective chorion-1 gene (dec-1) in Drosophila encodes
           follicle cell proteins necessary for proper eggshell
           assembly. Multiple products of the dec-1 gene are formed
           by alternative RNA splicing and proteolytic processing.
           Cleavage products include S80 (80 kDa) which is
           incorporated into the eggshell, and further proteolysis
           of S80 gives S60 (60 kDa).
          Length = 407

 Score = 34.8 bits (79), Expect = 0.28
 Identities = 27/87 (31%), Positives = 37/87 (42%), Gaps = 12/87 (13%)

Query: 66  QSPPLSSAP-----ANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQ----- 115
           Q P + S P     A PV A AP  AA  P+A   AA   +A +  A  P+  GQ     
Sbjct: 92  QLPAMPSMPGLLGAAAPVPAPAPAPAAAPPAAPAPAADTPAAPIPDAVQPAILGQAALQN 151

Query: 116 --PFISPPAQSAAFLAQPASTASLPPS 140
              F++P    A+ L   +     PP+
Sbjct: 152 AFTFLNPSNFDASSLLGQSVPTFAPPN 178


>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription
           factor N terminal domain.  The N terminus of the PEA3
           transcription factors is implicated in transactivation
           and in inhibition of DNA binding. Transactivation is
           potentiated by activation of the Ras/MAP kinase and
           protein kinase A signalling cascades. The N terminal
           region contains conserved MAP kinase phosphorylation
           sites.
          Length = 336

 Score = 34.5 bits (79), Expect = 0.29
 Identities = 49/237 (20%), Positives = 68/237 (28%), Gaps = 42/237 (17%)

Query: 6   KRDTSDPQPEAKRLCHLEE-KTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVS 64
           KR+   P  E    C  ++      E+CL            S  D+ PA          +
Sbjct: 88  KREPQSPSKELNSSCSQKQPPYPYGEKCLYS---------YSAYDRKPASGFKPPTPPST 138

Query: 65  DQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPS-----SSGQPF-- 117
             SP         +Q S P+  +  PS      P  + S L  P+ S      S   F  
Sbjct: 139 PCSPVNPQETVRQLQPSGPLSNSSPPSPHT---PLPNQSPLPPPMSSPDSSYPSEHRFQR 195

Query: 118 -ISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPP 176
            +S P           S    PP       PL  +    +   Y    +    G   Q P
Sbjct: 196 QLSEPCLPFPPPPGRGSRDGRPPYHRQMSEPLVPYPPQGFKQEYHDP-LYEEAGVPNQGP 254

Query: 177 PPSESVTPIPLSPVSSDQE--DFSEEDEVVDCNPPALFSFAPSTKEREPSIPDPDSE 231
            P          P+   QE  DF+ + EV  C          S+  R         E
Sbjct: 255 FP---------HPMMIKQEPRDFTYDSEVPGC---------HSSYGRAEGFYSNRHE 293


>gnl|CDD|216011 pfam00589, Phage_integrase, Phage integrase family.  Members of this
            family cleave DNA substrates by a series of staggered
            cuts, during which the protein becomes covalently linked
            to the DNA through a catalytic tyrosine residue at the
            carboxy end of the alignment. The catalytic site residues
            in CRE recombinase are Arg-173, His-289, Arg-292 and
            Tyr-324.
          Length = 170

 Score = 33.5 bits (77), Expect = 0.31
 Identities = 39/169 (23%), Positives = 63/169 (37%), Gaps = 23/169 (13%)

Query: 919  ASRVSAVVPNHFQVSRHVAILLLLSSGRRVHDLTLLSLDPDHFQELDDFVVFWPVFGSKT 978
              R+ A +     +     + LLL +G R+ +L  LSL      +LD+  +   +  +KT
Sbjct: 8    VERLLAALEEPLNIRDRALVELLLLTGLRISEL--LSLRWSDI-DLDNGTIR--IGQTKT 62

Query: 979  DSSSHLQSGWKIKENSSDPLFCIPTWIRHLSTLSQRRMGSRPLTSLFITTRGIVQPASRS 1038
               S           S   L  +  W      L  R+        LF++ RG  +P SRS
Sbjct: 63   -RKSR------TVPLSDAALEALKEW------LGDRKEAEES-EFLFVSRRG--KPLSRS 106

Query: 1039 VIAGWVKTALRGANIIA--SPGSMRSAVATFRYNSDLPLDSILRQGNWR 1085
             +    + A + A I    +P  +R   AT    +  PL  I +     
Sbjct: 107  TVNRAFRRAGKRAGIEKDLTPHDLRHTFATHLAENGAPLRVIQKLLGHS 155


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 34.7 bits (79), Expect = 0.33
 Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 11/103 (10%)

Query: 52  PAVSDPGMASGVSDQS-PPLSSAPANPVQASAPVQAAHQ-----PSATVQAA----PGSS 101
           PA +    AS   D +  P S+A   P  A AP  AA +     P+    A+    P  +
Sbjct: 42  PAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPDPAVA 101

Query: 102 ASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHH 144
             + AAP P ++ + F S      A      S AS  P  A H
Sbjct: 102 PQLAAAPKPDAA-EAFTSAAQAHEAPADAGTSAASKKPDPAAH 143


>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 709

 Score = 34.7 bits (79), Expect = 0.35
 Identities = 29/160 (18%), Positives = 45/160 (28%), Gaps = 18/160 (11%)

Query: 62  GVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAA-----PGSSASVLAAPLPSSSGQP 116
             ++  PP   AP     A+   Q     + ++Q A     P  +         + +  P
Sbjct: 428 QENNDVPPWEDAPDEAQTAAGTAQT---SAKSIQTASEAETPPENQVSKNKAADNETDAP 484

Query: 117 FISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPP 176
               P       A P   A    + AH     PF      YG+               PP
Sbjct: 485 LSEVP-SENPIQATPNDEAVETETFAHEAPAEPF------YGYGFPDNDCPPEDGAEIPP 537

Query: 177 PPSESVTPIPLSPVSSDQE---DFSEEDEVVDCNPPALFS 213
           P  E   P   +   +D+E        +      PP   +
Sbjct: 538 PDWEHAAPADTAGGGADEEAEAGGIGGNNTPSAPPPEFST 577



 Score = 30.8 bits (69), Expect = 5.4
 Identities = 36/193 (18%), Positives = 59/193 (30%), Gaps = 16/193 (8%)

Query: 51  SPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLP 110
           +P  +    A+ V + +   S +     + +A  +   +P A     P  +AS  A P  
Sbjct: 359 APLAAASCDANAVIENTELQSPSAQTAEKETAAKKPQPRPEAETAQTPVQTASAAAMPSE 418

Query: 111 SSSGQPF-------ISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQS 163
             +  P        + P   +       A TA     +           +         S
Sbjct: 419 GKTAGPVSNQENNDVPPWEDAPDEAQTAAGTAQTSAKSIQTA------SEAETPPENQVS 472

Query: 164 AMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPALFSFAPST--KER 221
             KA+  +   P     S  PI  +P     E  +   E     P   + F  +    E 
Sbjct: 473 KNKAADNETDAPLSEVPSENPIQATPNDEAVETETFAHE-APAEPFYGYGFPDNDCPPED 531

Query: 222 EPSIPDPDSELAS 234
              IP PD E A+
Sbjct: 532 GAEIPPPDWEHAA 544


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 34.2 bits (79), Expect = 0.53
 Identities = 39/204 (19%), Positives = 62/204 (30%), Gaps = 37/204 (18%)

Query: 50   QSP-----AVSDPGMASG----------VSDQS-------PPLSSAPANPVQASAPVQAA 87
            QSP     A + P MASG            D           +   P       A     
Sbjct: 822  QSPMPLTVACASPEMASGKVWIRYPVVRPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEP 881

Query: 88   HQPSATVQAAPGSSAS--VLAAPLPSSSG-QPFISPPAQSAAFLAQPA----STASLPPS 140
               +  V+A         V+A P P          P   +A    QP     S  ++   
Sbjct: 882  VVSAPVVEAVAEVVEEPVVVAEPQPEEVVVVETTHPEVIAAPVTEQPQVITESDVAVAQE 941

Query: 141  AAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEE 200
             A H  P+    +P      ++ A + +   VA+P   ++   P+     +  +   + E
Sbjct: 942  VAEHAEPV---VEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETVTAVE 998

Query: 201  DEVVDCNPPA-----LFSFAPSTK 219
             EV     P        + AP T+
Sbjct: 999  PEVAPAQVPEATVEHNHATAPMTR 1022


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 33.9 bits (78), Expect = 0.57
 Identities = 19/80 (23%), Positives = 25/80 (31%), Gaps = 13/80 (16%)

Query: 57  PGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQP 116
           P   S       P + A   P  A     AA    A  + AP        AP P ++   
Sbjct: 382 PAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAP--------APAPPAA--- 430

Query: 117 FISPPAQSAAFLAQPASTAS 136
             + P   +A  A  AS   
Sbjct: 431 --AAPPARSADPAAAASAGD 448



 Score = 32.4 bits (74), Expect = 1.6
 Identities = 16/73 (21%), Positives = 26/73 (35%), Gaps = 6/73 (8%)

Query: 70  LSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLA 129
             + P+    A  P   A  P A     P       AAP   ++ +P  +P   +AA   
Sbjct: 381 APAPPSAAWGAPTPAAPAAPPPAAAPPVPP------AAPARPAAARPAPAPAPPAAAAPP 434

Query: 130 QPASTASLPPSAA 142
             ++  +   SA 
Sbjct: 435 ARSADPAAAASAG 447



 Score = 30.1 bits (68), Expect = 7.7
 Identities = 16/83 (19%), Positives = 26/83 (31%), Gaps = 6/83 (7%)

Query: 22  LEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQAS 81
           L ++ E +ER  A    +      +    +         +      P   + PA    A 
Sbjct: 370 LLDRLEALERG-APAPPSAAWGAPTPAAPAAPPP-----AAAPPVPPAAPARPAAARPAP 423

Query: 82  APVQAAHQPSATVQAAPGSSASV 104
           AP   A        A P ++AS 
Sbjct: 424 APAPPAAAAPPARSADPAAAASA 446


>gnl|CDD|234630 PRK00095, mutL, DNA mismatch repair protein; Reviewed.
          Length = 617

 Score = 33.7 bits (78), Expect = 0.57
 Identities = 26/119 (21%), Positives = 41/119 (34%), Gaps = 9/119 (7%)

Query: 35  QLVQAITARKSSTLDQSPAVSDPGMASGVSD----QSPPLSSAPANPVQASAPVQAAHQP 90
            +VQAI       L QS  +     A+ V +    +  PL   P     +S P  +    
Sbjct: 316 LIVQAI----QEALAQSGLIPAAAGANQVLEPAEPEPLPLQQTPLYASGSSPPASSPSSA 371

Query: 91  SATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSA-AHHLYPL 148
                      +S    PL  ++ Q   +  A + A  A PA+      +A     +PL
Sbjct: 372 PPEQSEESQEESSAEKNPLQPNASQSEAAAAASAEAAAAAPAAAPEPAEAAEEADSFPL 430


>gnl|CDD|237782 PRK14666, uvrC, excinuclease ABC subunit C; Provisional.
          Length = 694

 Score = 33.7 bits (77), Expect = 0.68
 Identities = 18/86 (20%), Positives = 29/86 (33%), Gaps = 12/86 (13%)

Query: 56  DPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQ 115
                +G+   +P L  AP        PV       A       S   V     P+S   
Sbjct: 331 AVCTDAGLLPDTPLLPDAPEGSSDPVVPV-------AAATPVDASLPDVRTGTAPTS--- 380

Query: 116 PFISPPAQSAAFLAQPASTASLPPSA 141
             ++  + +   +AQP   A+L  +A
Sbjct: 381 --LANVSHADPAVAQPTQAATLAGAA 404



 Score = 30.6 bits (69), Expect = 4.9
 Identities = 21/86 (24%), Positives = 30/86 (34%), Gaps = 8/86 (9%)

Query: 46  STLDQSP-AVSDPGMASGVSDQSPPLSSAPANPVQASA----PVQAAHQPSATVQAAPGS 100
           S+    P A + P  AS    ++    ++ AN   A      P QAA    A  + A   
Sbjct: 352 SSDPVVPVAAATPVDASLPDVRTGTAPTSLANVSHADPAVAQPTQAATLAGAAPKGATHL 411

Query: 101 SASVLAAPLPSSSGQPFISPPAQSAA 126
               L   L    G P    P ++ A
Sbjct: 412 M---LEETLADLRGGPVRIVPPRNPA 434


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 33.3 bits (76), Expect = 0.71
 Identities = 20/121 (16%), Positives = 44/121 (36%), Gaps = 5/121 (4%)

Query: 22  LEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQAS 81
           ++E T +I R LA +  A   + + T             +  + +      A    V A 
Sbjct: 180 IDEVTREIGRVLAAVGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAK 239

Query: 82  APVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSA 141
              + A + +   +     +    A  +  ++ +      A++AA  A+ A+ A+   + 
Sbjct: 240 KAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKK-----AAKAAAKAAKGAAKATKGKAK 294

Query: 142 A 142
           A
Sbjct: 295 A 295


>gnl|CDD|240415 PTZ00429, PTZ00429, beta-adaptin; Provisional.
          Length = 746

 Score = 33.4 bits (76), Expect = 0.75
 Identities = 32/135 (23%), Positives = 44/135 (32%), Gaps = 12/135 (8%)

Query: 49  DQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP 108
           D     S P M  G  D SP  S+APA              P      A GS+ +  A P
Sbjct: 617 DAVELPSTPSM--GTQDGSPAPSAAPAGYDIFEFAGDGTGAPH---PVASGSNGAQHADP 671

Query: 109 LPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKAS 168
           L         +  A S AF A   S A  P S      P         + + + S  +  
Sbjct: 672 LGDLFSGLPSTVGASSPAFQAASGSQA--PASP-----PTAASAIEDLFANGMGSGSQTV 724

Query: 169 RGQVAQPPPPSESVT 183
              ++  P  ++  T
Sbjct: 725 PLPISAAPQSADRDT 739


>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl carrier
           protein subunit; Validated.
          Length = 130

 Score = 31.7 bits (72), Expect = 0.79
 Identities = 12/47 (25%), Positives = 21/47 (44%), Gaps = 3/47 (6%)

Query: 67  SPPLSSAPANPVQASAPV---QAAHQPSATVQAAPGSSASVLAAPLP 110
           +P  ++APA P     PV    +    +   Q A  + A  + +P+P
Sbjct: 23  APAQAAAPAQPASTPVPVPTEASPQVEAQAPQPAAAAGADAMPSPMP 69



 Score = 30.2 bits (68), Expect = 2.9
 Identities = 12/44 (27%), Positives = 16/44 (36%)

Query: 77  PVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISP 120
           P QA+AP Q A  P      A     +    P  ++      SP
Sbjct: 24  PAQAAAPAQPASTPVPVPTEASPQVEAQAPQPAAAAGADAMPSP 67


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 32.6 bits (74), Expect = 1.0
 Identities = 31/121 (25%), Positives = 45/121 (37%), Gaps = 22/121 (18%)

Query: 92  ATVQAAP---GSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTAS--LPPSAAHHLY 146
           A++QAA      S S   AP P     P       SA++L   +++AS  + PS     Y
Sbjct: 134 ASLQAARKEGLPSDSPAPAPSPRGRPHPASGSGRLSASYLQDLSTSASECIDPSVVFP-Y 192

Query: 147 PLPFFCDPSYYGHYLQSAMKASRGQVAQPPP-PSESVTPIPLSPVSSDQEDFSEEDEVVD 205
           PL                 ++   +VA P P       P   S   SD E   +E+E  +
Sbjct: 193 PLN---------------ERSKSSKVASPTPRLGLRTPPNSSSSSGSDSESEEDEEEEEE 237

Query: 206 C 206
            
Sbjct: 238 E 238


>gnl|CDD|218108 pfam04487, CITED, CITED.  CITED, CBP/p300-interacting
           transactivator with ED-rich tail, are characterized by a
           conserved 32-amino acid sequence at the C-terminus.
           CITED proteins do not bind DNA directly and are thought
           to function as transcriptional co-activators.
          Length = 206

 Score = 32.2 bits (73), Expect = 1.0
 Identities = 33/160 (20%), Positives = 43/160 (26%), Gaps = 22/160 (13%)

Query: 59  MASGVSDQSPPLSSAPANPVQASAPVQAAHQPSA---TVQAAPGSSASVLAAPLPSSSGQ 115
           MA            A       +AP    H   A   T+   PG        P  + SG 
Sbjct: 7   MAMNHGRFPDGAKGARMGMNSLAAPNHGPHDLRALMGTLLHYPGGGMDASGRPRSAMSGP 66

Query: 116 PFISPPAQSAAFLAQPASTASLPP---------SAAHHLYPLPFFCDPSYYGHYLQSAMK 166
                P QS         ++   P          A+  L  L    +  Y GH    A  
Sbjct: 67  MGGGHPHQSMPAYMMFNPSSKPQPFMLVPGPQLMASMQLQKL----NTQYQGHAGAPAGH 122

Query: 167 ASRGQVAQP------PPPSESVTPIPLSPVSSDQEDFSEE 200
              G   Q       PP  + +    L P   D +   EE
Sbjct: 123 PGGGGPQQFRPGAGQPPGMQHMPAPALPPNVIDTDLIDEE 162


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 33.1 bits (75), Expect = 1.1
 Identities = 37/200 (18%), Positives = 54/200 (27%), Gaps = 27/200 (13%)

Query: 74  PANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPAS 133
           P  P +   P  A           P       +   P S   P   PP Q  +   +P  
Sbjct: 591 PEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEG 650

Query: 134 TASLPPSAAHHLYPLPFFCDPSY----YGHYLQSAMKASRGQVAQPPPPSESVTPIPLSP 189
              +           PF  DP +    Y  YL +A K+            E+ T + L  
Sbjct: 651 PKIIKSPKPPKSPKPPF--DPKFKEKFYDDYLDAAAKS-----------KETKTTVVLD- 696

Query: 190 VSSDQEDFSEEDEVVDCNPPALFSFAPSTKEREPSIP-----DPDSELASQGVVCQKLGS 244
             S +    E        P       P    R+   P     DPD+E        +    
Sbjct: 697 -ESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDD---IEFFTP 752

Query: 245 PAWNRVRYTESQRALQAGGV 264
           P   R  + E+        +
Sbjct: 753 PEEERTFFHETPADTPLPDI 772


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 31.3 bits (71), Expect = 1.1
 Identities = 17/61 (27%), Positives = 21/61 (34%), Gaps = 1/61 (1%)

Query: 82  APVQAAHQPS-ATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPS 140
           AP+     P   T   A   + +V   P  S+   P     A  A   A P S    PP 
Sbjct: 80  APLTPPAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQPPR 139

Query: 141 A 141
           A
Sbjct: 140 A 140


>gnl|CDD|182649 PRK10689, PRK10689, transcription-repair coupling factor;
            Provisional.
          Length = 1147

 Score = 32.8 bits (75), Expect = 1.1
 Identities = 20/50 (40%), Positives = 32/50 (64%), Gaps = 3/50 (6%)

Query: 1046 TALRGANIIASPGSMRSAVATF--RYNSDLPLDSILRQGNWRGGQNFFNY 1093
            + +R  +IIA+P + R AV TF   Y+S +  ++ILR+   RGGQ ++ Y
Sbjct: 768  SGMRDLSIIATPPARRLAVKTFVREYDSLVVREAILRE-ILRGGQVYYLY 816


>gnl|CDD|216078 pfam00716, Peptidase_S21, Assemblin (Peptidase family S21). 
          Length = 326

 Score = 32.4 bits (74), Expect = 1.2
 Identities = 21/97 (21%), Positives = 27/97 (27%), Gaps = 17/97 (17%)

Query: 58  GMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP--------- 108
           G   GVS  + P   A    V  S    +   P  T   A  + ++  AA          
Sbjct: 228 GNDRGVSPGTAPSFDA-TPSVSPSGQPLSPAAPPGTSSVAGTALSASPAALFGDMVYVPL 286

Query: 109 -------LPSSSGQPFISPPAQSAAFLAQPASTASLP 138
                     +  QP         A LA PA     P
Sbjct: 287 DAYNQLLAGQAFNQPPDPQGPAPPAELAPPAPAPPPP 323


>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
          Length = 226

 Score = 32.1 bits (73), Expect = 1.2
 Identities = 19/92 (20%), Positives = 34/92 (36%), Gaps = 1/92 (1%)

Query: 53  AVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSS 112
           A SD G  +   D + P + A A   Q++ P  ++   +      P ++A+    P P  
Sbjct: 86  APSDAGSQASPDDDAQPAAEAEAAD-QSAPPEASSTSATDEAATDPPATAAARDGPTPDP 144

Query: 113 SGQPFISPPAQSAAFLAQPASTASLPPSAAHH 144
           + QP      +S       +     P +   H
Sbjct: 145 TAQPATPDERRSPRQRPPVSGEPPTPSTPDAH 176


>gnl|CDD|187703 cd09279, RNase_HI_archaeal_like, RNAse HI family that includes
           Archaeal RNase HI.  Ribonuclease H (RNase H) is
           classified into two evolutionarily unrelated families,
           type 1 (prokaryotic RNase HI, eukaryotic RNase H1 and
           viral RNase H) and type 2 (prokaryotic RNase HII and
           HIII, and eukaryotic RNase H2). RNase H is an
           endonuclease that cleaves the RNA strand of an RNA/DNA
           hybrid in a sequence non-specific manner. RNase H is
           involved in DNA replication, repair and transcription.
           RNase H is widely present in various organisms,
           including bacteria, archaea and eukaryotes and most
           prokaryotic and eukaryotic genomes contain multiple
           RNase H genes. Despite the lack of amino acid sequence
           homology, Type 1 and type 2 RNase H share a main-chain
           fold and steric configurations of the four acidic
           active-site (DEDD) residues and have the same catalytic
           mechanism and functions in cells.  One of the important
           functions of RNase H is to remove Okazaki fragments
           during DNA replication. Most archaeal genomes contain
           only type 2 RNase H (RNase HII); however, a few contain
           RNase HI as well. Although archaeal RNase HI sequences
           conserve the DEDD active-site motif, they lack other
           common features important for catalytic function, such
           as the basic protrusion region. Archaeal RNase HI
           homologs are more closely related to retroviral RNase HI
           than bacterial and eukaryotic type I RNase H in
           enzymatic properties.
          Length = 128

 Score = 30.9 bits (71), Expect = 1.4
 Identities = 15/66 (22%), Positives = 32/66 (48%), Gaps = 1/66 (1%)

Query: 824 LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSV 883
           L    + +  D+Q VV+ ++ +   K+  L   +E+   L + +    + ++IP   N  
Sbjct: 60  LGIKKLEIYGDSQLVVNQIQGEYEVKNERLAPYLEEARELLKKFEEVEI-KWIPREENKE 118

Query: 884 ADSLSR 889
           AD+L+ 
Sbjct: 119 ADALAN 124


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
           subunit 62; Provisional.
          Length = 576

 Score = 32.6 bits (74), Expect = 1.4
 Identities = 31/125 (24%), Positives = 41/125 (32%), Gaps = 13/125 (10%)

Query: 65  DQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQS 124
           D  PP S +P  P   S  V +     A    AP ++A+  AAP P++        P   
Sbjct: 446 DLKPPTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPAN------MRPLSP 499

Query: 125 AAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTP 184
            A         S  P+A           +    G+       A     AQP P       
Sbjct: 500 YAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPR------ 553

Query: 185 IPLSP 189
            PLSP
Sbjct: 554 -PLSP 557


>gnl|CDD|153413 cd07959, Anticodon_Ia_Leu_AEc, Anticodon-binding domain of archaeal
           and eukaryotic cytoplasmic leucyl tRNA synthetases.
           This domain is found in leucyl tRNA synthetases (LeuRS),
           which belong to the class Ia aminoacyl tRNA synthetases.
           It lies C-terminal to the catalytic core domain. In
           contrast to other class Ia enzymes, the anticodon is not
           used as an identity element in LeuRS (with exceptions
           such as Saccharomyces cerevisiae and some other
           eukaryotes). No anticodon-binding site can be defined
           for this family, which includes archaeal and eukaryotic
           cytoplasmic members. LeuRS catalyzes the transfer of
           leucine to the 3'-end of its tRNA.
          Length = 117

 Score = 30.6 bits (70), Expect = 1.5
 Identities = 14/27 (51%), Positives = 17/27 (62%), Gaps = 1/27 (3%)

Query: 416 RIGAELVGGRLRRFVDAWIRLGAP-AP 441
           R GA +    LRRF++ W RL AP AP
Sbjct: 81  RGGAGMNKDLLRRFIEVWTRLLAPFAP 107


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 32.2 bits (73), Expect = 1.5
 Identities = 19/78 (24%), Positives = 28/78 (35%), Gaps = 4/78 (5%)

Query: 67  SPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAA 126
             P     A+P     P      PS T+ A   + A+  A   P + G P    P    A
Sbjct: 208 PRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEA 267

Query: 127 FLAQPASTASLPPSAAHH 144
               P + A+  P A+ +
Sbjct: 268 ----PPANATPAPEASRY 281


>gnl|CDD|173156 PRK14693, PRK14693, hypothetical protein; Provisional.
          Length = 552

 Score = 32.4 bits (73), Expect = 1.5
 Identities = 29/96 (30%), Positives = 42/96 (43%), Gaps = 15/96 (15%)

Query: 39  AITARKSSTLDQSPAVSDPGMASGVSDQSPPLSS-------APANPVQASAPVQ-AAHQP 90
           A+T+   ST   +  V+  G A+G    +PP +        AP NP     PV   A  P
Sbjct: 25  AMTSSAVSTTGTTNTVASGGAAAGAPSFAPPAADAFPQADGAPRNPAVLQFPVPGGAPAP 84

Query: 91  SATVQAAP------GSSASVLAAPLPSSS-GQPFIS 119
           +    AAP      G  A+++ A L  +S   PF+S
Sbjct: 85  ADARVAAPVVYSAQGEQAAIMKAGLQQASWNNPFVS 120


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score = 32.2 bits (73), Expect = 1.6
 Identities = 23/106 (21%), Positives = 38/106 (35%), Gaps = 16/106 (15%)

Query: 62  GVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPP 121
           G  +   P  + P+     + P  A     AT  AAP ++A     P      +P     
Sbjct: 10  GKGEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKK---DKPRRERK 66

Query: 122 AQSAA------FLAQPASTAS------LPPSAAHHLYPLPF-FCDP 154
            + A+      F+ +P    +      L P   H ++ L F +C P
Sbjct: 67  PKPASLWKLEDFVVEPQEGKTRFHDFNLAPELMHAIHDLGFPYCTP 112


>gnl|CDD|234022 TIGR02813, omega_3_PfaA, polyketide-type polyunsaturated fatty acid
            synthase PfaA.  Members of the seed for this alignment
            are involved in omega-3 polyunsaturated fatty acid
            biosynthesis, such as the protein PfaA from the
            eicosapentaenoic acid biosynthesis operon in
            Photobacterium profundum strain SS9. PfaA is encoded
            together with PfaB, PfaC, and PfaD, and the functions of
            the individual polypeptides have not yet been described.
            More distant homologs of PfaA, also included with the
            reach of this model, appear to be involved in
            polyketide-like biosynthetic mechanisms of
            polyunsaturated fatty acid biosynthesis, an alternative
            to the more familiar iterated mechanism of chain
            extension and desaturation, and in most cases are encoded
            near genes for homologs of PfaB, PfaC, and/or PfaD.
          Length = 2582

 Score = 32.7 bits (74), Expect = 1.7
 Identities = 19/90 (21%), Positives = 29/90 (32%), Gaps = 3/90 (3%)

Query: 50   QSPAVSDPGMASGVSD-QSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAP 108
            Q+P +      + V         +AP  P   S PV +A    +   A     A V   P
Sbjct: 1125 QAPVIKSVVTQAPVVQVTISVAPAAPVLPAVVSPPVVSAAPAQSVATAVA--MAPVAEVP 1182

Query: 109  LPSSSGQPFISPPAQSAAFLAQPASTASLP 138
            +     Q     P+ + A   Q +   S  
Sbjct: 1183 IAVPVQQSVDYMPSVAQAAAPQASVNDSAI 1212


>gnl|CDD|220840 pfam10667, DUF2486, Protein of unknown function (DUF2486).  This
           family is made up of members from various Burkholderia
           spp. The function is unknown.
          Length = 245

 Score = 31.8 bits (72), Expect = 1.8
 Identities = 20/82 (24%), Positives = 27/82 (32%)

Query: 63  VSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPA 122
           V     P  + P +    + P  A H  +    A      S LAA  P +      +  A
Sbjct: 78  VPTPHVPAVALPGDTDAPAEPGAAPHVVAERAAAMQAPLPSALAADDPQAPPAGATAADA 137

Query: 123 QSAAFLAQPASTASLPPSAAHH 144
             AA  A P +     P AA  
Sbjct: 138 GDAAPDATPPAAGDASPPAAAQ 159


>gnl|CDD|233366 TIGR01348, PDHac_trf_long, pyruvate dehydrogenase complex
           dihydrolipoamide acetyltransferase, long form.  This
           model describes a subset of pyruvate dehydrogenase
           complex dihydrolipoamide acetyltransferase specifically
           close by both phylogenetic and per cent identity (UPGMA)
           trees. Members of this set include two or three copies
           of the lipoyl-binding domain. E. coli AceF is a member
           of this model, while mitochondrial and some other
           bacterial forms belong to a separate model [Energy
           metabolism, Pyruvate dehydrogenase].
          Length = 546

 Score = 32.2 bits (73), Expect = 1.8
 Identities = 19/63 (30%), Positives = 28/63 (44%), Gaps = 3/63 (4%)

Query: 63  VSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPA 122
           V+  +P  + APA   Q +A   AA QP     AAP ++ +   AP  + +  P     A
Sbjct: 191 VAGSTPATAPAPA-SAQPAAQSPAATQPEPA--AAPAAAKAQAPAPQQAGTQNPAKVDHA 247

Query: 123 QSA 125
             A
Sbjct: 248 APA 250



 Score = 30.2 bits (68), Expect = 7.0
 Identities = 19/49 (38%), Positives = 24/49 (48%), Gaps = 1/49 (2%)

Query: 51  SPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQ-AAP 98
           +PA + P   S  + Q  P ++  A   QA AP QA  Q  A V  AAP
Sbjct: 201 APASAQPAAQSPAATQPEPAAAPAAAKAQAPAPQQAGTQNPAKVDHAAP 249


>gnl|CDD|237082 PRK12373, PRK12373, NADH dehydrogenase subunit E; Provisional.
          Length = 400

 Score = 31.7 bits (72), Expect = 2.1
 Identities = 23/81 (28%), Positives = 31/81 (38%), Gaps = 2/81 (2%)

Query: 57  PGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQP 116
           P +A    D +P   S  A P  A A   AA +  AT   A   + +  A      SG  
Sbjct: 229 PLLAPWQGDAAPVPPSEAARPKSADAETNAALKTPATAPKAA--AKNAKAPEAQPVSGTA 286

Query: 117 FISPPAQSAAFLAQPASTASL 137
              P  + AA  A  A+  +L
Sbjct: 287 AAEPAPKEAAKAAAAAAKPAL 307



 Score = 30.9 bits (70), Expect = 3.6
 Identities = 30/145 (20%), Positives = 44/145 (30%), Gaps = 10/145 (6%)

Query: 7   RDTSDPQPEAKRLCHLEEKTEK----IERCLAQLVQAITARKSST--LDQSPAVSDPGMA 60
           R  S+P   A  L  L E+  K      + LA+ +     R   T     +P   D    
Sbjct: 185 RYASEP---AGGLTSLTEEAGKARYNASKALAEDIGDTVKRIDGTEVPLLAPWQGDAAPV 241

Query: 61  SGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISP 120
              S+ + P S+          P  A    +   +A      S  AA  P+       + 
Sbjct: 242 PP-SEAARPKSADAETNAALKTPATAPKAAAKNAKAPEAQPVSGTAAAEPAPKEAAKAAA 300

Query: 121 PAQSAAFLAQPASTASLPPSAAHHL 145
            A   A   +P       P  A  L
Sbjct: 301 AAAKPALEDKPRPLGIARPGGADDL 325


>gnl|CDD|237182 PRK12727, PRK12727, flagellar biosynthesis regulator FlhF;
           Provisional.
          Length = 559

 Score = 31.9 bits (72), Expect = 2.1
 Identities = 23/146 (15%), Positives = 37/146 (25%), Gaps = 14/146 (9%)

Query: 71  SSAPANPVQASAPVQAAHQPS-ATVQAAPGSSASVLAAP---LPSSSGQPF----ISPPA 122
             +      A+APV+AA  PS A    A  ++          L +   Q F     + P 
Sbjct: 117 PVSVPRQAPAAAPVRAASIPSPAAQALAHAAAVRTAPRQEHALSAVPEQLFADFLTTAPV 176

Query: 123 QSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESV 182
             A   A   +  +  P+ A     L                +      +    P     
Sbjct: 177 PRAPVQAPVVAAPAPVPAIAA---ALAAHAAY---AQDDDEQLDDDGFDLDDALPQILPP 230

Query: 183 TPIPLSPVSSDQEDFSEEDEVVDCNP 208
             +P   V+                P
Sbjct: 231 AALPPIVVAPAAPAALAAVAAAAPAP 256


>gnl|CDD|178035 PLN02414, PLN02414, glycine dehydrogenase (decarboxylating).
          Length = 993

 Score = 32.0 bits (73), Expect = 2.2
 Identities = 19/52 (36%), Positives = 23/52 (44%), Gaps = 5/52 (9%)

Query: 57  PGMAS-GVSDQSPPLSSAPANPV--QASAPVQAAHQPSATVQAAPGSSASVL 105
           PGM   GV     P    P++PV      P     QP  T+ AAP  SA +L
Sbjct: 740 PGMGPIGVKKHLAPF--LPSHPVVPTGGIPRPEKTQPLGTISAAPWGSALIL 789


>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN.  FtsN is a poorly
           conserved protein active in cell division in a number of
           Proteobacteria. The N-terminal 30 residue region tends
           to by Lys/Arg-rich, and is followed by a
           membrane-spanning region. This is followed by an acidic
           low-complexity region of variable length and a
           well-conserved C-terminal domain of two tandem regions
           matched by pfam05036 (Sporulation related repeat), found
           in several cell division and sporulation proteins. The
           role of FtsN as a suppressor for other cell division
           mutations is poorly understood; it may involve cell wall
           hydrolysis [Cellular processes, Cell division].
          Length = 298

 Score = 31.6 bits (71), Expect = 2.3
 Identities = 22/132 (16%), Positives = 40/132 (30%), Gaps = 12/132 (9%)

Query: 4   NPKRDTSDPQPEAKRLCHLEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGV 63
           N   + S+     +      E+ + +E+  A +  A     ++  +Q+ AV         
Sbjct: 97  NDPEEPSNGGGVEESAQLTAEQRQLLEQMQADMRAAEKVLATAPSEQTVAVEAR------ 150

Query: 64  SDQSPPLSSAPANPVQASA---PVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISP 120
                     P     A A   PV+     S   +A     A         S+ +P  + 
Sbjct: 151 ---KQTAEKKPQKARTAEAQKTPVETEKIASKVKEAKQKQKALPKQTAETQSNSKPIETA 207

Query: 121 PAQSAAFLAQPA 132
           P    A   +P 
Sbjct: 208 PKADKADKTKPK 219


>gnl|CDD|237537 PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional.
          Length = 440

 Score = 31.4 bits (72), Expect = 2.7
 Identities = 16/66 (24%), Positives = 30/66 (45%)

Query: 57  PGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQP 116
            G+A+G+   +   +SA A+P++ +A   A    S+    A  +      A   +++G  
Sbjct: 335 AGVAAGLGGVARAGASAAASPLRRAASRAAESMKSSFRAGARSTGGGAGGAAAAAAAGAA 394

Query: 117 FISPPA 122
              PPA
Sbjct: 395 AAGPPA 400


>gnl|CDD|238231 cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal
            catalytic domain. The DNA breaking-rejoining enzyme
            superfamily includes type IB topoisomerases and tyrosine
            recombinases that share the same fold in their catalytic
            domain containing six conserved active site residues. The
            best-studied members of this diverse superfamily include
            human topoisomerase I, the bacteriophage lambda
            integrase, the bacteriophage P1 Cre recombinase, the
            yeast Flp recombinase and the bacterial XerD/C
            recombinases. Their overall reaction mechanism is
            essentially identical and involves cleavage of a single
            strand of a DNA duplex by nucleophilic attack of a
            conserved tyrosine to give a 3' phosphotyrosyl
            protein-DNA adduct. In the second rejoining step, a
            terminal 5' hydroxyl attacks the covalent adduct to
            release the enzyme and generate duplex DNA. The enzymes
            differ in that topoisomerases cleave and then rejoin the
            same 5' and 3' termini, whereas a site-specific
            recombinase transfers a 5' hydroxyl generated by
            recombinase cleavage to a new 3' phosphate partner
            located in a different duplex region.  Many DNA
            breaking-rejoining enzymes also have N-terminal domains,
            which show little sequence or structure similarity.
          Length = 164

 Score = 30.5 bits (69), Expect = 2.8
 Identities = 37/167 (22%), Positives = 60/167 (35%), Gaps = 30/167 (17%)

Query: 935  HVAILLLLSSGRRVHDLTLLSLDPDHFQELDDFVVFWPVFGSKTDSSSHLQSGWKIKENS 994
            ++A+LLLL++G R+ +L   +L       LD  V    + G+KT                
Sbjct: 18   YLALLLLLATGLRISEL--CALRWSDID-LDKRV--IHITGTKT-KKERT---------- 61

Query: 995  SDPLFCIPTWIRHLSTLSQ--RRMGSRPLTS--LFITTRGIVQP-ASRSVIAGWVKTALR 1049
                  +P     L  L +  ++          LF + RG      SR  +    K A R
Sbjct: 62   ------VPLSEEALKLLKEYLKKRRPANGDEEYLFPSRRGGPAATLSRRNVKRIFKAAGR 115

Query: 1050 GANII---ASPGSMRSAVATFRYNSDLPLDSILRQGNWRGGQNFFNY 1093
             A I     +P S+R   A+   N+ L L+++              Y
Sbjct: 116  RAGIDKEGLTPHSLRHTFASALLNAGLDLEAVQDLLGHSSIAMTMRY 162


>gnl|CDD|233367 TIGR01349, PDHac_trf_mito, pyruvate dehydrogenase complex
           dihydrolipoamide acetyltransferase, long form.  This
           model represents one of several closely related clades
           of the dihydrolipoamide acetyltransferase subunit of the
           pyruvate dehydrogenase complex. It includes sequences
           from mitochondria and from alpha and beta branches of
           the proteobacteria, as well as from some other bacteria.
           Sequences from Gram-positive bacteria are not included.
           The non-enzymatic homolog protein X, which serves as an
           E3 component binding protein, falls within the clade
           phylogenetically but is rejected by its low score
           [Energy metabolism, Pyruvate dehydrogenase].
          Length = 436

 Score = 31.3 bits (71), Expect = 2.8
 Identities = 17/66 (25%), Positives = 27/66 (40%), Gaps = 5/66 (7%)

Query: 49  DQSPAVSDPGMASGVSDQSPPLSSAPANPVQAS----APVQAAHQPSATVQAAPGSSAS- 103
           D + A  +  + S  S    P   AP  P  A     AP + + +PS+    +   S   
Sbjct: 80  DVADAFKNYKLESSASPAPKPSEIAPTAPPSAPKPSPAPQKQSPEPSSPAPLSDKESGDR 139

Query: 104 VLAAPL 109
           + A+PL
Sbjct: 140 IFASPL 145


>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
          Length = 1000

 Score = 31.6 bits (71), Expect = 2.9
 Identities = 42/182 (23%), Positives = 61/182 (33%), Gaps = 22/182 (12%)

Query: 64  SDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPL---PSSSGQPFISP 120
           SD+ PP +S P     ++ P   A   +     AP S+     A     P +SG     P
Sbjct: 558 SDRGPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPASGPHEKQP 617

Query: 121 PAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYY--------GHYLQSAMKASRGQV 172
           P+ +   +A       L       L        P  +        G  +Q    + R   
Sbjct: 618 PSSAPRDMAPSVVRMFL----RERLLEQSTGPKPKSFWEMRAGRDGSGIQQEPSSRRQPA 673

Query: 173 AQPPPPSESVTP--IPLSPVSSDQEDFSEEDEVVDCNPPALFSFAPSTKEREPSIPDPDS 230
            Q  PP  S  P    L  V + +   SEE  +   +P       P + E +P   DPD 
Sbjct: 674 TQSTPPRPSWLPSVFVLPSVDAGRAQPSEESHLSSMSPT-----QPISHEEQPRYEDPDD 728

Query: 231 EL 232
            L
Sbjct: 729 PL 730


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 31.7 bits (72), Expect = 2.9
 Identities = 21/98 (21%), Positives = 27/98 (27%), Gaps = 2/98 (2%)

Query: 52  PAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPS 111
           P  +  G A G     P     PA+  + S   +          AA    A     P   
Sbjct: 646 PDYNRGGEAGGPGVPGPVPVGMPAHTARPSRVARGDPVRPTAHHAAL--RAPQAPRPGGP 703

Query: 112 SSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLP 149
             G   + PP    A        +SL  S      P P
Sbjct: 704 PGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEP 741


>gnl|CDD|239981 cd04608, CBS_pair_PALP_assoc, This cd contains two tandem repeats
          of the cystathionine beta-synthase (CBS pair) domains
          associated with the pyridoxal-phosphate (PALP)
          dependent enzyme domain upstream.   The vitamin B6
          complex comprises pyridoxine, pyridoxal, and
          pyridoxamine, as well as the 5'-phosphate esters of
          pyridoxal (PALP) and pyridoxamine, the last two being
          the biologically active coenzyme derivatives.  The
          members of the PALP family are principally involved in
          the biosynthesis of amino acids and amino acid-derived
          metabolites, but they are also found in the
          biosynthetic pathways of amino sugars and other
          amine-containing compounds.  CBS is a small domain
          originally identified in cystathionine beta-synthase
          and subsequently found in a wide range of different
          proteins. CBS domains usually come in tandem repeats,
          which associate to form a so-called Bateman domain or a
          CBS pair which is reflected in this model. The
          interface between the two CBS domains forms a cleft
          that is a potential ligand binding site. The CBS pair
          coexists with a variety of other functional domains.
          It has been proposed that the CBS domain may play a
          regulatory role, although its exact function is
          unknown.
          Length = 124

 Score = 29.6 bits (67), Expect = 3.3
 Identities = 18/50 (36%), Positives = 22/50 (44%)

Query: 38 QAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAA 87
          +AI   K    DQ P V + G   G+      LSS  +  VQ S PV  A
Sbjct: 15 EAIEILKEKGFDQLPVVDESGKILGMVTLGNLLSSLSSGKVQPSDPVSKA 64


>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
           factor (transactivator).  This family includes EBV BRLF1
           and similar ORF 50 proteins from other herpesviruses.
          Length = 500

 Score = 31.3 bits (71), Expect = 3.3
 Identities = 24/179 (13%), Positives = 40/179 (22%), Gaps = 20/179 (11%)

Query: 35  QLVQAITARKSSTLDQSPAVSD------------PGMASGVSDQSPPLSSAPANPVQASA 82
           Q         +ST       S             P  A           + P   VQ+  
Sbjct: 240 QSESFSDVWPASTQSPREETSAEPLAPASPSSRRPSTAQEEQIACSSPQAEPEQGVQSYV 299

Query: 83  PVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
           P  +  +PS     AP ++      P  +   +    P   +       A + +      
Sbjct: 300 PQSSDSRPSCFP--APSTTQPTFLPPNTNKKAKRDRRPQMVTPKQEGGAAVSQNHDGGTV 357

Query: 143 HHLYPLPFFCDPSY-YGHYLQSAMKASRGQ-----VAQPPPPSESVTPIPLSPVSSDQE 195
                 P     S      L S++  +         +  PP          S       
Sbjct: 358 RAPRGRPSGSGQSPPSNSPLLSSLADTPSGAAHQPASLLPPAVVQQQLEDASDKQPPTP 416



 Score = 31.3 bits (71), Expect = 3.6
 Identities = 29/167 (17%), Positives = 48/167 (28%), Gaps = 13/167 (7%)

Query: 22  LEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQAS 81
           LE +   + +           +  S L  SP+ + P  +   SD  P  + +P    + S
Sbjct: 203 LETRASPLSQFHGFTPHPSLPQPQSPLKPSPSSARPQQSESFSDVWPASTQSPRE--ETS 260

Query: 82  APVQAAHQPSATVQAAP--GSSASVLAAPLPSSSGQPFISPPAQSAAFLAQ-PASTASLP 138
           A   A   PS+   +       A       P    Q ++  P  S +  +  PA + + P
Sbjct: 261 AEPLAPASPSSRRPSTAQEEQIACSSPQAEPEQGVQSYV--PQSSDSRPSCFPAPSTTQP 318

Query: 139 PSAAHHLYPLPFFCDPSYYGHYLQSAMKA-SRGQVAQPPPPSESVTP 184
                   P                  K      V+Q         P
Sbjct: 319 T-----FLPPNTNKKAKRDRRPQMVTPKQEGGAAVSQNHDGGTVRAP 360



 Score = 30.1 bits (68), Expect = 8.2
 Identities = 23/79 (29%), Positives = 31/79 (39%), Gaps = 5/79 (6%)

Query: 64  SDQSPPLSSAPANPVQASAPVQAAHQPSATVQAA----PGSSASVLAAPLPSSSGQPFIS 119
           S QSPP S++P     A  P  AAHQP++ +  A        AS    P P SS  P   
Sbjct: 367 SGQSPP-SNSPLLSSLADTPSGAAHQPASLLPPAVVQQQLEDASDKQPPTPGSSLVPQPD 425

Query: 120 PPAQSAAFLAQPASTASLP 138
                 + +A       + 
Sbjct: 426 EQELGPSVMALLDRDQGIL 444


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 31.4 bits (71), Expect = 3.3
 Identities = 20/85 (23%), Positives = 30/85 (35%), Gaps = 1/85 (1%)

Query: 58  GMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPF 117
           G     + Q PP SS P  P   S+ +  +   +   +    S    L  P P+    P 
Sbjct: 179 GGNIPGAIQPPPPSSLPGLPP-GSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPP 237

Query: 118 ISPPAQSAAFLAQPASTASLPPSAA 142
            S PA     + Q  +   +PP   
Sbjct: 238 ASIPAPPIPPVIQYVAPPPVPPPQP 262


>gnl|CDD|234994 PRK01973, PRK01973, septum formation inhibitor; Reviewed.
          Length = 271

 Score = 30.8 bits (70), Expect = 3.3
 Identities = 16/67 (23%), Positives = 26/67 (38%), Gaps = 9/67 (13%)

Query: 58  GMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPL-------- 109
             A+  +D++   ++  A P  A+A   AA  P+A     P  + +  A           
Sbjct: 108 APAAKAADEAAAAAAEAAAPAAAAAAEAAAAAPAAAAAPEPPPAPAPEAVAAQSQTLVID 167

Query: 110 -PSSSGQ 115
            P  SGQ
Sbjct: 168 RPLRSGQ 174


>gnl|CDD|237001 PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase
           subunit E2; Reviewed.
          Length = 411

 Score = 31.3 bits (72), Expect = 3.4
 Identities = 13/50 (26%), Positives = 21/50 (42%)

Query: 73  APANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPA 122
           A A     +AP   A +P+    AA  ++ +  AAP   ++     SP  
Sbjct: 82  AEAAAAAEAAPEAPAPEPAPAAAAAAAAAPAAAAAPAAPAAAAAKASPAV 131


>gnl|CDD|225950 COG3416, COG3416, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 233

 Score = 30.6 bits (69), Expect = 3.5
 Identities = 28/117 (23%), Positives = 42/117 (35%), Gaps = 16/117 (13%)

Query: 16  AKRLCHLEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPA 75
           A+R+   E+  +K    + +L + I     + L    A S     S       P    PA
Sbjct: 47  AQRVLIQEQALKKASTQIKELEKRI-----AILQAGEAGS-GSFLSNAFKWGTP--QEPA 98

Query: 76  NPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPS---SSGQPFISPPAQSAAFLA 129
            P  A  P + A  PS        S A     P P+   +    F+    Q+AA +A
Sbjct: 99  PPANAPPPKEPAAPPS-----WRSSPAGPTTQPSPAAANTRSSSFLGGALQTAAGVA 150


>gnl|CDD|178806 PRK00030, minC, septum formation inhibitor; Provisional.
          Length = 292

 Score = 30.8 bits (69), Expect = 3.6
 Identities = 20/85 (23%), Positives = 31/85 (36%), Gaps = 15/85 (17%)

Query: 163 SAMKASRGQVAQPPPPSESVTPIPLSPVSS-----------DQEDFSEEDEVVDCNPPAL 211
           S   A   QV    PP++  TP+P  P ++           D+ D  + DE    NP ++
Sbjct: 101 STPVARAPQVIDTAPPNDVATPVPSVPEATAEAAAKAGPQDDEADGEQADEAPAHNPESV 160

Query: 212 FSFA----PSTKEREPSIPDPDSEL 232
            + A            + P   S L
Sbjct: 161 PTRAARETTEANRPTATPPQSSSAL 185



 Score = 29.7 bits (66), Expect = 8.6
 Identities = 17/75 (22%), Positives = 27/75 (36%), Gaps = 4/75 (5%)

Query: 68  PPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAF 127
            P     A P   + PV +   P AT +AA  +      A    +   P  +P +     
Sbjct: 107 APQVIDTAPPNDVATPVPSV--PEATAEAAAKAGPQDDEADGEQADEAPAHNPESVPTR- 163

Query: 128 LAQPASTASLPPSAA 142
            A   +T +  P+A 
Sbjct: 164 -AARETTEANRPTAT 177


>gnl|CDD|227361 COG5028, COG5028, Vesicle coat complex COPII, subunit SEC24/subunit
           SFB2/subunit SFB3 [Intracellular trafficking and
           secretion].
          Length = 861

 Score = 31.3 bits (71), Expect = 3.6
 Identities = 19/154 (12%), Positives = 37/154 (24%), Gaps = 9/154 (5%)

Query: 42  ARKSSTLDQSPAVSDPGMASGVSDQSPPLS--SAPANPVQASAPVQAAHQPSATVQAAPG 99
            +      QS   +    +   +      +  SA    +          Q    +  A  
Sbjct: 5   KKGVYPQAQSQVHTGAASSKKSARPHRAYANFSAGQMGMPPYTTPPLQQQSRRQIDQAAT 64

Query: 100 SSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPSYYGH 159
           +  +  A     S   P      + ++      +  + P    + L P+  F D      
Sbjct: 65  AMHNTGANNPAPSVMSPAFQSQQKFSSPYGGSMADGTAPK-PTNPLVPVDLFEDQPPPIS 123

Query: 160 YLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSD 193
            L              PP + +      S  S  
Sbjct: 124 DLFLPPP------PIVPPLTTNFVGSEQSNCSPK 151


>gnl|CDD|223400 COG0323, MutL, DNA mismatch repair enzyme (predicted ATPase) [DNA
           replication, recombination, and repair].
          Length = 638

 Score = 31.2 bits (71), Expect = 3.7
 Identities = 30/195 (15%), Positives = 54/195 (27%), Gaps = 24/195 (12%)

Query: 85  QAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHH 144
           Q    P A+V+A   +S  + A   PS   +  I     + +  ++  + +   P++   
Sbjct: 332 QQGLIPPASVEAPKSASQPLPAFQEPSPLPESRIQKSKVAKSGSSKSDAPSIAEPASGA- 390

Query: 145 LYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVV 204
             P P           +         +            P        + E  +  ++V 
Sbjct: 391 -SPSPASPSIRPLSKNILPESSPGSLKNEDRSYDDLLEEPAESEDKQEEAEQKAISEDV- 448

Query: 205 DCNPPALFSFAPSTK-----EREPSIPDPDSELASQGVVCQKLGSPAWNRVRYTESQRAL 259
               P   +           E E  +      L  Q             R+ Y + +  L
Sbjct: 449 ---FPLGEAIGQVHGTYILAEHEDGL-----VLVDQHAAH--------ERILYEKLKNEL 492

Query: 260 QAGGVFQPLAIPSEF 274
              G  QPL IP   
Sbjct: 493 GNVGELQPLLIPIRL 507


>gnl|CDD|218601 pfam05477, SURF2, Surfeit locus protein 2 (SURF2).  Surfeit locus
           protein 2 is part of a group of at least six sequence
           unrelated genes (Surf-1 to Surf-6). The six Surfeit
           genes have been classified as housekeeping genes, being
           expressed in all tissue types tested and not containing
           a TATA box in their promoter region. The exact function
           of SURF2 is unknown.
          Length = 244

 Score = 30.7 bits (69), Expect = 3.8
 Identities = 19/57 (33%), Positives = 26/57 (45%), Gaps = 1/57 (1%)

Query: 171 QVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPALFSFAPSTKEREPSIPD 227
            V QP    +S +     P SSD++D   ED + D  PP LF+     KE+     D
Sbjct: 140 GVRQPGRTEKSGSDF-WEPPSSDEDDSDSEDSMSDLYPPELFTLKNPGKEQNGDEDD 195


>gnl|CDD|235777 PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl carrier
           protein subunit; Validated.
          Length = 155

 Score = 29.8 bits (68), Expect = 4.1
 Identities = 19/74 (25%), Positives = 28/74 (37%), Gaps = 17/74 (22%)

Query: 63  VSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAA---------PL---- 109
           +S  +    +  A    A APV AA   +A   AAP ++ +  AA         P+    
Sbjct: 32  ISRAAAAPVAPVAQQAAA-APVAAAPAAAAAAAAAPAAAPAAAAAEAEGHVVTSPMVGTF 90

Query: 110 ---PSSSGQPFISP 120
              PS    PF+  
Sbjct: 91  YRAPSPDAPPFVEV 104


>gnl|CDD|236797 PRK10927, PRK10927, essential cell division protein FtsN;
           Provisional.
          Length = 319

 Score = 30.8 bits (69), Expect = 4.1
 Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 8/80 (10%)

Query: 66  QSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPF---ISPPA 122
           Q+  L+       Q+    Q+  Q + T QAAP  +        P+S+ QP+   +  PA
Sbjct: 158 QAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQP--RQSKPASTQQPYQDLLQTPA 215

Query: 123 QSAAFLAQPASTASLPPSAA 142
            +    AQ     + P + A
Sbjct: 216 HTT---AQSKPQQAAPVTRA 232


>gnl|CDD|238824 cd01646, RT_Bac_retron_I, RT_Bac_retron_I: Reverse transcriptases
           (RTs) in bacterial retrotransposons or retrons. The
           polymerase reaction of this enzyme leads to the
           production of a unique RNA-DNA complex called msDNA
           (multicopy single-stranded (ss)DNA) in which a small
           ssDNA branches out from a small ssRNA molecule via a
           2'-5'phosphodiester linkage. Bacterial retron RTs
           produce cDNA corresponding to only a small portion of
           the retron genome.
          Length = 158

 Score = 30.0 bits (68), Expect = 4.2
 Identities = 16/60 (26%), Positives = 23/60 (38%), Gaps = 6/60 (10%)

Query: 616 YLDDFLLV-NQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDR 674
           Y+DD  +  +      EI  +L    L  LG  +NL K+ + P P     G      L  
Sbjct: 88  YVDDIRIFADSKEEAEEILEELK-EFLAELGLSLNLSKTEILPLP----EGTASKDFLGY 142


>gnl|CDD|212508 cd11649, RsmI_like, Uncharacterized subfamily of the tetrapyrrole
           methylase family similar to ribosomal RNA small subunit
           methyltransferase I (RsmI).  Tetrapyrrole methylase uses
           S-AdoMet (S-adenosyl-L-methionine or SAM) in the
           methylation of diverse substrates. This uncharacterized
           subfamily exhibits sequence similarity to the ribosomal
           RNA small subunit methyltransferase I (RsmI), which
           catalyzes the 2-O-methylation of the ribose of cytidine
           1402 (C1402) in 16S rRNA.
          Length = 229

 Score = 30.2 bits (69), Expect = 4.7
 Identities = 20/68 (29%), Positives = 25/68 (36%), Gaps = 26/68 (38%)

Query: 52  PAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPS 111
           P V+DPG                      +  V  AHQ    V    G S S+L A L +
Sbjct: 91  PGVADPG----------------------AELVALAHQLGIRVVPLVGPS-SILLA-LMA 126

Query: 112 S--SGQPF 117
           S  +GQ F
Sbjct: 127 SGLNGQSF 134


>gnl|CDD|148635 pfam07139, DUF1387, Protein of unknown function (DUF1387).  This
           family represents a conserved region approximately 300
           residues long within a number of hypothetical proteins
           of unknown function that seem to be restricted to
           mammals.
          Length = 301

 Score = 30.4 bits (68), Expect = 4.9
 Identities = 27/133 (20%), Positives = 45/133 (33%), Gaps = 19/133 (14%)

Query: 40  ITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQ---------- 89
           +T +K +   +S    +    S   +++ P     A P      V   H           
Sbjct: 3   VTGKKKNKKKKSKPKPEAPAKSASKEETTPE--EQAAPGDEKDEVNGFHANGSADDTESV 60

Query: 90  -------PSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
                   SA++ A    + ++ A P PSSS    +S         + P S+A   PS+ 
Sbjct: 61  DSLSEGLDSASLDAREPEAVTLDAPPSPSSSLTNGLSDLQSKLELQSSPHSSAKPHPSSD 120

Query: 143 HHLYPLPFFCDPS 155
            H     +   PS
Sbjct: 121 QHKNAKKYVSKPS 133


>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
           Tim54.  Mitochondrial function depends on the import of
           hundreds of different proteins synthesised in the
           cytosol. Protein import is a multi-step pathway which
           includes the binding of precursor proteins to surface
           receptors, translocation of the precursor across one or
           both mitochondrial membranes, and folding and assembly
           of the imported protein inside the mitochondrion. Most
           precursor proteins carry amino-terminal targeting
           signals, called pre-sequences, and are imported into
           mitochondria via import complexes located in both the
           outer and the inner membrane (IM). The IM complex, TIM,
           is made up of at least two proteins which mediate
           translocation of proteins into the matrix by removing
           their signal peptide and another pair of proteins, Tim54
           and Tim22, that insert the polytopic proteins, that
           carry internal targetting information, into the inner
           membrane.
          Length = 377

 Score = 30.5 bits (69), Expect = 5.0
 Identities = 20/86 (23%), Positives = 30/86 (34%), Gaps = 6/86 (6%)

Query: 65  DQSPPLSSAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQS 124
           D++ P +   A P   S    A      T +    +       P+     +P+ISP    
Sbjct: 206 DEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPPVPKPYISPDEYP 265

Query: 125 AAFLAQPASTASLPPSAAHHLYPLPF 150
           +A L  P     L PS       +PF
Sbjct: 266 SAPLP-PELPQLLQPS-----LVIPF 285


>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
           synthetase/uroporphyrin-III C-methyltransferase;
           Reviewed.
          Length = 656

 Score = 30.8 bits (70), Expect = 5.2
 Identities = 14/76 (18%), Positives = 21/76 (27%), Gaps = 5/76 (6%)

Query: 30  ERCLAQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQ 89
           ER +   +    A        +PA S     +     +      PA    A AP      
Sbjct: 256 ERIVRAFLTWADAAAQP-ATAAPAPSRMTDTNDSKSVTS----QPAAAAAAPAPPPNPPA 310

Query: 90  PSATVQAAPGSSASVL 105
                 A  G  ++ L
Sbjct: 311 TPPEPPARRGRGSAAL 326


>gnl|CDD|235737 PRK06197, PRK06197, short chain dehydrogenase; Provisional.
          Length = 306

 Score = 30.4 bits (69), Expect = 5.3
 Identities = 17/54 (31%), Positives = 20/54 (37%), Gaps = 4/54 (7%)

Query: 85  QAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLP 138
            AA   +  V A PG S + LA  LP              A  LAQ     +LP
Sbjct: 197 AAAGATTIAVAAHPGVSNTELARNLPR----ALRPVATVLAPLLAQSPEMGALP 246


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 30.3 bits (68), Expect = 5.6
 Identities = 29/166 (17%), Positives = 51/166 (30%), Gaps = 12/166 (7%)

Query: 73  APANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPA 132
           AP     +   V  A +      AA   +++  A   P   G+P  + P   A  +    
Sbjct: 281 APGALRGSEYVVDEALEREPAELAAAAVASAASAVG-PVGPGEP--NQPDDVAEAVKAEV 337

Query: 133 STASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSS 192
           +  +   +A   +            G    +  + S   + +  P  +     P +    
Sbjct: 338 AEVTDEVAAESVVQV------ADRDGESTPAVEETSEADIEREQP-GDLAGQAPAAHQVD 390

Query: 193 DQEDFSEEDEVVDCNPPALFSFAPSTKEREPSIPDP--DSELASQG 236
            +   +  +E       A     P   E+   IPDP    ELA  G
Sbjct: 391 AEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAG 436


>gnl|CDD|131471 TIGR02418, acolac_catab, acetolactate synthase, catabolic.
           Acetolactate synthase (EC 2.2.1.6) combines two
           molecules of pyruvate to yield 2-acetolactate with the
           release of CO2. This reaction may be involved in either
           valine biosynthesis (biosynthetic) or conversion of
           pyruvate to acetoin and possibly to 2,3-butanediol
           (catabolic). The biosynthetic type, described by
           TIGR00118, is also capable of forming
           acetohydroxybutyrate from pyruvate and 2-oxobutyrate for
           isoleucine biosynthesis. The family described here, part
           of the same larger family of thiamine
           pyrophosphate-dependent enzymes (pfam00205, pfam02776)
           is the catabolic form, generally found associated with
           in species with acetolactate decarboxylase and usually
           found in the same operon. The model may not encompass
           all catabolic acetolactate synthases, but rather one
           particular clade in the larger TPP-dependent enzyme
           family [Energy metabolism, Fermentation].
          Length = 539

 Score = 30.5 bits (69), Expect = 5.6
 Identities = 37/136 (27%), Positives = 54/136 (39%), Gaps = 24/136 (17%)

Query: 33  LAQLVQAITARKSSTLDQS-PAVSD--PGMASGVSDQSPPLSSAPANPVQASAPVQAAHQ 89
           +AQ V  IT +    L  S P  S+   G+A+  S+  P +  A    V+ +  ++  HQ
Sbjct: 51  MAQAVGRITGKPGVALVTSGPGCSNLVTGLATANSEGDPVV--AIGGQVKRADLLKLTHQ 108

Query: 90  P-------------SATVQAAPGSSASVLAAPLPSSSGQP---FISPPAQSAAFLAQPAS 133
                         SA VQ     S  V  A   + SG+P   F+S P      +  P S
Sbjct: 109 SMDNVALFRPITKYSAEVQDPDALSEVVANAFRAAESGKPGAAFVSLPQD---VVDSPVS 165

Query: 134 TASLPPSAAHHLYPLP 149
             ++P S A  L   P
Sbjct: 166 VKAIPASYAPKLGAAP 181


>gnl|CDD|178145 PLN02530, PLN02530, histidine-tRNA ligase.
          Length = 487

 Score = 30.5 bits (69), Expect = 5.8
 Identities = 21/106 (19%), Positives = 28/106 (26%), Gaps = 23/106 (21%)

Query: 96  AAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAAHHLYPLPFFCDPS 155
            A  SSA+ ++A        P       S    A          SAA             
Sbjct: 1   TAIASSATRVSASAIRRPSLP--LSSRCSFLLSASSPRGGRCAASAA------------- 45

Query: 156 YYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEED 201
                  +    S G  A P    +    I ++P      DF  ED
Sbjct: 46  -------AGGGRSGGTTAPPSVQEDGKPKIDVNP-PKGTRDFPPED 83


>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
           carrier protein subunit; Validated.
          Length = 153

 Score = 29.4 bits (66), Expect = 5.9
 Identities = 17/50 (34%), Positives = 22/50 (44%), Gaps = 5/50 (10%)

Query: 72  SAPANPVQASAPVQAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISPP 121
           SA    V   AP  A   PS     AP   A    AP P+S+G+  ++ P
Sbjct: 46  SAVQEQVPTPAPAPAPAVPS-----APTPVAPAAPAPAPASAGENVVTAP 90



 Score = 29.4 bits (66), Expect = 6.2
 Identities = 15/55 (27%), Positives = 23/55 (41%), Gaps = 4/55 (7%)

Query: 60  ASGVSDQSPPLSSAPANPVQASAPVQ-AAHQPSATVQAAPGSSA---SVLAAPLP 110
           A G+      +      P  A AP   +A  P A    AP  ++   +V+ AP+P
Sbjct: 38  AKGLGIDLSAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPASAGENVVTAPMP 92


>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 30.3 bits (68), Expect = 6.4
 Identities = 30/116 (25%), Positives = 44/116 (37%), Gaps = 6/116 (5%)

Query: 71  SSAPANPVQASAPVQAAHQPSA-TVQAAPGSSASVLA-APLPSSSGQPFISPPAQSAAFL 128
           S A +  V+  AP     QP      A P +  +  A  P P+S+  P   PP   +A L
Sbjct: 383 SPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVARSAPL 442

Query: 129 AQPASTASLPPSAAHHLYPLPFFCDPSYYGHYLQSAMKASRGQVAQPPPPSESVTP 184
             P+  AS P + A     +      S+ G ++      SR Q  Q      + T 
Sbjct: 443 -PPSPQASAPRNVASGKPGVDL---GSWQGKFMNFTRNGSRKQPVQASSSDAAQTG 494


>gnl|CDD|233595 TIGR01845, outer_NodT, efflux transporter, outer membrane factor
           (OMF) lipoprotein, NodT family.  Members of This model
           comprise a subfamily of the Outer Membrane Factor (TCDB
           1.B.17) porins. OMF proteins operate in conjunction with
           a primary transporter of the RND, MFS, ABC, or PET
           systems, and a MFP (membrane fusion protein) to tranport
           substrates across membranes. The complex thus formed
           allows transport (export) of various solutes (heavy
           metal cations; drugs, oligosaccharides, proteins, etc.)
           across the two envelopes of the Gram-negative bacterial
           cell envelope in a single energy-coupled step. Current
           data suggest that the OMF (and not the MFP) is largely
           responsible for the formation of both the trans-outer
           membrane and trans-periplasmic channels. The roles
           played by the MFP have yet to be determined [Cellular
           processes, Detoxification, Transport and binding
           proteins, Porins].
          Length = 454

 Score = 30.5 bits (69), Expect = 6.5
 Identities = 27/107 (25%), Positives = 43/107 (40%), Gaps = 15/107 (14%)

Query: 29  IERCLAQLVQAITA------------RKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPAN 76
           ++  +AQ   A+ A             +   LDQ P      + S +  + P + +A   
Sbjct: 227 LDVQIAQARNALAALLGKGPSRGLAIARPLLLDQLPPDLPLSLPSDLLRRRPDIRAAERR 286

Query: 77  PVQASAPV---QAAHQPSATVQAAPGSSASVLAAPLPSSSGQPFISP 120
              A+A +   +AA  PS T+ A+ G SAS L+      S    I P
Sbjct: 287 LAAANAQIGVAKAAFFPSITLSASIGLSASQLSRLFDGGSRFWSIGP 333


>gnl|CDD|117527 pfam08961, DUF1875, Domain of unknown function (DUF1875).  The MIT
           domain, found in Nuclear receptor-binding factor 2, has
           no known function.
          Length = 243

 Score = 29.8 bits (66), Expect = 7.3
 Identities = 19/71 (26%), Positives = 27/71 (38%), Gaps = 8/71 (11%)

Query: 348 KRAECLESRRRLVEPRDPHLASLLL------RARRGKKSSSPQNLEPPGRVSLKVQTLQK 401
           K  E  ++   L   RD H+  LLL      RA+R ++  + Q  E      L  QT   
Sbjct: 2   KLTESEQAHLSLELQRDSHMKQLLLIQERWKRAKREERLKAQQATEQDAARHL--QTSHP 59

Query: 402 PQRCSSPVNPP 412
           P    +    P
Sbjct: 60  PSVEDAEGQSP 70


>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729).  This
           family of proteins is found in viruses. Proteins in this
           family are typically between 145 and 1707 amino acids in
           length. The family is found in association with
           pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
           There is a single completely conserved residue L that
           may be functionally important.
          Length = 115

 Score = 28.5 bits (64), Expect = 7.8
 Identities = 23/93 (24%), Positives = 25/93 (26%), Gaps = 13/93 (13%)

Query: 64  SDQSPP--LSSAPANPVQASAPVQAAHQPSAT---VQAAPGSSASVLAAPLPSSSGQPFI 118
           S  SPP         PV    P      P  +   V   P   A+    P P   G    
Sbjct: 27  SCFSPPESAHPDDPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPG---- 82

Query: 119 SPPAQSAAFLAQPASTASLPPSAAH----HLYP 147
                S      PA    LPP        H YP
Sbjct: 83  PAGPPSPLAPPAPARKPPLPPPRPQRRLLHTYP 115


>gnl|CDD|222843 PHA02030, PHA02030, hypothetical protein.
          Length = 336

 Score = 29.9 bits (67), Expect = 8.2
 Identities = 20/64 (31%), Positives = 24/64 (37%), Gaps = 5/64 (7%)

Query: 65  DQSPPLSSAPANPVQA-SAPVQAAHQPSATV-QAAPGSSASVLAAPLPSSSGQPFISPPA 122
                L + P     A SA   A    +A V QAAP        A LP     P ++P A
Sbjct: 261 AAGSNLPAVPNVAADAGSAAAPAVPAAAAAVAQAAPSVPQVPNVAVLPDV---PQVAPVA 317

Query: 123 QSAA 126
             AA
Sbjct: 318 APAA 321


>gnl|CDD|223009 PHA03211, PHA03211, serine/threonine kinase US3; Provisional.
          Length = 461

 Score = 29.9 bits (67), Expect = 8.3
 Identities = 18/101 (17%), Positives = 28/101 (27%), Gaps = 20/101 (19%)

Query: 2   ASNPKRDTSDPQPEAKRLCHLEEKTEKIERCLAQLVQAITARKSSTLDQSPAVSDPGMAS 61
             +        + EA RLC ++E        LA++             +    S      
Sbjct: 51  PEHDPPSPHGARDEAARLCQIQE-------LLAEMRS----------SEEYPDSGAEAED 93

Query: 62  GVSDQSPPLSSAP---ANPVQASAPVQAAHQPSATVQAAPG 99
              D +P   + P   A            H P+    A PG
Sbjct: 94  DDDDDAPDDVAYPDEYAEDDFLPGDGAPDHDPAPCGPAPPG 134


>gnl|CDD|215184 PLN02325, PLN02325, nudix hydrolase.
          Length = 144

 Score = 28.7 bits (64), Expect = 8.4
 Identities = 12/28 (42%), Positives = 16/28 (57%)

Query: 361 EPRDPHLASLLLRARRGKKSSSPQNLEP 388
           EP+  H  ++ +RA     S  PQNLEP
Sbjct: 81  EPKPSHYVTVFMRAVLADPSQVPQNLEP 108


>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
          Length = 209

 Score = 29.4 bits (66), Expect = 8.6
 Identities = 21/109 (19%), Positives = 40/109 (36%), Gaps = 2/109 (1%)

Query: 34  AQLVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSAT 93
           + L +A +  + +T ++   V+ P    G+ +  P +      P  A+A  QAA  P A 
Sbjct: 94  SDLQKATSMSEPATENKPAEVTTPVEPMGLPETPPAVPVPAPAPAVAAAAAQAAAAPKAP 153

Query: 94  VQAAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
            +    S      A    +           +AA   +P +  +  P+  
Sbjct: 154 AKPRAKSPRPAAKAAPKPTETITAKKAKKTAAA--PKPTADKTATPAKK 200


>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 292

 Score = 29.5 bits (66), Expect = 9.4
 Identities = 29/107 (27%), Positives = 42/107 (39%), Gaps = 3/107 (2%)

Query: 36  LVQAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQ 95
           L Q      +STL   PA   P   + V     P   A   PV+A A  + A +P A   
Sbjct: 116 LSQLNNVAVTSTLPTEPATVAPVRNASVPTAERP---AITRPVRAQAVSEPAVEPKAAKT 172

Query: 96  AAPGSSASVLAAPLPSSSGQPFISPPAQSAAFLAQPASTASLPPSAA 142
           A    +    A+P  + +  P     A S    + P+S  +L  SA+
Sbjct: 173 ATATEAKVQTASPAQTPATPPAGKGAAASGQLKSAPSSHYTLQLSAS 219


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.133    0.399 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 55,937,345
Number of extensions: 5552964
Number of successful extensions: 6571
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5864
Number of HSP's successfully gapped: 327
Length of query: 1097
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 990
Effective length of database: 6,191,724
Effective search space: 6129806760
Effective search space used: 6129806760
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (28.6 bits)