RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4
         (150 letters)



>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
           (DUF2076).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function. The domain,
           however, is found in various periplasmic ligand-binding
           sensor proteins.
          Length = 234

 Score = 40.0 bits (94), Expect = 1e-04
 Identities = 23/109 (21%), Positives = 30/109 (27%), Gaps = 14/109 (12%)

Query: 30  GHSRASPAPAPRPTPATPA--------PVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVA 81
           G +   P  AP   P  P             Q     P  A       +   A TA GVA
Sbjct: 89  GGAPRPPPAAPAVQPPAPPARPGWGSGGPSQQGAGQQPGYAQPGPGSFLGGAAQTAAGVA 148

Query: 82  IGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTG 130
            G  +G+ +      +F G      A    A        G   A +   
Sbjct: 149 GGMLLGNGL----ENLFGGHSQ--PAEIVDAIGEGGDGSGPAPADDTGI 191


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 39.2 bits (92), Expect = 4e-04
 Identities = 26/111 (23%), Positives = 34/111 (30%)

Query: 21  GRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGV 80
               +  +   +  +PA A     A PAP  A    PAP  AP          A  A   
Sbjct: 397 AAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSP 456

Query: 81  AIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGA 131
              +A       A       + +   AP A  APA A    +  AA     
Sbjct: 457 PPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGAD 507



 Score = 38.0 bits (89), Expect = 0.001
 Identities = 30/111 (27%), Positives = 40/111 (36%)

Query: 21  GRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGV 80
           G + +P +   S A+ APA  P PA  AP  A    PA    P   P       + AG  
Sbjct: 389 GGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNA 448

Query: 81  AIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGA 131
             G A       A +   + + +    P A  APAP       +A     A
Sbjct: 449 PAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAA 499



 Score = 36.9 bits (86), Expect = 0.002
 Identities = 28/116 (24%), Positives = 35/116 (30%)

Query: 16  KMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAA 75
                   P+P S G    +  PA    PA PA         AP  A       +A    
Sbjct: 593 GAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEH 652

Query: 76  TAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGA 131
               VA+  A     G       +   +   APA  A  APA    +Q A  P   
Sbjct: 653 HPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAAT 708



 Score = 33.4 bits (77), Expect = 0.030
 Identities = 22/108 (20%), Positives = 32/108 (29%)

Query: 24  PSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIG 83
           P+P     + A  + AP P  A P        +P         P      A  A   A  
Sbjct: 627 PAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPA 686

Query: 84  SAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGA 131
            A       A     + + +          PA    Q +Q A+ P+ A
Sbjct: 687 PAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPA 734



 Score = 33.4 bits (77), Expect = 0.034
 Identities = 23/102 (22%), Positives = 28/102 (27%), Gaps = 2/102 (1%)

Query: 27  VSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAV 86
           + R    A  A AP     + A            AAP          AA        +  
Sbjct: 381 LERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAA--PAPAAAPQPAPAPAPA 438

Query: 87  GHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEP 128
                 A      G+ S   A A  A PAPA     +  A P
Sbjct: 439 PAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAP 480



 Score = 33.0 bits (76), Expect = 0.039
 Identities = 28/124 (22%), Positives = 37/124 (29%)

Query: 8   PSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGP 67
           P+P ++  +      S  P       A+PA    P    PA   A P   +   AP    
Sbjct: 590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAA 649

Query: 68  GLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAE 127
                        + G                      AAPAAPA  APA    + +A  
Sbjct: 650 PEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATP 709

Query: 128 PTGA 131
           P G 
Sbjct: 710 PAGQ 713



 Score = 32.7 bits (75), Expect = 0.060
 Identities = 21/127 (16%), Positives = 31/127 (24%)

Query: 5   GRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQ 64
                   +         +  P +     A  APAP    A PA   A P          
Sbjct: 594 AAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHH 653

Query: 65  QGPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQS 124
                +   +    G    +              +   +   A  A  APAPA    +  
Sbjct: 654 PKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQ 713

Query: 125 AAEPTGA 131
           A +P   
Sbjct: 714 ADDPAAQ 720



 Score = 31.5 bits (72), Expect = 0.13
 Identities = 24/108 (22%), Positives = 29/108 (26%), Gaps = 18/108 (16%)

Query: 20  RGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGG 79
              +P+   +     +PAPAP           A    PA   + Q  P   A    TA  
Sbjct: 421 AAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAP 480

Query: 80  VAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAE 127
                A                    A  AAPAAPA          A 
Sbjct: 481 APAPPAAP------------------APAAAPAAPAAPAAPAGADDAA 510



 Score = 31.5 bits (72), Expect = 0.14
 Identities = 28/111 (25%), Positives = 34/111 (30%), Gaps = 1/111 (0%)

Query: 21  GRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGV 80
           G +P         A  +  P    A PA   A     AP  A        A  A   G  
Sbjct: 589 GPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVA 648

Query: 81  AIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGA 131
           A      H      +    G  +K A  AAPAAP PA    + +A      
Sbjct: 649 APEHHPKHVAVPDASDGGDGWPAK-AGGAAPAAPPPAPAPAAPAAPAGAAP 698



 Score = 31.1 bits (71), Expect = 0.20
 Identities = 24/132 (18%), Positives = 33/132 (25%), Gaps = 5/132 (3%)

Query: 5   GRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQ 64
                 V++     +    P     G    + A    P    PAP  A P  PA  A  Q
Sbjct: 641 AAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQ 700

Query: 65  QGPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDS-----KEAAPAAPAAPAPAYQ 119
             P   A   A               G +     +          +  P    APA    
Sbjct: 701 PAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPP 760

Query: 120 QGSQSAAEPTGA 131
             + + A    A
Sbjct: 761 PPAPAPAAAPAA 772



 Score = 30.7 bits (70), Expect = 0.27
 Identities = 23/100 (23%), Positives = 27/100 (27%), Gaps = 16/100 (16%)

Query: 18  VRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATA 77
                +P P        +P       PA  AP       P+   AP          A   
Sbjct: 423 PAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAP 482

Query: 78  GGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPA 117
              A  +                     AAPAAPAAPA A
Sbjct: 483 APPAAPAPAAA----------------PAAPAAPAAPAGA 506



 Score = 30.3 bits (69), Expect = 0.34
 Identities = 27/129 (20%), Positives = 37/129 (28%), Gaps = 2/129 (1%)

Query: 5   GRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQ 64
           G    P  +          P+  +   + A+PAPA        A     P + AP   P+
Sbjct: 596 GGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPK 655

Query: 65  QGPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEA--APAAPAAPAPAYQQGS 122
                 A           G A       A       + +  A   PA   A  P   Q  
Sbjct: 656 HVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQAD 715

Query: 123 QSAAEPTGA 131
             AA+P  A
Sbjct: 716 DPAAQPPQA 724



 Score = 30.3 bits (69), Expect = 0.40
 Identities = 18/124 (14%), Positives = 24/124 (19%), Gaps = 2/124 (1%)

Query: 7   SPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQG 66
            P           +    +P +   + A  APA     A   P  A    P    A    
Sbjct: 659 VPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPA 718

Query: 67  PGLMAQMAATAGGVAIGSAVG--HTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQS 124
                     +                                PA   A APA       
Sbjct: 719 AQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSP 778

Query: 125 AAEP 128
            +E 
Sbjct: 779 PSEE 782



 Score = 27.3 bits (61), Expect = 4.0
 Identities = 13/106 (12%), Positives = 18/106 (16%)

Query: 24  PSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIG 83
                     A+P       PA   P  AQ       AA    P             A  
Sbjct: 697 APAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPA 756

Query: 84  SAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPT 129
                          +       +     A   A     +   +  
Sbjct: 757 QPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAE 802



 Score = 27.3 bits (61), Expect = 4.1
 Identities = 14/55 (25%), Positives = 16/55 (29%)

Query: 19  RRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQM 73
               +PS        A+P P   P PA PA         AP A         A  
Sbjct: 457 PPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAAT 511



 Score = 26.5 bits (59), Expect = 6.7
 Identities = 23/109 (21%), Positives = 27/109 (24%), Gaps = 10/109 (9%)

Query: 33  RASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIG- 91
            A   PAP        P  A    P   A P       A  A    G A   A       
Sbjct: 585 EAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPA 644

Query: 92  ---------HAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGA 131
                         +   SD  +  PA     APA    + + A P   
Sbjct: 645 PGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAP 693


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 37.2 bits (87), Expect = 0.002
 Identities = 18/91 (19%), Positives = 27/91 (29%), Gaps = 5/91 (5%)

Query: 32  SRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIG 91
           +   P P   P  A PA        P    AP Q P +    A+             T  
Sbjct: 362 AAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQ 421

Query: 92  -----HAVTGMFSGSDSKEAAPAAPAAPAPA 117
                  +      + +K++ PAA +   P 
Sbjct: 422 LLAARQQLQRAQGATKAKKSEPAAASRARPV 452


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 36.1 bits (83), Expect = 0.004
 Identities = 39/115 (33%), Positives = 43/115 (37%), Gaps = 2/115 (1%)

Query: 3    RRGRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAA 62
            R GR+    S   +  RR   P+  S       P P P P PA  A V A P  P P AA
Sbjct: 2669 RLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAA 2728

Query: 63   PQQGPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPA 117
             Q  P L A  A  A  V  G A             +      A PAAPAA  P 
Sbjct: 2729 RQASPALPAAPAPPA--VPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781



 Score = 31.8 bits (72), Expect = 0.13
 Identities = 29/139 (20%), Positives = 40/139 (28%), Gaps = 8/139 (5%)

Query: 4    RGRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMA-- 61
            R  +  P +     VRR   P+      S A P   P   P   AP   QP    P    
Sbjct: 2869 RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ 2928

Query: 62   ---APQQGPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDS---KEAAPAAPAAPA 115
                P   P     +A T      G   G      +  +  G  +         AP+  A
Sbjct: 2929 PQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREA 2988

Query: 116  PAYQQGSQSAAEPTGACAW 134
            PA      +    +   +W
Sbjct: 2989 PASSTPPLTGHSLSRVSSW 3007



 Score = 29.5 bits (66), Expect = 0.79
 Identities = 29/125 (23%), Positives = 41/125 (32%), Gaps = 4/125 (3%)

Query: 4    RGRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAP 63
            R  +   V+S  +      SP   +   +      A  P  A+PA     PT   P  AP
Sbjct: 2781 RRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQP-TAP 2839

Query: 64   QQGPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQ 123
               PG         G VA G  V        +   +   +  A P       PA  + ++
Sbjct: 2840 PPPPGPPPPSLPLGGSVAPGGDVRR---RPPSRSPAAKPAAPARPPVRRLARPAVSRSTE 2896

Query: 124  SAAEP 128
            S A P
Sbjct: 2897 SFALP 2901



 Score = 27.2 bits (60), Expect = 3.9
 Identities = 29/105 (27%), Positives = 33/105 (31%), Gaps = 12/105 (11%)

Query: 37   APAPRPTPATPA-----------PVQAQPTMPAPMAAPQQGPGLMAQMAATAG-GVAIGS 84
            APAP   PA PA           P  A P  PAP AAP  GP       A A    +  S
Sbjct: 2738 APAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797

Query: 85   AVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPT 129
                              +     A+PA P P       +A  P 
Sbjct: 2798 LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842



 Score = 26.8 bits (59), Expect = 5.2
 Identities = 20/76 (26%), Positives = 26/76 (34%), Gaps = 1/76 (1%)

Query: 3   RRGRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRP-TPATPAPVQAQPTMPAPMA 61
           R   +P           R  +P P S      +P PA  P  PATP P     +   P  
Sbjct: 389 RHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAP 448

Query: 62  APQQGPGLMAQMAATA 77
            P++ P   A   A  
Sbjct: 449 PPERQPPAPATEPAPD 464



 Score = 26.4 bits (58), Expect = 8.2
 Identities = 16/94 (17%), Positives = 23/94 (24%)

Query: 22   RSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVA 81
            +S  P +    R  P     P+P  P      P  P+P  A  +                
Sbjct: 2594 QSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPR 2653

Query: 82   IGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPA 115
               A G             +    + P  P   A
Sbjct: 2654 DDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRA 2687



 Score = 26.1 bits (57), Expect = 9.4
 Identities = 21/112 (18%), Positives = 35/112 (31%)

Query: 7    SPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQG 66
            +P+P +        G    P     +   PAPAP   PA   P +      A ++  ++ 
Sbjct: 2738 APAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797

Query: 67   PGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAY 118
                   A     V   +A           +   + ++  AP  P  P P  
Sbjct: 2798 LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPS 2849


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 35.8 bits (83), Expect = 0.005
 Identities = 28/126 (22%), Positives = 42/126 (33%), Gaps = 1/126 (0%)

Query: 4   RGRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAP 63
              + +   +  K   R  + +P +   ++A+ APAP   PA  A   A P   A   AP
Sbjct: 367 AAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPP-AAAPPAP 425

Query: 64  QQGPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQ 123
              P   A  AA A   A  +        A     +        PA  +A        + 
Sbjct: 426 VAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAA 485

Query: 124 SAAEPT 129
           +   PT
Sbjct: 486 ARLTPT 491



 Score = 28.5 bits (64), Expect = 1.6
 Identities = 19/93 (20%), Positives = 25/93 (26%), Gaps = 1/93 (1%)

Query: 24  PSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIG 83
           P+  +   + A      RP  A PA         AP  A        +  AA        
Sbjct: 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAP-AAAASAPAAPPAAAPPA 424

Query: 84  SAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAP 116
                          +   +   APA PA  AP
Sbjct: 425 PVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAP 457



 Score = 26.6 bits (59), Expect = 6.1
 Identities = 27/102 (26%), Positives = 33/102 (32%), Gaps = 10/102 (9%)

Query: 33  RASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIGH 92
            A+   A      TPA  +A     AP+A     P   A  AA A   A   A       
Sbjct: 367 AAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAP---- 422

Query: 93  AVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGACAW 134
                     +  AA A  AAPA A    + + A P  A   
Sbjct: 423 ------PAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPE 458


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 36.0 bits (83), Expect = 0.005
 Identities = 24/110 (21%), Positives = 34/110 (30%), Gaps = 4/110 (3%)

Query: 24  PSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIG 83
           P+P +   + A+PAPA  P     AP  A        A  ++ P   A  AA        
Sbjct: 385 PAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP 444

Query: 84  SAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGACA 133
                          +   +     AA   P  A    + + A P  A A
Sbjct: 445 GGAPA----PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPA 490



 Score = 34.5 bits (79), Expect = 0.016
 Identities = 33/102 (32%), Positives = 38/102 (37%), Gaps = 1/102 (0%)

Query: 33  RASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIGH 92
            A+ AP  +P PA  AP  A P   AP AAP   P   A   A A   A  S     +  
Sbjct: 376 TAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAA 435

Query: 93  AVTGMFSGSDSKEAAPAAPAA-PAPAYQQGSQSAAEPTGACA 133
           A      G     A   APAA PA A +  +        A A
Sbjct: 436 ARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAA 477



 Score = 30.2 bits (68), Expect = 0.39
 Identities = 33/139 (23%), Positives = 39/139 (28%), Gaps = 15/139 (10%)

Query: 6   RSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPAT------PAPVQAQPTMPAP 59
           RSP+P +         R P          + APA    PA        A   A P   AP
Sbjct: 426 RSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAP 485

Query: 60  MAAPQQGPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEA---------APAA 110
            AAP                 A  +        A     S  D   A         APA 
Sbjct: 486 AAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAP 545

Query: 111 PAAPAPAYQQGSQSAAEPT 129
            AAPAP     ++    P 
Sbjct: 546 AAAPAPRAAAATEPVVAPR 564


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 35.5 bits (82), Expect = 0.006
 Identities = 16/56 (28%), Positives = 18/56 (32%)

Query: 26  PVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVA 81
           P     +   PA AP   PA PA   A    PAP       P   +   A A    
Sbjct: 392 PTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPARSADPAAAASAG 447



 Score = 27.0 bits (60), Expect = 4.2
 Identities = 23/104 (22%), Positives = 27/104 (25%), Gaps = 31/104 (29%)

Query: 32  SRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIG 91
             A   P+      TPA   A P   AP   P       A   A                
Sbjct: 379 RGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPA---------------- 422

Query: 92  HAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGACAWE 135
                          APA PAA AP  +    +AA  +    W 
Sbjct: 423 --------------PAPAPPAAAAPPARSADPAAA-ASAGDRWR 451


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 34.8 bits (80), Expect = 0.013
 Identities = 15/46 (32%), Positives = 17/46 (36%), Gaps = 1/46 (2%)

Query: 23  SPSPVSRGHSRASPAPAPRPTPA-TPAPVQAQPTMPAPMAAPQQGP 67
           SPSP     +    AP     PA TP  V   P    P+  P   P
Sbjct: 399 SPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAP 444



 Score = 27.0 bits (60), Expect = 4.1
 Identities = 8/41 (19%), Positives = 15/41 (36%)

Query: 24  PSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQ 64
           P+   +  + A+ +P+P  + A   P   Q         P 
Sbjct: 386 PAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPT 426


>gnl|CDD|235571 PRK05704, PRK05704, dihydrolipoamide succinyltransferase;
           Validated.
          Length = 407

 Score = 33.7 bits (78), Expect = 0.027
 Identities = 30/98 (30%), Positives = 36/98 (36%), Gaps = 6/98 (6%)

Query: 34  ASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIGHA 93
           A+ A A     A  AP QAQ    A  +     P   A+  A   G+   +  G   G  
Sbjct: 84  AAAAAAAAAAAAAAAPAQAQAAAAAEQSNDALSPA--ARKLAAENGLDASAVKGTGKGGR 141

Query: 94  VTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGA 131
           VT      D   A  AA AAPA        +A  P GA
Sbjct: 142 VTK----EDVLAALAAAAAAPAAPAAAAPAAAPAPLGA 175



 Score = 32.9 bits (76), Expect = 0.050
 Identities = 24/97 (24%), Positives = 36/97 (37%), Gaps = 7/97 (7%)

Query: 27  VSRGHSRASPAPAP-RPTPATPAPVQAQPTMPAPMAAPQQGPG---LMAQMAATAGGVAI 82
           +  G +  + A A      A  AP QAQ    A  +     P    L A+    A  V  
Sbjct: 76  IDEGAAAGAAAAAAAAAAAAAAAPAQAQAAAAAEQSNDALSPAARKLAAENGLDASAVK- 134

Query: 83  GSAVGHTI--GHAVTGMFSGSDSKEAAPAAPAAPAPA 117
           G+  G  +     +  + + + +  A  AA  A APA
Sbjct: 135 GTGKGGRVTKEDVLAALAAAAAAPAAPAAAAPAAAPA 171



 Score = 28.6 bits (65), Expect = 1.2
 Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 9/77 (11%)

Query: 57  PAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAP 116
           PAP A      G+++++ A  G       VG  +G    G  +G+ +  AA AA AA AP
Sbjct: 49  PAPAA------GVLSEILAEEGDTV---TVGQVLGRIDEGAAAGAAAAAAAAAAAAAAAP 99

Query: 117 AYQQGSQSAAEPTGACA 133
           A  Q + +A +   A +
Sbjct: 100 AQAQAAAAAEQSNDALS 116


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 33.7 bits (78), Expect = 0.030
 Identities = 26/108 (24%), Positives = 33/108 (30%), Gaps = 27/108 (25%)

Query: 21  GRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGV 80
           G + +P +   + A+ A AP   PA  AP    P  PA  A         A     A   
Sbjct: 39  GSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAP--------AAPPKPAAAA 90

Query: 81  AIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEP 128
           A  +A                    AAP A AA A       +    P
Sbjct: 91  AAAAA-------------------PAAPPAAAAAAAPAAAAVEDEVTP 119



 Score = 29.9 bits (68), Expect = 0.51
 Identities = 16/72 (22%), Positives = 26/72 (36%)

Query: 7   SPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQG 66
           +P+  ++         + +P ++  +  +PAP     PA P    A     A  AAP   
Sbjct: 43  APTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAA 102

Query: 67  PGLMAQMAATAG 78
               A  AA   
Sbjct: 103 AAAAAPAAAAVE 114



 Score = 28.3 bits (64), Expect = 2.0
 Identities = 21/99 (21%), Positives = 23/99 (23%), Gaps = 32/99 (32%)

Query: 35  SPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIGHAV 94
            P     PT A  A   A     A  AA            A A                 
Sbjct: 37  GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAA------PAPA----------------- 73

Query: 95  TGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGACA 133
                      AA A  A P PA    + +A     A A
Sbjct: 74  ---------PPAAAAPAAPPKPAAAAAAAAAPAAPPAAA 103


>gnl|CDD|114268 pfam05537, DUF759, Borrelia burgdorferi protein of unknown function
           (DUF759).  This family consists of several
           uncharacterized proteins from the Lyme disease
           spirochete Borrelia burgdorferi.
          Length = 439

 Score = 33.5 bits (76), Expect = 0.032
 Identities = 17/41 (41%), Positives = 26/41 (63%), Gaps = 7/41 (17%)

Query: 60  MAAPQQGPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSG 100
           M A  QG GL+A+       +AIGSA+G+ IG+A++ +  G
Sbjct: 125 MLAASQGKGLVAK-------IAIGSALGNIIGNAMSKVGGG 158


>gnl|CDD|148383 pfam06747, CHCH, CHCH domain.  we have identified a conserved motif
           in the LOC118487 protein that we have called the CHCH
           motif. Alignment of this protein with related members
           showed the presence of three subgroups of proteins,
           which are called the S (Small), N (N-terminal extended)
           and C (C-terminal extended) subgroups. All three
           sub-groups of proteins have in common that they contain
           a predicted conserved [coiled coil 1]-[helix 1]-[coiled
           coil 2]-[helix 2] domain (CHCH domain). Within each
           helix of the CHCH domain, there are two cysteines
           present in a C-X9-C motif. The N-group contains an
           additional double helix domain, and each helix contains
           the C-X9-C motif. This family contains a number of
           characterized proteins: Cox19 protein - a nuclear gene
           of Saccharomyces cerevisiae, codes for an 11-kDa protein
           (Cox19p) required for expression of cytochrome oxidase.
           Because cox19 mutants are able to synthesise the
           mitochondrial and nuclear gene products of cytochrome
           oxidase, Cox19p probably functions post-translationally
           during assembly of the enzyme. Cox19p is present in the
           cytoplasm and mitochondria, where it exists as a soluble
           intermembrane protein. This dual location is similar to
           what was previously reported for Cox17p, a low molecular
           weight copper protein thought to be required for
           maturation of the CuA centre of subunit 2 of cytochrome
           oxidase. Cox19p have four conserved potential metal
           ligands, these are three cysteines and one histidine.
           Mrp10 - belongs to the class of yeast mitochondrial
           ribosomal proteins that are essential for translation.
           Eukaryotic NADH-ubiquinone oxidoreductase 19 kDa
           (NDUFA8) subunit. The CHCH domain was previously called
           DUF657.
          Length = 35

 Score = 30.3 bits (69), Expect = 0.046
 Identities = 8/17 (47%), Positives = 9/17 (52%)

Query: 132 CAWEIKQFLQCANTQSD 148
           C  E K FL+C    SD
Sbjct: 1   CGEEFKAFLKCLKENSD 17


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 32.8 bits (74), Expect = 0.059
 Identities = 31/118 (26%), Positives = 40/118 (33%), Gaps = 12/118 (10%)

Query: 6   RSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPA-------TPAPVQAQPTMPA 58
           + P P  + H+   R   P+      +   P  A  PTPA        PAP QA    P 
Sbjct: 38  QKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPD 97

Query: 59  PMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAP 116
           P  APQ     +A         A  SA       A  G  + S   + A     +P P
Sbjct: 98  PAVAPQ-----LAAAPKPDAAEAFTSAAQAHEAPADAGTSAASKKPDPAAHTQHSPPP 150



 Score = 30.9 bits (69), Expect = 0.25
 Identities = 33/127 (25%), Positives = 47/127 (37%), Gaps = 11/127 (8%)

Query: 7   SPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTM-PAPMAAPQQ 65
           +  P+   H      + P P    H  AS AP P   P + A  +      P P A+ + 
Sbjct: 24  TNIPIPELHTSAAT-QKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKF 82

Query: 66  GPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAA-PAPAYQQGSQS 124
            P      AA+    A   AV   +  A        D+ EA  +A  A  APA    S +
Sbjct: 83  DPAPAPHQAASR---APDPAVAPQLAAAPK-----PDAAEAFTSAAQAHEAPADAGTSAA 134

Query: 125 AAEPTGA 131
           + +P  A
Sbjct: 135 SKKPDPA 141


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 32.5 bits (74), Expect = 0.070
 Identities = 17/76 (22%), Positives = 25/76 (32%), Gaps = 2/76 (2%)

Query: 1   MVRRGRSPSPVSSFH-KMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAP 59
            +R+    SP+     +    G+ P     G     P  +  PTP  P     +P   AP
Sbjct: 381 RMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGG-PLRPNGLAP 439

Query: 60  MAAPQQGPGLMAQMAA 75
           M A +         A 
Sbjct: 440 MNAVRAPSRNAQNAAQ 455


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 32.5 bits (74), Expect = 0.078
 Identities = 26/131 (19%), Positives = 37/131 (28%), Gaps = 4/131 (3%)

Query: 6   RSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQP-TMPAPMAAPQ 64
           R+ +P ++        R            + A A     +      AQP       +AP 
Sbjct: 422 RAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPA 481

Query: 65  QG--PGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGS 122
               P    + A  A   +  +        A            AAP AP A  P     +
Sbjct: 482 SDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAA 541

Query: 123 QSAAEPTGACA 133
             AA   GA A
Sbjct: 542 -PAARAGGAAA 551



 Score = 31.0 bits (70), Expect = 0.20
 Identities = 22/102 (21%), Positives = 28/102 (27%), Gaps = 3/102 (2%)

Query: 32  SRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIG 91
           + A PAP  R   A  A      T     A     P   A  AAT       +       
Sbjct: 377 AGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATA 436

Query: 92  HAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGACA 133
                    +D     PA   A A A  +  +  A+P     
Sbjct: 437 DRGD---DAADGDAPVPAKANARASADSRCDERDAQPPADSG 475



 Score = 28.3 bits (63), Expect = 1.8
 Identities = 18/111 (16%), Positives = 25/111 (22%), Gaps = 5/111 (4%)

Query: 21  GRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGV 80
           G   +      +  +         A  AP  A         AP   P             
Sbjct: 384 GARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPA-----PPATADR 438

Query: 81  AIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGA 131
              +A G     A     + +DS+     A             S A P  A
Sbjct: 439 GDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAA 489



 Score = 27.1 bits (60), Expect = 5.0
 Identities = 26/108 (24%), Positives = 32/108 (29%), Gaps = 1/108 (0%)

Query: 10  PVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGL 69
           P S          +P   +   +  +  P  R   A           P    A    P  
Sbjct: 480 PASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPA- 538

Query: 70  MAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPA 117
            A  AA AGG A    V    G  V+       +  A PAA  A AP 
Sbjct: 539 AAAPAARAGGAAAALDVLRNAGMRVSSDRGARAAAAAKPAAAPAAAPK 586


>gnl|CDD|225950 COG3416, COG3416, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 233

 Score = 31.8 bits (72), Expect = 0.092
 Identities = 20/72 (27%), Positives = 29/72 (40%), Gaps = 4/72 (5%)

Query: 34  ASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIGHA 93
           A P   P   P+  +      T P+P AA  +    +     TA GVA G  + + +   
Sbjct: 103 APPPKEPAAPPSWRSSPAGPTTQPSPAAANTRSSSFLGGALQTAAGVAGGMLLANGL--- 159

Query: 94  VTGMFSGSDSKE 105
             GMF    S+E
Sbjct: 160 -EGMFRHHKSEE 170


>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
           dihydrolipoamide succinyltransferase.  This model
           represents an Actinobacterial clade of E2 enzyme, a
           component of the 2-oxoglutarate dehydrogenase complex
           involved in the TCA cycle. These proteins have multiple
           domains including the catalytic domain (pfam00198), one
           or two biotin domains (pfam00364) and an E3-component
           binding domain (pfam02817).
          Length = 579

 Score = 31.9 bits (72), Expect = 0.11
 Identities = 30/116 (25%), Positives = 46/116 (39%), Gaps = 8/116 (6%)

Query: 24  PSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGL--MAQMAATAGGVA 81
             P     +RA  A    P PA      A P   AP+++   GP +  + +  A   GV 
Sbjct: 223 SEPAPDPAARAPHAAPDPPAPAPAPAKTAAPAAAAPVSSGDSGPYVTPLVRKLAKDKGVD 282

Query: 82  IGSAVGHTIGHAV------TGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGA 131
           + +  G  +G  +          +  +++ AA A  AA APA    +   AEP  A
Sbjct: 283 LSTVKGTGVGGRIRKQDVLAAAKAAEEARAAAAAPAAAAAPAAPAAAAKPAEPDTA 338



 Score = 30.4 bits (68), Expect = 0.31
 Identities = 29/127 (22%), Positives = 45/127 (35%), Gaps = 18/127 (14%)

Query: 23  SPSPVSRGHSRASPAPAPRPTPATPAPVQA-QPTMPAPMAAPQQGPGLMAQMAATAG--- 78
           +        + +     P P PA  AP  A  P  PAP  A    P   A +++      
Sbjct: 208 AEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAPAAAAPVSSGDSGPY 267

Query: 79  ------------GVAIGSAVGHTIGHAV--TGMFSGSDSKEAAPAAPAAPAPAYQQGSQS 124
                       GV + +  G  +G  +    + + + + E A AA AAPA A    + +
Sbjct: 268 VTPLVRKLAKDKGVDLSTVKGTGVGGRIRKQDVLAAAKAAEEARAAAAAPAAAAAPAAPA 327

Query: 125 AAEPTGA 131
           AA     
Sbjct: 328 AAAKPAE 334



 Score = 26.5 bits (58), Expect = 6.8
 Identities = 16/43 (37%), Positives = 16/43 (37%), Gaps = 5/43 (11%)

Query: 30  GHSRASPAPAPRPTPATPAPVQAQPT----MPAPMAAPQQGPG 68
           G   A  AP P   P   AP    PT     PAP A    G G
Sbjct: 83  GSEPAPAAPEPEAAPEPEAP-APAPTPAAEAPAPAAPQAGGSG 124


>gnl|CDD|235904 PRK06995, flhF, flagellar biosynthesis regulator FlhF; Validated.
          Length = 484

 Score = 31.9 bits (73), Expect = 0.11
 Identities = 27/110 (24%), Positives = 37/110 (33%), Gaps = 8/110 (7%)

Query: 23  SPSPVSRGHSRASPAPAPRPT--PATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGV 80
            P+  +   ++  PA AP     PA PA   A   +        Q   L+A+ AA A   
Sbjct: 53  PPAAAAPAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQREQLVARAAAPAAPE 112

Query: 81  AIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTG 130
           A   A       A          +  A AA AAP P     + +A     
Sbjct: 113 AQAPAAPAERAAAENAA------RRLARAAAAAPRPRVPADAAAAVADAV 156


>gnl|CDD|233273 TIGR01108, oadA, oxaloacetate decarboxylase alpha subunit.  This
           model describes the bacterial oxaloacetate decarboxylase
           alpha subunit and its equivalents in archaea. The
           oxaloacetate decarboxylase Na+ pump is the paradigm of
           the family of Na+ transport decarboxylases that present
           in bacteria and archaea. It a multi subunit enzyme
           consisting of a peripheral alpha-subunit and integral
           membrane subunits beta and gamma. The energy released by
           the decarboxylation reaction of oxaloacetate is coupled
           to Na+ ion pumping across the membrane [Transport and
           binding proteins, Cations and iron carrying compounds,
           Energy metabolism, Other].
          Length = 582

 Score = 31.3 bits (71), Expect = 0.17
 Identities = 13/61 (21%), Positives = 23/61 (37%), Gaps = 8/61 (13%)

Query: 28  SRGHSRASPA-PAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGG-VAIGSA 85
               S+ + + PA        A   A   + AP+A      G + ++  + G  VA G  
Sbjct: 491 GGDVSQITASAPANTSGGTVAAKAGAGTPVTAPIA------GSIVKVKVSEGQTVAEGEV 544

Query: 86  V 86
           +
Sbjct: 545 L 545


>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 598

 Score = 31.4 bits (72), Expect = 0.17
 Identities = 13/79 (16%), Positives = 17/79 (21%), Gaps = 1/79 (1%)

Query: 16  KMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAA 75
           +             G   A  APA    P   A   A    PA   A        A    
Sbjct: 389 EGPPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAG-GPAAALAAVPDAAAAAAAPP 447

Query: 76  TAGGVAIGSAVGHTIGHAV 94
                   +   ++    V
Sbjct: 448 APAAAPQPAVRLNSFEDIV 466



 Score = 26.4 bits (59), Expect = 7.4
 Identities = 6/51 (11%), Positives = 11/51 (21%)

Query: 7   SPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMP 57
             +P ++           +        A P  A             QP + 
Sbjct: 408 PGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPPAPAAAPQPAVR 458


>gnl|CDD|237000 PRK11855, PRK11855, dihydrolipoamide acetyltransferase; Reviewed.
          Length = 547

 Score = 30.9 bits (71), Expect = 0.19
 Identities = 19/53 (35%), Positives = 20/53 (37%)

Query: 34  ASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAV 86
           A+PA A  P  A PA   A    PAP AA        A  AA         AV
Sbjct: 197 AAPAAAAAPAAAAPAAAAAAAPAPAPAAAAAPAAAAPAAAAAPGKAPHASPAV 249



 Score = 30.6 bits (70), Expect = 0.31
 Identities = 14/39 (35%), Positives = 16/39 (41%)

Query: 30  GHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPG 68
           G + A+ APA    PA  A     P   AP AA     G
Sbjct: 79  GAAAAAAAPAAAAAPAAAAAAAPAPAAAAPAAAAAAAGG 117



 Score = 30.2 bits (69), Expect = 0.35
 Identities = 20/48 (41%), Positives = 21/48 (43%)

Query: 34  ASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVA 81
           A+ APA    PA  AP  A    PAP  A    P   A  AA A G A
Sbjct: 195 AAAAPAAAAAPAAAAPAAAAAAAPAPAPAAAAAPAAAAPAAAAAPGKA 242



 Score = 26.3 bits (59), Expect = 8.5
 Identities = 13/55 (23%), Positives = 15/55 (27%), Gaps = 4/55 (7%)

Query: 30  GHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGS 84
             + A+ A      PA  A     P   A  A     P      AA  G     S
Sbjct: 196 AAAPAAAAAPAAAAPAAAAAAAPAPAPAAAAAPAAAAP----AAAAAPGKAPHAS 246


>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin
          carboxyl carrier protein subunit; Validated.
          Length = 153

 Score = 30.2 bits (68), Expect = 0.23
 Identities = 14/30 (46%), Positives = 17/30 (56%), Gaps = 1/30 (3%)

Query: 34 ASPAPAPRP-TPATPAPVQAQPTMPAPMAA 62
           +PAPAP P  P+ P PV      PAP +A
Sbjct: 53 PTPAPAPAPAVPSAPTPVAPAAPAPAPASA 82


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 30.5 bits (69), Expect = 0.29
 Identities = 13/88 (14%), Positives = 19/88 (21%), Gaps = 5/88 (5%)

Query: 5   GRSPSPVSSFHKMVRRGRSPSPVSRGHSRA-----SPAPAPRPTPATPAPVQAQPTMPAP 59
            + P      +    +G       +    A     +  P P   P  P P+Q        
Sbjct: 193 QQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLS 252

Query: 60  MAAPQQGPGLMAQMAATAGGVAIGSAVG 87
              P   P    Q        A      
Sbjct: 253 QQMPPPPPQPPQQQQQPPQPQAQPPPQN 280


>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 30.7 bits (69), Expect = 0.30
 Identities = 23/94 (24%), Positives = 30/94 (31%), Gaps = 6/94 (6%)

Query: 22  RSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVA 81
           R P P       A PA  P P  A P P Q  P   +    P         +A+   GV 
Sbjct: 404 RHPGPAKPEAPGARPAELPSPASA-PTPEQQPPVARSAPLPPSPQASAPRNVASGKPGVD 462

Query: 82  IGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPA 115
           +GS  G                K+   A+ +  A
Sbjct: 463 LGSWQG-----KFMNFTRNGSRKQPVQASSSDAA 491


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 30.4 bits (69), Expect = 0.33
 Identities = 13/67 (19%), Positives = 19/67 (28%), Gaps = 3/67 (4%)

Query: 2   VRRGRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPA--TPAPVQAQPTMPAP 59
           V R       +  H+  R           ++ A P    +  P    P     +P  PAP
Sbjct: 68  VHRVNHAPANAQEHEAARPSPQHQYQPP-YASAQPRQPVQQPPEAQVPPQHAPRPAQPAP 126

Query: 60  MAAPQQG 66
               Q  
Sbjct: 127 QPVQQPA 133



 Score = 29.2 bits (66), Expect = 0.76
 Identities = 15/52 (28%), Positives = 19/52 (36%)

Query: 24  PSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAA 75
           P    +       APAP+P  + P P Q       P+AAPQ  P        
Sbjct: 138 PEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVM 189


>gnl|CDD|237537 PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional.
          Length = 440

 Score = 30.3 bits (69), Expect = 0.35
 Identities = 23/102 (22%), Positives = 31/102 (30%), Gaps = 2/102 (1%)

Query: 29  RGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMA--ATAGGVAIGSAV 86
            G + A+   A        A   A     A  A      G+ A +   A AG  A  S +
Sbjct: 297 GGGAAAAGGAAAAARGGAAAAGGASSAYSAGAAGGSGAAGVAAGLGGVARAGASAAASPL 356

Query: 87  GHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEP 128
                 A   M S   +   +    A  A A      +AA P
Sbjct: 357 RRAASRAAESMKSSFRAGARSTGGGAGGAAAAAAAGAAAAGP 398



 Score = 30.3 bits (69), Expect = 0.40
 Identities = 18/74 (24%), Positives = 25/74 (33%), Gaps = 2/74 (2%)

Query: 62  APQQGPGLMAQMAATAGGVAI--GSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQ 119
           APQ G G        AGG A+   +  G   G         + +     AA    + AY 
Sbjct: 266 APQLGAGAAVGTGLAAGGAAVAAAAGAGLAAGGGAAAAGGAAAAARGGAAAAGGASSAYS 325

Query: 120 QGSQSAAEPTGACA 133
            G+   +   G  A
Sbjct: 326 AGAAGGSGAAGVAA 339


>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 584

 Score = 30.2 bits (68), Expect = 0.36
 Identities = 14/43 (32%), Positives = 19/43 (44%), Gaps = 2/43 (4%)

Query: 26  PVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPG 68
           P +  H+  +P  AP P+ A P P       P P+ AP    G
Sbjct: 388 PANLLHN--APQAAPAPSAAAPEPKHQPAPEPRPVLAPTPASG 428



 Score = 26.4 bits (58), Expect = 7.1
 Identities = 11/44 (25%), Positives = 13/44 (29%)

Query: 32  SRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAA 75
           +    AP   P P+  AP       P P       P      AA
Sbjct: 390 NLLHNAPQAAPAPSAAAPEPKHQPAPEPRPVLAPTPASGEPNAA 433


>gnl|CDD|235777 PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl carrier
           protein subunit; Validated.
          Length = 155

 Score = 29.8 bits (68), Expect = 0.37
 Identities = 25/81 (30%), Positives = 29/81 (35%), Gaps = 17/81 (20%)

Query: 32  SRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIG 91
           SRA+ AP             A P   AP AA           AA A   A  +A     G
Sbjct: 33  SRAAAAPV----APVAQQAAAAPVAAAPAAAAA---------AAAAPAAAPAAAAAEAEG 79

Query: 92  HAVT----GMFSGSDSKEAAP 108
           H VT    G F  + S +A P
Sbjct: 80  HVVTSPMVGTFYRAPSPDAPP 100


>gnl|CDD|219419 pfam07462, MSP1_C, Merozoite surface protein 1 (MSP1) C-terminus.
           This family represents the C-terminal region of
           merozoite surface protein 1 (MSP1) which are found in a
           number of Plasmodium species. MSP-1 is a 200-kDa protein
           expressed on the surface of the P. vivax merozoite.
           MSP-1 of Plasmodium species is synthesised as a
           high-molecular-weight precursor and then processed into
           several fragments. At the time of red cell invasion by
           the merozoite, only the 19-kDa C-terminal fragment
           (MSP-119), which contains two epidermal growth
           factor-like domains, remains on the surface. Antibodies
           against MSP-119 inhibit merozoite entry into red cells,
           and immunisation with MSP-119 protects monkeys from
           challenging infections. Hence, MSP-119 is considered a
           promising vaccine candidate.
          Length = 574

 Score = 30.3 bits (68), Expect = 0.42
 Identities = 13/62 (20%), Positives = 20/62 (32%)

Query: 33  RASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIGH 92
           + +   A   T  TP    A P+  +   A   G    +    T+G V    A    +  
Sbjct: 260 KGTTQEAKVTTVVTPPQADAAPSPLSVRPAGSSGSASGSTQIPTSGSVLGPGAAATELQQ 319

Query: 93  AV 94
            V
Sbjct: 320 VV 321


>gnl|CDD|237793 PRK14703, PRK14703, glutaminyl-tRNA synthetase/YqeY domain fusion
           protein; Provisional.
          Length = 771

 Score = 30.1 bits (68), Expect = 0.43
 Identities = 12/52 (23%), Positives = 23/52 (44%), Gaps = 1/52 (1%)

Query: 14  FHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQ 65
           F++++    +    +R  +R   A AP+ T   P   +A+    A    P+Q
Sbjct: 546 FNRIITLKDTWGARAREAAREKRAAAPKKTAK-PRRSKAEARAEAAALNPEQ 596


>gnl|CDD|219580 pfam07793, DUF1631, Protein of unknown function (DUF1631).  The
           members of this family are sequences derived from a
           group of hypothetical proteins expressed by certain
           bacterial species. The region concerned is approximately
           440 amino acid residues in length.
          Length = 729

 Score = 29.9 bits (68), Expect = 0.44
 Identities = 11/74 (14%), Positives = 22/74 (29%), Gaps = 1/74 (1%)

Query: 24  PSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIG 83
            +  +    R+   PA        A    Q    +  A      GL+AQ  +++   A  
Sbjct: 224 GTKATSAGPRSQAPPAASGPSTDDAA-SGQGGQESAEALFAALQGLLAQARSSSAAPAAP 282

Query: 84  SAVGHTIGHAVTGM 97
           +         ++  
Sbjct: 283 AGQAPGSAQPLSTR 296



 Score = 26.9 bits (60), Expect = 4.8
 Identities = 17/69 (24%), Positives = 24/69 (34%)

Query: 58  APMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPA 117
           +     Q  P         A     G      +  A+ G+ + + S  AAPAAPA  AP 
Sbjct: 229 SAGPRSQAPPAASGPSTDDAASGQGGQESAEALFAALQGLLAQARSSSAAPAAPAGQAPG 288

Query: 118 YQQGSQSAA 126
             Q   +  
Sbjct: 289 SAQPLSTRE 297


>gnl|CDD|233365 TIGR01347, sucB, 2-oxoglutarate dehydrogenase complex
           dihydrolipoamide succinyltransferase (E2 component).
           This model describes the TCA cycle 2-oxoglutarate system
           E2 component, dihydrolipoamide succinyltransferase. It
           is closely related to the pyruvate dehydrogenase E2
           component, dihydrolipoamide acetyltransferase. The seed
           for this model includes mitochondrial and Gram-negative
           bacterial forms. Mycobacterial candidates are highly
           derived, differ in having and extra copy of the
           lipoyl-binding domain at the N-terminus. They score
           below the trusted cutoff, but above the noise cutoff and
           above all examples of dihydrolipoamide acetyltransferase
           [Energy metabolism, TCA cycle].
          Length = 403

 Score = 29.7 bits (67), Expect = 0.52
 Identities = 20/85 (23%), Positives = 30/85 (35%), Gaps = 5/85 (5%)

Query: 35  SPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPG--LMAQMAATAGGVAIGSAVGHTIGH 92
           + A  P  +        A     AP AA  +        ++A    G+ + +  G  +  
Sbjct: 80  ATAAPPAKSGEEKEETPAASAAAAPTAAANRPSLSPAARRLAKEH-GIDLSAVPGTGVTG 138

Query: 93  AVTGMFSGSDSKEAAPAAPAAPAPA 117
            VT        K  APA+   PAPA
Sbjct: 139 RVTK--EDIIKKTEAPASAQQPAPA 161



 Score = 28.5 bits (64), Expect = 1.5
 Identities = 23/98 (23%), Positives = 32/98 (32%), Gaps = 12/98 (12%)

Query: 34  ASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIGHA 93
           A PA +            A     A    P   P    ++A   G + + +  G  +   
Sbjct: 83  APPAKSG-EEKEETPAASAAAAPTAAANRPSLSPA-ARRLAKEHG-IDLSAVPGTGVTGR 139

Query: 94  VTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGA 131
           VT        KE       APA A QQ + +AA    A
Sbjct: 140 VT--------KEDIIKKTEAPASA-QQPAPAAAAKAPA 168


>gnl|CDD|235665 PRK05996, motB, flagellar motor protein MotB; Validated.
          Length = 423

 Score = 29.7 bits (67), Expect = 0.53
 Identities = 22/102 (21%), Positives = 29/102 (28%), Gaps = 14/102 (13%)

Query: 25  SPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGS 84
             V+       P  A        +   A  T+P     PQ  P    + AAT   +   +
Sbjct: 190 VEVTTAGDLLPPGQAREQAQGAKSATAAPATVPQAAPLPQAQP----KKAATEEELIADA 245

Query: 85  AVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAA 126
               T          G  +  AA AA   P P  QQ      
Sbjct: 246 KKAAT----------GEPAANAAKAAKPEPMPDDQQKEAEQL 277


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 29.8 bits (67), Expect = 0.55
 Identities = 10/57 (17%), Positives = 15/57 (26%)

Query: 7   SPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAP 63
           + +P           R  +  +       P       P  P    A P    P +AP
Sbjct: 373 AAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAP 429



 Score = 29.0 bits (65), Expect = 0.96
 Identities = 20/95 (21%), Positives = 30/95 (31%), Gaps = 12/95 (12%)

Query: 23  SPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAI 82
            P+  +    R +PAP+ RP  A  A +  +     P+      P +  +  A       
Sbjct: 370 KPTAAAPSPVRPTPAPSTRPKAAAAANIPPKE----PVRETATPPPVPPRPVAP------ 419

Query: 83  GSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPA 117
              V HT   A     +     E     P AP   
Sbjct: 420 --PVPHTPESAPKLTRAAIPVDEKPKYTPPAPPKE 452



 Score = 26.3 bits (58), Expect = 7.3
 Identities = 11/62 (17%), Positives = 16/62 (25%), Gaps = 4/62 (6%)

Query: 18  VRRGRSPSPVSRGHSRASPAPAPRP----TPATPAPVQAQPTMPAPMAAPQQGPGLMAQM 73
             R  +  P       A P P              PV  +P    P    ++   L+A  
Sbjct: 402 PVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPPKEEEKALIADG 461

Query: 74  AA 75
             
Sbjct: 462 DV 463


>gnl|CDD|235658 PRK05972, ligD, ATP-dependent DNA ligase; Reviewed.
          Length = 860

 Score = 29.9 bits (68), Expect = 0.57
 Identities = 14/46 (30%), Positives = 18/46 (39%), Gaps = 1/46 (2%)

Query: 19  RRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQ 64
            +GR+P P       A  A A R   A  A    +  +P    APQ
Sbjct: 193 GKGRAPKPFMTPKGNAGLAAAARAAAAAAAKKAKKKALPD-FLAPQ 237


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score = 29.5 bits (66), Expect = 0.62
 Identities = 13/35 (37%), Positives = 14/35 (40%)

Query: 34 ASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPG 68
          A P+PA  P P  PA   A  T  A  AA      
Sbjct: 20 APPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAE 54



 Score = 28.7 bits (64), Expect = 1.4
 Identities = 13/44 (29%), Positives = 16/44 (36%)

Query: 23 SPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQG 66
               +     A+PAP P    A PA   A P   AP A   + 
Sbjct: 15 EQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKK 58



 Score = 26.4 bits (58), Expect = 8.2
 Identities = 9/43 (20%), Positives = 13/43 (30%)

Query: 24 PSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQG 66
          P+P     + A   P P  T A      A          P++ 
Sbjct: 17 PAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKD 59


>gnl|CDD|188060 TIGR00531, BCCP, acetyl-CoA carboxylase, biotin carboxyl carrier
           protein.  This model is designed to identify biotin
           carboxyl carrier protein as a peptide of acetyl-CoA
           carboxylase. Scoring below the trusted cutoff is a
           related protein encoded in a region associated with
           polyketide synthesis in the prokaryote Saccharopolyspora
           hirsuta, and a reported chloroplast-encoded biotin
           carboxyl carrier protein that may be highly derived from
           the last common ancestral sequence. Scoring below the
           noise cutoff are biotin carboxyl carrier domains of
           other enzymes such as pyruvate carboxylase.The gene name
           is accB or fabE [Fatty acid and phospholipid metabolism,
           Biosynthesis].
          Length = 156

 Score = 29.0 bits (65), Expect = 0.65
 Identities = 15/68 (22%), Positives = 20/68 (29%)

Query: 41  RPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSG 100
           +   A       Q   P P   P          A  A   A     GH +   + G F  
Sbjct: 34  KAAAAAKKSAVQQAAAPVPAQVPAAPSAQAPAPAVCAPAPAKADKKGHFVRSPMVGTFYR 93

Query: 101 SDSKEAAP 108
           + S +A P
Sbjct: 94  APSPDAKP 101



 Score = 28.3 bits (63), Expect = 1.0
 Identities = 12/43 (27%), Positives = 18/43 (41%)

Query: 34 ASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAAT 76
          A+P PA  P   +          PAP  A ++G  + + M  T
Sbjct: 48 AAPVPAQVPAAPSAQAPAPAVCAPAPAKADKKGHFVRSPMVGT 90


>gnl|CDD|236999 PRK11854, aceF, pyruvate dehydrogenase dihydrolipoyltransacetylase;
           Validated.
          Length = 633

 Score = 29.2 bits (66), Expect = 0.79
 Identities = 18/55 (32%), Positives = 19/55 (34%), Gaps = 8/55 (14%)

Query: 34  ASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGH 88
           A+PA AP       AP  A     AP AAP       A  A      A   A  H
Sbjct: 283 AAPAAAPAK-QEAAAPAPAAAKAEAPAAAP-------AAKAEGKSEFAENDAYVH 329


>gnl|CDD|173954 cd08195, DHQS, Dehydroquinate synthase (DHQS) catalyzes the
           conversion of DAHP to DHQ in shikimate pathway for
           aromatic compounds synthesis.  Dehydroquinate synthase
           (DHQS) catalyzes the conversion of
           3-deoxy-D-arabino-heptulosonate-7-phosphate (DAHP) to
           dehydroquinate (DHQ) in the second step of the shikimate
           pathway. This pathway, which involves seven sequential
           enzymatic steps in the conversion of erythrose
           4-phosphate and phosphoenolpyruvate into chorismate for
           subsequent synthesis of aromatic compounds,  is found in
           bacteria, microbial eukaryotes, and plants, but not in
           mammals. Therefore, enzymes of this pathway are
           attractive targets for the development of non-toxic
           antimicrobial compounds, herbicides and anti-parasitic
           agents. The activity of DHQS requires nicotinamide
           adenine dinucleotide (NAD) as cofactor. A single active
           site in DHQS catalyzes five sequential reactions
           involving alcohol oxidation, phosphate elimination,
           carbonyl reduction, ring opening, and intramolecular
           aldol condensation. The binding of substrates and
           ligands induces domain conformational changes. In some
           fungi and protozoa, this domain is fused with the other
           four domains in shikimate pathway and forms a
           penta-domain AROM protein, which catalyzes steps 2-6 in
           the shikimate pathway.
          Length = 345

 Score = 28.9 bits (66), Expect = 1.0
 Identities = 6/8 (75%), Positives = 7/8 (87%)

Query: 87  GHTIGHAV 94
           GHT GHA+
Sbjct: 236 GHTFGHAI 243


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 29.1 bits (65), Expect = 1.0
 Identities = 20/74 (27%), Positives = 29/74 (39%), Gaps = 1/74 (1%)

Query: 2   VRRGRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMA 61
           VR G +P+   +  ++   G      + G   A+ AP P   P  P      P    P A
Sbjct: 879 VRVGYAPTLAEALDQVFGPGTGRVATAPG-GDAASAPPPGAGPPAPPQAVPPPRTTQPPA 937

Query: 62  APQQGPGLMAQMAA 75
           AP +GP +     A
Sbjct: 938 APPRGPDVPPAAVA 951


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
           family of proteins are found in large numbers in the
           Trichomonas vaginalis proteome. The function of this
           protein is unknown.
          Length = 422

 Score = 28.9 bits (64), Expect = 1.1
 Identities = 18/41 (43%), Positives = 21/41 (51%), Gaps = 4/41 (9%)

Query: 28  SRGHS-RASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGP 67
           S GH  R +P P  +PT   PA    QPT+  P   PQQ P
Sbjct: 192 SHGHRHRHAPKPTQQPTVQNPAQ---QPTVQNPAQQPQQQP 229



 Score = 26.9 bits (59), Expect = 5.3
 Identities = 10/30 (33%), Positives = 12/30 (40%)

Query: 38  PAPRPTPATPAPVQAQPTMPAPMAAPQQGP 67
           PA +PT   PA    Q     P+   QQ  
Sbjct: 212 PAQQPTVQNPAQQPQQQPQQQPVQPAQQPT 241



 Score = 26.2 bits (57), Expect = 9.2
 Identities = 9/46 (19%), Positives = 13/46 (28%)

Query: 22  RSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGP 67
           + P+  +         PA +P              P P    QQ P
Sbjct: 205 QQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPP 250


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 28.9 bits (65), Expect = 1.1
 Identities = 24/87 (27%), Positives = 31/87 (35%), Gaps = 19/87 (21%)

Query: 36  PAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIGHAVT 95
           P   P PTPA PA         A  A P   PGL+++                    A+ 
Sbjct: 535 PDVPPAPTPAEPAAPVVAAAPKAAAATPPAQPGLLSRFFG-----------------ALK 577

Query: 96  GMFSGSDSKEAA--PAAPAAPAPAYQQ 120
            +FSG +  +    PA  A   P  QQ
Sbjct: 578 ALFSGGEETKPQEQPAPKAEAKPERQQ 604


>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
          Length = 331

 Score = 28.5 bits (64), Expect = 1.3
 Identities = 15/79 (18%), Positives = 22/79 (27%)

Query: 23  SPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAI 82
             +P +      + APAP   P   A V          A P          AA       
Sbjct: 183 DTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQA 242

Query: 83  GSAVGHTIGHAVTGMFSGS 101
           G +      +A+   F+  
Sbjct: 243 GVSTPAADPNALVMNFTAD 261


>gnl|CDD|223941 COG1009, NuoL, NADH:ubiquinone oxidoreductase subunit 5 (chain
           L)/Multisubunit Na+/H+ antiporter, MnhA subunit [Energy
           production and conversion / Inorganic ion transport and
           metabolism].
          Length = 606

 Score = 28.8 bits (65), Expect = 1.3
 Identities = 9/27 (33%), Positives = 14/27 (51%), Gaps = 1/27 (3%)

Query: 68  GLMAQMAATAGGVAIGSAVGHTIGHAV 94
           G M  +A   G +A  +A+ H + HA 
Sbjct: 308 GYMF-VALGLGSLAYEAAIFHLVTHAF 333


>gnl|CDD|215932 pfam00465, Fe-ADH, Iron-containing alcohol dehydrogenase. 
          Length = 323

 Score = 28.3 bits (64), Expect = 1.3
 Identities = 10/30 (33%), Positives = 17/30 (56%), Gaps = 3/30 (10%)

Query: 72  QMAATAGGVAIGSA---VGHTIGHAVTGMF 98
            +A+T  G+A  +A     H + HA+ G+F
Sbjct: 234 LLASTLAGLAFSNAGLGAAHALAHALGGLF 263


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 28.5 bits (63), Expect = 1.4
 Identities = 14/68 (20%), Positives = 21/68 (30%)

Query: 5   GRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQ 64
           GR+  P ++  +      +P       +    A  P   P  P P       PAP   P+
Sbjct: 732 GRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPR 791

Query: 65  QGPGLMAQ 72
             P     
Sbjct: 792 GAPTPQPP 799



 Score = 28.5 bits (63), Expect = 1.6
 Identities = 15/68 (22%), Positives = 22/68 (32%)

Query: 5   GRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQ 64
           GR+  P ++  +      +P       +    A  P   P    P  A P  P P   PQ
Sbjct: 722 GRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQ 781

Query: 65  QGPGLMAQ 72
             P    +
Sbjct: 782 APPAPQQR 789



 Score = 28.1 bits (62), Expect = 2.2
 Identities = 34/122 (27%), Positives = 45/122 (36%), Gaps = 5/122 (4%)

Query: 5   GRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTP-ATPAPVQAQPTMPAPMAAP 63
           GR+  P ++  +      +P   +R  + A  AP P+P P A PAP Q     P P   P
Sbjct: 742 GRARPPAAAPGRARPPAAAPGR-ARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPP 800

Query: 64  QQGPG---LMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQ 120
           Q GP    LM + A    G           G    G  S          A A P P+   
Sbjct: 801 QAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQAAAGPTPSPGS 860

Query: 121 GS 122
           G+
Sbjct: 861 GT 862


>gnl|CDD|221459 pfam12200, DUF3597, Domain of unknown function (DUF3597).  This
          family of proteins is found in bacteria, eukaryotes and
          viruses. Proteins in this family are typically between
          126 and 281 amino acids in length. The function of this
          domain is unknown. The structure of this domain has
          been found to contain five helices with a long flexible
          loop between helices one and two.
          Length = 124

 Score = 27.7 bits (62), Expect = 1.5
 Identities = 16/48 (33%), Positives = 21/48 (43%)

Query: 31 HSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAG 78
           + A+PAPA  P  A  A   A P    P AA      ++  +AA  G
Sbjct: 13 AAAAAPAPAAAPATAPAAAAAAAPAATPPAAASVDVAAVLDALAAKKG 60


>gnl|CDD|220749 pfam10428, SOG2, RAM signalling pathway protein.  SOG2 proteins in
           Saccharomyces cerevisiae are involved in cell separation
           and cytokinesis.
          Length = 419

 Score = 28.2 bits (63), Expect = 1.6
 Identities = 14/64 (21%), Positives = 24/64 (37%), Gaps = 5/64 (7%)

Query: 2   VRRGRSPSPVSSFHKMVRRGRSPSPVSRGHSRASP-----APAPRPTPATPAPVQAQPTM 56
            R   +P+ + S   +      P P S G SR+S        +   + ATP   ++  + 
Sbjct: 195 PRSLSNPTTLESPSNLQVTTDVPPPYSNGTSRSSTMSSSANLSIISSLATPRSGESFRST 254

Query: 57  PAPM 60
           P   
Sbjct: 255 PTSG 258


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 28.3 bits (63), Expect = 1.8
 Identities = 13/54 (24%), Positives = 19/54 (35%)

Query: 25   SPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAG 78
            S   R   +    PA     A   P  A+   PA + + Q+    M + A   G
Sbjct: 1325 SLAERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEMLKPAEAIG 1378



 Score = 28.3 bits (63), Expect = 1.8
 Identities = 24/113 (21%), Positives = 36/113 (31%), Gaps = 6/113 (5%)

Query: 11   VSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPT-MPAPMAA----PQQ 65
            VS   ++ ++G      +   +   PA A +  PAT    Q   T M  P  A    P++
Sbjct: 1324 VSLAERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEMLKPAEAIGISPEK 1383

Query: 66   GPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPA-APAAPAPA 117
                M          ++            +   SGS S E       A P P 
Sbjct: 1384 KVRKMRASPFNKKSGSVLGRAATNKETESSENVSGSSSSEKDEIDVSAKPRPQ 1436


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 28.2 bits (63), Expect = 1.8
 Identities = 16/93 (17%), Positives = 21/93 (22%)

Query: 30  GHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHT 89
           G     P   P  T       +  P  P    A  + P          GG  +       
Sbjct: 658 GVPGPVPVGMPAHTARPSRVARGDPVRPTAHHAALRAPQAPRPGGPPGGGGGLPPPPDLP 717

Query: 90  IGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGS 122
                    S   +   AP  P  P      G+
Sbjct: 718 AAAGPAPCGSSLIASPTAPPEPEPPGAEQADGA 750


>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295).  This
           family is conserved in fungi but the function is not
           known.
          Length = 509

 Score = 28.0 bits (62), Expect = 2.2
 Identities = 16/53 (30%), Positives = 23/53 (43%), Gaps = 5/53 (9%)

Query: 7   SPSPVSSFHKM----VRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPT 55
           S +  SS   +    V RG SPS +S  + R++      P+P   A   A P 
Sbjct: 145 SDTSTSSSGALPSTSVVRGFSPSHISSSY-RSTAQLNKAPSPTKSAEPTAAPQ 196



 Score = 26.1 bits (57), Expect = 8.8
 Identities = 14/42 (33%), Positives = 21/42 (50%), Gaps = 1/42 (2%)

Query: 2   VRRGRSPSPVSSFHK-MVRRGRSPSPVSRGHSRASPAPAPRP 42
           V RG SPS +SS ++   +  ++PSP       A+P   P  
Sbjct: 160 VVRGFSPSHISSSYRSTAQLNKAPSPTKSAEPTAAPQAKPEL 201


>gnl|CDD|227832 COG5545, COG5545, Predicted P-loop ATPase and inactivated
           derivatives [General function prediction only].
          Length = 517

 Score = 28.0 bits (62), Expect = 2.2
 Identities = 17/61 (27%), Positives = 24/61 (39%), Gaps = 8/61 (13%)

Query: 1   MVRRGRSPSPV------SSFHKMVRRGRSPSPVSRGHSRASPAP--APRPTPATPAPVQA 52
           M   GR P+        +  +K  R     S  +R     + AP  AP+ TP T A  + 
Sbjct: 441 MKELGRDPTDGEPCVEDTWTNKRQRWPNFKSREARWVREQTAAPFGAPQETPPTEANEEG 500

Query: 53  Q 53
           Q
Sbjct: 501 Q 501


>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
           synthetase/uroporphyrin-III C-methyltransferase;
           Reviewed.
          Length = 656

 Score = 27.8 bits (62), Expect = 2.4
 Identities = 9/52 (17%), Positives = 13/52 (25%)

Query: 32  SRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIG 83
              +  PA       P P         P    +    L   +   A   A+G
Sbjct: 289 KSVTSQPAAAAAAPAPPPNPPATPPEPPARRGRGSAALWFVVVVLACAAAVG 340


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 27.8 bits (62), Expect = 2.7
 Identities = 24/107 (22%), Positives = 29/107 (27%), Gaps = 15/107 (14%)

Query: 34  ASPAPAPRPTPA---------TPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGS 84
           A P  A  P P             P   Q     P  +P   P     + A   G  I +
Sbjct: 55  APPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPTIQT 114

Query: 85  AVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGA 131
             G      V  M          PA      PA Q+  Q   +  GA
Sbjct: 115 EPGQLYPVQVPVM------VTQNPANSPLDQPAQQRALQQLQQRYGA 155


>gnl|CDD|130609 TIGR01546, GAPDH-II_archae, glyceraldehyde-3-phosphate
           dehydrogenase, type II.  This model describes the type
           II glyceraldehyde-3-phosphate dehydrogenases which are
           limited to archaea. These enzymes catalyze the
           interconversion of 1,3-diphosphoglycerate and
           glyceraldehyde-3-phosphate, a central step in glycolysis
           and gluconeogenesis. In archaea, either NAD or NADP may
           be utilized as the cofactor. The class I GAPDH's from
           bacteria and eukaryotes are covered by TIGR01534. All of
           the members of the seed are characterized. See, for
           instance. This model is very solid, there are no species
           falling between trusted and noise at this time. The
           closest relatives scoring in the noise are the class I
           GAPDH's.
          Length = 333

 Score = 27.5 bits (61), Expect = 2.9
 Identities = 12/30 (40%), Positives = 14/30 (46%), Gaps = 5/30 (16%)

Query: 17  MVRRGRSPSPVSRGHSRASPAPAPRPTPAT 46
           MVRR   P+ V +G     P  A  P P T
Sbjct: 162 MVRRAADPNDVKKG-----PINAIVPDPVT 186


>gnl|CDD|216334 pfam01153, Glypican, Glypican. 
          Length = 559

 Score = 27.7 bits (62), Expect = 2.9
 Identities = 10/28 (35%), Positives = 10/28 (35%)

Query: 19  RRGRSPSPVSRGHSRASPAPAPRPTPAT 46
           R  RSP   S         P  RPT A 
Sbjct: 346 RSARSPEEASFKKRFRKYTPEERPTTAA 373


>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
           Provisional.
          Length = 418

 Score = 27.3 bits (61), Expect = 3.1
 Identities = 11/41 (26%), Positives = 15/41 (36%)

Query: 23  SPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAP 63
           + +P     + + P P     P  PAP    P  P   A P
Sbjct: 147 AAAPTPEPPAASKPTPPAAAKPPEPAPAAKPPPTPVARADP 187


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 27.5 bits (61), Expect = 3.2
 Identities = 12/62 (19%), Positives = 22/62 (35%), Gaps = 5/62 (8%)

Query: 4   RGRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAP 63
            G+   P+++ +   R G    PV+      + +PA +PT     P    P         
Sbjct: 123 YGKQAGPITNPNVKRRTGVGLPPVA-----PAASPALKPTANGKRPSSKPPKSIMSPEVK 177

Query: 64  QQ 65
            +
Sbjct: 178 VK 179


>gnl|CDD|223182 COG0104, PurA, Adenylosuccinate synthase [Nucleotide transport and
           metabolism].
          Length = 430

 Score = 27.2 bits (61), Expect = 3.3
 Identities = 14/23 (60%), Positives = 15/23 (65%)

Query: 76  TAGGVAIGSAVGHTIGHAVTGMF 98
           TAGGVA GS VG T    V G+F
Sbjct: 245 TAGGVATGSGVGPTKIDYVIGVF 267


>gnl|CDD|237082 PRK12373, PRK12373, NADH dehydrogenase subunit E; Provisional.
          Length = 400

 Score = 27.5 bits (61), Expect = 3.4
 Identities = 19/98 (19%), Positives = 22/98 (22%)

Query: 34  ASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIGHA 93
            +  P   P     APV          A  +    L     A         A        
Sbjct: 225 GTEVPLLAPWQGDAAPVPPSEAARPKSADAETNAALKTPATAPKAAAKNAKAPEAQPVSG 284

Query: 94  VTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGA 131
                        A AA A PA   +      A P GA
Sbjct: 285 TAAAEPAPKEAAKAAAAAAKPALEDKPRPLGIARPGGA 322


>gnl|CDD|224496 COG1580, FliL, Flagellar basal body-associated protein [Cell
           motility and secretion].
          Length = 159

 Score = 26.9 bits (60), Expect = 3.5
 Identities = 10/50 (20%), Positives = 13/50 (26%), Gaps = 10/50 (20%)

Query: 69  LMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAY 118
           L+  +   A   A                 S S+  EA  A       AY
Sbjct: 22  LIVLIVLLALAGAGYFFW----------FGSKSEDAEAESAQTPVEILAY 61


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 27.3 bits (61), Expect = 3.6
 Identities = 23/115 (20%), Positives = 30/115 (26%), Gaps = 17/115 (14%)

Query: 5   GRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQ 64
              P+  S           PS  S     +SP+ +  P   +       P  P     P 
Sbjct: 176 DADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPS 235

Query: 65  QGPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQ 119
             PG  A        V                    S +K   P+A A PAP   
Sbjct: 236 PPPGPAAPPPPPVQQV-----------------PPLSTAKPTPPSASATPAPIGG 273


>gnl|CDD|216648 pfam01690, PLRV_ORF5, Potato leaf roll virus readthrough protein.
           This family consists mainly of the potato leaf roll
          virus readthrough protein. This is generated via a
          readthrough of open reading frame 3 a coat protein
          allowing transcription of open reading frame 5 to give
          an extended coat protein with a large c-terminal
          addition or read through domain. The readthrough
          protein is thought to play a role in the circulative
          aphid transmission of potato leaf roll virus. Also in
          the family is open reading frame 6 from beet western
          yellows virus and potato leaf roll virus both
          luteovirus and an unknown protein from cucurbit
          aphid-borne yellows virus a closterovirus.
          Length = 460

 Score = 27.4 bits (61), Expect = 3.6
 Identities = 11/24 (45%), Positives = 12/24 (50%), Gaps = 1/24 (4%)

Query: 32 SRASPAPAPRPTPATPAPVQAQPT 55
              P P P PTP  PAP   +PT
Sbjct: 3  GEPGPKPGPTPTPPPPAP-TPEPT 25


>gnl|CDD|113156 pfam04375, HemX, HemX.  This family consists of several bacterial
          HemX proteins. The hemX gene is not essential for haem
          synthesis in B. subtilis. HemX is a polytopic membrane
          protein which by an unknown mechanism down-regulates
          the level of HemA.
          Length = 372

 Score = 27.2 bits (60), Expect = 3.6
 Identities = 13/53 (24%), Positives = 18/53 (33%)

Query: 34 ASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAV 86
          + PA     T  T +PV   P   +  A       L+  +   AGG   G   
Sbjct: 2  SVPAETASQTTVTSSPVAPGPAAKSGGATGLAALALLVALGLGAGGWYFGLQQ 54


>gnl|CDD|236126 PRK07899, rpsA, 30S ribosomal protein S1; Reviewed.
          Length = 486

 Score = 27.3 bits (61), Expect = 3.7
 Identities = 14/49 (28%), Positives = 20/49 (40%)

Query: 85  AVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTGACA 133
           A       A T     + + EAA AA ++ + +    S S AE  G  A
Sbjct: 422 AEAERRWEAHTAQVEKAAAAEAAAAAASSESSSSASSSSSEAEAGGTLA 470


>gnl|CDD|234565 PRK00002, aroB, 3-dehydroquinate synthase; Reviewed.
          Length = 358

 Score = 27.0 bits (61), Expect = 3.8
 Identities = 6/8 (75%), Positives = 7/8 (87%)

Query: 87  GHTIGHAV 94
           GHT GHA+
Sbjct: 243 GHTFGHAI 250


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 27.3 bits (60), Expect = 4.0
 Identities = 16/95 (16%), Positives = 26/95 (27%), Gaps = 10/95 (10%)

Query: 33  RASPAPAPRPTPATPAPVQAQPTMPAP---------MAAPQQGPGLMAQMAATAGGVAIG 83
            A+    P  +    AP   +                   +         + T+  +A  
Sbjct: 523 TAAANIEPNCSADAAAPATKRARPETKTELEAVVRFPYQIRNMESPAFVHSFTSTTLAAA 582

Query: 84  SAVGHTIGHAVTGMFSG-SDSKEAAPAAPAAPAPA 117
           +  G     A+ G          A PA  + PAPA
Sbjct: 583 AGQGSDTAEALAGAIETLLTQASAQPAGLSLPAPA 617


>gnl|CDD|235585 PRK05733, PRK05733, single-stranded DNA-binding protein;
           Provisional.
          Length = 172

 Score = 26.8 bits (59), Expect = 4.4
 Identities = 10/33 (30%), Positives = 12/33 (36%)

Query: 31  HSRASPAPAPRPTPATPAPVQAQPTMPAPMAAP 63
           +  A    A RP  A     +  P  PAP  A 
Sbjct: 129 NQSAPRQQAQRPQQAAQQQSRPAPQQPAPQPAA 161



 Score = 25.7 bits (56), Expect = 9.8
 Identities = 11/46 (23%), Positives = 16/46 (34%), Gaps = 1/46 (2%)

Query: 20  RGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQ 65
           R +      +G    + + APR     P     Q + PAP     Q
Sbjct: 114 RPQGDDQGGQGGGNYNQS-APRQQAQRPQQAAQQQSRPAPQQPAPQ 158


>gnl|CDD|226830 COG4395, COG4395, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 281

 Score = 27.1 bits (60), Expect = 4.4
 Identities = 13/71 (18%), Positives = 21/71 (29%), Gaps = 9/71 (12%)

Query: 30  GHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHT 89
           G   ++  P+ +  PA  A         AP A P    G++  +A        G   G  
Sbjct: 1   GTRSSTATPSTQTAPAAAAA--------APAARPSFFSGMLGGLAGGLLMGLSGMFFGGL 52

Query: 90  IGHAVTGMFSG 100
               +      
Sbjct: 53  FF-GIAAFLIF 62


>gnl|CDD|119350 cd02871, GH18_chitinase_D-like, GH18 domain of Chitinase D (ChiD). 
           ChiD, a chitinase found in Bacillus circulans,
           hydrolyzes the 1,4-beta-linkages of N-acetylglucosamine
           in chitin and chitodextrins.  The domain architecture of
           ChiD includes a catalytic glycosyl hydrolase family 18
           (GH18) domain, a chitin-binding domain, and a
           fibronectin type III domain. The chitin-binding and
           fibronectin type III domains are located either
           N-terminal or C-terminal to the catalytic domain.  This
           family includes exochitinase Chi36 from Bacillus cereus.
          Length = 312

 Score = 26.9 bits (60), Expect = 4.4
 Identities = 15/65 (23%), Positives = 22/65 (33%), Gaps = 1/65 (1%)

Query: 46  TPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKE 105
           T  P+        P+ A +   GL A  +A  GG    S V   +   + G   GS    
Sbjct: 223 TGFPIAGND-RFPPLPADKVVIGLPASPSAAGGGYVSPSEVIKALDCLMKGTNCGSYYPA 281

Query: 106 AAPAA 110
               +
Sbjct: 282 GGYPS 286


>gnl|CDD|237868 PRK14960, PRK14960, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 702

 Score = 26.9 bits (59), Expect = 4.5
 Identities = 24/121 (19%), Positives = 31/121 (25%), Gaps = 11/121 (9%)

Query: 10  PVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGL 69
           PV    +       P P         P P P P P            P P   PQ    L
Sbjct: 400 PVEVISQPAMVEPEPEPEPEPEPEPEPEPEPEPEPE-----------PEPEPEPQPNQDL 448

Query: 70  MAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPT 129
           M         + + SAV       +   F     ++             +    S AEP 
Sbjct: 449 MVFDPNHHELIGLESAVVQETVSVLEEDFIPVPEQKLVQVQAETQVKQIEPEPASTAEPI 508

Query: 130 G 130
           G
Sbjct: 509 G 509


>gnl|CDD|237802 PRK14723, flhF, flagellar biosynthesis regulator FlhF; Provisional.
          Length = 767

 Score = 27.1 bits (60), Expect = 4.6
 Identities = 16/43 (37%), Positives = 20/43 (46%), Gaps = 1/43 (2%)

Query: 34  ASPAPAPRPTP-ATPAPVQAQPTMPAPMAAPQQGPGLMAQMAA 75
           A PAPAP P     PAP  A    PA + AP     L  ++ +
Sbjct: 59  APPAPAPLPAALVAPAPAAASIAAPAAVPAPGAIGDLRGELQS 101



 Score = 26.7 bits (59), Expect = 6.2
 Identities = 15/45 (33%), Positives = 17/45 (37%)

Query: 34 ASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAG 78
          A  A A    P  PAP+ A    PAP AA    P  +    A   
Sbjct: 50 AVAASAQAYAPPAPAPLPAALVAPAPAAASIAAPAAVPAPGAIGD 94


>gnl|CDD|233374 TIGR01357, aroB, 3-dehydroquinate synthase.  This model represents
           3-dehydroquinate synthase, the enzyme catalyzing the
           second of seven steps in the shikimate pathway of
           chorismate biosynthesis. Chorismate is the last common
           intermediate in the biosynthesis of all three aromatic
           amino acids [Amino acid biosynthesis, Aromatic amino
           acid family].
          Length = 344

 Score = 26.8 bits (60), Expect = 4.7
 Identities = 7/13 (53%), Positives = 8/13 (61%)

Query: 87  GHTIGHAVTGMFS 99
           GHTIGHA+     
Sbjct: 233 GHTIGHAIEAEAG 245


>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
          Length = 590

 Score = 27.0 bits (60), Expect = 5.0
 Identities = 11/56 (19%), Positives = 16/56 (28%)

Query: 4  RGRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAP 59
             S S   S      R R PS  S  H+ +        +P+ P+           
Sbjct: 19 SSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSSPSPPSVTLPPAATTQT 74


>gnl|CDD|225766 COG3225, GldG, ABC-type uncharacterized transport system involved
           in gliding motility, auxiliary component [Cell motility
           and secretion].
          Length = 538

 Score = 26.7 bits (59), Expect = 5.1
 Identities = 11/65 (16%), Positives = 21/65 (32%), Gaps = 1/65 (1%)

Query: 5   GRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTM-PAPMAAP 63
             +  P     ++  +   P+P +   S  +          +PA + A   +  A MA  
Sbjct: 393 NANLVPSGKPEELAAKIPGPAPSAFIASPENDQEGKSDRENSPAILVADSDLLSAYMANQ 452

Query: 64  QQGPG 68
            Q   
Sbjct: 453 SQQVV 457


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 26.8 bits (59), Expect = 5.4
 Identities = 12/56 (21%), Positives = 21/56 (37%), Gaps = 1/56 (1%)

Query: 11  VSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPV-QAQPTMPAPMAAPQQ 65
           V +F K V R ++    +    R+   PA  P      P+ +   +   P+  P  
Sbjct: 253 VRNFAKTVWRQKNTPAQTPMPERSWQTPAQTPARRISTPMTEEIKSWQTPLQTPAM 308


>gnl|CDD|236507 PRK09424, pntA, NAD(P) transhydrogenase subunit alpha; Provisional.
          Length = 509

 Score = 26.7 bits (60), Expect = 5.4
 Identities = 10/43 (23%), Positives = 16/43 (37%)

Query: 44  PATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIGSAV 86
           P  P  V A P   A   A ++     A        +A+ +A+
Sbjct: 369 PPPPIQVSAAPAAAAAAPAAKEEEKKPASPWRKYALMALAAAL 411


>gnl|CDD|240324 PTZ00234, PTZ00234, variable surface protein Vir12; Provisional.
          Length = 433

 Score = 26.9 bits (59), Expect = 5.7
 Identities = 17/94 (18%), Positives = 32/94 (34%), Gaps = 10/94 (10%)

Query: 37  APAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATAGGVAIG--SAVGHTIGHAV 94
           + +P+  P+  +P   +P   A + +P   P    + A   G  A    ++V  +    V
Sbjct: 242 SSSPKTAPSKGSPGAQEPAKTAVVKSPGGIPSAGGKRAGAEGAGANKGLASVAPSQSREV 301

Query: 95  TGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEP 128
                   ++   P AP   +   Q G       
Sbjct: 302 LNG-----TQNPVPVAPLGSS---QAGVHMPHGS 327


>gnl|CDD|223414 COG0337, AroB, 3-dehydroquinate synthetase [Amino acid transport
           and metabolism].
          Length = 360

 Score = 26.8 bits (60), Expect = 5.8
 Identities = 6/8 (75%), Positives = 7/8 (87%)

Query: 87  GHTIGHAV 94
           GHT GHA+
Sbjct: 244 GHTFGHAI 251


>gnl|CDD|225987 COG3456, COG3456, Predicted component of the type VI protein
           secretion system, contains a FHA domain [Intracellular
           trafficking, secretion, and vesicular    transport;
           Signal transduction mechanisms].
          Length = 430

 Score = 26.7 bits (59), Expect = 5.9
 Identities = 19/111 (17%), Positives = 30/111 (27%), Gaps = 3/111 (2%)

Query: 5   GRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMP--APMAA 62
           G S  P+S+  +       P P     S +  + AP   PA+   +          P  A
Sbjct: 159 GFSADPLSA-LQSESLIAQPDPAGGAPSISRNSEAPLSDPASAGGIDTPFPDDLDVPPIA 217

Query: 63  PQQGPGLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAA 113
                       + +       AV       +       DS +A      A
Sbjct: 218 ASPPGPQEGPTISASQPAQRDHAVLAPFLRGLGVSLPDLDSDDAEAFLEEA 268


>gnl|CDD|140267 PTZ00240, PTZ00240, 60S ribosomal protein P0; Provisional.
          Length = 323

 Score = 26.5 bits (58), Expect = 5.9
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 99  SGSDSKEAAPAAPAAPAPAYQQGSQSAAE 127
            GS + EAA AAPAA + A ++  + + E
Sbjct: 286 GGSAAAEAAAAAPAAASAAAKEEEEESDE 314


>gnl|CDD|226338 COG3817, COG3817, Predicted membrane protein [Function unknown].
          Length = 313

 Score = 26.6 bits (59), Expect = 6.0
 Identities = 14/43 (32%), Positives = 20/43 (46%), Gaps = 1/43 (2%)

Query: 68  GLMAQMAATAGGVAIGSAVGHTIGHAVTGMFSGSDSKEAAPAA 110
            ++ QM A  G V   + VG  I H +      +DS+  A AA
Sbjct: 168 AILPQMLAALGAVFASAGVGDVIAH-LISEIIPADSRFIAVAA 209


>gnl|CDD|216685 pfam01761, DHQ_synthase, 3-dehydroquinate synthase.  The
           3-dehydroquinate synthase EC:4.6.1.3 domain is present
           in isolation in various bacterial 3-dehydroquinate
           synthases and also present as a domain in the
           pentafunctional AROM polypeptide. 3-dehydroquinate (DHQ)
           synthase catalyzes the formation of dehydroquinate (DHQ)
           and orthophosphate from 3-deoxy-D-arabino heptulosonic 7
           phosphate. This reaction is part of the shikimate
           pathway which is involved in the biosynthesis of
           aromatic amino acids.
          Length = 259

 Score = 26.3 bits (59), Expect = 6.4
 Identities = 6/8 (75%), Positives = 7/8 (87%)

Query: 87  GHTIGHAV 94
           GHT GHA+
Sbjct: 180 GHTFGHAI 187


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 26.4 bits (58), Expect = 6.4
 Identities = 12/43 (27%), Positives = 19/43 (44%)

Query: 34  ASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAAT 76
           A+P  AP PTP+ PA   +  +     A  ++ P   +  A  
Sbjct: 89  ATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATA 131


>gnl|CDD|236234 PRK08312, PRK08312, putative indolepyruvate oxidoreductase subunit
           B; Reviewed.
          Length = 510

 Score = 26.5 bits (59), Expect = 6.4
 Identities = 17/68 (25%), Positives = 23/68 (33%), Gaps = 5/68 (7%)

Query: 1   MVRRGRSPSPVSSFHKMVRRG-----RSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPT 55
           +   G  P     F  ++R        S    + G + A    A  P PA  AP   +P 
Sbjct: 174 LAGSGALPFAREQFEAVIRASGKGVKASLRAFAAGFAAAQQGGAAPPPPAAAAPTAPRPR 233

Query: 56  MPAPMAAP 63
            PA  A  
Sbjct: 234 PPALAARA 241


>gnl|CDD|227315 COG4982, COG4982, 3-oxoacyl-[acyl-carrier protein].
          Length = 866

 Score = 26.7 bits (59), Expect = 6.4
 Identities = 9/37 (24%), Positives = 11/37 (29%)

Query: 26 PVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAA 62
                   +  PAP  +   PA     P   A  AA
Sbjct: 7  AKEEPAKEEATPPAPAASAPAPAAAAPAPVAAAAPAA 43


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 26.2 bits (58), Expect = 6.4
 Identities = 9/44 (20%), Positives = 13/44 (29%)

Query: 24  PSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGP 67
           P    +       +P     PA P    A      P+ +   GP
Sbjct: 117 PKREVKPVEPRPASPFENTAPARPTSSTATAAASKPVTSVSSGP 160


>gnl|CDD|215605 PLN03153, PLN03153, hypothetical protein; Provisional.
          Length = 537

 Score = 26.4 bits (58), Expect = 6.6
 Identities = 8/53 (15%), Positives = 16/53 (30%)

Query: 7   SPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAP 59
           S +    +H+      SP  +        P   P P P +   + +   +   
Sbjct: 50  SGTTARCWHRFKDWEGSPDTLLWNKRYHHPIVTPLPPPPSSPSLPSSLLLDHF 102


>gnl|CDD|234022 TIGR02813, omega_3_PfaA, polyketide-type polyunsaturated fatty acid
            synthase PfaA.  Members of the seed for this alignment
            are involved in omega-3 polyunsaturated fatty acid
            biosynthesis, such as the protein PfaA from the
            eicosapentaenoic acid biosynthesis operon in
            Photobacterium profundum strain SS9. PfaA is encoded
            together with PfaB, PfaC, and PfaD, and the functions of
            the individual polypeptides have not yet been described.
            More distant homologs of PfaA, also included with the
            reach of this model, appear to be involved in
            polyketide-like biosynthetic mechanisms of
            polyunsaturated fatty acid biosynthesis, an alternative
            to the more familiar iterated mechanism of chain
            extension and desaturation, and in most cases are encoded
            near genes for homologs of PfaB, PfaC, and/or PfaD.
          Length = 2582

 Score = 26.5 bits (58), Expect = 6.6
 Identities = 28/109 (25%), Positives = 41/109 (37%), Gaps = 8/109 (7%)

Query: 23   SPSPVSRGHSRASPAPAPRPTPATPAPV-QAQPTMPAPMAAPQQGPGLMAQMAATAGGVA 81
            + +PV +     +P      + A  APV  A  + P   AAP Q     A   A    V 
Sbjct: 1124 TQAPVIKSVVTQAPVVQVTISVAPAAPVLPAVVSPPVVSAAPAQS-VATAVAMAPVAEVP 1182

Query: 82   IGSAVGHTIGHAVTGMFSGSDSKEAAPAAPAAPAPAYQQGSQSAAEPTG 130
            I   V  ++       +  S ++ AAP A    +   Q   +  AE TG
Sbjct: 1183 IAVPVQQSVD------YMPSVAQAAAPQASVNDSAIQQVMMEVVAEKTG 1225


>gnl|CDD|222783 PHA00350, PHA00350, putative assembly protein.
          Length = 399

 Score = 26.4 bits (58), Expect = 6.8
 Identities = 9/40 (22%), Positives = 12/40 (30%), Gaps = 3/40 (7%)

Query: 30  GHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGL 69
             S  +PA     TP+    V        P    +  P L
Sbjct: 252 SDSVKTPAEHQAQTPSGVPTVSGPAR---PAQTGKPAPVL 288


>gnl|CDD|185045 PRK15088, PRK15088, PTS system mannose-specific transporter
           subunits IIAB; Provisional.
          Length = 322

 Score = 26.1 bits (58), Expect = 7.2
 Identities = 6/34 (17%), Positives = 7/34 (20%)

Query: 33  RASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQG 66
           +A  A         PA     P    P       
Sbjct: 127 KALKAKPVEKAAPAPAAAAPAPAAATPAKPMGPN 160


>gnl|CDD|111576 pfam02691, VacA, Vacuolating cyotoxin.  This family consists of
           Vacuolating cyotoxin proteins form Proteobacteria. These
           proteins are an important virulence determinate in H.
           pylori and induce cytoplasmic vacuolation in a variety
           of mammalian cell lines.
          Length = 1002

 Score = 26.6 bits (58), Expect = 7.4
 Identities = 16/43 (37%), Positives = 21/43 (48%), Gaps = 6/43 (13%)

Query: 73  MAATAGGVAIGSAVGHTIGHAVTGMFS-GSDSKEAAPAAPAAP 114
           + A  GG+A G+AVG      V+G+   G    E A  AP  P
Sbjct: 8   IPAIVGGIATGAAVG-----TVSGLLGWGLKQAEEANKAPDKP 45


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 26.4 bits (58), Expect = 7.4
 Identities = 17/63 (26%), Positives = 24/63 (38%), Gaps = 1/63 (1%)

Query: 6   RSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQ 65
           + P P  S      R  S    +R  +   P P+ RP  A PAP + +      +    Q
Sbjct: 254 KKPDPSPSMASPETRESSKRTETRPRTSLRP-PSARPASARPAPPRVKRKEIVTVLQDAQ 312

Query: 66  GPG 68
           G G
Sbjct: 313 GVG 315


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 26.4 bits (58), Expect = 8.1
 Identities = 16/43 (37%), Positives = 19/43 (44%)

Query: 23  SPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQ 65
           S +P   G S A P+  P   PA P P   Q   P P+  PQ 
Sbjct: 220 SFNPFLPGPSPAQPSAPPASIPAPPIPPVIQYVAPPPVPPPQP 262


>gnl|CDD|233366 TIGR01348, PDHac_trf_long, pyruvate dehydrogenase complex
           dihydrolipoamide acetyltransferase, long form.  This
           model describes a subset of pyruvate dehydrogenase
           complex dihydrolipoamide acetyltransferase specifically
           close by both phylogenetic and per cent identity (UPGMA)
           trees. Members of this set include two or three copies
           of the lipoyl-binding domain. E. coli AceF is a member
           of this model, while mitochondrial and some other
           bacterial forms belong to a separate model [Energy
           metabolism, Pyruvate dehydrogenase].
          Length = 546

 Score = 26.4 bits (58), Expect = 8.3
 Identities = 17/44 (38%), Positives = 20/44 (45%), Gaps = 2/44 (4%)

Query: 24  PSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGP 67
           P+P S   +  SPA       A PA  +AQ   PAP  A  Q P
Sbjct: 200 PAPASAQPAAQSPAATQPEPAAAPAAAKAQA--PAPQQAGTQNP 241


>gnl|CDD|237592 PRK14040, PRK14040, oxaloacetate decarboxylase; Provisional.
          Length = 593

 Score = 26.0 bits (58), Expect = 8.4
 Identities = 10/28 (35%), Positives = 13/28 (46%)

Query: 34  ASPAPAPRPTPATPAPVQAQPTMPAPMA 61
           A+PA AP    A      A   + AP+A
Sbjct: 505 AAPAAAPAAAAAAAPAAAAGEPVTAPLA 532


>gnl|CDD|173956 cd08197, DOIS, 2-deoxy-scyllo-inosose synthase (DOIS) catalyzes
           carbocycle formation from D-glucose-6-phosphate to
           2-deoxy-scyllo-inosose through a multistep reaction in
           the biosynthesis of aminoglycoside antibiotics.
           2-deoxy-scyllo-inosose synthase (DOIS) catalyzes
           carbocycle formation from D-glucose-6-phosphate to
           2-deoxy-scyllo-inosose through a multistep reaction in
           the biosynthesis of aminoglycoside antibiotics.
           2-Deoxystreptamine (DOS)-containing aminoglycoside
           antibiotics includes neomycin, kanamycin, gentamicin,
           and ribostamycin. They are important antibacterial
           agents. DOIS is a homologue of the dehydroquinate
           synthase which catalyzes the cyclization of
           3-deoxy-D-arabino-heputulosonate-7-phosphate to
           dehydroquinate (DHQ) in the shikimate pathway.
          Length = 355

 Score = 26.1 bits (58), Expect = 8.6
 Identities = 7/8 (87%), Positives = 8/8 (100%)

Query: 87  GHTIGHAV 94
           GHT+GHAV
Sbjct: 235 GHTVGHAV 242


>gnl|CDD|215213 PLN02374, PLN02374, pyruvate dehydrogenase (acetyl-transferring).
          Length = 433

 Score = 26.1 bits (57), Expect = 9.2
 Identities = 10/60 (16%), Positives = 17/60 (28%), Gaps = 2/60 (3%)

Query: 1  MVRRGRSPSPVSSFHKM--VRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPA 58
            R     SP+    K      G +    S     A+         A  A V+ + +  +
Sbjct: 18 SSRDDAPSSPLRGALKRSSAFTGSTSKLSSLRGLNAANGRRRSTVVAVSAVVKEKNSKAS 77


>gnl|CDD|236851 PRK11114, PRK11114, cellulose synthase regulator protein;
          Provisional.
          Length = 756

 Score = 26.0 bits (58), Expect = 9.4
 Identities = 12/63 (19%), Positives = 17/63 (26%), Gaps = 1/63 (1%)

Query: 17 MVRRGRSPSPVSRGHSRASPAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGL-MAQMAA 75
           +  G S  P     +  +  P             A        AAP +   L  AQ+  
Sbjct: 10 ALALGMSAFPAFATAAAPATQPLANAAGQVMPAAPAAAAPVVAQAAPSRDVTLTFAQLGP 69

Query: 76 TAG 78
            G
Sbjct: 70 APG 72


>gnl|CDD|109943 pfam00906, Hepatitis_core, Hepatitis core antigen.  The core
           antigen of hepatitis viruses possesses a carboxyl
           terminus rich in arginine. On this basis it was
           predicted that the core antigen would bind DNA. There is
           some experimental evidence to support this.
          Length = 182

 Score = 25.5 bits (56), Expect = 9.5
 Identities = 15/35 (42%), Positives = 18/35 (51%), Gaps = 10/35 (28%)

Query: 4   RGRSPSPVSSFHKMVRRGRSPSPVSRGHSRASPAP 38
           R R+PSP        RR RS SP  R  S+ SP+ 
Sbjct: 156 RRRTPSP--------RRRRSQSP-RRRRSQ-SPSS 180


>gnl|CDD|237529 PRK13841, PRK13841, conjugal transfer protein TrbL; Provisional.
          Length = 391

 Score = 26.0 bits (57), Expect = 9.8
 Identities = 23/60 (38%), Positives = 29/60 (48%), Gaps = 2/60 (3%)

Query: 58  APMAAPQQGPGLMAQMAATAGGVAIGSAV--GHTIGHAVTGMFSGSDSKEAAPAAPAAPA 115
           A +AA   G G  A  AA AGG ++  A   G   G    G  +GS +KE A  +P A A
Sbjct: 301 AFLAAGAAGAGFAAASAARAGGSSLAGAALRGMGAGIGSAGSAAGSAAKEKAIGSPGAYA 360


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
           protein family belong to a conserved gene four-gene
           neighborhood found sporadically in a phylogenetically
           broad range of bacteria: Nocardia farcinica,
           Symbiobacterium thermophilum, and Streptomyces
           avermitilis (Actinobacteria), Geobacillus kaustophilus
           (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
           solanacearum (Betaproteobacteria). Proteins in this
           family average over 1400 amino acids in length
           [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 25.9 bits (57), Expect = 9.9
 Identities = 21/84 (25%), Positives = 29/84 (34%), Gaps = 12/84 (14%)

Query: 36  PAPAPRPTPATPAPVQAQPTMPAPMAAPQQGPGLMAQMAATA-----GGVAIGSAV--GH 88
           P  A    PAT AP +    +  P   P        ++   A      G+A G+A     
Sbjct: 645 PDGAVLLHPATVAPGETLADVLRPWLDPPVSAERQPEVDPAAVTRVLEGIACGTADAGAA 704

Query: 89  TIGHAVTGMFS-----GSDSKEAA 107
                V G F      G+ +K AA
Sbjct: 705 HTWIDVDGRFRLGVLRGAWAKPAA 728


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.312    0.123    0.371 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,444,688
Number of extensions: 661291
Number of successful extensions: 4603
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2926
Number of HSP's successfully gapped: 738
Length of query: 150
Length of database: 10,937,602
Length adjustment: 88
Effective length of query: 62
Effective length of database: 7,034,450
Effective search space: 436135900
Effective search space used: 436135900
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 54 (24.8 bits)