RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16142
         (426 letters)



>gnl|CDD|233367 TIGR01349, PDHac_trf_mito, pyruvate dehydrogenase complex
           dihydrolipoamide acetyltransferase, long form.  This
           model represents one of several closely related clades
           of the dihydrolipoamide acetyltransferase subunit of the
           pyruvate dehydrogenase complex. It includes sequences
           from mitochondria and from alpha and beta branches of
           the proteobacteria, as well as from some other bacteria.
           Sequences from Gram-positive bacteria are not included.
           The non-enzymatic homolog protein X, which serves as an
           E3 component binding protein, falls within the clade
           phylogenetically but is rejected by its low score
           [Energy metabolism, Pyruvate dehydrogenase].
          Length = 436

 Score =  353 bits (907), Expect = e-119
 Identities = 175/368 (47%), Positives = 222/368 (60%), Gaps = 50/368 (13%)

Query: 85  VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
           + +PALSPTM  G +  W KKEGDK+N GD++AEIETDKATM FE  EEGYLAKILVP G
Sbjct: 2   ITMPALSPTMTTGNLAKWLKKEGDKVNPGDVIAEIETDKATMEFEAVEEGYLAKILVPEG 61

Query: 145 SKDVPIGKLVCIIVENESDVA-AFKDFK--DDAPPAAGASAPAPPPPKVAAAPPPPPPKA 201
           +KDVP+ K + ++VE + DVA AFK++K    A PA   S  AP  P  A  P P P K 
Sbjct: 62  TKDVPVNKPIAVLVEEKEDVADAFKNYKLESSASPAPKPSEIAPTAPPSAPKPSPAPQKQ 121

Query: 202 APAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADL----- 256
           +P PS  P P      G R++ASPLAK+LA EKG+DLS++ AGSG  G I   D+     
Sbjct: 122 SPEPSS-PAPLSDKESGDRIFASPLAKKLAKEKGIDLSAV-AGSGPNGRIVKKDIESFVP 179

Query: 257 -----SKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIAKRLLQSKQ---------- 301
                +    A    A   +A P + G + D+P+S +R +IAKRLL+SKQ          
Sbjct: 180 QSPASANQQAAATTPATYPAAAPVSTGSYEDVPLSNIRKIIAKRLLESKQTIPHYYVSIE 239

Query: 302 -----VIKLREQMNKALEKRGAKLSINDFIIKATALASRRVPEANSSWQDTFIREYHSVD 356
                ++ LR+++N    +   KLS+NDFIIKA+ALA R VPEANSSW D FIR Y +VD
Sbjct: 240 CNVDKLLALRKELNAMASEV-YKLSVNDFIIKASALALREVPEANSSWTDNFIRRYKNVD 298

Query: 357 VSVAVNTDKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEG 416
           +SVAV T  GL TPIV +AD KGL  I                   SN++K LA +A+  
Sbjct: 299 ISVAVATPDGLITPIVRNADAKGLSTI-------------------SNEIKDLAKRARNN 339

Query: 417 KLQPHEFQ 424
           KL+P EFQ
Sbjct: 340 KLKPEEFQ 347



 Score = 70.2 bits (172), Expect = 4e-13
 Identities = 29/40 (72%), Positives = 35/40 (87%)

Query: 39  KGGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLV 78
           +GGT +ISNLGMFGIK+F+AIINPPQACILAVG++    V
Sbjct: 347 QGGTFTISNLGMFGIKDFTAIINPPQACILAVGAVEDVAV 386


>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
           component of pyruvate dehydrogenase complex.
          Length = 539

 Score =  306 bits (785), Expect = 2e-99
 Identities = 150/360 (41%), Positives = 208/360 (57%), Gaps = 44/360 (12%)

Query: 85  VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
           + +P+LSPTM  G I  W KKEGDK++ G++L E+ETDKAT+  E  EEGYLAKI+   G
Sbjct: 115 IGMPSLSPTMTEGNIARWLKKEGDKVSPGEVLCEVETDKATVEMECMEEGYLAKIVKGDG 174

Query: 145 SKDVPIGKLVCIIVENESDVAAFKDFKDDAP--PAAGASAPAPPPPKVAAAPPPPPPKAA 202
           +K++ +G+++ I VE E D+  FKD+K  +   PAA  + P+PPPPK      P      
Sbjct: 175 AKEIKVGEVIAITVEEEEDIGKFKDYKPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEP 234

Query: 203 PAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSK--AS 260
            A  P+  P    S G R++ASPLA++LA +  + LSSI  G+G  G I  AD+    AS
Sbjct: 235 KASKPSAPP----SSGDRIFASPLARKLAEDNNVPLSSI-KGTGPDGRIVKADIEDYLAS 289

Query: 261 KAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIAKRLLQSKQVI---------------KL 305
                 AP  +        +TD+P + +R V A RLLQSKQ I                L
Sbjct: 290 GGKGATAPPSTDSKAPALDYTDIPNTQIRKVTASRLLQSKQTIPHYYLTVDTRVDKLMAL 349

Query: 306 REQMNKALEKRGAK-LSINDFIIKATALASRRVPEANSSWQDTFIREYHSVDVSVAVNTD 364
           R Q+N   E  G K +S+ND +IKA ALA R+VP+ NSSW D +IR+YH+V+++VAV T+
Sbjct: 350 RSQLNSLQEASGGKKISVNDLVIKAAALALRKVPQCNSSWTDDYIRQYHNVNINVAVQTE 409

Query: 365 KGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKLQPHEFQ 424
            GL+ P+V DADKKGL  I                   + +VK LA KA+E  L+P +++
Sbjct: 410 NGLYVPVVKDADKKGLSTI-------------------AEEVKQLAQKARENSLKPEDYE 450



 Score = 61.4 bits (149), Expect = 3e-10
 Identities = 26/40 (65%), Positives = 32/40 (80%), Gaps = 1/40 (2%)

Query: 40  GGTVSISNLG-MFGIKNFSAIINPPQACILAVGSLSQRLV 78
           GGT ++SNLG  FGIK F AIINPPQ+ ILAVGS  +R++
Sbjct: 451 GGTFTVSNLGGPFGIKQFCAIINPPQSAILAVGSAEKRVI 490


>gnl|CDD|237001 PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase
           subunit E2; Reviewed.
          Length = 411

 Score =  282 bits (723), Expect = 2e-91
 Identities = 132/358 (36%), Positives = 181/358 (50%), Gaps = 60/358 (16%)

Query: 87  LPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSK 146
           +P L   M  G IV W  K GD + EG  LAE+ETDKAT+   +P  G +AK+LV  G  
Sbjct: 7   MPDLGEGMTEGEIVEWLVKVGDTVKEGQPLAEVETDKATVEIPSPVAGTVAKLLVEEGDV 66

Query: 147 DVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
            VP+G ++ +I E     AA       A   A   APAP P   AAA     P AA AP+
Sbjct: 67  -VPVGSVIAVIEEEGEAEAA-------AAAEAAPEAPAPEPAPAAAAAAAAAPAAAAAPA 118

Query: 207 PTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVA 266
                + K        ASP  ++LA E G+DLS++  GSG  G IT  D+  A+ A A A
Sbjct: 119 APAAAAAK--------ASPAVRKLARELGVDLSTV-KGSGPGGRITKEDVEAAAAAAAPA 169

Query: 267 APSKSAKP-----TANGPFTDLPVSGVRGVIAKRLLQSKQVI---------------KLR 306
           A + +A        A      +P+SG+R  IAKR+++SK+ I                LR
Sbjct: 170 AAAAAAAAAAPPAAAAEGEERVPLSGMRKAIAKRMVESKREIPHFTLTDEVDVTALLALR 229

Query: 307 EQMNKALEKRGAKLSINDFIIKATALASRRVPEANSSWQDTFIREYHSVDVSVAVNTDKG 366
           +Q+     K    L++ DF+IKA ALA ++ PE N+SW D  I     V++ +AV TD G
Sbjct: 230 KQLKAIGVK----LTVTDFLIKAVALALKKFPELNASWDDDAIVLKKYVNIGIAVATDGG 285

Query: 367 LFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKLQPHEFQ 424
           L  P++ DADKK L ++                   + ++K LA KA+EGKL+P E Q
Sbjct: 286 LIVPVIRDADKKSLFEL-------------------AREIKDLAEKAREGKLKPEELQ 324



 Score = 71.4 bits (176), Expect = 2e-13
 Identities = 23/40 (57%), Positives = 30/40 (75%)

Query: 39  KGGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLV 78
           +GGT +ISNLGMFG   F+ IINPP+  IL VG++ +R V
Sbjct: 324 QGGTFTISNLGMFGGDYFTPIINPPEVAILGVGAIVERPV 363


>gnl|CDD|223582 COG0508, AceF, Pyruvate/2-oxoglutarate dehydrogenase complex,
           dihydrolipoamide acyltransferase (E2) component, and
           related enzymes [Energy production and conversion].
          Length = 404

 Score =  239 bits (612), Expect = 5e-75
 Identities = 132/361 (36%), Positives = 187/361 (51%), Gaps = 69/361 (19%)

Query: 85  VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
           + +P L  TM  GTIV W KK GDK+ EGD+L E+ETDKATM    P+ G LAKILV  G
Sbjct: 5   IKMPDLGETMTEGTIVEWLKKVGDKVKEGDVLVEVETDKATMEVPAPDAGVLAKILVEEG 64

Query: 145 SKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPA 204
              VP+G ++  I E                   GA APA      AA  PP P  AAPA
Sbjct: 65  DT-VPVGAVIARIEE------------------EGADAPA------AAEAPPEPAAAAPA 99

Query: 205 PSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADL----SKAS 260
            +P    S       RV ASP  +RLA E G+DLS +  G+G  G IT  D+    ++ +
Sbjct: 100 SAPATAASAAAG---RVLASPAVRRLAREAGIDLSKV-KGTGPGGRITKKDVEAAVAEKA 155

Query: 261 KAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIAKRLLQSKQ---------------VIKL 305
            A A  AP+ +A  +A G    +P+S +R  IA+R+++SKQ               ++ L
Sbjct: 156 AAAAAPAPAAAAPASAAGEEERVPMSRIRKAIAERMVESKQTIPHLTLFNEVDMTKLMAL 215

Query: 306 REQMNKALEKRGAKLSINDFIIKATALASRRVPEANSSW--QDTFIREYHSVDVSVAVNT 363
           R+++ +  EK+G KL+   F++KA   A ++ PE N+S       I  +  V++ +AV+T
Sbjct: 216 RKKLKEEFEKKGVKLTFLSFLVKAVVKALKKFPEVNASIDGDGEEIVYHKYVNIGIAVDT 275

Query: 364 DKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKLQPHEF 423
            +GL  P++ DADKK L +I                   + ++K LA KA++GKL P E 
Sbjct: 276 PRGLVVPVIRDADKKSLAEI-------------------AKEIKDLAKKARDGKLTPEEM 316

Query: 424 Q 424
           Q
Sbjct: 317 Q 317



 Score = 60.1 bits (146), Expect = 7e-10
 Identities = 27/53 (50%), Positives = 35/53 (66%)

Query: 38  KKGGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGNDRVALPAL 90
            +GGT +ISNLGMFG   F+ IINPPQ  IL VG++ +R V  G + V  P +
Sbjct: 316 MQGGTFTISNLGMFGSLMFTPIINPPQVAILGVGAIEERPVVVGGEIVVRPMM 368


>gnl|CDD|237000 PRK11855, PRK11855, dihydrolipoamide acetyltransferase; Reviewed.
          Length = 547

 Score =  182 bits (465), Expect = 5e-52
 Identities = 109/363 (30%), Positives = 158/363 (43%), Gaps = 69/363 (19%)

Query: 97  GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLVCI 156
             ++ W  K GD + E   L  +ETDKATM   +P  G + +I V  G K V +G L+ +
Sbjct: 133 VEVIEWLVKVGDTVEEDQSLITVETDKATMEIPSPVAGVVKEIKVKVGDK-VSVGSLLVV 191

Query: 157 IVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
           I             +  A   A A+APA   P  AAA  P P  AA A      P+   +
Sbjct: 192 I-------------EVAAAAPAAAAAPAAAAPAAAAAAAPAPAPAAAAAPAAAAPAAAAA 238

Query: 217 GGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAG-AVAAPSKSAKPT 275
            G   +ASP  +RLA E G+DLS +  G+G  G IT  D+    K   + AA + +A   
Sbjct: 239 PGKAPHASPAVRRLARELGVDLSQV-KGTGKKGRITKEDVQAFVKGAMSAAAAAAAAAAA 297

Query: 276 ANGPFTDL-----------------PVSGVRGVIAKRLLQSKQVI--------------- 303
           A G    L                 P+S ++ + A  L +S   I               
Sbjct: 298 AGGGGLGLLPWPKVDFSKFGEIETKPLSRIKKISAANLHRSWVTIPHVTQFDEADITDLE 357

Query: 304 KLREQMNKALEKRGAKLSINDFIIKATALASRRVPEANSSWQD--TFIREYHSVDVSVAV 361
            LR+Q+ K  EK G KL++  F IKA   A +  P  N+S  +    +      ++  AV
Sbjct: 358 ALRKQLKKEAEKAGVKLTMLPFFIKAVVAALKEFPVFNASLDEDGDELTYKKYFNIGFAV 417

Query: 362 NTDKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKLQPH 421
           +T  GL  P++ D DKK L++I                + E  +   LA KA++GKL+P 
Sbjct: 418 DTPNGLVVPVIKDVDKKSLLEI----------------AREIAE---LAKKARDGKLKPD 458

Query: 422 EFQ 424
           + Q
Sbjct: 459 DMQ 461



 Score = 73.7 bits (182), Expect = 4e-14
 Identities = 36/110 (32%), Positives = 48/110 (43%), Gaps = 9/110 (8%)

Query: 97  GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLVCI 156
             ++ W  KEGD + E   L  +ETDKATM   +P  G + +I V  G   V +G L+  
Sbjct: 16  VEVIEWLVKEGDTVEEDQPLVTVETDKATMEIPSPAAGVVKEIKVKVGDT-VSVGGLL-A 73

Query: 157 IVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
           ++E     AA       A   A A+APA       A     P  AA A  
Sbjct: 74  VIEAAGAAAA-------AAAPAAAAAPAAAAAAAPAPAAAAPAAAAAAAG 116



 Score = 46.7 bits (112), Expect = 1e-05
 Identities = 19/48 (39%), Positives = 27/48 (56%)

Query: 39  KGGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGNDRVA 86
           +GG  +IS+LG  G   F+ IIN P+  IL VG    + V +G + V 
Sbjct: 461 QGGCFTISSLGGIGGTAFTPIINAPEVAILGVGKSQMKPVWDGKEFVP 508


>gnl|CDD|235571 PRK05704, PRK05704, dihydrolipoamide succinyltransferase;
           Validated.
          Length = 407

 Score =  175 bits (446), Expect = 2e-50
 Identities = 108/360 (30%), Positives = 157/360 (43%), Gaps = 71/360 (19%)

Query: 88  PALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKD 147
           P L  ++   TI +W KK GD +   ++L EIETDK  +    P  G L++IL   G   
Sbjct: 8   PTLPESVTEATIATWHKKPGDAVKRDEVLVEIETDKVVLEVPAPAAGVLSEILAEEGD-T 66

Query: 148 VPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
           V +G+++  I E                 AAGA+A A      AAA P     AA A   
Sbjct: 67  VTVGQVLGRIDEGA---------------AAGAAAAAAAAAAAAAAAPAQAQAAAAAEQS 111

Query: 208 TPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAA 267
                           SP A++LAAE GLD S++  G+G  G +T  D+  A  A A A 
Sbjct: 112 NDA------------LSPAARKLAAENGLDASAV-KGTGKGGRVTKEDVLAALAAAAAAP 158

Query: 268 PSKSAKPTANGPFTD-------LPVSGVRGVIAKRLLQSKQ---------------VIKL 305
            + +A   A  P          +P++ +R  IA+RLL+++                V+ L
Sbjct: 159 AAPAAAAPAAAPAPLGARPEERVPMTRLRKTIAERLLEAQNTTAMLTTFNEVDMTPVMDL 218

Query: 306 REQMNKALEKR-GAKLSINDFIIKATALASRRVPEANSSWQDTFIREYHSVDVSVAVNTD 364
           R+Q   A EK+ G KL    F +KA   A +R PE N+S     I  ++  D+ +AV T 
Sbjct: 219 RKQYKDAFEKKHGVKLGFMSFFVKAVVEALKRYPEVNASIDGDDIVYHNYYDIGIAVGTP 278

Query: 365 KGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKLQPHEFQ 424
           +GL  P++ DAD+    +I                      +  LA KA++GKL   E  
Sbjct: 279 RGLVVPVLRDADQLSFAEI-------------------EKKIAELAKKARDGKLSIEELT 319



 Score = 41.7 bits (99), Expect = 5e-04
 Identities = 23/56 (41%), Positives = 31/56 (55%), Gaps = 4/56 (7%)

Query: 40  GGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGNDRVALP----ALS 91
           GGT +I+N G+FG    + IINPPQ+ IL +  + +R V      V  P    ALS
Sbjct: 320 GGTFTITNGGVFGSLMSTPIINPPQSAILGMHKIKERPVAVNGQIVIRPMMYLALS 375


>gnl|CDD|233365 TIGR01347, sucB, 2-oxoglutarate dehydrogenase complex
           dihydrolipoamide succinyltransferase (E2 component).
           This model describes the TCA cycle 2-oxoglutarate system
           E2 component, dihydrolipoamide succinyltransferase. It
           is closely related to the pyruvate dehydrogenase E2
           component, dihydrolipoamide acetyltransferase. The seed
           for this model includes mitochondrial and Gram-negative
           bacterial forms. Mycobacterial candidates are highly
           derived, differ in having and extra copy of the
           lipoyl-binding domain at the N-terminus. They score
           below the trusted cutoff, but above the noise cutoff and
           above all examples of dihydrolipoamide acetyltransferase
           [Energy metabolism, TCA cycle].
          Length = 403

 Score =  154 bits (391), Expect = 9e-43
 Identities = 101/361 (27%), Positives = 160/361 (44%), Gaps = 69/361 (19%)

Query: 85  VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
           + +P L+ ++  GT+  W KK GD +   + + EIETDK  +   +P +G L +IL   G
Sbjct: 3   IKVPELAESITEGTVAEWHKKVGDTVKRDENIVEIETDKVVLEVPSPADGVLQEILFKEG 62

Query: 145 SKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPA 204
              V  G+++ I+ E            +DA  A  A +        AA+    P  AA  
Sbjct: 63  DT-VESGQVLAILEEG-----------NDATAAPPAKSGEEKEETPAASAAAAPTAAANR 110

Query: 205 PSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGA 264
           PS                 SP A+RLA E G+DLS++  G+G+ G +T  D+ K ++A A
Sbjct: 111 PS----------------LSPAARRLAKEHGIDLSAV-PGTGVTGRVTKEDIIKKTEAPA 153

Query: 265 VAA--PSKSAKPTANG---PFTDLPVSGVRGVIAKRLLQSKQ---------------VIK 304
            A      +A         P   + ++ +R  IA+RL +++                V++
Sbjct: 154 SAQQPAPAAAAKAPANFTRPEERVKMTRLRQRIAERLKEAQNSTAMLTTFNEVDMSAVME 213

Query: 305 LREQMNKALEKR-GAKLSINDFIIKATALASRRVPEANSSWQDTFIREYHSVDVSVAVNT 363
           LR++  +  EK+ G KL    F +KA   A +R PE N+      I      D+SVAV+T
Sbjct: 214 LRKRYKEEFEKKHGVKLGFMSFFVKAVVAALKRFPEVNAEIDGDDIVYKDYYDISVAVST 273

Query: 364 DKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKLQPHEF 423
           D+GL  P+V +AD+    DI                     ++  L  KA++GKL   + 
Sbjct: 274 DRGLVVPVVRNADRMSFADI-------------------EKEIADLGKKARDGKLTLEDM 314

Query: 424 Q 424
            
Sbjct: 315 T 315



 Score = 38.2 bits (89), Expect = 0.007
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 4/56 (7%)

Query: 40  GGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGNDRVALP----ALS 91
           GGT +I+N G+FG    + IINPPQ+ IL +  + +R V         P    ALS
Sbjct: 316 GGTFTITNGGVFGSLMSTPIINPPQSAILGMHGIKERPVAVNGQIEIRPMMYLALS 371


>gnl|CDD|237011 PRK11892, PRK11892, pyruvate dehydrogenase subunit beta;
           Provisional.
          Length = 464

 Score =  138 bits (351), Expect = 2e-36
 Identities = 57/128 (44%), Positives = 74/128 (57%), Gaps = 3/128 (2%)

Query: 85  VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
           + +PALSPTME GT+  W KKEGDK+  GD++AEIETDKATM  E  +EG L KILVP G
Sbjct: 5   ILMPALSPTMEEGTLAKWLKKEGDKVKSGDVIAEIETDKATMEVEAVDEGTLGKILVPEG 64

Query: 145 SKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPA 204
           ++ V +   + +++E     +   D       AA A+A AP     AAA    P  AAPA
Sbjct: 65  TEGVKVNTPIAVLLEEGESAS---DAGAAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPA 121

Query: 205 PSPTPVPS 212
                V +
Sbjct: 122 APAAEVAA 129


>gnl|CDD|236999 PRK11854, aceF, pyruvate dehydrogenase dihydrolipoyltransacetylase;
           Validated.
          Length = 633

 Score =  130 bits (329), Expect = 8e-33
 Identities = 91/367 (24%), Positives = 144/367 (39%), Gaps = 77/367 (20%)

Query: 97  GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLVCI 156
             +     K GDK+     L  +E DKA+M    P  G + +I V  G K          
Sbjct: 219 VEVTEVMVKVGDKVEAEQSLITVEGDKASMEVPAPFAGTVKEIKVNVGDK---------- 268

Query: 157 IVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT- 215
            V+  S +  F+           A A AP   + AA  P      APA +P      K+ 
Sbjct: 269 -VKTGSLIMRFEV-------EGAAPAAAPAKQEAAAPAPAAAKAEAPAAAPAAKAEGKSE 320

Query: 216 --SGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAK 273
                  V+A+PL +RLA E G++L+ +  G+G  G I   D+    K     A +  A 
Sbjct: 321 FAENDAYVHATPLVRRLAREFGVNLAKVK-GTGRKGRILKEDVQAYVKDAVKRAEAAPAA 379

Query: 274 PTANGPFTDL-----------------PVSGVRGVIAKRLLQSKQVI------------- 303
             A G    L                  +  ++ +    L ++  +I             
Sbjct: 380 AAAGGGGPGLLPWPKVDFSKFGEIEEVELGRIQKISGANLHRNWVMIPHVTQFDKADITE 439

Query: 304 --KLREQMNKALEKR--GAKLSINDFIIKATALASRRVPEANSSW-QDTFIREYHS-VDV 357
               R+Q N   EKR  G K++   FI+KA A A  ++P  NSS  +D         V++
Sbjct: 440 LEAFRKQQNAEAEKRKLGVKITPLVFIMKAVAAALEQMPRFNSSLSEDGQRLTLKKYVNI 499

Query: 358 SVAVNTDKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGK 417
            +AV+T  GL  P+  D +KKG++++                   S ++  ++ KA++GK
Sbjct: 500 GIAVDTPNGLVVPVFKDVNKKGIIEL-------------------SRELMDISKKARDGK 540

Query: 418 LQPHEFQ 424
           L   + Q
Sbjct: 541 LTAGDMQ 547



 Score = 58.9 bits (143), Expect = 2e-09
 Identities = 31/106 (29%), Positives = 43/106 (40%), Gaps = 16/106 (15%)

Query: 97  GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLVCI 156
             +     K GDK+     L  +E DKA+M   +P+ G + +I V  G K V  G L+ I
Sbjct: 15  VEVTEILVKVGDKVEAEQSLITVEGDKASMEVPSPQAGVVKEIKVKVGDK-VETGALIMI 73

Query: 157 IVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAA 202
                           ++   A  +APA    K  AAP   P  AA
Sbjct: 74  F---------------ESADGAADAAPAQAEEKKEAAPAAAPAAAA 104



 Score = 51.9 bits (125), Expect = 4e-07
 Identities = 30/97 (30%), Positives = 39/97 (40%), Gaps = 8/97 (8%)

Query: 97  GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLVCI 156
             +     K GD +     L  +E DKA+M    P  G + +I V  G K V  G L+ +
Sbjct: 118 VEVTEILVKVGDTVEAEQSLITVEGDKASMEVPAPFAGTVKEIKVNVGDK-VSTGSLIMV 176

Query: 157 IVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAA 193
                   AA       AP AA A+APA  P   A  
Sbjct: 177 FEVAGEAPAA-------APAAAEAAAPAAAPAAAAGV 206



 Score = 38.8 bits (91), Expect = 0.006
 Identities = 15/42 (35%), Positives = 23/42 (54%)

Query: 40  GGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREG 81
           GG  +IS++G  G  +F+ I+N P+  IL V   +   V  G
Sbjct: 548 GGCFTISSIGGLGTTHFTPIVNAPEVAILGVSKSAMEPVWNG 589


>gnl|CDD|233366 TIGR01348, PDHac_trf_long, pyruvate dehydrogenase complex
           dihydrolipoamide acetyltransferase, long form.  This
           model describes a subset of pyruvate dehydrogenase
           complex dihydrolipoamide acetyltransferase specifically
           close by both phylogenetic and per cent identity (UPGMA)
           trees. Members of this set include two or three copies
           of the lipoyl-binding domain. E. coli AceF is a member
           of this model, while mitochondrial and some other
           bacterial forms belong to a separate model [Energy
           metabolism, Pyruvate dehydrogenase].
          Length = 546

 Score =  129 bits (325), Expect = 1e-32
 Identities = 97/367 (26%), Positives = 160/367 (43%), Gaps = 71/367 (19%)

Query: 95  EMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLV 154
           E  T++    K GD ++    L  +E+DKA+M    P  G +  + V  G   VP G L+
Sbjct: 128 EKVTVIEVLVKVGDTVSADQSLITLESDKASMEVPAPASGVVKSVKVKVGDS-VPTGDLI 186

Query: 155 CIIVENESDVAAFKDFKDDAPPAAGASAPAPPP---PKVAAAPPPPPPKAAPAPSPTPVP 211
             +    S            P  A A A A P    P      P   P AA A +P P  
Sbjct: 187 LTLSVAGS-----------TPATAPAPASAQPAAQSPAATQPEPAAAPAAAKAQAPAPQQ 235

Query: 212 SQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKS 271
           +   +     +A+P  +RLA E G+DLS++  G+G+ G I   D+ +  K  +V A + +
Sbjct: 236 AGTQNPAKVDHAAPAVRRLAREFGVDLSAV-KGTGIKGRILREDVQRFVKEPSVRAQAAA 294

Query: 272 AKPTANGP---------------FTDLPVSGVRGVIAKRLLQSKQVI------------- 303
           A      P                 ++ +S +R +    L ++  +I             
Sbjct: 295 ASAAGGAPGALPWPNVDFSKFGEVEEVDMSRIRKISGANLTRNWTMIPHVTHFDKADITE 354

Query: 304 --KLREQMNKALEKRGAKLSINDFIIKATALASRRVPEANSSW----QDTFIREYHSVDV 357
               R+Q N A+EK G KL++   ++KA A A ++ P+ N+S     +   +++Y  V++
Sbjct: 355 MEAFRKQQNAAVEKEGVKLTVLHILMKAVAAALKKFPKFNASLDLGGEQLILKKY--VNI 412

Query: 358 SVAVNTDKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGK 417
            VAV+T  GL  P++ D D+KG+ ++                   + ++  LA KA++GK
Sbjct: 413 GVAVDTPNGLLVPVIKDVDRKGITEL-------------------ALELSDLAKKARDGK 453

Query: 418 LQPHEFQ 424
           L P E Q
Sbjct: 454 LTPDEMQ 460



 Score = 49.9 bits (119), Expect = 2e-06
 Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 8/112 (7%)

Query: 95  EMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLV 154
           E G ++    K GDK+  G  L  +E+DKA+M   +   G + +I V  G   +P+G ++
Sbjct: 12  EEGEVIEVLVKPGDKVEAGQSLITLESDKASMEVPSSAAGIIKEIKVKVGDT-LPVGGVI 70

Query: 155 CIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
             + E  +   A  + K +A PA  A APAP      AA     P A  +  
Sbjct: 71  ATL-EVGAGAQAQAEAKKEAAPAPTAGAPAP------AAQAQAAPAAGQSSG 115



 Score = 32.9 bits (75), Expect = 0.35
 Identities = 13/32 (40%), Positives = 19/32 (59%)

Query: 39  KGGTVSISNLGMFGIKNFSAIINPPQACILAV 70
           +G   +IS+LG  G   F+ I+N P+  IL V
Sbjct: 460 QGACFTISSLGGIGGTAFTPIVNAPEVAILGV 491


>gnl|CDD|215782 pfam00198, 2-oxoacid_dh, 2-oxoacid dehydrogenases acyltransferase
           (catalytic domain).  These proteins contain one to three
           copies of a lipoyl binding domain followed by the
           catalytic domain.
          Length = 212

 Score =  120 bits (303), Expect = 6e-32
 Identities = 42/124 (33%), Positives = 70/124 (56%), Gaps = 20/124 (16%)

Query: 302 VIKLREQMNKALEKRGAKLSINDFIIKATALASRRVPEANSSWQ-DTFIREYHSVDVSVA 360
           ++ LRE++ +  +K G KL+  DF++KA ALA +  PE N+SW  D  I     V++ +A
Sbjct: 22  LLALREKLKEDAKKEGLKLTFLDFLVKAVALALKEFPELNASWDGDAEIVYKKYVNIGIA 81

Query: 361 VNTDKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKLQP 420
           V T  GL  P++ +AD+K L++I                   + ++K LA +A++GKL+P
Sbjct: 82  VATPDGLIVPVIRNADRKSLLEI-------------------AKEIKDLAERARDGKLKP 122

Query: 421 HEFQ 424
            + Q
Sbjct: 123 EDLQ 126



 Score = 71.0 bits (175), Expect = 3e-14
 Identities = 24/44 (54%), Positives = 31/44 (70%)

Query: 39  KGGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGN 82
           +GGT +ISNLGMFG+  F+ IINPPQ  IL VG++ +R V    
Sbjct: 126 QGGTFTISNLGMFGVTFFTPIINPPQVAILGVGAIRKRPVVVDG 169


>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
           Provisional.
          Length = 418

 Score =  124 bits (314), Expect = 9e-32
 Identities = 79/352 (22%), Positives = 127/352 (36%), Gaps = 92/352 (26%)

Query: 85  VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
           + +P +  ++  GT+V W KK GD + E +++  IETDK ++    P  G + KI    G
Sbjct: 47  IKVPTMGDSISEGTVVEWKKKVGDYVKEDEVICIIETDKVSVDIRAPASGVITKIFAEEG 106

Query: 145 SKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAP--APPPPKVAAAPPPPPPKAA 202
              V +G  +  I                APPAA  +A   A         P    P   
Sbjct: 107 D-TVEVGAPLSEIDTGG------------APPAAAPAAAAAAKAEKTTPEKPKAAAPTPE 153

Query: 203 PAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKA 262
           P  +  P P                                              K  + 
Sbjct: 154 PPAASKPTPPAAA------------------------------------------KPPEP 171

Query: 263 GAVAAPSKSAKPTANGPFTDLPVSGVRGVIAKRLLQSKQ---------------VIKLRE 307
              A P  +    A+   T +P+S +R  IA+RL  S+                +++LR+
Sbjct: 172 APAAKPPPTPVARADPRETRVPMSRMRQRIAERLKASQNTCAMLTTFNECDMSALMELRK 231

Query: 308 QMNKALEKR-GAKLSINDFIIKATALASRRVPEANSSWQDTFIREYHSVDVSVAVNTDKG 366
           +     +K+ G KL      +KA+ +A +++P  N+      I   + VD+SVAV T  G
Sbjct: 232 EYKDDFQKKHGVKLGFMSAFVKASTIALKKMPIVNAYIDGDEIVYRNYVDISVAVATPTG 291

Query: 367 LFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKL 418
           L  P++ + + K   +I    + L                  LA KA+  KL
Sbjct: 292 LVVPVIRNCENKSFAEI---EKELAD----------------LAEKARNNKL 324



 Score = 44.3 bits (105), Expect = 9e-05
 Identities = 23/49 (46%), Positives = 32/49 (65%)

Query: 40  GGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGNDRVALP 88
           GGT +ISN G+FG    + IINPPQ+ IL + ++ +R V  GN+ V  P
Sbjct: 331 GGTFTISNGGVFGSLMGTPIINPPQSAILGMHAIKKRPVVVGNEIVIRP 379


>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
           dihydrolipoamide succinyltransferase.  This model
           represents an Actinobacterial clade of E2 enzyme, a
           component of the 2-oxoglutarate dehydrogenase complex
           involved in the TCA cycle. These proteins have multiple
           domains including the catalytic domain (pfam00198), one
           or two biotin domains (pfam00364) and an E3-component
           binding domain (pfam02817).
          Length = 579

 Score =  125 bits (315), Expect = 3e-31
 Identities = 102/378 (26%), Positives = 164/378 (43%), Gaps = 61/378 (16%)

Query: 85  VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
           V +P L  ++  GT+ SW K  GD +   + L E+ TDK      +P  G L +I  P  
Sbjct: 129 VKMPELGESVTEGTVTSWLKAVGDTVEVDEPLLEVSTDKVDTEIPSPVAGTLLEIRAPE- 187

Query: 145 SKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPP---PKVAAAPPPPPPKA 201
              V +G ++ II  + +   A    ++   P+   S PAP P      AA  PP P  A
Sbjct: 188 DDTVEVGTVLAII-GDANAAPAEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPA 246

Query: 202 APAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASK 261
               +     +  +SG +  Y +PL ++LA +KG+DLS++  G+G+ G I   D+  A+K
Sbjct: 247 PAKTAAPAAAAPVSSGDSGPYVTPLVRKLAKDKGVDLSTV-KGTGVGGRIRKQDVLAAAK 305

Query: 262 AG-----AVAAPSKSAKPTANGPFTDLP-------------VSGVRGVIAKRLLQSKQ-- 301
           A      A AAP+ +A P A                     ++ +R + A + ++S Q  
Sbjct: 306 AAEEARAAAAAPAAAAAPAAPAAAAKPAEPDTAKLRGTTQKMNRIRQITADKTIESLQTS 365

Query: 302 -------------VIKLREQMNKA-LEKRGAKLSINDFIIKATALASRRVPEANSSW-QD 346
                        V  LR +     LEK G  L+   F ++A   A +  P  N+S+  +
Sbjct: 366 AQLTQVHEVDMTRVAALRARAKNDFLEKNGVNLTFLPFFVQAVTEALKAHPNVNASYNAE 425

Query: 347 TFIREYH-SVDVSVAVNTDKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESND 405
           T    YH    V +AV+T +GL  P++ +A    L  +                   +  
Sbjct: 426 TKEVTYHDVEHVGIAVDTPRGLLVPVIHNAGDLSLPGL-------------------AKA 466

Query: 406 VKTLAAKAKEGKLQPHEF 423
           +  LAA+A++ KL+P E 
Sbjct: 467 INDLAARARDNKLKPDEL 484



 Score = 65.0 bits (158), Expect = 3e-11
 Identities = 44/142 (30%), Positives = 60/142 (42%), Gaps = 22/142 (15%)

Query: 85  VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
           V +PAL  ++  GT+ SW K  GD +   + L E+ TDK      +P  G L +I  P  
Sbjct: 5   VKMPALGESVTEGTVTSWLKAVGDTVEADEPLLEVSTDKVDTEIPSPAAGVLLEIRAPE- 63

Query: 145 SKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAP- 203
              V +G ++ II E                P    S PAP  P+  AAP P  P  AP 
Sbjct: 64  DDTVEVGGVLAIIGE----------------PGEAGSEPAPAAPEPEAAPEPEAPAPAPT 107

Query: 204 ----APSPTPVPSQKTSGGTRV 221
               AP+P    +  +   T V
Sbjct: 108 PAAEAPAPAAPQAGGSGEATEV 129



 Score = 32.7 bits (74), Expect = 0.46
 Identities = 19/52 (36%), Positives = 31/52 (59%), Gaps = 4/52 (7%)

Query: 40  GGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLV----REGNDRVAL 87
           GGT +I+N+G  G    + I+NPPQA IL  G++ +R       +G + +A+
Sbjct: 486 GGTFTITNIGSGGALFDTPILNPPQAAILGTGAIVKRPRVIKDEDGGESIAI 537


>gnl|CDD|133458 cd06849, lipoyl_domain, Lipoyl domain of the dihydrolipoyl
           acyltransferase component (E2) of 2-oxo acid
           dehydrogenases. 2-oxo acid dehydrogenase multienzyme
           complexes, like pyruvate dehydrogenase (PDH),
           2-oxoglutarate dehydrogenase (OGDH) and branched-chain
           2-oxo acid dehydrogenase (BCDH), contain at least three
           different enzymes, 2-oxo acid dehydrogenase (E1),
           dihydrolipoyl acyltransferase (E2) and dihydrolipoamide
           dehydrogenase (E3) and play a key role in redox
           regulation. E2, the central component of the complex,
           catalyzes the transfer of the acyl group of CoA from E1
           to E3 via reductive acetylation of a lipoyl group
           covalently attached to a lysine residue.
          Length = 74

 Score =  102 bits (256), Expect = 7e-27
 Identities = 38/71 (53%), Positives = 49/71 (69%), Gaps = 1/71 (1%)

Query: 87  LPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSK 146
           +P L  +M  GTIV W  KEGD + EGD+LAE+ETDKAT+  E P  G LAKILV  G  
Sbjct: 5   MPDLGESMTEGTIVEWLVKEGDSVEEGDVLAEVETDKATVEVEAPAAGVLAKILVEEGDT 64

Query: 147 DVPIGKLVCII 157
            VP+G+++ +I
Sbjct: 65  -VPVGQVIAVI 74


>gnl|CDD|133456 cd06663, Biotinyl_lipoyl_domains, Biotinyl_lipoyl_domains are
           present in biotin-dependent carboxylases/decarboxylases,
           the dihydrolipoyl acyltransferase component (E2) of
           2-oxo acid dehydrogenases, and the H-protein of the
           glycine cleavage system (GCS). These domains transport
           CO2, acyl, or methylamine, respectively, between
           components of the complex/protein via a biotinyl or
           lipoyl group, which is covalently attached to a highly
           conserved lysine residue.
          Length = 73

 Score = 75.2 bits (185), Expect = 4e-17
 Identities = 31/73 (42%), Positives = 44/73 (60%), Gaps = 1/73 (1%)

Query: 85  VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
           + +P L+  +  GT+V W KK GDK+ +GD+LAEIE  KAT   E P+ G + K+LV  G
Sbjct: 2   ILIPDLAQHLGDGTVVKWLKKVGDKVKKGDVLAEIEAMKATSDVEAPKSGTVKKVLVKEG 61

Query: 145 SKDVPIGKLVCII 157
           +K V     +  I
Sbjct: 62  TK-VEGDTPLVKI 73


>gnl|CDD|184875 PRK14875, PRK14875, acetoin dehydrogenase E2 subunit
           dihydrolipoyllysine-residue acetyltransferase;
           Provisional.
          Length = 371

 Score = 78.1 bits (193), Expect = 7e-16
 Identities = 31/110 (28%), Positives = 51/110 (46%), Gaps = 8/110 (7%)

Query: 84  RVALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPA 143
            + +P    +M  G +  W  +EGD++ +GD L ++ETDK T   E P  G L + +   
Sbjct: 4   PITMPKWGLSMTEGKVAGWLVQEGDEVEKGDELLDVETDKITNEVEAPAAGTLRRQVAQE 63

Query: 144 GSKDVPIGKLVCIIVENE-SD------VAAFKDFKDDAPPAAGASAPAPP 186
           G   +P+G L+ ++ + E SD      +A F             + PAP 
Sbjct: 64  GE-TLPVGALLAVVADAEVSDAEIDAFIAPFARRFAPEGIDEEDAGPAPR 112


>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
          Length = 463

 Score = 78.3 bits (192), Expect = 1e-15
 Identities = 89/349 (25%), Positives = 137/349 (39%), Gaps = 89/349 (25%)

Query: 70  VGSLSQRLVR----EGNDRV--ALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDK 123
           V S  QR VR    E  D V   +P +  ++  GT+ ++ KK G+++   + +A+IETDK
Sbjct: 73  VSSTLQRWVRPFSSESGDTVEAVVPHMGESITDGTLATFLKKPGERVQADEAIAQIETDK 132

Query: 124 ATMGFETPEEGYLAKILVPAGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAP 183
            T+   +P  G + + LV  G    P G  V II ++E               AA    P
Sbjct: 133 VTIDIASPASGVIQEFLVKEGDTVEP-GTKVAIISKSED--------------AASQVTP 177

Query: 184 APPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGA 243
           +   P+     P PP +    P              +V ++P+A++              
Sbjct: 178 SQKIPETTDPKPSPPAEDKQKP--------------KVESAPVAEK-------------- 209

Query: 244 GSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTD----LPVSGVRGVIAKRLLQS 299
                            KA +   P K +      P  +    +P++ +R  +A RL  S
Sbjct: 210 ----------------PKAPSSPPPPKQSAKEPQLPPKERERRVPMTRLRKRVATRLKDS 253

Query: 300 KQ---------------VIKLREQMNKA-LEKRGAKLSINDFIIKATALASRRVPEANS- 342
           +                ++KLR Q   A  EK G KL +    IKA   A +  P  N+ 
Sbjct: 254 QNTFALLTTFNEVDMTNLMKLRSQYKDAFYEKHGVKLGLMSGFIKAAVSALQHQPVVNAV 313

Query: 343 -SWQDTFIREYHSVDVSVAVNTDKGLFTPIVFDADKKGLVDISNDSRSL 390
               D   R+Y  VD+S+AV T KGL  P++  ADK    +I      L
Sbjct: 314 IDGDDIIYRDY--VDISIAVGTSKGLVVPVIRGADKMNFAEIEKTINGL 360



 Score = 36.3 bits (83), Expect = 0.027
 Identities = 19/51 (37%), Positives = 30/51 (58%)

Query: 40  GGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGNDRVALPAL 90
           GG+ ++SN G++G    + IINPPQ+ IL + S+  R +  G   V  P +
Sbjct: 376 GGSFTVSNGGVYGSLISTPIINPPQSAILGMHSIVSRPMVVGGSVVPRPMM 426


>gnl|CDD|201182 pfam00364, Biotin_lipoyl, Biotin-requiring enzyme.  This family
           covers two Prosite entries, the conserved lysine residue
           binds biotin in one group and lipoic acid in the other.
           Note that the HMM does not currently recognise the
           Glycine cleavage system H proteins.
          Length = 73

 Score = 63.0 bits (154), Expect = 8e-13
 Identities = 23/73 (31%), Positives = 35/73 (47%), Gaps = 2/73 (2%)

Query: 85  VALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
           +  P +  +++ GT   W  K GDK+  G +L E+E  K  M    P  G + +ILV  G
Sbjct: 3   IKSPMIGESVKEGT-AEWLVKVGDKVKAGQVLCEVEAMKMEMEIPAPVAGVVKEILVKEG 61

Query: 145 SKDVPIGKLVCII 157
              V +G  +  I
Sbjct: 62  DT-VEVGDPLAKI 73


>gnl|CDD|184847 PRK14843, PRK14843, dihydrolipoamide acetyltransferase;
           Provisional.
          Length = 347

 Score = 67.2 bits (164), Expect = 3e-12
 Identities = 58/230 (25%), Positives = 102/230 (44%), Gaps = 49/230 (21%)

Query: 224 SPLAKRLAAEKGLDLSSIGAGSGLFGSITSAD----LSKASKAGAVAAPSKSAK------ 273
           SPLAKR+A E  +    I  G+G  G I   D    L +  +  ++ +P++  K      
Sbjct: 52  SPLAKRIALEHNIAWQEI-QGTGHRGKIMKKDVLALLPENIENDSIKSPAQIEKVEEVPD 110

Query: 274 -PTANGPFTDLPVSGVRGVIAKRLLQS---------------KQVIKLREQ-MNKALEKR 316
             T  G    +P++ +R VIA+R+++S                +++ LR++ +   +E  
Sbjct: 111 NVTPYGEIERIPMTPMRKVIAQRMVESYLTAPTFTLNYEVDMTEMLALRKKVLEPIMEAT 170

Query: 317 GAKLSINDFIIKATALASRRVPEANSSWQD--TFIREYHSVDVSVAVNTDKGLFTPIVFD 374
           G K ++ D +  A      + P  N+S  +    I  ++ V++++AV  D GL TP+V++
Sbjct: 171 GKKTTVTDLLSLAVVKTLMKHPYINASLTEDGKTIITHNYVNLAMAVGMDNGLMTPVVYN 230

Query: 375 ADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKLQPHEFQ 424
           A+K  L                   SE     K +  +  +GKL P E Q
Sbjct: 231 AEKMSL-------------------SELVVAFKDVIGRTLDGKLAPSELQ 261



 Score = 46.4 bits (110), Expect = 2e-05
 Identities = 24/62 (38%), Positives = 35/62 (56%)

Query: 33  LAGLRKKGGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGNDRVALPALSP 92
           LA    +  T +ISNLGMFG+++F  IIN P + IL V S  ++ V    + V  P +S 
Sbjct: 255 LAPSELQNSTFTISNLGMFGVQSFGPIINQPNSAILGVSSTIEKPVVVNGEIVIRPIMSL 314

Query: 93  TM 94
            +
Sbjct: 315 GL 316



 Score = 36.4 bits (84), Expect = 0.023
 Identities = 45/170 (26%), Positives = 74/170 (43%), Gaps = 18/170 (10%)

Query: 223 ASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA---NGP 279
           A+P A++LA + G++L  + +GSG  G +   D+ +  K   V   S  AK  A   N  
Sbjct: 8   ATPAARKLADDLGINLYDV-SGSGANGRVHKEDV-ETYKDTNVVRISPLAKRIALEHNIA 65

Query: 280 FTDLPVSGVRGVIAKR----LL---QSKQVIKLREQMNKALEKRGAKLSINDF-IIKATA 331
           + ++  +G RG I K+    LL        IK   Q+ K  E         +   I  T 
Sbjct: 66  WQEIQGTGHRGKIMKKDVLALLPENIENDSIKSPAQIEKVEEVPDNVTPYGEIERIPMTP 125

Query: 332 L---ASRRVPEANSSWQDTFIREYHSVDVSVAVNTDKGLFTPIVFDADKK 378
           +    ++R+ E+  +   TF   Y  VD++  +   K +  PI+    KK
Sbjct: 126 MRKVIAQRMVESYLT-APTFTLNY-EVDMTEMLALRKKVLEPIMEATGKK 173


>gnl|CDD|215289 PLN02528, PLN02528, 2-oxoisovalerate dehydrogenase E2 component.
          Length = 416

 Score = 67.4 bits (165), Expect = 3e-12
 Identities = 67/320 (20%), Positives = 130/320 (40%), Gaps = 63/320 (19%)

Query: 99  IVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGK-LVCII 157
           ++ W  KEGD++ E   L E+++DKAT+   +  +G +A+I    G   V +G+ L+ I+
Sbjct: 15  LLRWFVKEGDQVEEFQPLCEVQSDKATIEITSRYKGKVAQINFSPGDI-VKVGETLLKIM 73

Query: 158 VENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
           VE+   + +                                P  +          ++ S 
Sbjct: 74  VEDSQHLRS---------------------------DSLLLPTDSSNIVSLAESDERGSN 106

Query: 218 GTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSAD-LSKASKAGAVAAPS------- 269
            + V ++P  + LA + G+DL+ I  G+G  G +   D L  A++ G V   S       
Sbjct: 107 LSGVLSTPAVRHLAKQYGIDLNDI-LGTGKDGRVLKEDVLKYAAQKGVVKDSSSAEEATI 165

Query: 270 ---------KSAKPTANGPFTDLPVSGVRGVIAKRLLQSKQV--------------IKLR 306
                     S     +     +P+ G +  + K +  + +V              ++L+
Sbjct: 166 AEQEEFSTSVSTPTEQSYEDKTIPLRGFQRAMVKTMTAAAKVPHFHYVEEINVDALVELK 225

Query: 307 EQMNKALEKRGAKLSINDFIIKATALASRRVPEANSSW-QDTF-IREYHSVDVSVAVNTD 364
               +       K +   F+IK+ ++A  + P  NS + ++T  IR   S ++ VA+ T+
Sbjct: 226 ASFQENNTDPTVKHTFLPFLIKSLSMALSKYPLLNSCFNEETSEIRLKGSHNIGVAMATE 285

Query: 365 KGLFTPIVFDADKKGLVDIS 384
            GL  P + +     L++I+
Sbjct: 286 HGLVVPNIKNVQSLSLLEIT 305



 Score = 42.0 bits (99), Expect = 4e-04
 Identities = 19/50 (38%), Positives = 32/50 (64%), Gaps = 1/50 (2%)

Query: 40  GGTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGNDRVALPA 89
           GGT+++SN+G  G K  S ++N P+  I+A+G + Q++ R  +D    PA
Sbjct: 327 GGTITLSNIGAIGGKFGSPVLNLPEVAIIALGRI-QKVPRFVDDGNVYPA 375


>gnl|CDD|237002 PRK11857, PRK11857, dihydrolipoamide acetyltransferase; Reviewed.
          Length = 306

 Score = 61.3 bits (149), Expect = 2e-10
 Identities = 62/246 (25%), Positives = 107/246 (43%), Gaps = 68/246 (27%)

Query: 220 RVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADL--------SKASKAGAVAAPS-- 269
           ++ A+P+A+ LA + G+D+S +  GSG  G I + D+        S  + A A +  S  
Sbjct: 1   KILATPIARALAKKLGIDISLL-KGSGRDGKILAEDVENFIKSLKSAPTPAEAASVSSAQ 59

Query: 270 ---KSAKPTANGPFTDL---PVSGVRGVIAKRLLQSKQVI---------------KLREQ 308
              K+A P A  P  +     V+ +R  IA+ +  S   +                LR+ 
Sbjct: 60  QAAKTAAPAAAPPKLEGKREKVAPIRKAIARAMTNSWSNVAYVNLVNEIDMTKLWDLRKS 119

Query: 309 MNKALEK-RGAKLSINDFIIKATALASRRVP-------EANSS--WQDTFIREYHSVDVS 358
           +   + K  G KL+   FI KA  +A +  P       EA S   + DT       +++ 
Sbjct: 120 VKDPVLKTEGVKLTFLPFIAKAILIALKEFPIFAAKYDEATSELVYPDT-------LNLG 172

Query: 359 VAVNTDKGLFTPIVFDADKKGLVDISNDSRSLISFRRPVKFSEESNDVKTLAAKAKEGKL 418
           +AV+T+ GL  P++ +A K  +V+I                   + ++  LA  A+E K+
Sbjct: 173 IAVDTEAGLMVPVIKNAQKLSIVEI-------------------AKEISRLAKAARERKI 213

Query: 419 QPHEFQ 424
           +P E +
Sbjct: 214 KPDEMK 219



 Score = 29.4 bits (66), Expect = 3.5
 Identities = 10/47 (21%), Positives = 13/47 (27%), Gaps = 3/47 (6%)

Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
             DV   ++F      A   +  A       AA    P  A P    
Sbjct: 33  AEDV---ENFIKSLKSAPTPAEAASVSSAQQAAKTAAPAAAPPKLEG 76


>gnl|CDD|202412 pfam02817, E3_binding, e3 binding domain.  This family represents a
           small domain of the E2 subunit of 2-oxo-acid
           dehydrogenases responsible for the binding of the E3
           subunit.
          Length = 39

 Score = 50.5 bits (122), Expect = 8e-09
 Identities = 21/39 (53%), Positives = 28/39 (71%), Gaps = 1/39 (2%)

Query: 220 RVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSK 258
           RV+ASP A++LAAEKG+DLS +  G+G  G IT  D+  
Sbjct: 2   RVFASPAARKLAAEKGIDLSQV-KGTGPGGRITKEDVLA 39


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 49.1 bits (118), Expect = 3e-06
 Identities = 25/64 (39%), Positives = 28/64 (43%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
             A  AA ASAPA  P   A A P P P AA AP+  P P+   +      A P A   A
Sbjct: 47  AAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAA 106

Query: 232 AEKG 235
           A   
Sbjct: 107 APAA 110



 Score = 46.0 bits (110), Expect = 3e-05
 Identities = 18/58 (31%), Positives = 22/58 (37%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
           AP    A+APA PP   AAA     P A PA +    P+              A  +A
Sbjct: 71  APAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVTPLRGAAAAVA 128



 Score = 43.0 bits (102), Expect = 3e-04
 Identities = 20/73 (27%), Positives = 26/73 (35%), Gaps = 5/73 (6%)

Query: 167 FKDFKDDAPPAAGASAP-----APPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRV 221
           F D+   +  A  A+A      A  P    AA  P  P  AP  +  P    K +     
Sbjct: 33  FADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA 92

Query: 222 YASPLAKRLAAEK 234
            A+P A   AA  
Sbjct: 93  AAAPAAPPAAAAA 105



 Score = 42.2 bits (100), Expect = 5e-04
 Identities = 14/45 (31%), Positives = 16/45 (35%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
              A   A    PA      AA   PP   AA AP+   V  + T
Sbjct: 74  PPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVT 118



 Score = 41.8 bits (99), Expect = 6e-04
 Identities = 14/60 (23%), Positives = 19/60 (31%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
           AP AA A+     P     A   P     PA +     +          A+P A  +  E
Sbjct: 57  APAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDE 116



 Score = 41.8 bits (99), Expect = 6e-04
 Identities = 14/46 (30%), Positives = 15/46 (32%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
             AP A    A A       AAPP     AAPA +          G
Sbjct: 77  AAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVTPLRG 122



 Score = 41.4 bits (98), Expect = 9e-04
 Identities = 12/57 (21%), Positives = 15/57 (26%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRL 230
           A   A  +   P     AA P P    AA A    P  +   +             L
Sbjct: 64  AKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVTPL 120



 Score = 40.6 bits (96), Expect = 0.001
 Identities = 43/197 (21%), Positives = 66/197 (33%), Gaps = 48/197 (24%)

Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
            ++++F  D  P + A+  A      AAA  P    AA AP+  P P+   +        
Sbjct: 27  PSWREFFADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAA-PAPAPPAAAAPAAPPK 85

Query: 225 PLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLP 284
           P A                               A+ A   A P+ +A         +  
Sbjct: 86  PAAA-----------------------------AAAAAAPAAPPAAAAAAAPAAAAVEDE 116

Query: 285 VSGVRGV---IAKRLLQSKQV----------IKL----REQMNKALEK-RGAKLSINDFI 326
           V+ +RG    +AK +  S +V           KL    R  +N  L++ RG K+S    I
Sbjct: 117 VTPLRGAAAAVAKNMDASLEVPTATSVRAVPAKLLIDNRIVINNHLKRTRGGKVSFTHLI 176

Query: 327 IKATALASRRVPEANSS 343
             A   A +  P  N  
Sbjct: 177 GYALVQALKAFPNMNRH 193


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 47.0 bits (112), Expect = 1e-05
 Identities = 22/61 (36%), Positives = 25/61 (40%), Gaps = 1/61 (1%)

Query: 174 APPAAGASAPAPPPPKVAA-APPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
           A P A A A AP     AA AP   P  AA AP+  P  +          A+P A   AA
Sbjct: 382 ARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAA 441

Query: 233 E 233
            
Sbjct: 442 P 442



 Score = 45.9 bits (109), Expect = 3e-05
 Identities = 18/51 (35%), Positives = 22/51 (43%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
              AA A APA  P   A+AP  PP  A PAP   P  +   +      A+
Sbjct: 394 VAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAA 444



 Score = 42.4 bits (100), Expect = 4e-04
 Identities = 31/123 (25%), Positives = 43/123 (34%), Gaps = 10/123 (8%)

Query: 166 AFK-DFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
           AFK     +A   A    PA P     AA P     AAPAP+  P  +  ++      A+
Sbjct: 363 AFKPAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAP-AAAASAPAAPPAAA 421

Query: 225 PLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLP 284
           P A   A       ++  A          A ++ A    A AAP   A P    P   + 
Sbjct: 422 PPAPVAAPAAAAPAAAPAAA--------PAAVALAPAPPAQAAPETVAIPVRVAPEPAVA 473

Query: 285 VSG 287
            + 
Sbjct: 474 SAA 476



 Score = 38.9 bits (91), Expect = 0.004
 Identities = 15/47 (31%), Positives = 17/47 (36%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
             APPA  A      P +VA  P       APA +P       T  G
Sbjct: 448 APAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEG 494



 Score = 35.5 bits (82), Expect = 0.061
 Identities = 20/68 (29%), Positives = 27/68 (39%), Gaps = 5/68 (7%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV-----PSQKTSGGTRVYASPLA 227
            AP AA  +A     P   A  P PP +AAP     PV     P+  ++      A   A
Sbjct: 427 AAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAA 486

Query: 228 KRLAAEKG 235
           +    E+G
Sbjct: 487 RLTPTEEG 494


>gnl|CDD|133459 cd06850, biotinyl_domain, The biotinyl-domain or biotin carboxyl
           carrier protein (BCCP) domain is present in all
           biotin-dependent enzymes, such as acetyl-CoA
           carboxylase, pyruvate carboxylase, propionyl-CoA
           carboxylase, methylcrotonyl-CoA carboxylase, geranyl-CoA
           carboxylase, oxaloacetate decarboxylase,
           methylmalonyl-CoA decarboxylase, transcarboxylase and
           urea amidolyase. This domain functions in transferring
           CO2 from one subsite to another, allowing carboxylation,
           decarboxylation, or transcarboxylation. During this
           process, biotin is covalently attached to a specific
           lysine.
          Length = 67

 Score = 40.1 bits (95), Expect = 9e-05
 Identities = 23/61 (37%), Positives = 32/61 (52%), Gaps = 1/61 (1%)

Query: 97  GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLVCI 156
           GT+V    KEGDK+  G  LA +E  K       P  G + +ILV  G + V  G+L+ +
Sbjct: 8   GTVVKVLVKEGDKVEAGQPLAVLEAMKMENEVTAPVAGVVKEILVKEGDQ-VEAGQLLVV 66

Query: 157 I 157
           I
Sbjct: 67  I 67


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 44.2 bits (105), Expect = 1e-04
 Identities = 17/40 (42%), Positives = 20/40 (50%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
           A PAA  +  A  P   AA  P   P+ APAP+P P P  
Sbjct: 404 AAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPS 443



 Score = 38.8 bits (91), Expect = 0.005
 Identities = 15/39 (38%), Positives = 18/39 (46%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
           A  A   +A AP     A A  P P  AAPA +  P P+
Sbjct: 388 AGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPA 426



 Score = 38.4 bits (90), Expect = 0.006
 Identities = 16/52 (30%), Positives = 19/52 (36%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
           AP  A  +APA   P   A  P   P A  A  P   P Q   G +    + 
Sbjct: 684 APAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAA 735



 Score = 37.7 bits (88), Expect = 0.011
 Identities = 18/46 (39%), Positives = 20/46 (43%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
            AP A  A APA  P   AA  P PP   APA +P    +     G
Sbjct: 460 AAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAG 505



 Score = 37.7 bits (88), Expect = 0.012
 Identities = 20/41 (48%), Positives = 20/41 (48%), Gaps = 3/41 (7%)

Query: 174 APPAAGASAPAPPPPKVAAAPP---PPPPKAAPAPSPTPVP 211
           AP  A A APAP PP  A   P    P P  A APS  P P
Sbjct: 428 APQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAP 468



 Score = 37.3 bits (87), Expect = 0.017
 Identities = 18/61 (29%), Positives = 22/61 (36%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
            A PAA   APAP  P   A   P  P  APA +P    +   +      A   +    A
Sbjct: 675 GAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPA 734

Query: 233 E 233
            
Sbjct: 735 A 735



 Score = 36.9 bits (86), Expect = 0.019
 Identities = 15/42 (35%), Positives = 17/42 (40%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
             A   +   APAP       A P P P AAPAP+  P    
Sbjct: 457 PPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPA 498



 Score = 36.9 bits (86), Expect = 0.020
 Identities = 17/55 (30%), Positives = 20/55 (36%), Gaps = 1/55 (1%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
            AP  A A            AP PPP  AAP+  P P P+           +P A
Sbjct: 433 PAPAPAPAPPSPAGNAPAGGAPSPPPA-AAPSAQPAPAPAAAPEPTAAPAPAPPA 486



 Score = 36.9 bits (86), Expect = 0.022
 Identities = 14/39 (35%), Positives = 16/39 (41%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
            AP AA A APA  P    A  P P P +    +P    
Sbjct: 415 AAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGA 453



 Score = 36.9 bits (86), Expect = 0.023
 Identities = 16/42 (38%), Positives = 18/42 (42%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
                A  A  PAP P   AA     P + APAP+ TP   Q
Sbjct: 672 KAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQ 713



 Score = 36.1 bits (84), Expect = 0.037
 Identities = 13/39 (33%), Positives = 14/39 (35%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
            A      +APAP PP   A    P   AAPA       
Sbjct: 470 PAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADD 508



 Score = 36.1 bits (84), Expect = 0.042
 Identities = 14/57 (24%), Positives = 21/57 (36%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
            +     A +  PA  P    A  P   P+   AP+P P  +   +      A+P A
Sbjct: 446 GNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAA 502



 Score = 35.7 bits (83), Expect = 0.053
 Identities = 21/103 (20%), Positives = 30/103 (29%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
            PPA  +S P     + AA   P  P A         P++ ++      A+P        
Sbjct: 599 GPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVA 658

Query: 234 KGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
                          G    A    A    A AAP+ +A    
Sbjct: 659 VPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQP 701



 Score = 35.7 bits (83), Expect = 0.055
 Identities = 14/54 (25%), Positives = 16/54 (29%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
             A PA   +A AP     A AP    P+  P     P  S    G        
Sbjct: 623 APAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGA 676



 Score = 34.6 bits (80), Expect = 0.11
 Identities = 37/170 (21%), Positives = 53/170 (31%), Gaps = 6/170 (3%)

Query: 116 LAEIETDKATMGFETPEEGYL------AKILVPAGSKDVPIGKLVCIIVENESDVAAFKD 169
           +  +  D   +GF T            A++LV A ++++     V  +V      A  + 
Sbjct: 540 VLGVRGDTLVLGFSTGGLARRFASPGNAEVLVTALAEELGGDWQVEAVVGPAPGAAGGEG 599

Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKR 229
               A       A  P  P   AAP  P P  A A       +         +       
Sbjct: 600 PPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAV 659

Query: 230 LAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGP 279
             A  G D     AG     +   A    A  A A AAP++ A   A  P
Sbjct: 660 PDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATP 709



 Score = 33.8 bits (78), Expect = 0.18
 Identities = 20/55 (36%), Positives = 23/55 (41%), Gaps = 1/55 (1%)

Query: 172 DDAPPAAGASAPA-PPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
           D  P  AG +APA PPP    AAP  P   A   P+P P  +          A P
Sbjct: 667 DGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQP 721



 Score = 33.0 bits (76), Expect = 0.35
 Identities = 23/97 (23%), Positives = 29/97 (29%), Gaps = 1/97 (1%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
            AP A  A A A P     A    PP   A  P+  P P           A+     L  
Sbjct: 685 PAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQP-PQAAQGASAPSPAADDPVPLPP 743

Query: 233 EKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPS 269
           E        GA +        A  +  + A   + PS
Sbjct: 744 EPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780



 Score = 32.7 bits (75), Expect = 0.45
 Identities = 16/40 (40%), Positives = 16/40 (40%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           DD P  AGA A  PPPP  A A  P        PS     
Sbjct: 746 DDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEM 785



 Score = 32.3 bits (74), Expect = 0.56
 Identities = 12/42 (28%), Positives = 14/42 (33%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
             P AA A AP   P   AA   P  P A           ++
Sbjct: 474 PEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRER 515



 Score = 31.9 bits (73), Expect = 0.70
 Identities = 14/40 (35%), Positives = 16/40 (40%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
              P  A A  P PP P  AAAP   PP + P+       
Sbjct: 748 PPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAE 787



 Score = 31.1 bits (71), Expect = 1.2
 Identities = 9/46 (19%), Positives = 13/46 (28%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
               PA  A+  A PPP   +         AP+         +   
Sbjct: 760 PPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEVA 805



 Score = 31.1 bits (71), Expect = 1.2
 Identities = 11/40 (27%), Positives = 13/40 (32%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
           D PP    +   PPPP   A    P     P+P       
Sbjct: 746 DDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEM 785



 Score = 30.7 bits (70), Expect = 1.9
 Identities = 12/43 (27%), Positives = 14/43 (32%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
             P      A AP  P    AP P    AA  P   P   ++ 
Sbjct: 743 PEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEM 785



 Score = 30.7 bits (70), Expect = 2.0
 Identities = 14/62 (22%), Positives = 19/62 (30%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
             PP   A APA  P       PP   +         +  +       V    L + L A
Sbjct: 756 AQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEVAMELLEEELGA 815

Query: 233 EK 234
           +K
Sbjct: 816 KK 817



 Score = 30.3 bits (69), Expect = 2.1
 Identities = 12/59 (20%), Positives = 17/59 (28%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
               A   A        A A    P    PAP+P    +   +   +   +P A   A 
Sbjct: 654 PKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAG 712



 Score = 30.0 bits (68), Expect = 3.1
 Identities = 19/80 (23%), Positives = 24/80 (30%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRL 230
           K  A P A       P     AAP  PPP  APA    P  +          A+P A + 
Sbjct: 655 KHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQA 714

Query: 231 AAEKGLDLSSIGAGSGLFGS 250
                    +    S    +
Sbjct: 715 DDPAAQPPQAAQGASAPSPA 734



 Score = 30.0 bits (68), Expect = 3.2
 Identities = 11/43 (25%), Positives = 11/43 (25%)

Query: 169 DFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           D   D   A     P P P   AA    PPP            
Sbjct: 746 DDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAED 788



 Score = 30.0 bits (68), Expect = 3.3
 Identities = 16/42 (38%), Positives = 17/42 (40%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
           D  P       P  P    A  PPPP P  A AP+  P PS 
Sbjct: 737 DPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSP 778



 Score = 29.6 bits (67), Expect = 3.5
 Identities = 24/107 (22%), Positives = 28/107 (26%), Gaps = 3/107 (2%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
            AP A  A APA      A A   P P  A  P   P               P     AA
Sbjct: 619 AAPAAPAAPAPAGAAAAPAEASAAPAPGVAA-PEHHPKHVAVPDASDGGDGWPAKAGGAA 677

Query: 233 EKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGP 279
                 +   A         +A    A    A     ++  P A  P
Sbjct: 678 PAAPPPAPAPAAPA--APAGAAPAQPAPAPAATPPAGQADDPAAQPP 722



 Score = 29.6 bits (67), Expect = 3.7
 Identities = 16/56 (28%), Positives = 17/56 (30%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
             AP A   +APA P P  AA PP        A  P                 PL 
Sbjct: 687 PAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLP 742



 Score = 28.8 bits (65), Expect = 6.8
 Identities = 13/44 (29%), Positives = 14/44 (31%)

Query: 169 DFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
                AP          P    AAAPPP PP      +    PS
Sbjct: 749 PDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPS 792



 Score = 28.4 bits (64), Expect = 8.1
 Identities = 13/61 (21%), Positives = 18/61 (29%)

Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTR 220
            +   A       AP  A  + PAP P     A     P A P  +     +   +    
Sbjct: 679 AAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDP 738

Query: 221 V 221
           V
Sbjct: 739 V 739



 Score = 28.4 bits (64), Expect = 8.7
 Identities = 14/42 (33%), Positives = 17/42 (40%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
           D A     A+  A  P   A  P P PP+    P P   P+Q
Sbjct: 716 DPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQ 757


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 43.7 bits (103), Expect = 1e-04
 Identities = 31/132 (23%), Positives = 41/132 (31%), Gaps = 6/132 (4%)

Query: 164 VAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPP---PPPKAAPAPSPTPVPSQKTSGGTR 220
           +AA +      P  A A APAP     AAA P    P P AA A +     +   +    
Sbjct: 433 LAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPA 492

Query: 221 VYASPLAKRLA---AEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTAN 277
               P  + L    A              +  SI     +    A    AP+ +A P   
Sbjct: 493 DDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPR 552

Query: 278 GPFTDLPVSGVR 289
                 PV   R
Sbjct: 553 AAAATEPVVAPR 564



 Score = 43.3 bits (102), Expect = 2e-04
 Identities = 31/109 (28%), Positives = 41/109 (37%), Gaps = 5/109 (4%)

Query: 173 DAPPAAGASAPAPPPPKVAAAP-----PPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
            A PA  A+AP   P   AAAP      P  P AAPA +P    + +         SP  
Sbjct: 371 GAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAP 430

Query: 228 KRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
           + LAA +       G       +  +A  + A  A A   P  +A   A
Sbjct: 431 EALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAA 479



 Score = 35.2 bits (81), Expect = 0.074
 Identities = 25/113 (22%), Positives = 35/113 (30%), Gaps = 10/113 (8%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
                 A A A A     VAAAP    P      +     ++   G      +P A   A
Sbjct: 401 APPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAA 460

Query: 232 AEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLP 284
           A          A     G    A  + A+ A A  A + +       P+ +LP
Sbjct: 461 A----------ARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELP 503



 Score = 33.3 bits (76), Expect = 0.25
 Identities = 12/45 (26%), Positives = 15/45 (33%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
           P  A             AA P P   AA  P   P P + ++ G 
Sbjct: 529 PATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGL 573



 Score = 32.2 bits (73), Expect = 0.68
 Identities = 13/53 (24%), Positives = 19/53 (35%)

Query: 163 DVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
           + A+    + DA PA   +   P P              APA +P P  +  T
Sbjct: 505 EFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAAT 557



 Score = 30.6 bits (69), Expect = 1.8
 Identities = 16/62 (25%), Positives = 23/62 (37%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
           DDA      +  A P P+ AAA  P      P  S + +P         + A    + LA
Sbjct: 535 DDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPALAARLPVRGLA 594

Query: 232 AE 233
            +
Sbjct: 595 QQ 596



 Score = 29.5 bits (66), Expect = 4.2
 Identities = 11/48 (22%), Positives = 14/48 (29%)

Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPT 208
            +       F+  AP  A A AP           P PP  +A      
Sbjct: 529 PATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDM 576


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This family
           contains bacterial fibronectin-attachment proteins
           (FAP). Family members are rich in alanine and proline,
           are approximately 300 long, and seem to be restricted to
           mycobacteria. These proteins contain a
           fibronectin-binding motif that allows mycobacteria to
           bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 43.0 bits (101), Expect = 2e-04
 Identities = 17/38 (44%), Positives = 18/38 (47%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
              AA A APA PPP    A PP P    P  +P P P
Sbjct: 48  PSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPP 85



 Score = 40.6 bits (95), Expect = 0.001
 Identities = 23/59 (38%), Positives = 25/59 (42%), Gaps = 5/59 (8%)

Query: 174 APPAAGASAPAPPPPK-----VAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
           A PA   + PAPPPP       A AP  PPP   PA  P P P    +      A P A
Sbjct: 32  ALPATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNA 90



 Score = 38.3 bits (89), Expect = 0.005
 Identities = 17/38 (44%), Positives = 18/38 (47%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
            PP    +AP  P P    A PPPPP    AP P PV 
Sbjct: 60  PPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVD 97



 Score = 38.0 bits (88), Expect = 0.006
 Identities = 17/37 (45%), Positives = 18/37 (48%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
           D  P     + A   P  AA PPPPPP A PAP P  
Sbjct: 40  DPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDD 76



 Score = 36.8 bits (85), Expect = 0.016
 Identities = 17/60 (28%), Positives = 22/60 (36%), Gaps = 1/60 (1%)

Query: 174 APPAAGASAPAPPPPKVAAAP-PPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
            PP+  A+APAP  P     P  PP P+     +  P P    +        P A    A
Sbjct: 46  PPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPA 105



 Score = 34.9 bits (80), Expect = 0.065
 Identities = 21/47 (44%), Positives = 23/47 (48%), Gaps = 2/47 (4%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPP--PKAAPAPSPTPVPSQKTSGG 218
           AP     +A  PPPP    APPPPP  P A P P+P P       GG
Sbjct: 71  APQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPAPEPGRIDNAVGG 117



 Score = 34.1 bits (78), Expect = 0.11
 Identities = 19/49 (38%), Positives = 20/49 (40%), Gaps = 5/49 (10%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPP-----PKAAPAPSPTPVPSQKTSG 217
            PP   A+ PAP P    AAPPPPP     P   P     P P     G
Sbjct: 61  PPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPAPEPG 109



 Score = 31.8 bits (72), Expect = 0.56
 Identities = 17/39 (43%), Positives = 17/39 (43%), Gaps = 5/39 (12%)

Query: 178 AGASAPAPPPPKVAAA-----PPPPPPKAAPAPSPTPVP 211
           AGASA A   P  A A     PPPP   AA      P P
Sbjct: 24  AGASAVAIALPATANADPAPPPPPPSTAAAAPAPAAPPP 62


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 43.8 bits (103), Expect = 2e-04
 Identities = 28/111 (25%), Positives = 37/111 (33%), Gaps = 2/111 (1%)

Query: 174  APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
             PP   A+  A P    A APP  P   A    P       T+ G    A P A      
Sbjct: 2721 LPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP 2780

Query: 234  KGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLP 284
            + L   ++ + S    S+ S      +   A      +A P A  P   LP
Sbjct: 2781 RRLTRPAVASLSESRESLPSP--WDPADPPAAVLAPAAALPPAASPAGPLP 2829



 Score = 41.5 bits (97), Expect = 0.001
 Identities = 21/60 (35%), Positives = 24/60 (40%), Gaps = 3/60 (5%)

Query: 173  DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPT---PVPSQKTSGGTRVYASPLAKR 229
            D P A  A A A PP    A P PPP  A P   P    P P     GG+      + +R
Sbjct: 2806 DPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRR 2865



 Score = 38.8 bits (90), Expect = 0.006
 Identities = 17/64 (26%), Positives = 22/64 (34%), Gaps = 7/64 (10%)

Query: 172  DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
            DD     G + P+P PP        P P   P PSP+P  ++           P   R  
Sbjct: 2603 DDRGDPRGPAPPSPLPP----DTHAPDP---PPPSPSPAANEPDPHPPPTVPPPERPRDD 2655

Query: 232  AEKG 235
               G
Sbjct: 2656 PAPG 2659



 Score = 37.6 bits (87), Expect = 0.013
 Identities = 18/47 (38%), Positives = 23/47 (48%), Gaps = 1/47 (2%)

Query: 173  DAPPAAGASAPAPPPPKVAAAPPPPP-PKAAPAPSPTPVPSQKTSGG 218
             APP        PPPP+    PPPPP P+   AP+  P  + + SG 
Sbjct: 2912 QAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGA 2958



 Score = 36.5 bits (84), Expect = 0.033
 Identities = 27/106 (25%), Positives = 36/106 (33%), Gaps = 6/106 (5%)

Query: 183  PAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIG 242
            P  PP   A  PP P P+  P P P P P             PLA       G    S  
Sbjct: 2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP-----QPPLAPT-TDPAGAGEPSGA 2958

Query: 243  AGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGV 288
                  G++    ++         APS+ A  ++  P T   +S V
Sbjct: 2959 VPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRV 3004



 Score = 36.5 bits (84), Expect = 0.035
 Identities = 27/112 (24%), Positives = 33/112 (29%), Gaps = 8/112 (7%)

Query: 176  PAAGASAPAPPPP----KVAAAPPPPPPKAAPAPSP----TPVPSQKTSGGTRVYASPLA 227
            P       A  P        A PPPPPP   PAP      TP+P    +      A P A
Sbjct: 2679 PPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAA 2738

Query: 228  KRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGP 279
                A      +  G         T+   + A  A   A P +     A   
Sbjct: 2739 PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790



 Score = 35.7 bits (82), Expect = 0.049
 Identities = 17/36 (47%), Positives = 20/36 (55%)

Query: 178  AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
            A   A  PPPP   AAPP  P ++ P P P P PS+
Sbjct: 2544 ASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSE 2579



 Score = 35.7 bits (82), Expect = 0.052
 Identities = 14/43 (32%), Positives = 15/43 (34%)

Query: 175  PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
            PP      P PPPP     P  P    A A  P+    Q   G
Sbjct: 2923 PPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG 2965



 Score = 35.7 bits (82), Expect = 0.060
 Identities = 19/67 (28%), Positives = 24/67 (35%), Gaps = 5/67 (7%)

Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPT-----PVPSQKTSGGTRVYASPLAKRL 230
               AS P P P  V A+ PPPP    P+  P        P ++         +P     
Sbjct: 409 APVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDD 468

Query: 231 AAEKGLD 237
           A  K LD
Sbjct: 469 ATRKALD 475



 Score = 35.3 bits (81), Expect = 0.080
 Identities = 29/108 (26%), Positives = 39/108 (36%), Gaps = 4/108 (3%)

Query: 172  DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
              APPA   + PA P      A P  PP  A  P+P P  +       R+    +A    
Sbjct: 2738 APAPPAV-PAGPATP---GGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793

Query: 232  AEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGP 279
            + + L      A         +A L  A+       P  SA+PTA  P
Sbjct: 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP 2841



 Score = 33.8 bits (77), Expect = 0.20
 Identities = 15/58 (25%), Positives = 18/58 (31%)

Query: 175  PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
            P       P P P       P PPP   P P P   P+   +G      +     L A
Sbjct: 2909 PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGA 2966



 Score = 33.8 bits (77), Expect = 0.21
 Identities = 23/114 (20%), Positives = 31/114 (27%), Gaps = 11/114 (9%)

Query: 174  APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
            A PAA A    P  P     P  P      A  P P P    + G     +  A    +E
Sbjct: 2734 ALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793

Query: 234  KGLDLSS-----------IGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
                L S           +   + L  + + A       +    AP     P  
Sbjct: 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPP 2847



 Score = 33.0 bits (75), Expect = 0.39
 Identities = 14/39 (35%), Positives = 15/39 (38%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
            PP  G  A   P     A  PPPPP+AA        P 
Sbjct: 257 PPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPD 295



 Score = 32.2 bits (73), Expect = 0.64
 Identities = 15/47 (31%), Positives = 17/47 (36%), Gaps = 1/47 (2%)

Query: 174 APPAAGASAPAPPPPKVAAAP-PPPPPKAAPAPSPTPVPSQKTSGGT 219
           A    G     P  P  A+ P P P P  A AP P   P      G+
Sbjct: 396 ARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGS 442



 Score = 31.4 bits (71), Expect = 1.0
 Identities = 21/75 (28%), Positives = 25/75 (33%), Gaps = 20/75 (26%)

Query: 175  PPAAGASAPAPPPPKVAAAPPPPPP--------------------KAAPAPSPTPVPSQK 214
            PPAA  + P PPP       PPPPP                    +  P+ SP   P+  
Sbjct: 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP 2878

Query: 215  TSGGTRVYASPLAKR 229
                 R  A P   R
Sbjct: 2879 ARPPVRRLARPAVSR 2893



 Score = 31.4 bits (71), Expect = 1.0
 Identities = 18/76 (23%), Positives = 23/76 (30%), Gaps = 15/76 (19%)

Query: 169  DFKDDAPPAAGASAPAPPPP---------------KVAAAPPPPPPKAAPAPSPTPVPSQ 213
            D +  APP+        P P                    PPP  P+  PAP     P +
Sbjct: 2607 DPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR 2666

Query: 214  KTSGGTRVYASPLAKR 229
                G    AS   +R
Sbjct: 2667 ARRLGRAAQASSPPQR 2682



 Score = 30.3 bits (68), Expect = 2.3
 Identities = 17/52 (32%), Positives = 21/52 (40%), Gaps = 1/52 (1%)

Query: 176  PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
            P  GA  P          P P P + APA S  P+     S  +  +AS LA
Sbjct: 2962 PWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSS-WASSLA 3012



 Score = 30.3 bits (68), Expect = 2.5
 Identities = 18/55 (32%), Positives = 19/55 (34%), Gaps = 14/55 (25%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPP--------------PKAAPAPSPTPVPS 212
            D  P     A  PPPP  AAAP                  P A PAP   P P+
Sbjct: 265 ADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPA 319



 Score = 29.9 bits (67), Expect = 3.7
 Identities = 30/118 (25%), Positives = 42/118 (35%), Gaps = 7/118 (5%)

Query: 174 APPAAGASAPAPPPPKVAAAP-----PPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAK 228
             P   ++ P P  P  +A P     P PPP+  P    T          TR     L +
Sbjct: 420 PTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRKALDALRE 479

Query: 229 RLAAE-KGLDLSSI-GAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLP 284
           R   E  G DL+ + G      G++      +A+ A  VA  S+        PF   P
Sbjct: 480 RRPPEPPGADLAELLGRHPDTAGTVVRLAAREAAIAREVAECSRLTINALRSPFPASP 537



 Score = 29.1 bits (65), Expect = 6.7
 Identities = 16/45 (35%), Positives = 20/45 (44%), Gaps = 1/45 (2%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPP-PKAAPAPSPTPVPSQKTSGG 218
           P     + PA P P     P P P P +AP P  TP+PS +    
Sbjct: 399 PGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSD 443



 Score = 28.8 bits (64), Expect = 8.6
 Identities = 16/65 (24%), Positives = 21/65 (32%), Gaps = 7/65 (10%)

Query: 171 KDDAPPAAGASAPAPPPPKV-----AAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVY--A 223
               P AA  +  A PP  V     A AP   P    P P      +++           
Sbjct: 278 PPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPAPAGDAEEEDDEDGAMEVV 337

Query: 224 SPLAK 228
           SPL +
Sbjct: 338 SPLPR 342



 Score = 28.4 bits (63), Expect = 9.1
 Identities = 12/43 (27%), Positives = 15/43 (34%), Gaps = 5/43 (11%)

Query: 169  DFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
            D  D   PA  +  P          P PPPP  +PA +     
Sbjct: 2604 DRGDPRGPAPPSPLPPDTHA-----PDPPPPSPSPAANEPDPH 2641


>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
           (DUF2360).  This is the conserved 140 amino acid region
           of a family of proteins conserved from nematodes to
           humans. One C. elegans member is annotated as a
           Daf-16-dependent longevity protein 1 but this could not
           be confirmed. The function is unknown.
          Length = 147

 Score = 40.5 bits (95), Expect = 3e-04
 Identities = 21/59 (35%), Positives = 23/59 (38%), Gaps = 5/59 (8%)

Query: 168 KDFKDDAPPAAGASA-----PAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRV 221
           +D      P   ASA     P PPPP  A A  PPPP+A   P   P P         V
Sbjct: 51  EDVTVQTTPPPPASAITNGGPPPPPPARAEAASPPPPEAPAEPPAEPEPEAPAENTVTV 109


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 42.5 bits (101), Expect = 3e-04
 Identities = 22/101 (21%), Positives = 32/101 (31%), Gaps = 14/101 (13%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
            + PP + A A +        A   PP   A  P P   P Q  +       S L   LA
Sbjct: 368 PEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQL---LA 424

Query: 232 AEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSA 272
           A + L  +             +    K+  A A  A   ++
Sbjct: 425 ARQQLQRAQ-----------GATKAKKSEPAAASRARPVNS 454



 Score = 37.5 bits (88), Expect = 0.011
 Identities = 16/58 (27%), Positives = 20/58 (34%)

Query: 156 IIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
            + E E    +          AA  +A APP       PP   P+ APA       SQ
Sbjct: 364 PLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQ 421



 Score = 33.3 bits (77), Expect = 0.27
 Identities = 12/47 (25%), Positives = 13/47 (27%)

Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
           F   AP       P    P  +A     P  A   P    VP    S
Sbjct: 359 FHPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPAS 405



 Score = 31.0 bits (71), Expect = 1.3
 Identities = 17/67 (25%), Positives = 25/67 (37%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRL 230
           K   P AA  + P     +  A+  P P     AP+       K +    V   P+A   
Sbjct: 439 KKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPK 498

Query: 231 AAEKGLD 237
           A +K L+
Sbjct: 499 ALKKALE 505



 Score = 28.7 bits (65), Expect = 7.9
 Identities = 13/61 (21%), Positives = 17/61 (27%), Gaps = 3/61 (4%)

Query: 158 VENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPP---PPPPKAAPAPSPTPVPSQK 214
           +E  + V       + AP    A       P      P   P   K A     TP  + K
Sbjct: 456 LERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHEKTPELAAK 515

Query: 215 T 215
            
Sbjct: 516 L 516


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 41.5 bits (96), Expect = 6e-04
 Identities = 40/118 (33%), Positives = 50/118 (42%), Gaps = 6/118 (5%)

Query: 165 AAFKDFKDDAPPAAGASAP--APPPPKVAAAPPP----PPPKAAPAPSPTPVPSQKTSGG 218
           AA    K  APPA  A+AP  A   P  AAAPP     PP KAA  P+    P  K +  
Sbjct: 225 AAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAP 284

Query: 219 TRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
               A+P AK  AA      +   A +    +      + A  A A   P+K+A P A
Sbjct: 285 PAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAATPPAKAAAPPA 342



 Score = 39.9 bits (92), Expect = 0.002
 Identities = 35/112 (31%), Positives = 46/112 (41%), Gaps = 6/112 (5%)

Query: 171 KDDAPPAAGASAP--APPPPKVAAAPP----PPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
           K  A PA  A+ P  A   P  AAA P     PP KAA  P+    P  K +      A+
Sbjct: 224 KAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAA 283

Query: 225 PLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
           P AK  A       +   A +    +  +   + A  A A A P+K+A P A
Sbjct: 284 PPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAATPPA 335



 Score = 38.0 bits (87), Expect = 0.007
 Identities = 38/116 (32%), Positives = 49/116 (42%), Gaps = 10/116 (8%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAP-----PP-----PPPKAAPAPSPTPVPSQKTSGGTR 220
           K  A P+   SA A  P K AAAP     PP      P KAA AP+    P  K +    
Sbjct: 206 KKAAAPSGKKSAKAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPA 265

Query: 221 VYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
             A+P AK  A           A +    +  +   + A+ A A AAP+K+A P A
Sbjct: 266 KAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPPA 321



 Score = 37.6 bits (86), Expect = 0.009
 Identities = 32/88 (36%), Positives = 39/88 (44%), Gaps = 6/88 (6%)

Query: 165 AAFKDFKDDAPPAAGASAPA---PPPPKVAAAPP---PPPPKAAPAPSPTPVPSQKTSGG 218
           AA    K  APPA  A+ PA    PP K AAAP      P KAA AP+    P  K +  
Sbjct: 267 AAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAP 326

Query: 219 TRVYASPLAKRLAAEKGLDLSSIGAGSG 246
               A+P AK  A       + +G  +G
Sbjct: 327 PAKAATPPAKAAAPPAKAAAAPVGKKAG 354



 Score = 36.9 bits (84), Expect = 0.016
 Identities = 37/110 (33%), Positives = 44/110 (40%), Gaps = 6/110 (5%)

Query: 165 AAFKDFKDDAPPAAGASAP--APPPPKVAAAPPP----PPPKAAPAPSPTPVPSQKTSGG 218
           AA    K  A PA  A+ P  A  PP  AAAPP     PP KAA  P+    P  K +  
Sbjct: 239 AAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAA 298

Query: 219 TRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAP 268
               A+  AK  AA          A +    + T    + A  A A AAP
Sbjct: 299 PAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAATPPAKAAAPPAKAAAAP 348



 Score = 31.5 bits (70), Expect = 0.91
 Identities = 29/100 (29%), Positives = 37/100 (37%), Gaps = 2/100 (2%)

Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGL 236
           AA   A AP   K A A  P   KAA AP+    P  K +      A+  AK  A     
Sbjct: 203 AAAKKAAAPSGKKSAKAAAPA--KAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKA 260

Query: 237 DLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
                 A +    +      + A  A A A P+K+A   A
Sbjct: 261 AAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAPA 300



 Score = 29.5 bits (65), Expect = 3.2
 Identities = 22/70 (31%), Positives = 31/70 (44%)

Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
           AA    K  A PA  A+APA      A A  PP   AAP       P++  +   +  A+
Sbjct: 288 AAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAATPPAKAAAPPAKAAAA 347

Query: 225 PLAKRLAAEK 234
           P+ K+   +K
Sbjct: 348 PVGKKAGGKK 357


>gnl|CDD|235904 PRK06995, flhF, flagellar biosynthesis regulator FlhF; Validated.
          Length = 484

 Score = 41.5 bits (98), Expect = 6e-04
 Identities = 30/117 (25%), Positives = 41/117 (35%), Gaps = 8/117 (6%)

Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTR 220
           +SD+AA       APPAA  +APA   P  AAAP      AAPA  P P   +     T 
Sbjct: 45  DSDLAAL------APPAA--AAPAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTA 96

Query: 221 VYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTAN 277
                +A+  A       +             +  L++A+ A         A     
Sbjct: 97  QREQLVARAAAPAAPEAQAPAAPAERAAAENAARRLARAAAAAPRPRVPADAAAAVA 153



 Score = 29.5 bits (67), Expect = 3.7
 Identities = 25/137 (18%), Positives = 47/137 (34%), Gaps = 8/137 (5%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
           A   +  +A APP     AA  PPP  A  A S    P+ + +     +    AKRL A+
Sbjct: 42  ALADSDLAALAPPAAAAPAAAQPPPAAAPAAVSRPAAPAAEPA----PWLVEHAKRLTAQ 97

Query: 234 KGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIA 293
           +   ++   A +    +  +   +  ++  A    ++     A         +     +A
Sbjct: 98  REQLVARAAAPA----APEAQAPAAPAERAAAENAARRLARAAAAAPRPRVPADAAAAVA 153

Query: 294 KRLLQSKQVIKLREQMN 310
             +    + I     M 
Sbjct: 154 DAVKARIERIVNDTVMQ 170



 Score = 28.8 bits (65), Expect = 5.8
 Identities = 7/35 (20%), Positives = 9/35 (25%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
           + A     A   A           P    AA A +
Sbjct: 121 ERAAAENAARRLARAAAAAPRPRVPADAAAAVADA 155


>gnl|CDD|181304 PRK08225, PRK08225, acetyl-CoA carboxylase biotin carboxyl carrier
           protein subunit; Validated.
          Length = 70

 Score = 37.5 bits (87), Expect = 8e-04
 Identities = 22/50 (44%), Positives = 27/50 (54%), Gaps = 9/50 (18%)

Query: 78  VREGNDRVALPALSPTMEM-------GTIVSWAKKEGDKLNEGDLLAEIE 120
           V EG D V L   S  ME+       GT+     +EGD +NEGD+L EIE
Sbjct: 23  VEEGQDVVILE--SMKMEIPIVAEEAGTVKKINVQEGDFVNEGDVLLEIE 70


>gnl|CDD|236712 PRK10547, PRK10547, chemotaxis protein CheA; Provisional.
          Length = 670

 Score = 41.3 bits (97), Expect = 0.001
 Identities = 16/85 (18%), Positives = 27/85 (31%), Gaps = 5/85 (5%)

Query: 136 LAKILVPAGSKDVPIGKLVCIIVENES----DVAAFKDFKDDAPPAAGASAPAPPPPKVA 191
           L   L  + ++D  I  ++C ++E +        A    K +        +P    P V 
Sbjct: 196 LEATLPGSVAED-DITAVLCFVIEADQITFETAVAAPQEKAEETTEVVEVSPKISVPPVL 254

Query: 192 AAPPPPPPKAAPAPSPTPVPSQKTS 216
                  P        T   S+ TS
Sbjct: 255 KLAAEQAPAGRVEREKTARSSESTS 279



 Score = 28.2 bits (63), Expect = 9.5
 Identities = 8/54 (14%), Positives = 13/54 (24%), Gaps = 2/54 (3%)

Query: 182 APAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKG 235
             A   P+  A       + +P  S  PV                 ++ A    
Sbjct: 225 ETAVAAPQEKAEETTEVVEVSPKISVPPVLKLAAEQAPA--GRVEREKTARSSE 276


>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region.  The
           defective chorion-1 gene (dec-1) in Drosophila encodes
           follicle cell proteins necessary for proper eggshell
           assembly. Multiple products of the dec-1 gene are formed
           by alternative RNA splicing and proteolytic processing.
           Cleavage products include S80 (80 kDa) which is
           incorporated into the eggshell, and further proteolysis
           of S80 gives S60 (60 kDa).
          Length = 407

 Score = 40.2 bits (93), Expect = 0.002
 Identities = 15/38 (39%), Positives = 17/38 (44%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
           P   GA+AP P P    AA PP  P  A      P+P 
Sbjct: 100 PGLLGAAAPVPAPAPAPAAAPPAAPAPAADTPAAPIPD 137



 Score = 32.1 bits (72), Expect = 0.60
 Identities = 14/34 (41%), Positives = 14/34 (41%)

Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
           AA   APAP P     A P P      AP P  V
Sbjct: 106 AAPVPAPAPAPAAAPPAAPAPAADTPAAPIPDAV 139



 Score = 31.7 bits (71), Expect = 0.70
 Identities = 17/37 (45%), Positives = 18/37 (48%), Gaps = 3/37 (8%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAP-SPTP 209
           AP  A A APA  PP  AA  P     AAP P +  P
Sbjct: 107 APVPAPAPAPAAAPP--AAPAPAADTPAAPIPDAVQP 141



 Score = 31.7 bits (71), Expect = 0.76
 Identities = 13/35 (37%), Positives = 14/35 (40%), Gaps = 3/35 (8%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
             AP  A A   AP P   AA  P  P   A  P+
Sbjct: 111 APAPAPAAAPPAAPAP---AADTPAAPIPDAVQPA 142



 Score = 31.3 bits (70), Expect = 1.1
 Identities = 18/50 (36%), Positives = 20/50 (40%), Gaps = 6/50 (12%)

Query: 172 DDAPPAAGASAPAPPPPKV-----AAAP-PPPPPKAAPAPSPTPVPSQKT 215
           D  P      A  P  P +     AAAP P P P  A AP   P P+  T
Sbjct: 81  DLGPQVGSFLAQLPAMPSMPGLLGAAAPVPAPAPAPAAAPPAAPAPAADT 130


>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl carrier
           protein subunit; Validated.
          Length = 130

 Score = 38.3 bits (89), Expect = 0.002
 Identities = 15/48 (31%), Positives = 21/48 (43%), Gaps = 1/48 (2%)

Query: 179 GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPL 226
           GA A A  P + A+ P P P +A+P       P    + G     SP+
Sbjct: 22  GAPAQAAAPAQPASTPVPVPTEASPQVEAQA-PQPAAAAGADAMPSPM 68



 Score = 34.8 bits (80), Expect = 0.023
 Identities = 16/42 (38%), Positives = 19/42 (45%), Gaps = 6/42 (14%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPK-AAPA-----PSPTP 209
           A PA  AS P P P + +       P+ AA A     PSP P
Sbjct: 28  AAPAQPASTPVPVPTEASPQVEAQAPQPAAAAGADAMPSPMP 69



 Score = 31.3 bits (71), Expect = 0.36
 Identities = 11/35 (31%), Positives = 15/35 (42%)

Query: 193 APPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
           AP      A PA +P PVP++ +         P A
Sbjct: 23  APAQAAAPAQPASTPVPVPTEASPQVEAQAPQPAA 57


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 40.2 bits (94), Expect = 0.002
 Identities = 16/70 (22%), Positives = 26/70 (37%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
             P    + P P PP+  A P P  P++AP  +   +P  +    T        ++    
Sbjct: 400 KEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPPKEEEKALIA 459

Query: 234 KGLDLSSIGA 243
            G  L  + A
Sbjct: 460 DGDVLEQLEA 469



 Score = 39.8 bits (93), Expect = 0.002
 Identities = 16/58 (27%), Positives = 18/58 (31%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
            P A A+A  PP   V     PPP    P   P P   +     TR       K    
Sbjct: 388 RPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYT 445



 Score = 37.5 bits (87), Expect = 0.013
 Identities = 19/60 (31%), Positives = 22/60 (36%), Gaps = 4/60 (6%)

Query: 171 KDDAPPAAGASAPAPPP--PKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT--RVYASPL 226
              AP  A  +A AP P  P  A +  P    AA  P   PV    T      R  A P+
Sbjct: 362 PVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPV 421



 Score = 32.9 bits (75), Expect = 0.35
 Identities = 16/56 (28%), Positives = 17/56 (30%), Gaps = 3/56 (5%)

Query: 175 PPAAGASAPA-PPPPKVAAAPPPPPPKAAPA--PSPTPVPSQKTSGGTRVYASPLA 227
            P     AP+  P    AA  PP  P    A  P   P P       T   A  L 
Sbjct: 377 SPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLT 432



 Score = 32.9 bits (75), Expect = 0.37
 Identities = 15/51 (29%), Positives = 17/51 (33%), Gaps = 5/51 (9%)

Query: 174 APPAAGASAP-----APPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
            P  A ++ P     A  PPK        PP   P P   PVP    S   
Sbjct: 380 RPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPK 430


>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
           carboxylase.
          Length = 274

 Score = 39.4 bits (92), Expect = 0.002
 Identities = 14/42 (33%), Positives = 16/42 (38%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
           PP   A  PA PP    A   P         SP P  + K+S
Sbjct: 155 PPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPKSS 196



 Score = 38.7 bits (90), Expect = 0.003
 Identities = 14/39 (35%), Positives = 18/39 (46%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
            PPA       PPP  +  A PP    A  AP+ +P P+
Sbjct: 145 PPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPT 183



 Score = 36.0 bits (83), Expect = 0.026
 Identities = 15/40 (37%), Positives = 17/40 (42%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
            A P A   AP+ P       P  PPP  AP  S  P+ S
Sbjct: 163 PASPPAAQPAPSAPASSPPPTPASPPPAKAPKSSHPPLKS 202



 Score = 31.7 bits (72), Expect = 0.62
 Identities = 14/51 (27%), Positives = 16/51 (31%)

Query: 184 APPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEK 234
            PPP  V    PPPP    PA  P   P+      +            A K
Sbjct: 144 PPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPK 194



 Score = 28.3 bits (63), Expect = 6.7
 Identities = 16/46 (34%), Positives = 18/46 (39%), Gaps = 3/46 (6%)

Query: 183 PAPPPPKVAAAPPPPP---PKAAPAPSPTPVPSQKTSGGTRVYASP 225
           P PP P V   PPPP    P + PA  P P     +   T     P
Sbjct: 144 PPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPP 189


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 39.8 bits (93), Expect = 0.003
 Identities = 14/42 (33%), Positives = 19/42 (45%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
           P AA A++P+P     AA P  P     PA +P  V     +
Sbjct: 392 PSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPA 433



 Score = 38.2 bits (89), Expect = 0.007
 Identities = 14/38 (36%), Positives = 22/38 (57%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           AP  + A+A +P P + +AA  P  P++A  P+ TP  
Sbjct: 389 APQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPT 426



 Score = 35.5 bits (82), Expect = 0.059
 Identities = 10/42 (23%), Positives = 15/42 (35%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
           +P  + A+A    P         PP  +   P+  PV    T
Sbjct: 401 SPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPST 442



 Score = 34.4 bits (79), Expect = 0.13
 Identities = 19/67 (28%), Positives = 21/67 (31%), Gaps = 5/67 (7%)

Query: 164 VAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP-----TPVPSQKTSGG 218
           +A      DDA    G      P     AA P P   AA +PSP        PS   S  
Sbjct: 359 LAQLTQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSAT 418

Query: 219 TRVYASP 225
                 P
Sbjct: 419 QPAGTPP 425



 Score = 33.6 bits (77), Expect = 0.20
 Identities = 15/45 (33%), Positives = 17/45 (37%), Gaps = 3/45 (6%)

Query: 174 APPAAGASAPAPPP---PKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
           AP +A   A  PP       AA P  PP  A  A  P     +K 
Sbjct: 413 APQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKK 457


>gnl|CDD|223029 PHA03264, PHA03264, envelope glycoprotein D; Provisional.
          Length = 416

 Score = 39.2 bits (91), Expect = 0.003
 Identities = 16/61 (26%), Positives = 19/61 (31%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEK 234
           PP    S    PPP  +   P PP    P   P P P +  + G            A   
Sbjct: 255 PPYFEESKGYEPPPAPSGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRD 314

Query: 235 G 235
           G
Sbjct: 315 G 315



 Score = 30.7 bits (69), Expect = 1.4
 Identities = 20/71 (28%), Positives = 24/71 (33%), Gaps = 8/71 (11%)

Query: 142 PAGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPP------PKVAAAPP 195
           P   +D   G+      E E    A +D      P  G   PAP        P + A   
Sbjct: 289 PGPVEDGAPGRETG--GEGEGPEPAGRDGAAGGEPKPGPPRPAPDADRPEGWPSLEAITF 346

Query: 196 PPPPKAAPAPS 206
           PPP  A PA  
Sbjct: 347 PPPTPATPAVP 357


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 39.4 bits (92), Expect = 0.003
 Identities = 20/113 (17%), Positives = 30/113 (26%), Gaps = 14/113 (12%)

Query: 164 VAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYA 223
                           A A   P    AAA        AP  +P P  +           
Sbjct: 390 AVGASAVPAVTAVTGAAGAALAPK---AAAAAAATRAEAPPAAPAPPATADRGDDAADGD 446

Query: 224 SPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
           +P+  +  A    D                 D    + +G+ +AP+  A P A
Sbjct: 447 APVPAKANARASADSRCD-----------ERDAQPPADSGSASAPASDAPPDA 488



 Score = 36.4 bits (84), Expect = 0.028
 Identities = 20/104 (19%), Positives = 28/104 (26%), Gaps = 5/104 (4%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
           AP AA A+A        AA  PP         +    P        R  A        A+
Sbjct: 411 APKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPA-KANARASADSRCDERDAQ 469

Query: 234 KGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTAN 277
              D  S  A +         D +      A A  + +     +
Sbjct: 470 PPADSGSASAPAS----DAPPDAAFEPAPRAAAPSAATPAAVPD 509



 Score = 35.2 bits (81), Expect = 0.072
 Identities = 20/96 (20%), Positives = 32/96 (33%), Gaps = 3/96 (3%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
            D+      + P       +A     PP AA  P+P        +      A   A   A
Sbjct: 459 ADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPA---A 515

Query: 232 AEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAA 267
           A +    ++    +      T A  + A++AG  AA
Sbjct: 516 ASREDAPAAAAPPAPEARPPTPAAAAPAARAGGAAA 551



 Score = 34.8 bits (80), Expect = 0.092
 Identities = 22/101 (21%), Positives = 34/101 (33%), Gaps = 4/101 (3%)

Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKG 235
              G +     P +VA A P P  +AA A   + VP+     G    A+  A    A   
Sbjct: 362 VTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTG----AAGAALAPKAAAA 417

Query: 236 LDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
              +   A         +AD    +  G    P+K+    +
Sbjct: 418 AAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARAS 458



 Score = 34.8 bits (80), Expect = 0.11
 Identities = 18/107 (16%), Positives = 27/107 (25%), Gaps = 3/107 (2%)

Query: 173 DAPPAAGASAPAP---PPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKR 229
            APPA             P  A A       +         P+   S       +P    
Sbjct: 430 PAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAA 489

Query: 230 LAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTA 276
                     S    + +  +   A  S+     A A P+  A+P  
Sbjct: 490 FEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPT 536



 Score = 33.7 bits (77), Expect = 0.20
 Identities = 15/61 (24%), Positives = 18/61 (29%)

Query: 158 VENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
              E    A             A APA    + A A   PP   A  P+P        +G
Sbjct: 488 AAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAG 547

Query: 218 G 218
           G
Sbjct: 548 G 548



 Score = 32.1 bits (73), Expect = 0.72
 Identities = 15/58 (25%), Positives = 18/58 (31%), Gaps = 4/58 (6%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKR 229
           D    AA A+ PA  P     A P P         PTP     T       A+   + 
Sbjct: 566 DRGARAAAAAKPAAAPA----AAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQA 619



 Score = 31.0 bits (70), Expect = 1.6
 Identities = 16/54 (29%), Positives = 18/54 (33%), Gaps = 3/54 (5%)

Query: 179 GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
           GA A A   P   AA P   PK A       VP+ +    T       A R   
Sbjct: 568 GARAAAAAKP---AAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQ 618


>gnl|CDD|222828 PHA01732, PHA01732, proline-rich protein.
          Length = 94

 Score = 36.3 bits (83), Expect = 0.003
 Identities = 25/75 (33%), Positives = 30/75 (40%)

Query: 184 APPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGA 243
            P P   A  PPPPP   AP P PT  P    +   R   S   ++     G     I  
Sbjct: 12  PPAPLPPAPVPPPPPAPPAPVPEPTVKPVNAEAPKIREAQSKRGQKQQKAGGTASLRIPK 71

Query: 244 GSGLFGSITSADLSK 258
            SG  GSI++   SK
Sbjct: 72  QSGTQGSISAGGTSK 86


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 39.3 bits (92), Expect = 0.003
 Identities = 20/44 (45%), Positives = 21/44 (47%), Gaps = 5/44 (11%)

Query: 175 PPAAGASAPAPP--PPKVAAAPPPPPPKAAPA---PSPTPVPSQ 213
             A GA  PA P  PP  AA P PP   A PA   P+P P P  
Sbjct: 386 SAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPA 429



 Score = 38.6 bits (90), Expect = 0.005
 Identities = 20/51 (39%), Positives = 23/51 (45%), Gaps = 4/51 (7%)

Query: 174 APPAAGASAPAPPPPKVAA----APPPPPPKAAPAPSPTPVPSQKTSGGTR 220
           APP A A    P  P   A    AP P PP AA  P+ +  P+   S G R
Sbjct: 399 APPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPARSADPAAAASAGDR 449



 Score = 35.5 bits (82), Expect = 0.047
 Identities = 17/46 (36%), Positives = 20/46 (43%), Gaps = 3/46 (6%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPP---PPKAAPAPSPTPVPSQKTS 216
            P A  A  PA  PP   AAP  P    P  APAP     P  +++
Sbjct: 393 TPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPARSA 438



 Score = 31.3 bits (71), Expect = 1.1
 Identities = 12/33 (36%), Positives = 13/33 (39%)

Query: 184 APPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
           AP PP  A   P P   AAP P+  P       
Sbjct: 381 APAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAP 413



 Score = 29.7 bits (67), Expect = 3.7
 Identities = 11/51 (21%), Positives = 13/51 (25%)

Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
             GA AP            P  P  A AP   P    + +      A    
Sbjct: 378 ERGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPP 428


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score = 39.1 bits (91), Expect = 0.004
 Identities = 13/50 (26%), Positives = 17/50 (34%), Gaps = 1/50 (2%)

Query: 172 DDAPPAAGASAPAP-PPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTR 220
           +   PA    +PA  P P   A    P  KAA   +  P   +      R
Sbjct: 13  EAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPR 62



 Score = 34.1 bits (78), Expect = 0.13
 Identities = 14/50 (28%), Positives = 15/50 (30%)

Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
           AA       A  AA A+  A P      A  P   K      P P    K
Sbjct: 25  AAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKPASLWK 74



 Score = 32.2 bits (73), Expect = 0.52
 Identities = 12/37 (32%), Positives = 14/37 (37%)

Query: 179 GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
           G +    P P   AA P PPP A  A   T   +   
Sbjct: 12  GEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAA 48



 Score = 32.2 bits (73), Expect = 0.54
 Identities = 16/35 (45%), Positives = 17/35 (48%)

Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
             A  PAP PP  AAAP PPPP    AP+      
Sbjct: 12  GEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAP 46



 Score = 28.7 bits (64), Expect = 6.0
 Identities = 14/50 (28%), Positives = 19/50 (38%), Gaps = 1/50 (2%)

Query: 160 NESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
           + +   A       A PA  A+APA   P+ A  P    P+    P P  
Sbjct: 23  SPAAAPAPPPPAKTAAPATKAAAPAAAAPR-AEKPKKDKPRRERKPKPAS 71



 Score = 28.7 bits (64), Expect = 7.0
 Identities = 14/56 (25%), Positives = 19/56 (33%), Gaps = 5/56 (8%)

Query: 179 GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEK 234
           G      P P      PP P  A   P P    +  T       A+P A++   +K
Sbjct: 10  GKGEAEQPAP-----APPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDK 60


>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
          Length = 226

 Score = 38.1 bits (89), Expect = 0.005
 Identities = 20/64 (31%), Positives = 24/64 (37%), Gaps = 1/64 (1%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPP-PPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRL 230
           D A P+   +  APP   V   P P  PPK  P   P P P  +          P  K +
Sbjct: 79  DAAAPSLDPATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPV 138

Query: 231 AAEK 234
             EK
Sbjct: 139 VEEK 142



 Score = 33.4 bits (77), Expect = 0.14
 Identities = 11/42 (26%), Positives = 12/42 (28%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
           AP       P   P              AP P P PV  +K 
Sbjct: 102 APVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKA 143



 Score = 32.3 bits (74), Expect = 0.31
 Identities = 20/59 (33%), Positives = 23/59 (38%), Gaps = 4/59 (6%)

Query: 174 APPAAGASAPAPPPPKVAAAPPP----PPPKAAPAPSPTPVPSQKTSGGTRVYASPLAK 228
           A  A  A+AP+  P  VA    P    P P   P P P   P  K     +V A P  K
Sbjct: 74  AVRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPK 132



 Score = 30.7 bits (70), Expect = 1.2
 Identities = 10/29 (34%), Positives = 12/29 (41%)

Query: 183 PAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           P P P +   APP P P+  P       P
Sbjct: 117 PKPKPQQKVEAPPAPKPEPKPVVEEKAAP 145


>gnl|CDD|235777 PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl carrier
           protein subunit; Validated.
          Length = 155

 Score = 36.7 bits (86), Expect = 0.007
 Identities = 17/63 (26%), Positives = 23/63 (36%), Gaps = 10/63 (15%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG--------GTRVY 222
           +  A P A  +  A   P  AA        AAPA +P    + +  G        GT  Y
Sbjct: 34  RAAAAPVAPVAQQAAAAPVAAAPAAAAAAAAAPAAAPAA-AAAEAEGHVVTSPMVGT-FY 91

Query: 223 ASP 225
            +P
Sbjct: 92  RAP 94


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 38.0 bits (88), Expect = 0.007
 Identities = 14/51 (27%), Positives = 21/51 (41%), Gaps = 6/51 (11%)

Query: 174 APPAAGASAPAPPPPKVAAAPP------PPPPKAAPAPSPTPVPSQKTSGG 218
           + P    S PA P   ++AAP        P  ++A A +P    +   S G
Sbjct: 93  SAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPSTSVPSSG 143



 Score = 37.6 bits (87), Expect = 0.009
 Identities = 14/46 (30%), Positives = 18/46 (39%), Gaps = 3/46 (6%)

Query: 174 APPAAGASAPAP---PPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
            P A   SAP P   PP   A+     P  A    SP+   +  T+
Sbjct: 86  PPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATA 131



 Score = 36.0 bits (83), Expect = 0.034
 Identities = 17/62 (27%), Positives = 22/62 (35%), Gaps = 1/62 (1%)

Query: 179 GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDL 238
           G  AP    P  A  P P PP A+PA   +  P+      +    S  A    +      
Sbjct: 82  GKVAPPAATPTSAPTPTPSPP-ASPASGMSAAPASAVEEKSPSEESATATAPESPSTSVP 140

Query: 239 SS 240
           SS
Sbjct: 141 SS 142



 Score = 31.8 bits (72), Expect = 0.75
 Identities = 10/28 (35%), Positives = 14/28 (50%)

Query: 189 KVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
               APP   P +AP P+P+P  S  + 
Sbjct: 81  TGKVAPPAATPTSAPTPTPSPPASPASG 108


>gnl|CDD|234994 PRK01973, PRK01973, septum formation inhibitor; Reviewed.
          Length = 271

 Score = 37.4 bits (87), Expect = 0.009
 Identities = 26/68 (38%), Positives = 33/68 (48%), Gaps = 3/68 (4%)

Query: 159 ENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPP---PKAAPAPSPTPVPSQKT 215
              +D AA    +  AP AA A+  A   P  AAAP PPP   P+A  A S T V  +  
Sbjct: 111 AKAADEAAAAAAEAAAPAAAAAAEAAAAAPAAAAAPEPPPAPAPEAVAAQSQTLVIDRPL 170

Query: 216 SGGTRVYA 223
             G ++YA
Sbjct: 171 RSGQQIYA 178



 Score = 37.0 bits (86), Expect = 0.014
 Identities = 20/59 (33%), Positives = 22/59 (37%), Gaps = 3/59 (5%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP---SQKTSGGTRVYASPL 226
           K     AA A+  A P    AA      P AA AP P P P   +      T V   PL
Sbjct: 112 KAADEAAAAAAEAAAPAAAAAAEAAAAAPAAAAAPEPPPAPAPEAVAAQSQTLVIDRPL 170


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 37.6 bits (88), Expect = 0.011
 Identities = 15/43 (34%), Positives = 17/43 (39%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
             P      P+ P  K AA  P PPP  A  P P P    + S
Sbjct: 383 PSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPS 425



 Score = 34.6 bits (80), Expect = 0.12
 Identities = 14/42 (33%), Positives = 18/42 (42%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
            P+   +   P  PK   A   P P  A A  P PVP++ T 
Sbjct: 382 NPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTE 423



 Score = 33.4 bits (77), Expect = 0.21
 Identities = 13/52 (25%), Positives = 17/52 (32%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
           +  A     P   PP  A  PP P P A   P   P P    +  +    + 
Sbjct: 518 SNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQ 569



 Score = 32.6 bits (75), Expect = 0.42
 Identities = 7/42 (16%), Positives = 9/42 (21%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
            P          P           P   PA +  P+P     
Sbjct: 380 APNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEP 421



 Score = 32.6 bits (75), Expect = 0.46
 Identities = 8/43 (18%), Positives = 12/43 (27%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
             P+A  +  A   P    A   PP      P+         +
Sbjct: 390 IQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANA 432



 Score = 32.2 bits (74), Expect = 0.52
 Identities = 24/135 (17%), Positives = 39/135 (28%), Gaps = 20/135 (14%)

Query: 86  ALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIE-----TDKATMGFETPEEGYLAKIL 140
            L  L+  + +  I+ W +           L   E     + +  +  E    G L    
Sbjct: 315 ELCKLAKQINLERILQWQQH----------LKGSEYQLKNSTQPRLWLEVTLLGLLPSAF 364

Query: 141 VPAGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPK 200
           +   +              +             + P    +A  P PP   A+PP P P 
Sbjct: 365 ISEIANASAPANPTPAPNPSPPPAPI-----QPSAPKTKQAATTPSPPPAKASPPIPVPA 419

Query: 201 AAPAPSPTPVPSQKT 215
               PSPTP  +   
Sbjct: 420 EPTEPSPTPPANAAN 434



 Score = 32.2 bits (74), Expect = 0.54
 Identities = 16/46 (34%), Positives = 17/46 (36%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
           A   A    P    P   A  PP P   A AP PTP P   T+   
Sbjct: 517 ASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQA 562



 Score = 32.2 bits (74), Expect = 0.58
 Identities = 15/35 (42%), Positives = 17/35 (48%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPT 208
           +PP A AS P P P +     P PP  AA AP   
Sbjct: 405 SPPPAKASPPIPVPAEPTEPSPTPPANAANAPPSL 439



 Score = 32.2 bits (74), Expect = 0.59
 Identities = 15/44 (34%), Positives = 19/44 (43%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
              PP A  ++   P    A + PPPP    P PSPT   S + 
Sbjct: 553 PPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTKDSSPEE 596



 Score = 31.1 bits (71), Expect = 1.4
 Identities = 11/37 (29%), Positives = 14/37 (37%)

Query: 179 GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
           G+++     P      PPPP    P P PT      T
Sbjct: 515 GSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPT 551



 Score = 30.7 bits (70), Expect = 1.6
 Identities = 11/39 (28%), Positives = 13/39 (33%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
           A   +   A A PP  V A P  P P      +  P   
Sbjct: 401 ATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPPSL 439



 Score = 30.3 bits (69), Expect = 2.3
 Identities = 11/37 (29%), Positives = 17/37 (45%)

Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
           A+ P+PPP K +   P P     P+P+P    +    
Sbjct: 401 ATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPP 437



 Score = 29.9 bits (68), Expect = 3.0
 Identities = 13/44 (29%), Positives = 14/44 (31%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
               A     PAP P    A   P  PK   A +    P  K S
Sbjct: 369 ANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKAS 412



 Score = 29.5 bits (67), Expect = 4.0
 Identities = 16/42 (38%), Positives = 16/42 (38%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
           PPA     P P        PPPPPP A  A S  P      S
Sbjct: 533 PPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADS 574



 Score = 29.2 bits (66), Expect = 5.6
 Identities = 7/42 (16%), Positives = 10/42 (23%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
                  +  PA   P +     P  P   P  +    P   
Sbjct: 398 KQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPPSL 439



 Score = 28.4 bits (64), Expect = 9.7
 Identities = 17/63 (26%), Positives = 19/63 (30%), Gaps = 8/63 (12%)

Query: 180 ASAPAPPPPKVA--------AAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
           AS  A  PP             P P P   AP P+P P P   T   +   A   A    
Sbjct: 517 ASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSP 576

Query: 232 AEK 234
              
Sbjct: 577 PPP 579


>gnl|CDD|237802 PRK14723, flhF, flagellar biosynthesis regulator FlhF; Provisional.
          Length = 767

 Score = 37.9 bits (88), Expect = 0.011
 Identities = 24/96 (25%), Positives = 36/96 (37%), Gaps = 7/96 (7%)

Query: 156 IIVENES-----DVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
           +++ N S     ++ A  D    A  A+  +   P P  + AA   P P AA   +P  V
Sbjct: 27  LVLSNRSVAEGVEIVAMLDEDLGAVAASAQAYAPPAPAPLPAALVAPAPAAASIAAPAAV 86

Query: 211 PSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSG 246
           P+    G  R      + R   E+ L      AG  
Sbjct: 87  PAPGAIGDLR--GELQSMRGMLERQLAGLLWAAGEV 120


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 36.0 bits (83), Expect = 0.012
 Identities = 17/40 (42%), Positives = 17/40 (42%), Gaps = 3/40 (7%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
              P     S PA PP   A A   P P AAP PS  P P
Sbjct: 101 PAVPTPPPTSTPAVPP---APAAAVPAPAAAPPPSDPPQP 137



 Score = 35.2 bits (81), Expect = 0.020
 Identities = 16/43 (37%), Positives = 18/43 (41%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
                + A A   PPP    A PP P  A PAP+  P PS   
Sbjct: 93  PPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPP 135



 Score = 30.2 bits (68), Expect = 0.97
 Identities = 17/54 (31%), Positives = 20/54 (37%), Gaps = 3/54 (5%)

Query: 166 AFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP---TPVPSQKTS 216
           A K      PPA       P     A A P PPP + PA  P     VP+   +
Sbjct: 75  ARKILAPLTPPAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAA 128



 Score = 28.6 bits (64), Expect = 3.7
 Identities = 11/28 (39%), Positives = 14/28 (50%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKA 201
           A P A A+A   P      + PP PP+A
Sbjct: 113 AVPPAPAAAVPAPAAAPPPSDPPQPPRA 140



 Score = 28.3 bits (63), Expect = 5.0
 Identities = 11/32 (34%), Positives = 12/32 (37%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAP 205
            P +  A  PAP     A A  PPP      P
Sbjct: 107 PPTSTPAVPPAPAAAVPAPAAAPPPSDPPQPP 138



 Score = 27.9 bits (62), Expect = 5.8
 Identities = 14/45 (31%), Positives = 16/45 (35%)

Query: 183 PAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
           P  PP  V       P  A P P PT  P+   +    V A   A
Sbjct: 84  PPAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAA 128



 Score = 27.5 bits (61), Expect = 9.1
 Identities = 13/31 (41%), Positives = 13/31 (41%), Gaps = 1/31 (3%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPP-PKAAP 203
            P    A A A P P  A  P  PP P  AP
Sbjct: 111 TPAVPPAPAAAVPAPAAAPPPSDPPQPPRAP 141


>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
           carrier protein subunit; Validated.
          Length = 153

 Score = 36.0 bits (83), Expect = 0.013
 Identities = 13/39 (33%), Positives = 16/39 (41%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
           D +        PAP P     + P P   AAPAP+P   
Sbjct: 44  DLSAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPASA 82



 Score = 34.5 bits (79), Expect = 0.042
 Identities = 13/36 (36%), Positives = 14/36 (38%)

Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
            A     P P P    A P  P P A  AP+P P  
Sbjct: 46  SAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPAS 81



 Score = 29.8 bits (67), Expect = 1.5
 Identities = 9/37 (24%), Positives = 12/37 (32%), Gaps = 1/37 (2%)

Query: 177 AAGASAPAPPPPKVAAAP-PPPPPKAAPAPSPTPVPS 212
           A G         +    P P P P    AP+P    +
Sbjct: 38  AKGLGIDLSAVQEQVPTPAPAPAPAVPSAPTPVAPAA 74


>gnl|CDD|236507 PRK09424, pntA, NAD(P) transhydrogenase subunit alpha; Provisional.
          Length = 509

 Score = 37.5 bits (88), Expect = 0.013
 Identities = 9/31 (29%), Positives = 11/31 (35%)

Query: 185 PPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
           PPP +V+AAP       A         S   
Sbjct: 370 PPPIQVSAAPAAAAAAPAAKEEEKKPASPWR 400



 Score = 31.3 bits (72), Expect = 0.96
 Identities = 9/29 (31%), Positives = 11/29 (37%)

Query: 185 PPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
           PPPP   +A P     A  A      P+ 
Sbjct: 369 PPPPIQVSAAPAAAAAAPAAKEEEKKPAS 397



 Score = 30.6 bits (70), Expect = 2.0
 Identities = 10/33 (30%), Positives = 13/33 (39%), Gaps = 4/33 (12%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
            PP   ++APA       AAP     +  PA  
Sbjct: 370 PPPIQVSAAPAAAA----AAPAAKEEEKKPASP 398


>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824).  This is
           a repeating domain found in fungal proteins. It is
           proline-rich, and the function is not known.
          Length = 135

 Score = 35.6 bits (82), Expect = 0.013
 Identities = 15/42 (35%), Positives = 16/42 (38%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
              PP +    P  P P    A  PPPP A P P   P P  
Sbjct: 77  FPPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQNYPYPPG 118



 Score = 28.7 bits (64), Expect = 2.9
 Identities = 12/37 (32%), Positives = 12/37 (32%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
             P       PA  PP   A PPP      P P   P
Sbjct: 87  PPPGPQPGYNPADYPPPPGAVPPPQNYPYPPGPGQDP 123


>gnl|CDD|223585 COG0511, AccB, Biotin carboxyl carrier protein [Lipid metabolism].
          Length = 140

 Score = 35.4 bits (82), Expect = 0.016
 Identities = 17/49 (34%), Positives = 23/49 (46%)

Query: 97  GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGS 145
           GT+     + GD +  G  LA IE  K     E P +G + +ILV  G 
Sbjct: 79  GTVYKPFVEVGDTVKAGQTLAIIEAMKMENEIEAPADGVVKEILVKNGD 127


>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 584

 Score = 36.8 bits (85), Expect = 0.021
 Identities = 12/37 (32%), Positives = 15/37 (40%)

Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
           PA          P  +AA P P  + AP P P   P+
Sbjct: 388 PANLLHNAPQAAPAPSAAAPEPKHQPAPEPRPVLAPT 424



 Score = 35.2 bits (81), Expect = 0.073
 Identities = 25/89 (28%), Positives = 37/89 (41%)

Query: 133 EGYLAKILVPAGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAA 192
           E   A++L+P+ S            +E   D++   +   +AP AA A + A P PK   
Sbjct: 354 EVVCARLLLPSASDAESALLQRVERIETRLDMSIPANLLHNAPQAAPAPSAAAPEPKHQP 413

Query: 193 APPPPPPKAAPAPSPTPVPSQKTSGGTRV 221
           AP P P  A    S  P  +   S  + V
Sbjct: 414 APEPRPVLAPTPASGEPNAAAVRSMWSTV 442



 Score = 29.5 bits (66), Expect = 3.7
 Identities = 26/133 (19%), Positives = 39/133 (29%), Gaps = 27/133 (20%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVY----------- 222
           A P         P P +A  P    P AA   S       K    +R             
Sbjct: 405 AAPEPKHQPAPEPRPVLAPTPASGEPNAAAVRSMWSTVRDKVRQRSRTTEVMLAGATVRA 464

Query: 223 -----------ASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKS 271
                      ++PLA+RL+ ++  D+ +      L       +     + G  AA +  
Sbjct: 465 LEGNTLVLTHESAPLARRLSEQRNADVLA----EALK-DALGVNWRVRCETGKPAAAASP 519

Query: 272 AKPTANGPFTDLP 284
           A   AN P     
Sbjct: 520 AGGGANAPPAKPV 532


>gnl|CDD|223968 COG1038, PycA, Pyruvate carboxylase [Energy production and
            conversion].
          Length = 1149

 Score = 36.9 bits (86), Expect = 0.022
 Identities = 18/50 (36%), Positives = 28/50 (56%)

Query: 97   GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSK 146
            G +V    K+GDK+ +GD+LA IE  K       P +G + ++LV  G +
Sbjct: 1088 GVVVEVKVKKGDKVKKGDVLAVIEAMKMETTISAPFDGTVKEVLVKDGDQ 1137


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 36.8 bits (85), Expect = 0.022
 Identities = 20/79 (25%), Positives = 26/79 (32%), Gaps = 12/79 (15%)

Query: 176 PAAGASAPAP-----PPPKVAAAPPPPPPKAAPAPSPTPV------PSQKTSGGTRVYAS 224
           P  G  A AP       P   A PP PP    P  +  P       P    +    +  +
Sbjct: 897 PGTGRVATAPGGDAASAPPPGAGPPAPPQAVPPPRTTQPPAAPPRGPDVPPAAVAELRET 956

Query: 225 PLAKRLAAEKGLDLSSIGA 243
               R A   G D ++ GA
Sbjct: 957 LADLRSAQRSG-DFTAYGA 974



 Score = 30.6 bits (69), Expect = 1.7
 Identities = 14/38 (36%), Positives = 15/38 (39%), Gaps = 2/38 (5%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
            P  GA  PAPP       P    P AAP   P   P+
Sbjct: 913 APPPGAGPPAPPQA--VPPPRTTQPPAAPPRGPDVPPA 948


>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
           synthetase/uroporphyrin-III C-methyltransferase;
           Reviewed.
          Length = 656

 Score = 36.6 bits (85), Expect = 0.022
 Identities = 11/40 (27%), Positives = 13/40 (32%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
                  + +    P   AAAP PPP   A  P P     
Sbjct: 281 RMTDTNDSKSVTSQPAAAAAAPAPPPNPPATPPEPPARRG 320



 Score = 34.7 bits (80), Expect = 0.11
 Identities = 16/61 (26%), Positives = 20/61 (32%), Gaps = 7/61 (11%)

Query: 156 IIVENESDVAAFKDFKDDAPPAAGASAPAP-------PPPKVAAAPPPPPPKAAPAPSPT 208
           I +    D    + F   A  AA  +  AP            +    P    AAPAP P 
Sbjct: 248 ITLTGAGDERIVRAFLTWADAAAQPATAAPAPSRMTDTNDSKSVTSQPAAAAAAPAPPPN 307

Query: 209 P 209
           P
Sbjct: 308 P 308



 Score = 30.5 bits (69), Expect = 1.9
 Identities = 10/32 (31%), Positives = 11/32 (34%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAP 205
            P AA A+   PP P      PP       A 
Sbjct: 294 QPAAAAAAPAPPPNPPATPPEPPARRGRGSAA 325



 Score = 29.7 bits (67), Expect = 4.0
 Identities = 14/43 (32%), Positives = 17/43 (39%)

Query: 160 NESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAA 202
             S +    D K      A A+A   PPP   A PP PP +  
Sbjct: 278 APSRMTDTNDSKSVTSQPAAAAAAPAPPPNPPATPPEPPARRG 320



 Score = 28.9 bits (65), Expect = 5.5
 Identities = 12/35 (34%), Positives = 13/35 (37%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
                  A+A AP PP    A PP PP      S 
Sbjct: 290 SVTSQPAAAAAAPAPPPNPPATPPEPPARRGRGSA 324



 Score = 28.9 bits (65), Expect = 6.6
 Identities = 22/82 (26%), Positives = 32/82 (39%), Gaps = 10/82 (12%)

Query: 164 VAAFKDFKDD-APPAAGASAPAPPP---------PKVAAAPPPPPPKAAPAPSPTPVPSQ 213
           V AF  + D  A PA  A AP+             + AAA   P P   P  +P   P++
Sbjct: 259 VRAFLTWADAAAQPATAAPAPSRMTDTNDSKSVTSQPAAAAAAPAPPPNPPATPPEPPAR 318

Query: 214 KTSGGTRVYASPLAKRLAAEKG 235
           +  G   ++   +    AA  G
Sbjct: 319 RGRGSAALWFVVVVLACAAAVG 340


>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI.  This minor capsid
           protein may act as a link between the external capsid
           and the internal DNA-protein core. The C-terminal 11
           residues may function as a protease cofactor leading to
           enzyme activation.
          Length = 238

 Score = 35.9 bits (83), Expect = 0.022
 Identities = 11/43 (25%), Positives = 14/43 (32%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
             P  A    P P   +V     P PP       P P P ++ 
Sbjct: 123 ADPIQALQPRPRPDVEEVLVPAAPEPPSYEETIKPGPAPVEEP 165


>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729).  This
           family of proteins is found in viruses. Proteins in this
           family are typically between 145 and 1707 amino acids in
           length. The family is found in association with
           pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
           There is a single completely conserved residue L that
           may be functionally important.
          Length = 115

 Score = 34.7 bits (80), Expect = 0.023
 Identities = 18/58 (31%), Positives = 23/58 (39%), Gaps = 1/58 (1%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKR 229
            D PP   A    PPP +  AAPPP P    P P+  P P    +   +    P   +
Sbjct: 51  VDTPPPVSAVWVLPPPSE-PAAPPPDPEPPVPGPAGPPSPLAPPAPARKPPLPPPRPQ 107



 Score = 33.9 bits (78), Expect = 0.038
 Identities = 16/53 (30%), Positives = 19/53 (35%), Gaps = 11/53 (20%)

Query: 171 KDDAPPAAGASAPAPP-----------PPKVAAAPPPPPPKAAPAPSPTPVPS 212
              +PP +      PP           PP V+A    PPP    AP P P P 
Sbjct: 27  SCFSPPESAHPDDPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPP 79


>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 36.5 bits (84), Expect = 0.030
 Identities = 21/82 (25%), Positives = 27/82 (32%), Gaps = 5/82 (6%)

Query: 164 VAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPP-----PKAAPAPSPTPVPSQKTSGG 218
            A   D K  AP          P P    AP   P     P +AP P   P  ++     
Sbjct: 384 PAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVARSAPLP 443

Query: 219 TRVYASPLAKRLAAEKGLDLSS 240
               AS      + + G+DL S
Sbjct: 444 PSPQASAPRNVASGKPGVDLGS 465



 Score = 33.0 bits (75), Expect = 0.30
 Identities = 22/112 (19%), Positives = 33/112 (29%), Gaps = 8/112 (7%)

Query: 174 APPAAGA----SAPAPPPPKVAAAPPPPPPKAA---PAPSPTPVPSQKTSGGTRVYASPL 226
             PA        AP P  P+    P P  P+A    PA  P+P  +        V  S  
Sbjct: 382 PSPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVARSAP 441

Query: 227 AKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANG 278
                        + G      GS         ++ G+   P +++   A  
Sbjct: 442 LPPSPQASAPRNVASGKPGVDLGSWQGK-FMNFTRNGSRKQPVQASSSDAAQ 492


>gnl|CDD|221459 pfam12200, DUF3597, Domain of unknown function (DUF3597).  This
           family of proteins is found in bacteria, eukaryotes and
           viruses. Proteins in this family are typically between
           126 and 281 amino acids in length. The function of this
           domain is unknown. The structure of this domain has been
           found to contain five helices with a long flexible loop
           between helices one and two.
          Length = 124

 Score = 34.2 bits (79), Expect = 0.035
 Identities = 21/60 (35%), Positives = 27/60 (45%), Gaps = 7/60 (11%)

Query: 179 GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDL 238
           G +A A P P  A A  P    AA AP+ TP  +        V  + +   LAA+KG  L
Sbjct: 11  GEAAAAAPAPAAAPATAPAAA-AAAAPAATPPAAAS------VDVAAVLDALAAKKGQKL 63



 Score = 32.7 bits (75), Expect = 0.12
 Identities = 9/32 (28%), Positives = 10/32 (31%)

Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPT 208
            A A+APAP      A         A  P   
Sbjct: 12  EAAAAAPAPAAAPATAPAAAAAAAPAATPPAA 43



 Score = 32.3 bits (74), Expect = 0.17
 Identities = 14/34 (41%), Positives = 17/34 (50%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPA 204
           +  A   A A+APA  P   AAA P   P AA +
Sbjct: 12  EAAAAAPAPAAAPATAPAAAAAAAPAATPPAAAS 45



 Score = 31.9 bits (73), Expect = 0.18
 Identities = 14/38 (36%), Positives = 16/38 (42%)

Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAA 202
           A F +    AP  A A A AP     AA    PP  A+
Sbjct: 8   AIFGEAAAAAPAPAAAPATAPAAAAAAAPAATPPAAAS 45



 Score = 30.4 bits (69), Expect = 0.66
 Identities = 11/36 (30%), Positives = 13/36 (36%)

Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAP 205
           F + A  A   +A     P  AAA  P     A A 
Sbjct: 10  FGEAAAAAPAPAAAPATAPAAAAAAAPAATPPAAAS 45


>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
          Length = 421

 Score = 35.8 bits (82), Expect = 0.035
 Identities = 24/107 (22%), Positives = 38/107 (35%), Gaps = 1/107 (0%)

Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLS 239
            +A  P      ++   P   A P+P+  P      S  T   AS +A   A     D++
Sbjct: 301 VAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATT-TQASAVALSSAGVLPSDVT 359

Query: 240 SIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVS 286
             G  +       +      S      + + +   TANGP T LP +
Sbjct: 360 LPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITSTANGPTTSLPAA 406


>gnl|CDD|220260 pfam09483, HpaP, Type III secretion protein (HpaP).  This entry
           represents proteins encoded by genes which are always
           found in type III secretion operons, although their
           function in the processes of secretion and virulence is
           unclear. Hpa stands for Hrp-associated gene, where Hrp
           stands for hypersensitivity response and virulence. see
           also PMID:18584024.
          Length = 185

 Score = 35.1 bits (81), Expect = 0.036
 Identities = 15/39 (38%), Positives = 17/39 (43%), Gaps = 1/39 (2%)

Query: 172 DDAPPAAGASAPAPP-PPKVAAAPPPPPPKAAPAPSPTP 209
             A  A     PAPP P + A    P  P+ APAP   P
Sbjct: 21  RRAARAGPPGTPAPPGPAEDAHPEFPERPRDAPAPPAPP 59



 Score = 32.0 bits (73), Expect = 0.30
 Identities = 13/38 (34%), Positives = 15/38 (39%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
             PA  A    P  P+ A APP PP        P P+ 
Sbjct: 35  PGPAEDAHPEFPERPRDAPAPPAPPRATDGDRDPQPLA 72



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 13/40 (32%), Positives = 14/40 (35%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
           APP     A    P +   AP PP P  A      P P  
Sbjct: 33  APPGPAEDAHPEFPERPRDAPAPPAPPRATDGDRDPQPLA 72



 Score = 29.0 bits (65), Expect = 3.4
 Identities = 11/39 (28%), Positives = 14/39 (35%), Gaps = 6/39 (15%)

Query: 180 ASAPAPPPPKVAAAPP------PPPPKAAPAPSPTPVPS 212
            +A A PP   A   P        P +   AP+P   P 
Sbjct: 22  RAARAGPPGTPAPPGPAEDAHPEFPERPRDAPAPPAPPR 60


>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366).  This
           family of proteins is found in bacteria and eukaryotes.
           Proteins in this family are typically between 227 and
           387 amino acids in length.
          Length = 213

 Score = 35.0 bits (81), Expect = 0.044
 Identities = 12/59 (20%), Positives = 18/59 (30%), Gaps = 7/59 (11%)

Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
           E+D+ A  +          A+AP      V +          P P P P    +   G 
Sbjct: 107 EADLLALME-------EEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEKKSGM 158


>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
          Length = 931

 Score = 36.0 bits (83), Expect = 0.048
 Identities = 21/76 (27%), Positives = 24/76 (31%), Gaps = 9/76 (11%)

Query: 163 DVAAFKDFKDDAPPAAGASAPAPPPP-------KVAAAPP--PPPPKAAPAPSPTPVPSQ 213
           + A        A   A    P P PP         AA  P   P P AAPAP+    PS 
Sbjct: 58  EAAEAAPLPAAAESIASPEVPPPVPPAPAQEGEAPAAEQPSAVPAPSAAPAPAEPVEPSL 117

Query: 214 KTSGGTRVYASPLAKR 229
             +         L  R
Sbjct: 118 AANPFAAAIEWLLGGR 133



 Score = 30.2 bits (68), Expect = 2.5
 Identities = 21/76 (27%), Positives = 24/76 (31%), Gaps = 13/76 (17%)

Query: 173 DAPPAAGASAPAPPPPKVA-----AAPPPPPPKA--------APAPSPTPVPSQKTSGGT 219
           +   A  A+  AP P           PPP PP          A  PS  P PS   +   
Sbjct: 52  EQMAAPEAAEAAPLPAAAESIASPEVPPPVPPAPAQEGEAPAAEQPSAVPAPSAAPAPAE 111

Query: 220 RVYASPLAKRLAAEKG 235
            V  S  A   AA   
Sbjct: 112 PVEPSLAANPFAAAIE 127


>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 598

 Score = 35.6 bits (83), Expect = 0.048
 Identities = 17/70 (24%), Positives = 23/70 (32%), Gaps = 5/70 (7%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
            PP  G +  AP       A    P    PA +   VP           A+P A   A +
Sbjct: 400 GPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVP-----DAAAAAAAPPAPAAAPQ 454

Query: 234 KGLDLSSIGA 243
             + L+S   
Sbjct: 455 PAVRLNSFED 464



 Score = 29.9 bits (68), Expect = 3.1
 Identities = 13/62 (20%), Positives = 15/62 (24%), Gaps = 7/62 (11%)

Query: 174 APPAAGASAPAPPPPK-------VAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPL 226
                     AP  P         AAAP    P AA A  P    +            P 
Sbjct: 397 GGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPPAPAAAPQPA 456

Query: 227 AK 228
            +
Sbjct: 457 VR 458


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 35.9 bits (83), Expect = 0.051
 Identities = 19/94 (20%), Positives = 28/94 (29%), Gaps = 5/94 (5%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEK 234
           P  + + +P+P  P    A P  P K          P+      TR      A+   A +
Sbjct: 343 PGPSPSRSPSPSRPP-PPADPSSPRKRPRPSRAPSSPAASAGRPTR----RRARAAVAGR 397

Query: 235 GLDLSSIGAGSGLFGSITSADLSKASKAGAVAAP 268
                + G         +  D   AS A     P
Sbjct: 398 ARRRDATGRFPAGRPRPSPLDAGAASGAFYARYP 431



 Score = 34.4 bits (79), Expect = 0.14
 Identities = 22/116 (18%), Positives = 36/116 (31%), Gaps = 9/116 (7%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
             +  + GA+    P P  + +P  PPP A P+         +        A    +R A
Sbjct: 331 SSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRA 390

Query: 232 AEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSG 287
                  +     +G F        +   +   + A + S    A  P   L  SG
Sbjct: 391 RAAVAGRARRRDATGRFP-------AGRPRPSPLDAGAASGAFYARYP--LLTPSG 437



 Score = 33.6 bits (77), Expect = 0.22
 Identities = 14/60 (23%), Positives = 19/60 (31%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
             PP       +PPP           P  +P P P   P    +    V +   + R AA
Sbjct: 114 PDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAA 173



 Score = 33.2 bits (76), Expect = 0.27
 Identities = 19/122 (15%), Positives = 29/122 (23%), Gaps = 1/122 (0%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
           A PAA AS  A           P     AP+  P   P                    + 
Sbjct: 157 ASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRR-SSPISA 215

Query: 234 KGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIA 293
                +     S    +  S+  S +S++       ++  P        LP         
Sbjct: 216 SASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGW 275

Query: 294 KR 295
             
Sbjct: 276 NG 277



 Score = 33.2 bits (76), Expect = 0.28
 Identities = 26/125 (20%), Positives = 41/125 (32%), Gaps = 15/125 (12%)

Query: 160 NESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
           + S +A     ++ +P   G S+P PPPP    A PPP P    +    PV S       
Sbjct: 91  SLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAA 150

Query: 220 RVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGP 279
              A+  +    A                 + +S   +    +    A + S+ P    P
Sbjct: 151 SPPAAGASPAAVAS---------------DAASSRQAALPLSSPEETARAPSSPPAEPPP 195

Query: 280 FTDLP 284
            T   
Sbjct: 196 STPPA 200



 Score = 32.5 bits (74), Expect = 0.56
 Identities = 18/65 (27%), Positives = 27/65 (41%), Gaps = 2/65 (3%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPP--PKAAPAPSPTPVPSQKTSGGTRVYASPLAKRL 230
            AP +  A  P   PP  A+  PP    P +A A SP P P +  +      +S  +   
Sbjct: 184 RAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSE 243

Query: 231 AAEKG 235
           ++  G
Sbjct: 244 SSGCG 248



 Score = 31.7 bits (72), Expect = 0.91
 Identities = 15/47 (31%), Positives = 18/47 (38%)

Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           A  K+ +    P    + P    P  AAA  PP    AP P P P  
Sbjct: 865 ARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRV 911



 Score = 30.9 bits (70), Expect = 1.6
 Identities = 17/44 (38%), Positives = 18/44 (40%), Gaps = 1/44 (2%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
            P A   A APP    AAAPP   P   P P+P         GG
Sbjct: 878 EPRARPGAAAPPKAA-AAAPPAGAPAPRPRPAPRVKLGPMPPGG 920



 Score = 30.5 bits (69), Expect = 2.2
 Identities = 20/69 (28%), Positives = 24/69 (34%), Gaps = 1/69 (1%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
             A P +     A  P    A PPP  P AA +P P P  S   S      A    +  A
Sbjct: 171 QAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRP-PRRSSPISASASSPAPAPGRSAA 229

Query: 232 AEKGLDLSS 240
            + G   S 
Sbjct: 230 DDAGASSSD 238



 Score = 30.1 bits (68), Expect = 2.8
 Identities = 11/49 (22%), Positives = 16/49 (32%)

Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPL 226
                  P P     A  PP   AA  P+  P P  + +   ++   P 
Sbjct: 870 GRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPP 918



 Score = 29.8 bits (67), Expect = 3.3
 Identities = 22/132 (16%), Positives = 40/132 (30%), Gaps = 10/132 (7%)

Query: 168 KDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPK----------AAPAPSPTPVPSQKTSG 217
            +     PPAA +  P      ++A+   P P           A+ + S +   S    G
Sbjct: 191 AEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWG 250

Query: 218 GTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTAN 277
                  P    +     +  +S   G        S+  S   ++ + +  S  + P  +
Sbjct: 251 PENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPS 310

Query: 278 GPFTDLPVSGVR 289
            P      S  R
Sbjct: 311 SPRASSSSSSSR 322



 Score = 29.0 bits (65), Expect = 5.9
 Identities = 16/121 (13%), Positives = 37/121 (30%), Gaps = 7/121 (5%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEK 234
            P++     +        +P P P      P+P+   +  +S  +R   S  +   ++ +
Sbjct: 277 GPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSR--ESSSSSTSSSSE 334

Query: 235 GLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIAK 294
               +++  G        S   S +         S   +P  +   +    S  R    +
Sbjct: 335 SSRGAAVSPGPSP-----SRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRR 389

Query: 295 R 295
            
Sbjct: 390 A 390



 Score = 28.6 bits (64), Expect = 7.6
 Identities = 12/38 (31%), Positives = 13/38 (34%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           A P   A+A  P         P P  K  P P   P P
Sbjct: 886 AAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDP 923



 Score = 28.6 bits (64), Expect = 7.8
 Identities = 17/71 (23%), Positives = 22/71 (30%), Gaps = 8/71 (11%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAPP------PPPPKAAPAPSPTPVPSQKTS--GGTRVY 222
               PPA  +S    P P  A + P      P   +A  A +        T      R  
Sbjct: 354 SRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPR 413

Query: 223 ASPLAKRLAAE 233
            SPL    A+ 
Sbjct: 414 PSPLDAGAASG 424



 Score = 28.6 bits (64), Expect = 8.4
 Identities = 15/62 (24%), Positives = 19/62 (30%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
                       PP   A      PPKAA A  P   P+ +     RV   P+       
Sbjct: 864 RARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDP 923

Query: 234 KG 235
           +G
Sbjct: 924 RG 925


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 35.8 bits (82), Expect = 0.052
 Identities = 17/35 (48%), Positives = 18/35 (51%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
           PPAA   AP P PP  A   P   P+ AP P P P
Sbjct: 766 PPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPP 800



 Score = 35.0 bits (80), Expect = 0.084
 Identities = 25/61 (40%), Positives = 29/61 (47%), Gaps = 8/61 (13%)

Query: 174 APPAAGASAPAPPPPKVAAAPPP-PPPKAAPAP------SPTPVPSQKTSGGTRVYASPL 226
           A P A A   A PP     AP P PPP+A PAP      +PTP P    +G T +   P 
Sbjct: 754 ARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQP-PPQAGPTSMQLMPR 812

Query: 227 A 227
           A
Sbjct: 813 A 813



 Score = 32.7 bits (74), Expect = 0.45
 Identities = 22/61 (36%), Positives = 25/61 (40%), Gaps = 9/61 (14%)

Query: 174 APPAAGASAPAPPP--------PKVAAAPPPPPPKAAP-APSPTPVPSQKTSGGTRVYAS 224
           A P A A   A PP        P  AA     PP AAP AP+P P P    +   R   +
Sbjct: 734 ARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGA 793

Query: 225 P 225
           P
Sbjct: 794 P 794



 Score = 31.6 bits (71), Expect = 0.93
 Identities = 13/60 (21%), Positives = 17/60 (28%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEK 234
           PP A      P  P   A  P      A  P+  P  ++  +        P A    A  
Sbjct: 697 PPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARP 756



 Score = 28.9 bits (64), Expect = 6.3
 Identities = 17/64 (26%), Positives = 19/64 (29%), Gaps = 10/64 (15%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAP--------APSPTPVPSQKTSGGTRVYASP 225
             P A     A  P   AA     PP AAP        AP     P+          A+P
Sbjct: 704 MRPPAAPPGRAQRP--AAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAP 761

Query: 226 LAKR 229
              R
Sbjct: 762 GRAR 765



 Score = 28.9 bits (64), Expect = 7.4
 Identities = 15/61 (24%), Positives = 20/61 (32%), Gaps = 1/61 (1%)

Query: 175 PPAAGASAPAPPPPKVAAA-PPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
           P   GA+   P         PPP  P     P+  P  +Q+ +  T     P A    A 
Sbjct: 676 PSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRAR 735

Query: 234 K 234
            
Sbjct: 736 P 736



 Score = 28.5 bits (63), Expect = 9.5
 Identities = 15/56 (26%), Positives = 20/56 (35%), Gaps = 2/56 (3%)

Query: 174 APPAAGASAPAPPPPKV--AAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
           A   A A+  A PP      A PP   P  A  P+  P  ++  +        P A
Sbjct: 714 AQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAA 769


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 35.3 bits (81), Expect = 0.062
 Identities = 16/33 (48%), Positives = 16/33 (48%), Gaps = 3/33 (9%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
           P A  AS PAPP P V     PPP    P P P
Sbjct: 233 PSAPPASIPAPPIPPVIQYVAPPPV---PPPQP 262



 Score = 34.1 bits (78), Expect = 0.15
 Identities = 14/40 (35%), Positives = 16/40 (40%), Gaps = 2/40 (5%)

Query: 176 PAAGASAPAPPPPKVAAAPPPP--PPKAAPAPSPTPVPSQ 213
           P     +PA P    A+ P PP  P     AP P P P  
Sbjct: 223 PFLPGPSPAQPSAPPASIPAPPIPPVIQYVAPPPVPPPQP 262


>gnl|CDD|178806 PRK00030, minC, septum formation inhibitor; Provisional.
          Length = 292

 Score = 34.7 bits (79), Expect = 0.066
 Identities = 19/76 (25%), Positives = 24/76 (31%), Gaps = 2/76 (2%)

Query: 148 VPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
           VP             D  A  +  D+AP     S P     +   A  P      P  S 
Sbjct: 126 VPEATAEAAAKAGPQDDEADGEQADEAPAHNPESVPTRAARETTEANRPTA--TPPQSSS 183

Query: 208 TPVPSQKTSGGTRVYA 223
             V ++    G RVYA
Sbjct: 184 ALVITKPLRSGQRVYA 199



 Score = 30.0 bits (67), Expect = 2.1
 Identities = 30/128 (23%), Positives = 39/128 (30%), Gaps = 21/128 (16%)

Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAP---SPTPVPSQKTSGGTRVYASPLAKRLAAEK 234
            GA      P +++      P     AP     TPVP           + P A   AA K
Sbjct: 88  QGARDAGLVPVELSTPVARAPQVIDTAPPNDVATPVP-----------SVPEATAEAAAK 136

Query: 235 GLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIAK 294
                    G         AD + A    +V   +      AN P    P S    VI K
Sbjct: 137 AGPQDDEADGE-------QADEAPAHNPESVPTRAARETTEANRPTATPPQSSSALVITK 189

Query: 295 RLLQSKQV 302
            L   ++V
Sbjct: 190 PLRSGQRV 197



 Score = 28.5 bits (63), Expect = 6.6
 Identities = 12/52 (23%), Positives = 17/52 (32%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
            P A         PP   A P P  P+A    +    P    + G +   +P
Sbjct: 102 TPVARAPQVIDTAPPNDVATPVPSVPEATAEAAAKAGPQDDEADGEQADEAP 153


>gnl|CDD|235658 PRK05972, ligD, ATP-dependent DNA ligase; Reviewed.
          Length = 860

 Score = 35.3 bits (82), Expect = 0.072
 Identities = 19/58 (32%), Positives = 25/58 (43%), Gaps = 5/58 (8%)

Query: 165 AAFKDFKDDAPPAA-GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRV 221
           A FK  ++D P     A  PAPP    A A P  P  AA A +     ++    G R+
Sbjct: 521 AVFKGLREDKPAREVVAERPAPP----ATAEPAAPAAAAAAAATAAAAAKAEVAGVRI 574



 Score = 32.6 bits (75), Expect = 0.41
 Identities = 14/70 (20%), Positives = 20/70 (28%)

Query: 167 FKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPL 226
            K   + A P          P  VA+        A    +P P  + K + G    A   
Sbjct: 158 IKHRDEFARPGDEYDVLEALPDSVASGRTMAAIAAGKGRAPKPFMTPKGNAGLAAAARAA 217

Query: 227 AKRLAAEKGL 236
           A   A +   
Sbjct: 218 AAAAAKKAKK 227


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 34.7 bits (79), Expect = 0.095
 Identities = 14/37 (37%), Positives = 15/37 (40%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           P AA A +  PP P   A PP   P AA        P
Sbjct: 185 PGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQP 221



 Score = 29.7 bits (66), Expect = 4.3
 Identities = 15/68 (22%), Positives = 26/68 (38%), Gaps = 3/68 (4%)

Query: 159 ENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
           E++SD +A +       P    S   PP   +A + PPP P A   P      + + +  
Sbjct: 161 ESDSDSSAQQQLLQ---PQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQ 217

Query: 219 TRVYASPL 226
            +  +   
Sbjct: 218 PQQPSPLS 225



 Score = 29.3 bits (65), Expect = 5.5
 Identities = 13/45 (28%), Positives = 18/45 (40%), Gaps = 8/45 (17%)

Query: 174 APPAAGASAPAPPPPKVAAAPP--------PPPPKAAPAPSPTPV 210
           +        P  PP ++   P         PPPP  +P+P PT V
Sbjct: 495 SSGGCAGPGPPLPPIQIKEEPLDEAEEPESPPPPPRSPSPEPTVV 539



 Score = 28.9 bits (64), Expect = 7.1
 Identities = 11/49 (22%), Positives = 16/49 (32%), Gaps = 4/49 (8%)

Query: 175 PPAAGASAPAPPP----PKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
           P +   S P   P    P  +   P   P  +   S    P + +SG  
Sbjct: 439 PHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAPPRASSGSQ 487


>gnl|CDD|236851 PRK11114, PRK11114, cellulose synthase regulator protein;
           Provisional.
          Length = 756

 Score = 34.5 bits (80), Expect = 0.10
 Identities = 14/43 (32%), Positives = 16/43 (37%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
            P  A A+APA  P   AA    P   AA AP        +  
Sbjct: 18  FPAFATAAAPATQPLANAAGQVMPAAPAAAAPVVAQAAPSRDV 60



 Score = 33.8 bits (78), Expect = 0.18
 Identities = 15/51 (29%), Positives = 21/51 (41%), Gaps = 1/51 (1%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
           A PA  A+A AP    +A A     P A  A +P    +  +   T  +A 
Sbjct: 17  AFPAF-ATAAAPATQPLANAAGQVMPAAPAAAAPVVAQAAPSRDVTLTFAQ 66


>gnl|CDD|237263 PRK12999, PRK12999, pyruvate carboxylase; Reviewed.
          Length = 1146

 Score = 34.7 bits (81), Expect = 0.11
 Identities = 21/54 (38%), Positives = 31/54 (57%), Gaps = 8/54 (14%)

Query: 97   GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFET----PEEGYLAKILVPAGSK 146
            G++V+   KEGD++  GD LA IE     M  ET    P +G + ++LV AG +
Sbjct: 1085 GSVVTVLVKEGDEVKAGDPLAVIE----AMKMETTITAPVDGTVKRVLVKAGDQ 1134



 Score = 30.1 bits (69), Expect = 2.5
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 97   GTIVSWAKKEGDKLNEGDLLAEIE 120
            GT+     K GD++  GDLL E+E
Sbjct: 1122 GTVKRVLVKAGDQVEAGDLLVELE 1145


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 34.4 bits (79), Expect = 0.12
 Identities = 12/37 (32%), Positives = 13/37 (35%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
            P  G     PPP   AAA P P   +  A    P  
Sbjct: 702 GPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPE 738



 Score = 32.8 bits (75), Expect = 0.40
 Identities = 32/191 (16%), Positives = 52/191 (27%), Gaps = 38/191 (19%)

Query: 41  GTVSISNLGMFGIKNFSAIINPPQACILAVGSLSQRLVREGND-RVALPALSPTMEMGT- 98
           G V++      G+ + ++    P+     V     R  R     R    A    M +   
Sbjct: 581 GNVTVRVTRPTGLASGASNAEDPE-----VRRARTRGARALAQARTFGRATVGEMIISGF 635

Query: 99  ----IVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLV 154
                 +  + + ++  E             M   T     +A+          P+    
Sbjct: 636 PPVFKTALPRPDYNRGGEAGGPGVPGPVPVGMPAHTARPSRVARG--------DPVR--- 684

Query: 155 CIIVENESDVAAFKDFKDDAPPAAGASAP------APPPPKVAAAPPPPPPKAAPAPSPT 208
                              AP A     P       PPPP + AA  P P  ++   SPT
Sbjct: 685 ----------PTAHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPT 734

Query: 209 PVPSQKTSGGT 219
             P  +  G  
Sbjct: 735 APPEPEPPGAE 745



 Score = 30.2 bits (68), Expect = 2.3
 Identities = 10/40 (25%), Positives = 13/40 (32%), Gaps = 1/40 (2%)

Query: 175 PPAAGASAPAPP-PPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
           PP   A+A   P    + A+P  PP    P          
Sbjct: 713 PPDLPAAAGPAPCGSSLIASPTAPPEPEPPGAEQADGAEN 752



 Score = 30.2 bits (68), Expect = 2.6
 Identities = 16/61 (26%), Positives = 21/61 (34%), Gaps = 7/61 (11%)

Query: 175 PPAAGASAPAPPPPKV-------AAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
           P A  A+  AP  P+           PPPP   AA  P+P       +         P A
Sbjct: 685 PTAHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGA 744

Query: 228 K 228
           +
Sbjct: 745 E 745



 Score = 29.8 bits (67), Expect = 3.3
 Identities = 9/54 (16%), Positives = 13/54 (24%), Gaps = 10/54 (18%)

Query: 182 APAPPPPKVAAAPPPPPPKA----------APAPSPTPVPSQKTSGGTRVYASP 225
            P  P    AA   P  P+            P P           G + + +  
Sbjct: 681 DPVRPTAHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPT 734


>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein.  The WWbp domain is
           characterized by several short PY and PT-like motifs of
           the PPPPY form. These appear to bind directly to the WW
           domains of WWP1 and WWP2 and other such diverse proteins
           as dystrophin and YAP (Yes-associated protein). This is
           the WW-domain binding protein WWbp via PY and PY_like
           motifs. The presence of a phosphotyrosine residue in the
           pWBP-1 peptide abolishes WW domain binding which
           suggests a potential regulatory role for tyrosine
           phosphorylation in modulating WW domain-ligand
           interactions. Given the likelihood that WWP1 and WWP2
           function as E3 ubiquitin-protein ligases, it is possible
           that initial substrate-specific recognition occurs via
           WW domain-substrate protein interaction followed by
           ubiquitin transfer and subsequent proteolysis. This
           domain lies just downstream of the GRAM (pfam02893) in
           many members.
          Length = 111

 Score = 32.4 bits (74), Expect = 0.12
 Identities = 12/37 (32%), Positives = 12/37 (32%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPT 208
              P  A     APPP        P PP   P P P 
Sbjct: 55  HLEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPP 91



 Score = 30.8 bits (70), Expect = 0.39
 Identities = 15/52 (28%), Positives = 17/52 (32%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
           A P +  S   PPP       P P      AP P   P         VY +P
Sbjct: 37  AQPVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTP 88



 Score = 28.5 bits (64), Expect = 2.3
 Identities = 14/38 (36%), Positives = 15/38 (39%), Gaps = 1/38 (2%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           APP  G   P  P P     P PPPP +     P   P
Sbjct: 67  APPPYGPPPPYYPAPP-GVYPTPPPPNSGYMADPQEPP 103



 Score = 27.4 bits (61), Expect = 6.0
 Identities = 12/46 (26%), Positives = 12/46 (26%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
             P  GA     P P       PPP    P   P P     T    
Sbjct: 46  YYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPP 91



 Score = 27.0 bits (60), Expect = 8.8
 Identities = 11/50 (22%), Positives = 12/50 (24%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
           PP        P P     A PPP     P     P            Y +
Sbjct: 48  PPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMA 97


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
           domain family is found in eukaryotes, and is typically
           between 125 and 136 amino acids in length.
          Length = 133

 Score = 32.5 bits (74), Expect = 0.13
 Identities = 19/43 (44%), Positives = 23/43 (53%), Gaps = 7/43 (16%)

Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV-PSQKTSGGT 219
           +G+SAPAPP        P P P A PAP+   V  S  +SGG 
Sbjct: 12  SGSSAPAPPAG------PGPGPNAPPAPAAPGVDSSAGSSGGE 48



 Score = 27.1 bits (60), Expect = 9.7
 Identities = 13/45 (28%), Positives = 16/45 (35%), Gaps = 3/45 (6%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
                + A AP   P     APP P   AAP    +   S   +G
Sbjct: 9   GVSSGSSAPAPPAGPGPGPNAPPAP---AAPGVDSSAGSSGGEAG 50


>gnl|CDD|236792 PRK10905, PRK10905, cell division protein DamX; Validated.
          Length = 328

 Score = 34.1 bits (78), Expect = 0.13
 Identities = 16/74 (21%), Positives = 27/74 (36%)

Query: 181 SAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSS 240
           +     P  VA  P    P A  A +  P  +   +       +P+     A+     ++
Sbjct: 170 ATAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAA 229

Query: 241 IGAGSGLFGSITSA 254
            G  +G  GS+ SA
Sbjct: 230 GGKTAGNVGSLKSA 243



 Score = 29.1 bits (65), Expect = 4.8
 Identities = 16/47 (34%), Positives = 19/47 (40%), Gaps = 3/47 (6%)

Query: 174 APPAAGASAPAP---PPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
           A P A   APA    P PK  A   P    +    + TP    KT+G
Sbjct: 189 AAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAGGKTAG 235



 Score = 28.4 bits (63), Expect = 6.9
 Identities = 8/45 (17%), Positives = 12/45 (26%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
              A   A    P   +   P      AP  + +P  +  T    
Sbjct: 186 TEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAG 230


>gnl|CDD|219419 pfam07462, MSP1_C, Merozoite surface protein 1 (MSP1) C-terminus.
           This family represents the C-terminal region of
           merozoite surface protein 1 (MSP1) which are found in a
           number of Plasmodium species. MSP-1 is a 200-kDa protein
           expressed on the surface of the P. vivax merozoite.
           MSP-1 of Plasmodium species is synthesised as a
           high-molecular-weight precursor and then processed into
           several fragments. At the time of red cell invasion by
           the merozoite, only the 19-kDa C-terminal fragment
           (MSP-119), which contains two epidermal growth
           factor-like domains, remains on the surface. Antibodies
           against MSP-119 inhibit merozoite entry into red cells,
           and immunisation with MSP-119 protects monkeys from
           challenging infections. Hence, MSP-119 is considered a
           promising vaccine candidate.
          Length = 574

 Score = 34.1 bits (78), Expect = 0.13
 Identities = 11/55 (20%), Positives = 17/55 (30%), Gaps = 11/55 (20%)

Query: 166 AFKDFKDDAP-----PAAGASAPAPPPPKVAAAPPPPPPKAAPA------PSPTP 209
           A + +K+  P      A   +   PP    A +P    P  +          PT 
Sbjct: 250 ALEQYKELLPKGTTQEAKVTTVVTPPQADAAPSPLSVRPAGSSGSASGSTQIPTS 304


>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2).  DAZ
           associated protein 2 has a highly conserved sequence
           throughout evolution including a conserved polyproline
           region and several SH2/SH3 binding sites. It occurs as a
           single copy gene with a four-exon organisation and is
           located on chromosome 12. It encodes a ubiquitously
           expressed protein and binds to DAZ and DAZL1 through DAZ
           repeats.
          Length = 136

 Score = 32.5 bits (74), Expect = 0.14
 Identities = 14/47 (29%), Positives = 15/47 (31%), Gaps = 6/47 (12%)

Query: 173 DAPPA------AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
           DAPPA           P       A+AP P P    P      V  Q
Sbjct: 3   DAPPAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQ 49



 Score = 31.7 bits (72), Expect = 0.31
 Identities = 12/49 (24%), Positives = 13/49 (26%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYA 223
           P A    A AP P      P        P  S  P+          VY 
Sbjct: 20  PQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYP 68


>gnl|CDD|234178 TIGR03354, VI_FHA, type VI secretion system FHA domain protein.
           Members of this protein family are FHA
           (forkhead-associated) domain-containing proteins that
           are part of type VI secretion loci in a considerable
           number of bacteria, most of which are known pathogens.
           Species include Pseudomonas aeruginosa PAO1, Aeromonas
           hydrophila, Yersinia pestis, Burkholderia mallei, etc
           [Protein fate, Protein and peptide secretion and
           trafficking, Cellular processes, Pathogenesis].
          Length = 396

 Score = 33.9 bits (78), Expect = 0.15
 Identities = 18/67 (26%), Positives = 23/67 (34%), Gaps = 3/67 (4%)

Query: 158 VENESDVAAFKDFKD---DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
           ++ E   AA  D        P  A   A A P        PP  P  AP P+P P     
Sbjct: 144 LDQEPLSAADLDDLSAPLFPPLDARLPAFAAPIDAEPTMVPPFVPLPAPEPAPAPASQAP 203

Query: 215 TSGGTRV 221
           +S    +
Sbjct: 204 SSDAVAL 210


>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
           family.  The proteins represented by this model contain
           three RNA recognition motifs (rrm: pfam00076) and have
           been characterized as poly-pyrimidine tract binding
           proteins associated with RNA splicing factors. In the
           case of PUF60 (GP|6176532), in complex with p54, and in
           the presence of U2AF, facilitates association of U2
           snRNP with pre-mRNA.
          Length = 612

 Score = 33.9 bits (77), Expect = 0.15
 Identities = 30/141 (21%), Positives = 45/141 (31%), Gaps = 7/141 (4%)

Query: 137 AKILVPAGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPP 196
           A +L P           +   + N++ V++ K   ++ PP    +APA   P     P P
Sbjct: 321 AAVLGPRAQSPATPSSSLPTDIGNKAVVSSAKKEAEEVPPLP-QAAPAVVKPGPMEIPTP 379

Query: 197 PPPKAAPAPSPTPVPS--QKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLF----GS 250
            PP     PS    P     T       ASP  K    +  +   ++             
Sbjct: 380 VPPPGLAIPSLVAPPGLVAPTEINPSFLASPRKKMKREKLPVTFGALDDTLAWKEPSKED 439

Query: 251 ITSADLSKASKAGAVAAPSKS 271
            TS D    +  G  AA    
Sbjct: 440 QTSEDGKMLAIMGEAAAALAL 460


>gnl|CDD|130302 TIGR01235, pyruv_carbox, pyruvate carboxylase.  This enzyme plays a
            role in gluconeogensis but not glycolysis [Energy
            metabolism, Glycolysis/gluconeogenesis].
          Length = 1143

 Score = 34.0 bits (78), Expect = 0.16
 Identities = 15/50 (30%), Positives = 26/50 (52%)

Query: 97   GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSK 146
            G I+      G  +N+GD L  +E  K     + P++G + ++LV AG +
Sbjct: 1083 GVIIEVKVSSGQAVNKGDPLVVLEAMKMETAIQAPKDGTIKEVLVKAGEQ 1132


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 33.5 bits (77), Expect = 0.17
 Identities = 12/39 (30%), Positives = 13/39 (33%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
              PA      AP P + A  P  P     P P   P P
Sbjct: 149 QVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAP 187



 Score = 32.7 bits (75), Expect = 0.32
 Identities = 9/42 (21%), Positives = 14/42 (33%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
           + P     S    P P+   + P P  +A     P   P  +
Sbjct: 139 EQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPE 180



 Score = 32.7 bits (75), Expect = 0.33
 Identities = 12/42 (28%), Positives = 14/42 (33%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
               P    SAP P       A P   P+  P   P PV  +
Sbjct: 150 VAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVMDK 191



 Score = 30.0 bits (68), Expect = 2.3
 Identities = 17/51 (33%), Positives = 19/51 (37%), Gaps = 5/51 (9%)

Query: 175 PPAAGASAPAPPPPKVAAAPP-----PPPPKAAPAPSPTPVPSQKTSGGTR 220
            P +   APAP P   A  P      P  P AAP P P   P+       R
Sbjct: 144 QPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVMDKPKR 194



 Score = 30.0 bits (68), Expect = 2.3
 Identities = 10/47 (21%), Positives = 11/47 (23%)

Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
                A +       PA  A  PA P       P   P      P  
Sbjct: 148 PQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVMDKPKR 194



 Score = 28.8 bits (65), Expect = 4.9
 Identities = 14/50 (28%), Positives = 19/50 (38%), Gaps = 1/50 (2%)

Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPA-PSPTPVPSQ 213
           AA    +    P   ++ P  P  +   A  PP     PA P+P PV   
Sbjct: 83  AARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQP 132



 Score = 28.8 bits (65), Expect = 5.4
 Identities = 11/46 (23%), Positives = 13/46 (28%), Gaps = 1/46 (2%)

Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
                P    A  P P  P      P   P   P     P P+Q+ 
Sbjct: 123 QPAPQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVH-SAPQPAQQA 167


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 33.8 bits (77), Expect = 0.17
 Identities = 10/47 (21%), Positives = 13/47 (27%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTR 220
             P+   S P+   P  +     P     P    TP P     G   
Sbjct: 222 PTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEAP 268



 Score = 29.9 bits (67), Expect = 2.8
 Identities = 13/62 (20%), Positives = 18/62 (29%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
           AP    A    P  P+        P       + T  PS      +   A+P A      
Sbjct: 194 APRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEA 253

Query: 234 KG 235
           +G
Sbjct: 254 EG 255



 Score = 29.2 bits (65), Expect = 5.0
 Identities = 8/48 (16%), Positives = 15/48 (31%), Gaps = 3/48 (6%)

Query: 175 PPAAGASAP--APPPPKVAAAPPPPPPKAAP-APSPTPVPSQKTSGGT 219
           P     ++P   P P    + P    P  +    +P    + +  G  
Sbjct: 210 PTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTP 257


>gnl|CDD|139048 PRK12538, PRK12538, RNA polymerase sigma factor; Provisional.
          Length = 233

 Score = 33.3 bits (76), Expect = 0.17
 Identities = 13/32 (40%), Positives = 13/32 (40%)

Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
           PA  A A   P P VA AP        P P P
Sbjct: 10  PAEAAPATTAPSPIVAEAPDAASVSIVPDPPP 41


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 33.7 bits (78), Expect = 0.19
 Identities = 33/165 (20%), Positives = 53/165 (32%), Gaps = 9/165 (5%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
           A     A+A A    K  AA   P  +AA A +     ++         A   A+R  A+
Sbjct: 58  AEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAK 117

Query: 234 KGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIA 293
           K                  +A   K  K G      +  +          P   +   + 
Sbjct: 118 KAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIPREV- 176

Query: 294 KRLLQSKQVIKLREQMN-------KALEKRGAKLSINDFIIKATA 331
             + ++  V +L E+M        K L K G   +IN  + + TA
Sbjct: 177 -VIPETITVAELAEKMAVKAAEVIKKLFKLGVMATINQSLDQETA 220


>gnl|CDD|216368 pfam01213, CAP_N, Adenylate cyclase associated (CAP) N terminal. 
          Length = 313

 Score = 33.3 bits (76), Expect = 0.20
 Identities = 13/38 (34%), Positives = 16/38 (42%)

Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
           +    P AA  SA         +APPPPPP   P+   
Sbjct: 209 WSKKGPVAAAKSALPAVSSSAPSAPPPPPPPPPPSVPT 246



 Score = 28.6 bits (64), Expect = 5.9
 Identities = 11/54 (20%), Positives = 19/54 (35%)

Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
            +A     P V+++ P  PP   P P P+      +       +      + AE
Sbjct: 215 VAAAKSALPAVSSSAPSAPPPPPPPPPPSVPTISNSVESASSDSKGGRGAVFAE 268


>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
          Length = 291

 Score = 33.1 bits (75), Expect = 0.22
 Identities = 19/66 (28%), Positives = 24/66 (36%), Gaps = 7/66 (10%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS-----PTPVPSQKTSGGTRVYA--SPLA 227
           P +    AP   PP      P   P A   PS     P   P+  T   T+V    SP  
Sbjct: 33  PSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSSTKVETPQSPTT 92

Query: 228 KRLAAE 233
           K++  E
Sbjct: 93  KQVPTE 98


>gnl|CDD|220596 pfam10138, Tellurium_res, Tellurium resistance protein.  Members of
           this family confer resistance to the metalloid element
           tellurium and its salts.
          Length = 98

 Score = 31.2 bits (71), Expect = 0.22
 Identities = 12/22 (54%), Positives = 12/22 (54%)

Query: 182 APAPPPPKVAAAPPPPPPKAAP 203
             AP PP   A P P PP AAP
Sbjct: 1   PAAPVPPPAPAPPAPAPPPAAP 22



 Score = 30.8 bits (70), Expect = 0.32
 Identities = 12/22 (54%), Positives = 13/22 (59%)

Query: 191 AAAPPPPPPKAAPAPSPTPVPS 212
            AAP PPP  A PAP+P P   
Sbjct: 1   PAAPVPPPAPAPPAPAPPPAAP 22



 Score = 29.7 bits (67), Expect = 0.73
 Identities = 12/23 (52%), Positives = 12/23 (52%)

Query: 176 PAAGASAPAPPPPKVAAAPPPPP 198
           PAA    PAP PP  A  P  PP
Sbjct: 1   PAAPVPPPAPAPPAPAPPPAAPP 23



 Score = 29.3 bits (66), Expect = 1.3
 Identities = 11/30 (36%), Positives = 13/30 (43%)

Query: 187 PPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
           P      P P PP  AP P+  PV   K +
Sbjct: 1   PAAPVPPPAPAPPAPAPPPAAPPVSLSKIT 30



 Score = 28.9 bits (65), Expect = 1.3
 Identities = 12/23 (52%), Positives = 12/23 (52%)

Query: 183 PAPPPPKVAAAPPPPPPKAAPAP 205
           PA P P  A APP P P  A  P
Sbjct: 1   PAAPVPPPAPAPPAPAPPPAAPP 23



 Score = 27.0 bits (60), Expect = 8.3
 Identities = 9/23 (39%), Positives = 9/23 (39%)

Query: 174 APPAAGASAPAPPPPKVAAAPPP 196
                   APAPP P    A PP
Sbjct: 1   PAAPVPPPAPAPPAPAPPPAAPP 23


>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 226

 Score = 32.9 bits (75), Expect = 0.22
 Identities = 14/45 (31%), Positives = 17/45 (37%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
            P AA    P  PP +  A   P  P   P P P   P+   +G 
Sbjct: 106 VPVAAQTPKPVKPPKQPPAGAVPAKPTPKPEPKPVAEPAAAPTGQ 150



 Score = 29.9 bits (67), Expect = 2.1
 Identities = 12/37 (32%), Positives = 12/37 (32%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
           A     A  P P  P         P K  P P P PV
Sbjct: 104 AGVPVAAQTPKPVKPPKQPPAGAVPAKPTPKPEPKPV 140



 Score = 29.5 bits (66), Expect = 2.7
 Identities = 11/41 (26%), Positives = 12/41 (29%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
              AG    A  P  V     PP       P+P P P    
Sbjct: 101 STPAGVPVAAQTPKPVKPPKQPPAGAVPAKPTPKPEPKPVA 141



 Score = 28.7 bits (64), Expect = 5.6
 Identities = 17/65 (26%), Positives = 22/65 (33%), Gaps = 2/65 (3%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT--RVYASPLAKR 229
           D    AA +  P P       + P   P AA  P P   P Q  +G    +    P  K 
Sbjct: 80  DAGDAAAASVDPQPVAQPPVESTPAGVPVAAQTPKPVKPPKQPPAGAVPAKPTPKPEPKP 139

Query: 230 LAAEK 234
           +A   
Sbjct: 140 VAEPA 144


>gnl|CDD|216648 pfam01690, PLRV_ORF5, Potato leaf roll virus readthrough protein.
           This family consists mainly of the potato leaf roll
           virus readthrough protein. This is generated via a
           readthrough of open reading frame 3 a coat protein
           allowing transcription of open reading frame 5 to give
           an extended coat protein with a large c-terminal
           addition or read through domain. The readthrough protein
           is thought to play a role in the circulative aphid
           transmission of potato leaf roll virus. Also in the
           family is open reading frame 6 from beet western yellows
           virus and potato leaf roll virus both luteovirus and an
           unknown protein from cucurbit aphid-borne yellows virus
           a closterovirus.
          Length = 460

 Score = 33.5 bits (77), Expect = 0.22
 Identities = 13/29 (44%), Positives = 16/29 (55%), Gaps = 4/29 (13%)

Query: 186 PPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
           P PK    P PPPP    AP+P P P++ 
Sbjct: 5   PGPKPGPTPTPPPP----APTPEPTPAKH 29



 Score = 32.4 bits (74), Expect = 0.44
 Identities = 9/24 (37%), Positives = 10/24 (41%)

Query: 184 APPPPKVAAAPPPPPPKAAPAPSP 207
             P P     PPPP P   P P+ 
Sbjct: 5   PGPKPGPTPTPPPPAPTPEPTPAK 28


>gnl|CDD|234730 PRK00339, minC, septum formation inhibitor; Reviewed.
          Length = 249

 Score = 32.8 bits (75), Expect = 0.23
 Identities = 16/51 (31%), Positives = 20/51 (39%), Gaps = 2/51 (3%)

Query: 175 PPAAGASAPAPPPPKVA--AAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYA 223
           PP+     P  P P  A       P     PA  PT V +    GG ++YA
Sbjct: 108 PPSGARERPLEPSPAEAEPEPAKAPEKPPEPAVRPTRVITTPVRGGQQIYA 158


>gnl|CDD|133457 cd06848, GCS_H, Glycine cleavage H-protein. Glycine cleavage
           H-proteins are part of the glycine cleavage system (GCS)
           found in bacteria, archea and the mitochondria of
           eukaryotes. GCS is a multienzyme complex consisting of 4
           different components (P-, H-, T- and L-proteins) which
           catalyzes the oxidative cleavage of glycine. The
           H-protein shuttles the methylamine group of glycine from
           the P-protein (glycine dehydrogenase) to the T-protein
           (aminomethyltransferase) via a lipoyl group, attached to
           a completely conserved lysine residue.
          Length = 96

 Score = 31.0 bits (71), Expect = 0.24
 Identities = 11/41 (26%), Positives = 19/41 (46%), Gaps = 1/41 (2%)

Query: 95  EMGTIVS-WAKKEGDKLNEGDLLAEIETDKATMGFETPEEG 134
            +G IV     + G ++ +GD    +E+ KA     +P  G
Sbjct: 27  LLGDIVFVELPEVGTEVKKGDPFGSVESVKAASDLYSPVSG 67


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 33.1 bits (76), Expect = 0.25
 Identities = 13/61 (21%), Positives = 21/61 (34%), Gaps = 4/61 (6%)

Query: 160 NESDVAAFKDFKDDAPPAAGASAPAPPPP----KVAAAPPPPPPKAAPAPSPTPVPSQKT 215
              D         D P ++      P PP      + +  PP P +  + +P P P   T
Sbjct: 173 PGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPT 232

Query: 216 S 216
           +
Sbjct: 233 N 233



 Score = 32.3 bits (74), Expect = 0.37
 Identities = 21/114 (18%), Positives = 34/114 (29%), Gaps = 16/114 (14%)

Query: 112 EGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLVCIIVENESDVAAFKDFK 171
           E +      T+          +         A   D P          +   V +F    
Sbjct: 157 EDEDADVATTNSDNSFPGEDADP------ASASPSDPP---------SSSPGVPSFPSPP 201

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
           +D    + +S P P P    +  PPP P++   PSP P P+       +     
Sbjct: 202 EDPSSPSDSSLP-PAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPL 254



 Score = 31.6 bits (72), Expect = 0.74
 Identities = 16/54 (29%), Positives = 20/54 (37%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
           D + P A +S  +  PP    +P  P P   PA  P P   Q     T     P
Sbjct: 209 DSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPP 262



 Score = 29.7 bits (67), Expect = 2.9
 Identities = 17/54 (31%), Positives = 19/54 (35%), Gaps = 6/54 (11%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPP------PPKAAPAPSPTPVPSQKTSGGT 219
           D  PP+  +     PPP  AA PPPP         A P P           G T
Sbjct: 222 DTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGIT 275


>gnl|CDD|179538 PRK03100, PRK03100, sec-independent translocase; Provisional.
          Length = 136

 Score = 31.9 bits (73), Expect = 0.26
 Identities = 8/37 (21%), Positives = 10/37 (27%), Gaps = 2/37 (5%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
             D       +  +P       APP   P A     P
Sbjct: 96  NFDQTGNGRPTPSSPDAA--GPAPPDATPLAPGERPP 130



 Score = 27.3 bits (61), Expect = 8.2
 Identities = 6/40 (15%), Positives = 8/40 (20%)

Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
              D     G            ++P    P    A    P
Sbjct: 86  LDGDDSLFTGNFDQTGNGRPTPSSPDAAGPAPPDATPLAP 125


>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 504

 Score = 33.3 bits (76), Expect = 0.27
 Identities = 16/50 (32%), Positives = 21/50 (42%), Gaps = 3/50 (6%)

Query: 174 APPAAGASAPAPPP---PKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTR 220
           AP AA  +A AP P   P+    P P P ++A AP      +       R
Sbjct: 380 APTAAATAAGAPLPDFDPRPRGPPAPEPARSAEAPPLVAPAAAPAGLALR 429



 Score = 31.7 bits (72), Expect = 0.70
 Identities = 12/57 (21%), Positives = 19/57 (33%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
           + +    +  SAP        A  P   P+    P+P P  S +        A+P  
Sbjct: 369 RLEKEVRSLRSAPTAAATAAGAPLPDFDPRPRGPPAPEPARSAEAPPLVAPAAAPAG 425



 Score = 29.0 bits (65), Expect = 5.0
 Identities = 11/69 (15%), Positives = 15/69 (21%), Gaps = 4/69 (5%)

Query: 176 PAAGASAPAPPPPKVAAAP----PPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
           PA           +V +             AP P   P P    +      A        
Sbjct: 360 PADLTQRLNRLEKEVRSLRSAPTAAATAAGAPLPDFDPRPRGPPAPEPARSAEAPPLVAP 419

Query: 232 AEKGLDLSS 240
           A     L+ 
Sbjct: 420 AAAPAGLAL 428



 Score = 28.3 bits (63), Expect = 8.5
 Identities = 9/35 (25%), Positives = 9/35 (25%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
           D  P   G  AP P     A     P    A    
Sbjct: 394 DFDPRPRGPPAPEPARSAEAPPLVAPAAAPAGLAL 428


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 33.2 bits (75), Expect = 0.27
 Identities = 36/142 (25%), Positives = 52/142 (36%), Gaps = 22/142 (15%)

Query: 148 VPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS- 206
           +PI +L       + D A        AP  A + AP P     +AA   P    AP P+ 
Sbjct: 26  IPIPELHTSAATQKPDPAP-------APHQAASRAPDPAVAPTSAASRKPDLAQAPTPAA 78

Query: 207 ---PTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAG 263
                P P       +R     +A +LAA    D +          + TSA  +  + A 
Sbjct: 79  SEKFDPAP-APHQAASRAPDPAVAPQLAAAPKPDAAE---------AFTSAAQAHEAPAD 128

Query: 264 A-VAAPSKSAKPTANGPFTDLP 284
           A  +A SK   P A+   +  P
Sbjct: 129 AGTSAASKKPDPAAHTQHSPPP 150


>gnl|CDD|221079 pfam11321, DUF3123, Protein of unknown function (DUF3123).  This
           eukaryotic family of proteins has no known function.
          Length = 113

 Score = 31.5 bits (71), Expect = 0.27
 Identities = 16/43 (37%), Positives = 17/43 (39%)

Query: 166 AFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPT 208
           A KD K    PAA A A        AA       + A AP PT
Sbjct: 55  ARKDVKLIGAPAAAAPAVTSSTATDAAVRRARSRRNAAAPRPT 97


>gnl|CDD|205711 pfam13533, Biotin_lipoyl_2, Biotin-lipoyl like. 
          Length = 50

 Score = 29.8 bits (68), Expect = 0.28
 Identities = 8/27 (29%), Positives = 18/27 (66%)

Query: 97  GTIVSWAKKEGDKLNEGDLLAEIETDK 123
           G +V+   KEG  + +GD+L  +++++
Sbjct: 11  GRVVAVNVKEGQSVKKGDVLFTLDSEE 37


>gnl|CDD|222579 pfam14179, YppG, YppG-like protein.  The YppG-like protein family
           includes the B. subtilis YppG protein, which is
           functionally uncharacterized. This family of proteins is
           found in bacteria. Proteins in this family are typically
           between 115 and 181 amino acids in length. There are two
           completely conserved residues (F and G) that may be
           functionally important.
          Length = 110

 Score = 31.2 bits (71), Expect = 0.29
 Identities = 10/36 (27%), Positives = 13/36 (36%)

Query: 181 SAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
             P  PP +      PP P+  P  SP      + S
Sbjct: 31  PPPYSPPQQQQGHFMPPQPQPYPKQSPQQQQPPQFS 66



 Score = 28.9 bits (65), Expect = 1.6
 Identities = 12/45 (26%), Positives = 16/45 (35%), Gaps = 2/45 (4%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
           PP        PP P+    P   P +  P    + +   K S GT
Sbjct: 36  PPQQQQGHFMPPQPQ--PYPKQSPQQQQPPQFSSFLSQFKNSDGT 78


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 33.1 bits (76), Expect = 0.29
 Identities = 13/32 (40%), Positives = 13/32 (40%), Gaps = 3/32 (9%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPA 204
           D PPA     PA P   V AA P       PA
Sbjct: 536 DVPPA---PTPAEPAAPVVAAAPKAAAATPPA 564



 Score = 32.7 bits (75), Expect = 0.42
 Identities = 13/39 (33%), Positives = 16/39 (41%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
           P  A  + P  PP    A P  P   AAP  +    P+Q
Sbjct: 527 PALATFAMPDVPPAPTPAEPAAPVVAAAPKAAAATPPAQ 565



 Score = 29.6 bits (67), Expect = 3.6
 Identities = 10/40 (25%), Positives = 10/40 (25%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
               A      AP    VA     P   A P P    V  
Sbjct: 874 PVAAAVEPVVSAPVVEAVAEVVEEPVVVAEPQPEEVVVVE 913


>gnl|CDD|117486 pfam08919, F_actin_bind, F-actin binding.  The F-actin binding
           domain forms a compact bundle of four antiparallel
           alpha-helices, which are arranged in a left-handed
           topology. Binding of F-actin to the F-actin binding
           domain may result in cytoplasmic retention and
           subcellular distribution of the protein, as well as
           possible inhibition of protein function.
          Length = 179

 Score = 32.0 bits (72), Expect = 0.31
 Identities = 24/103 (23%), Positives = 42/103 (40%), Gaps = 1/103 (0%)

Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLS 239
              PA P P+  A P   PP   P PS +P PS K + GT+  ++     ++    L  +
Sbjct: 6   PVPPAVPKPQSTAKPVGTPPSPVPLPSTSPSPS-KMANGTQPSSAAFIPLISTRVSLRKT 64

Query: 240 SIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTD 282
                    G IT   +  +++A  +A    S +  ++    +
Sbjct: 65  RQPPERIASGKITKGVVLDSTEALCLAISRNSEQMASHSAVLE 107


>gnl|CDD|182725 PRK10781, rcsF, outer membrane lipoprotein; Reviewed.
          Length = 133

 Score = 31.6 bits (72), Expect = 0.32
 Identities = 10/30 (33%), Positives = 11/30 (36%)

Query: 181 SAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
           S     P +  A PP   P    AP   PV
Sbjct: 20  SRSPVEPVQSTAPPPKAEPAKPKAPRAAPV 49


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
           family of proteins are found in large numbers in the
           Trichomonas vaginalis proteome. The function of this
           protein is unknown.
          Length = 422

 Score = 32.7 bits (74), Expect = 0.33
 Identities = 10/38 (26%), Positives = 13/38 (34%)

Query: 183 PAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTR 220
           PA  P +     P  P +     +P   P Q   G  R
Sbjct: 221 PAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKR 258



 Score = 30.8 bits (69), Expect = 1.3
 Identities = 9/43 (20%), Positives = 12/43 (27%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
           P       P   P + A  P P  P   P  +       +  G
Sbjct: 221 PAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKRSREQG 263



 Score = 28.5 bits (63), Expect = 8.4
 Identities = 8/39 (20%), Positives = 11/39 (28%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
              A       P  +    P   P + A  P+P     Q
Sbjct: 210 QNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQ 248



 Score = 28.5 bits (63), Expect = 8.4
 Identities = 14/32 (43%), Positives = 17/32 (53%), Gaps = 2/32 (6%)

Query: 176 PAAGASAPAPPPPKVAAAP--PPPPPKAAPAP 205
           PA G   PA     +A+    PP PP+ APAP
Sbjct: 152 PAGGTYIPAGGTYILASGTYIPPNPPREAPAP 183


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 32.3 bits (74), Expect = 0.34
 Identities = 16/46 (34%), Positives = 17/46 (36%), Gaps = 3/46 (6%)

Query: 172 DDAPPAAGASAPAP---PPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
           D  PP A    P P   P P+    P PP       P P P P  K
Sbjct: 57  DLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIPKPEPKPKPK 102



 Score = 31.2 bits (71), Expect = 0.90
 Identities = 16/34 (47%), Positives = 17/34 (50%), Gaps = 1/34 (2%)

Query: 183 PAPPPPKVAAAP-PPPPPKAAPAPSPTPVPSQKT 215
           P P PPK A    P P PK  P P P P P +K 
Sbjct: 80  PIPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKKV 113



 Score = 30.8 bits (70), Expect = 1.3
 Identities = 13/33 (39%), Positives = 13/33 (39%), Gaps = 4/33 (12%)

Query: 183 PAPPPP----KVAAAPPPPPPKAAPAPSPTPVP 211
           PAP  P     VA A   PP    P P P   P
Sbjct: 42  PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEP 74



 Score = 29.7 bits (67), Expect = 2.6
 Identities = 15/44 (34%), Positives = 16/44 (36%), Gaps = 6/44 (13%)

Query: 174 APPAAGASAPAPPPPKVAAAPP------PPPPKAAPAPSPTPVP 211
           AP          PPP+    P       P PPK AP   P P P
Sbjct: 54  APADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIPKPEP 97


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 32.8 bits (75), Expect = 0.35
 Identities = 18/68 (26%), Positives = 23/68 (33%), Gaps = 7/68 (10%)

Query: 160 NESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
           ++S VA F    D +P A     P P        P PPP +A  A          T  G+
Sbjct: 40  SQSGVAEF--PWDPSPQA-----PPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGS 92

Query: 220 RVYASPLA 227
                P  
Sbjct: 93  PAANPPAT 100


>gnl|CDD|234336 TIGR03734, PRTRC_parB, PRTRC system ParB family protein.  A novel
           genetic system characterized by six major proteins,
           included a ParB homolog and a ThiF homolog, is
           designated PRTRC, or ParB-Related,ThiF-Related Cassette.
           It is often found on plasmids. This protein family the
           member related to ParB, and is designated PRTRC system
           ParB family protein.
          Length = 554

 Score = 32.8 bits (75), Expect = 0.35
 Identities = 22/83 (26%), Positives = 30/83 (36%), Gaps = 9/83 (10%)

Query: 160 NESDVAAFKDFKDDAPPAAGASAPAPP----PPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
           N + VAA K  +  A  AA   A        P K  +        AA   +   VPS + 
Sbjct: 314 NTTKVAARKKAERAAAAAAQKPAAPAAGPGTPAKEKSPAETATSGAAKPAAKKAVPSSQP 373

Query: 216 SGG-----TRVYASPLAKRLAAE 233
           S        +V+   LA+ LA  
Sbjct: 374 SNRVKDYREKVWRKALARELALN 396


>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
          Length = 331

 Score = 32.7 bits (75), Expect = 0.36
 Identities = 13/42 (30%), Positives = 16/42 (38%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
           D +     A+ PAP  P          P  A AP+P   P Q
Sbjct: 165 DTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQ 206



 Score = 30.4 bits (69), Expect = 2.0
 Identities = 9/48 (18%), Positives = 13/48 (27%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
           +   PA   +      P+  A   P       A +P P       G  
Sbjct: 188 NSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAA 235



 Score = 30.0 bits (68), Expect = 2.0
 Identities = 15/65 (23%), Positives = 17/65 (26%), Gaps = 4/65 (6%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPP----KAAPAPSPTPVPSQKTSGGTRVYASPLA 227
             AP A   + P        A  P P       A  APS   V +  T           A
Sbjct: 175 TPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGA 234

Query: 228 KRLAA 232
             L  
Sbjct: 235 APLPT 239



 Score = 29.6 bits (67), Expect = 2.8
 Identities = 17/91 (18%), Positives = 23/91 (25%), Gaps = 6/91 (6%)

Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLS 239
            + PA  P   A     P     PA +  P P+        V  S      AA       
Sbjct: 169 TTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAAT------ 222

Query: 240 SIGAGSGLFGSITSADLSKASKAGAVAAPSK 270
              A              +A  +   A P+ 
Sbjct: 223 PAPAAPATPDGAAPLPTDQAGVSTPAADPNA 253



 Score = 29.6 bits (67), Expect = 3.2
 Identities = 10/42 (23%), Positives = 11/42 (26%), Gaps = 1/42 (2%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
               PAA            A A  P P       +    PSQ
Sbjct: 174 TTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVA-PSQ 214



 Score = 29.6 bits (67), Expect = 3.3
 Identities = 10/43 (23%), Positives = 13/43 (30%), Gaps = 3/43 (6%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP---TPVPSQ 213
             PA      A   P  A       P  A   +P    P+P+ 
Sbjct: 198 PAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTD 240



 Score = 28.8 bits (65), Expect = 5.8
 Identities = 11/56 (19%), Positives = 15/56 (26%), Gaps = 1/56 (1%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
              P  A      P   +  A    P P A        V   + +  T    +P A
Sbjct: 173 ATTPAPAAPVDTTPTNSQTPAVATAPAP-AVDPQQNAVVAPSQANVDTAATPAPAA 227


>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
          Length = 327

 Score = 32.3 bits (74), Expect = 0.41
 Identities = 12/49 (24%), Positives = 15/49 (30%), Gaps = 5/49 (10%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAA-----PAPSPTPVPSQKTSG 217
             PA   + P PP       P     K A     PA S  P     ++ 
Sbjct: 188 KVPATAETHPTPPQKPATKKPAVNHHKTATVAVPPATSGKPKSGAASAR 236



 Score = 31.6 bits (72), Expect = 0.80
 Identities = 8/46 (17%), Positives = 9/46 (19%), Gaps = 1/46 (2%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS-QKTSGG 218
            P       PA    K A    PP     P        +       
Sbjct: 199 PPQKPATKKPAVNHHKTATVAVPPATSGKPKSGAASARALSSAPAS 244



 Score = 28.5 bits (64), Expect = 6.2
 Identities = 7/39 (17%), Positives = 9/39 (23%), Gaps = 3/39 (7%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
           AP                  PP  P    PA +     +
Sbjct: 182 APSKGAKVPATAETHPT---PPQKPATKKPAVNHHKTAT 217


>gnl|CDD|177618 PHA03381, PHA03381, tegument protein VP22; Provisional.
          Length = 290

 Score = 32.3 bits (73), Expect = 0.41
 Identities = 16/83 (19%), Positives = 29/83 (34%), Gaps = 4/83 (4%)

Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKG 235
           P A   A  P P +   A  P   +     + +P P  +     +  ++P  ++ A    
Sbjct: 93  PHAQPEASGPGPAR--GARGPAGSRGRGRRAESPSP--RDPPNPKGASAPRGRKSACADS 148

Query: 236 LDLSSIGAGSGLFGSITSADLSK 258
             L    A +      T A L++
Sbjct: 149 AALLDAPAPAAPKRQKTPAGLAR 171


>gnl|CDD|218146 pfam04554, Extensin_2, Extensin-like region. 
          Length = 57

 Score = 29.4 bits (66), Expect = 0.44
 Identities = 13/34 (38%), Positives = 15/34 (44%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPT 208
           PP    S P P    V  +PPPP  K+ P P   
Sbjct: 17  PPYYYKSPPPPVKSPVYKSPPPPVYKSPPPPKYV 50



 Score = 27.4 bits (61), Expect = 1.9
 Identities = 11/33 (33%), Positives = 13/33 (39%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
           PP     +P PP        PPPP   +P P  
Sbjct: 16  PPPYYYKSPPPPVKSPVYKSPPPPVYKSPPPPK 48



 Score = 27.0 bits (60), Expect = 2.7
 Identities = 17/51 (33%), Positives = 20/51 (39%), Gaps = 2/51 (3%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
           PP        PPP    +  PPPP K+    SP P   +       VY SP
Sbjct: 6   PPPPVKQYSPPPPYYYKS--PPPPVKSPVYKSPPPPVYKSPPPPKYVYKSP 54



 Score = 26.3 bits (58), Expect = 5.4
 Identities = 14/38 (36%), Positives = 17/38 (44%), Gaps = 1/38 (2%)

Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
           +K   PP       +PPPP V  +PPPP       P P
Sbjct: 21  YKSPPPPVKSPVYKSPPPP-VYKSPPPPKYVYKSPPPP 57



 Score = 25.9 bits (57), Expect = 8.1
 Identities = 17/41 (41%), Positives = 17/41 (41%), Gaps = 1/41 (2%)

Query: 185 PPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
            PPP V    PPPP      P P   P  K S    VY SP
Sbjct: 5   SPPPPVKQYSPPPPYYYKSPPPPVKSPVYK-SPPPPVYKSP 44


>gnl|CDD|144451 pfam00859, CTF_NFI, CTF/NF-I family transcription modulation
           region. 
          Length = 295

 Score = 32.0 bits (72), Expect = 0.47
 Identities = 20/46 (43%), Positives = 21/46 (45%), Gaps = 3/46 (6%)

Query: 183 PAPPPPKVAAAPPPPPPKAAPAPSPTPV---PSQKTSGGTRVYASP 225
           P    P    APPPPPP A P P P P    P+  T GG     SP
Sbjct: 217 PNHFLPTPMLAPPPPPPMARPVPLPMPDTKPPTTSTEGGATSPTSP 262


>gnl|CDD|217461 pfam03261, CDK5_activator, Cyclin-dependent kinase 5 activator
           protein. 
          Length = 314

 Score = 32.2 bits (73), Expect = 0.47
 Identities = 12/49 (24%), Positives = 17/49 (34%)

Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
             +    PAPPP  ++       P    APS     +  +     V AS
Sbjct: 117 EPSPGQPPAPPPSVLSGKNANCIPSQKNAPSIAITSTGGSPRRVIVQAS 165


>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
          Length = 540

 Score = 32.6 bits (74), Expect = 0.47
 Identities = 14/45 (31%), Positives = 15/45 (33%), Gaps = 7/45 (15%)

Query: 174 APPAAGASAPAPPPPKV-------AAAPPPPPPKAAPAPSPTPVP 211
            P          PPP         A A P PP    PAP P P+ 
Sbjct: 161 GPRPVPTPPGGTPPPDDDEGDEAGAPATPAPPLHPPPAPHPHPIA 205



 Score = 29.9 bits (67), Expect = 2.8
 Identities = 13/37 (35%), Positives = 13/37 (35%), Gaps = 5/37 (13%)

Query: 169 DFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAP 205
              DD    AGA A   PP       PPP P   P  
Sbjct: 174 PPDDDEGDEAGAPATPAPP-----LHPPPAPHPHPIA 205


>gnl|CDD|235309 PRK04596, minC, septum formation inhibitor; Reviewed.
          Length = 248

 Score = 31.8 bits (72), Expect = 0.49
 Identities = 14/34 (41%), Positives = 17/34 (50%)

Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAP 203
           F+     AA +  P PPPP   A P PP  + AP
Sbjct: 108 FRAQYERAAVSPPPPPPPPPARAEPAPPVARPAP 141


>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c.  This family of
           proteins is found in eukaryotes. Proteins in this family
           are typically between 263 and 318 amino acids in length.
           There is a conserved SFT sequence motif. MSA-2 is a
           plasma membrane glycoprotein which can be found in
           Babesia bovis species.
          Length = 201

 Score = 31.7 bits (72), Expect = 0.49
 Identities = 14/46 (30%), Positives = 17/46 (36%), Gaps = 2/46 (4%)

Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRV 221
           P+ G++ P    P     P   P K A  P      S  T GG  V
Sbjct: 157 PSDGSATPNTSAPPTPGNPAAQPEKPAETPKGN--GSSFTFGGLTV 200


>gnl|CDD|183757 PRK12800, fliF, flagellar MS-ring protein; Reviewed.
          Length = 574

 Score = 32.3 bits (73), Expect = 0.54
 Identities = 15/48 (31%), Positives = 22/48 (45%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVY 222
           PP A +++P  PP   AA  P  P  A    +    P++ +   TR Y
Sbjct: 317 PPGATSNSPGQPPAPAAAGAPGTPAAANGQAAAAAAPTESSKSATRNY 364


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 31.7 bits (72), Expect = 0.55
 Identities = 14/33 (42%), Positives = 14/33 (42%)

Query: 183 PAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
           P PP PK    P   P K  P P P P P  K 
Sbjct: 79  PTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKV 111



 Score = 29.4 bits (66), Expect = 3.4
 Identities = 17/104 (16%), Positives = 27/104 (25%), Gaps = 6/104 (5%)

Query: 175 PPAAGASAPAPPPPKVAAAPP------PPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAK 228
            P      P PP        P        P        P P P  K     +V   P  K
Sbjct: 59  EPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPK 118

Query: 229 RLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSA 272
           +  ++      +         S  SA  +    + +  +  + A
Sbjct: 119 KPPSKTAAKAPAAPNQPARPPSAASASGAATGPSASYLSGLRRA 162


>gnl|CDD|236658 PRK10153, PRK10153, DNA-binding transcriptional activator CadC;
           Provisional.
          Length = 517

 Score = 32.3 bits (74), Expect = 0.55
 Identities = 16/60 (26%), Positives = 16/60 (26%), Gaps = 4/60 (6%)

Query: 152 KLVC--IIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
           KLV   I    E             P      A A P     A  P PP   A   SP  
Sbjct: 99  KLVAPVIWYSEEEGEEILAS--SPPPIPEAVPATAEPSESANAQFPAPPLTRAVCQSPAK 156



 Score = 31.2 bits (71), Expect = 1.1
 Identities = 12/37 (32%), Positives = 15/37 (40%)

Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
           AS+P P P  V A   P     A  P+P    +   S
Sbjct: 117 ASSPPPIPEAVPATAEPSESANAQFPAPPLTRAVCQS 153


>gnl|CDD|234797 PRK00575, tatA, twin arginine translocase protein A; Provisional.
          Length = 92

 Score = 30.1 bits (68), Expect = 0.55
 Identities = 9/49 (18%), Positives = 14/49 (28%)

Query: 168 KDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
           K+ + D    A A+A   P       P         A S       + +
Sbjct: 44  KEMQSDNKAEASAAAAQAPYQVATPTPVQSQRVDPAAASGQDSTEARPA 92


>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
          Length = 361

 Score = 32.0 bits (73), Expect = 0.58
 Identities = 11/44 (25%), Positives = 15/44 (34%), Gaps = 2/44 (4%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
             A G    + P P   ++ P   P AA   +     S   SG 
Sbjct: 248 TLAPGVLEGSEPTPTAPSSAPATAPAAAAPQAAATSSS--ASGN 289



 Score = 30.4 bits (69), Expect = 1.7
 Identities = 15/58 (25%), Positives = 20/58 (34%), Gaps = 5/58 (8%)

Query: 172 DDAPPAAGAS-----APAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
           +D   A   S     AP    P V     P P   + AP+  P  +   +  T   AS
Sbjct: 230 EDPTGAPVTSSGFLGAPTTLAPGVLEGSEPTPTAPSSAPATAPAAAAPQAAATSSSAS 287


>gnl|CDD|223010 PHA03215, PHA03215, nuclear protein UL24; Provisional.
          Length = 262

 Score = 31.8 bits (72), Expect = 0.60
 Identities = 15/42 (35%), Positives = 15/42 (35%), Gaps = 2/42 (4%)

Query: 179 GASAPAPP--PPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
           GA  P     P   AA PP P   AAP   P         GG
Sbjct: 203 GARRPRSRRRPRGGAAGPPVPAEPAAPERPPADPTPTSARGG 244



 Score = 28.3 bits (63), Expect = 7.9
 Identities = 11/32 (34%), Positives = 12/32 (37%), Gaps = 3/32 (9%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
           P    A  P P  P   AAP  PP    P  +
Sbjct: 213 PRGGAAGPPVPAEP---AAPERPPADPTPTSA 241


>gnl|CDD|236511 PRK09429, mepA, penicillin-insensitive murein endopeptidase;
           Reviewed.
          Length = 275

 Score = 31.8 bits (73), Expect = 0.61
 Identities = 16/44 (36%), Positives = 17/44 (38%), Gaps = 13/44 (29%)

Query: 171 KDDAPPAAG-------AS------APAPPPPKVAAAPPPPPPKA 201
           +D APP  G       AS       P  P  K    PPPP P A
Sbjct: 224 EDQAPPPPGDGCGAELASWSDAPFEPPKPTTKPKPKPPPPLPPA 267


>gnl|CDD|221020 pfam11191, DUF2782, Protein of unknown function (DUF2782).  This is
           a bacterial family of proteins whose function is
           unknown.
          Length = 104

 Score = 30.0 bits (68), Expect = 0.65
 Identities = 10/30 (33%), Positives = 10/30 (33%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAP 203
           AP  A A A     P    A P  P    P
Sbjct: 10  APALAAADAATDDDPIPLPAIPTTPEDKEP 39


>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1.  This region is found
           in some of the Diaphanous related formins (Drfs). It
           consists of low complexity repeats of around 12
           residues.
          Length = 160

 Score = 31.1 bits (70), Expect = 0.69
 Identities = 14/33 (42%), Positives = 15/33 (45%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
           P   G   P PP P     PPPPP   +P P P
Sbjct: 116 PGGPGVPPPPPPFPGAPGIPPPPPGMGSPPPPP 148



 Score = 30.7 bits (69), Expect = 0.91
 Identities = 18/41 (43%), Positives = 20/41 (48%), Gaps = 3/41 (7%)

Query: 174 APPAAGASAPAPPPP---KVAAAPPPPPPKAAPAPSPTPVP 211
            PP  G +   PPPP     A  PPPP P  A  P P P+P
Sbjct: 27  PPPLPGGTGIPPPPPLPGGAAIPPPPPLPGVAGIPPPPPLP 67



 Score = 29.9 bits (67), Expect = 1.5
 Identities = 18/37 (48%), Positives = 19/37 (51%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           P  AG   P P P   A  PPPP P AA  P P P+P
Sbjct: 55  PGVAGIPPPPPLPGATAIPPPPPLPGAAGIPPPPPLP 91



 Score = 29.5 bits (66), Expect = 1.8
 Identities = 14/37 (37%), Positives = 15/37 (40%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           P       P P P      PPPP P  A  P P P+P
Sbjct: 19  PGGVCIPPPPPLPGGTGIPPPPPLPGGAAIPPPPPLP 55



 Score = 29.1 bits (65), Expect = 2.5
 Identities = 21/41 (51%), Positives = 23/41 (56%), Gaps = 3/41 (7%)

Query: 174 APPAAGASAPAPPPP--KVAAAPPPPP-PKAAPAPSPTPVP 211
            PP  G +A  PPPP   VA  PPPPP P A   P P P+P
Sbjct: 39  PPPLPGGAAIPPPPPLPGVAGIPPPPPLPGATAIPPPPPLP 79



 Score = 29.1 bits (65), Expect = 2.6
 Identities = 15/36 (41%), Positives = 17/36 (47%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
            PP  GA    PPPP + + PPPP     PA    P
Sbjct: 125 PPPFPGAPGIPPPPPGMGSPPPPPFGFGVPAAPVLP 160



 Score = 29.1 bits (65), Expect = 2.8
 Identities = 18/37 (48%), Positives = 18/37 (48%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           P  AG   P PP P  AA PPPPP    P   P P P
Sbjct: 91  PGGAGIPPPPPPLPGGAAVPPPPPLPGGPGVPPPPPP 127



 Score = 28.8 bits (64), Expect = 3.8
 Identities = 17/36 (47%), Positives = 17/36 (47%)

Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           P A    P PP P  A  PPPPPP    A  P P P
Sbjct: 79  PGAAGIPPPPPLPGGAGIPPPPPPLPGGAAVPPPPP 114



 Score = 27.6 bits (61), Expect = 8.6
 Identities = 15/37 (40%), Positives = 16/37 (43%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           P   G   P P P  V   PPPP P     P P P+P
Sbjct: 7   PGGVGIPPPPPLPGGVCIPPPPPLPGGTGIPPPPPLP 43


>gnl|CDD|221827 pfam12881, NUT_N, NUT protein N terminus.  This family includes the
           NUT protein. The gene encoding for NUT protein (Nuclear
           Testis protein) is found fused to BRD3 or BRD4 genes, in
           some aggressive types of carcinoma, due to chromosomal
           translocations. Proteins of the BRD family contain two
           bromodomains that bind transcriptionally active
           chromatin through associations with acetylated histones
           H3 and H4. Such proteins are crucial for the regulation
           of cell cycle progression. On the other hand, little is
           known about NUT protein. NUT is known to have a Nuclear
           Export Sequence (NES) as well as a Nuclear Localization
           Signal (NLS), both located towards the C-terminal end of
           the protein. A fused NUT-GFP protein showed either
           cytoplasmic or nuclear localization, suggesting that it
           is subject to nuclear/cytoplasmic shuttling. Consistent
           with this possibility, treatment with leptomycin B an
           inhibitor of CRM1-dependent nuclear export resulted in
           re-distribution of NUT-GFP to the nucleus. Inspection of
           NUT revealed a C-terminal sequence similar to known
           nuclear export sequences (NES) which are often regulated
           by phosphorylation.
          Length = 328

 Score = 31.5 bits (71), Expect = 0.71
 Identities = 15/52 (28%), Positives = 15/52 (28%), Gaps = 5/52 (9%)

Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAPSPT-----PVPSQKTSGGTRVYAS 224
            G   P PPPP     P      A P P        P   QK S G      
Sbjct: 151 LGLPLPPPPPPVAQLVPIVSLENAWPGPQGATGEGGPAAIQKPSPGDYSSKP 202


>gnl|CDD|219133 pfam06682, DUF1183, Protein of unknown function (DUF1183).  This
           family consists of several eukaryotic proteins of around
           360 residues in length. The function of this family is
           unknown.
          Length = 317

 Score = 31.6 bits (72), Expect = 0.72
 Identities = 13/40 (32%), Positives = 17/40 (42%)

Query: 179 GASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
           G S P PPPP   ++ PPP    A   S       ++  G
Sbjct: 208 GGSGPGPPPPGFKSSFPPPYGPGAGPSSGYGSGGTRSGQG 247


>gnl|CDD|219580 pfam07793, DUF1631, Protein of unknown function (DUF1631).  The
           members of this family are sequences derived from a
           group of hypothetical proteins expressed by certain
           bacterial species. The region concerned is approximately
           440 amino acid residues in length.
          Length = 729

 Score = 31.9 bits (73), Expect = 0.75
 Identities = 13/44 (29%), Positives = 18/44 (40%)

Query: 164 VAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
           + AF+   + A       A AP  P+VA       P+ A AP  
Sbjct: 595 LEAFQAPPELAAQLTALPAVAPAEPRVAEDIVAVAPEEARAPRL 638


>gnl|CDD|237082 PRK12373, PRK12373, NADH dehydrogenase subunit E; Provisional.
          Length = 400

 Score = 31.7 bits (72), Expect = 0.76
 Identities = 14/61 (22%), Positives = 18/61 (29%), Gaps = 5/61 (8%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
           +  A   SA A     +      P   A  A +P   P   T+      A P  K  A  
Sbjct: 244 SEAARPKSADAETNAALKTPATAPKAAAKNAKAPEAQPVSGTAA-----AEPAPKEAAKA 298

Query: 234 K 234
            
Sbjct: 299 A 299



 Score = 31.3 bits (71), Expect = 0.94
 Identities = 15/53 (28%), Positives = 20/53 (37%)

Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
           E++ A           A  A AP   P    AA  P P +AA A +    P+ 
Sbjct: 255 ETNAALKTPATAPKAAAKNAKAPEAQPVSGTAAAEPAPKEAAKAAAAAAKPAL 307



 Score = 30.5 bits (69), Expect = 1.6
 Identities = 25/113 (22%), Positives = 32/113 (28%), Gaps = 9/113 (7%)

Query: 162 SDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRV 221
           +  A  +D  D      G   P   P +  AAP PP   AA   S     +         
Sbjct: 208 ASKALAEDIGDTVKRIDGTEVPLLAPWQGDAAPVPPSE-AARPKSADAETNAALKTPATA 266

Query: 222 YASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKP 274
             +      A E      +  A          A    A  A A A P+   KP
Sbjct: 267 PKAAAKNAKAPEAQPVSGTAAA--------EPAPKEAAKAAAAAAKPALEDKP 311


>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
           partitioning].
          Length = 324

 Score = 31.4 bits (71), Expect = 0.77
 Identities = 13/56 (23%), Positives = 17/56 (30%)

Query: 159 ENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
           E   + A       +AP      + AP P      P  P  +  P     P P  K
Sbjct: 126 EQPPEEARQPVLPQEAPAPQPVHSAAPQPAVQTVQPAVPEQQVQPEEVVEPAPEVK 181


>gnl|CDD|218839 pfam05983, Med7, MED7 protein.  This family consists of several
           eukaryotic proteins which are homologues of the yeast
           MED7 protein. Activation of gene transcription in
           metazoans is a multi-step process that is triggered by
           factors that recognise transcriptional enhancer sites in
           DNA. These factors work with co-activators such as MED7
           to direct transcriptional initiation by the RNA
           polymerase II apparatus.
          Length = 161

 Score = 30.7 bits (70), Expect = 0.79
 Identities = 15/62 (24%), Positives = 23/62 (37%), Gaps = 17/62 (27%)

Query: 181 SAPAPPPP----------KVAAAPPPPPPKA-------APAPSPTPVPSQKTSGGTRVYA 223
           S   PPPP          ++   PPPPPP                 +PS ++ G  ++Y 
Sbjct: 4   SLYPPPPPYYKLFTDENLELRFLPPPPPPTEGSYRVFGETYSLEDKLPSLESQGIEQLYP 63

Query: 224 SP 225
           + 
Sbjct: 64  TS 65


>gnl|CDD|180801 PRK07033, PRK07033, hypothetical protein; Provisional.
          Length = 427

 Score = 31.6 bits (72), Expect = 0.79
 Identities = 14/48 (29%), Positives = 15/48 (31%)

Query: 160 NESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
           N S            PP  G   PA  P   A   P P   AA   +P
Sbjct: 1   NTSSDPFSAGSGGFVPPNPGDRTPAAAPAAGAPFQPRPGRGAASGLNP 48



 Score = 30.0 bits (68), Expect = 2.6
 Identities = 13/46 (28%), Positives = 15/46 (32%), Gaps = 3/46 (6%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
             P +AG+    PP P     P   P   A AP          SG 
Sbjct: 4   SDPFSAGSGGFVPPNPG-DRTPAAAPA--AGAPFQPRPGRGAASGL 46


>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR.  This region is found
           to the N-terminus of the pfam00170 transcription factor
           domain. It is between 150 and 200 amino acids in length.
           The N-terminal half is rather rich in proline residues
           and has been termed the PRD (proline rich domain),
           whereas the C-terminal half is more polar and has been
           called the MFMR (multifunctional mosaic region). It has
           been suggested that this family is composed of three
           sub-families called A, B and C, classified according to
           motif composition. It has been suggested that some of
           these motifs may be involved in mediating
           protein-protein interactions. The MFMR region contains a
           nuclear localisation signal in bZIP opaque and GBF-2.
           The MFMR also contains a transregulatory activity in
           TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention
           signals.
          Length = 189

 Score = 31.0 bits (70), Expect = 0.84
 Identities = 29/96 (30%), Positives = 39/96 (40%), Gaps = 9/96 (9%)

Query: 186 PPPKVAAAPP------PPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRL-AAEKGLDL 238
           PPP  A  PP      P  P  +   SP  +PS +  G T +     AK     +KG   
Sbjct: 75  PPPYAAMYPPGGVYAHPSMPPGSHPFSPYAMPSAEVPGSTPLSMETDAKSSDNKDKGSIK 134

Query: 239 SSIG--AGSGLFGSITSADLSKASKAGAVAAPSKSA 272
            S G     GL  S  + +  KAS + A    S+S+
Sbjct: 135 KSKGSDGSLGLAMSGKNGESGKASGSSANGGSSQSS 170


>gnl|CDD|177677 PLN00045, PLN00045, photosystem I reaction center subunit IV;
           Provisional.
          Length = 101

 Score = 29.6 bits (66), Expect = 0.86
 Identities = 13/36 (36%), Positives = 17/36 (47%)

Query: 164 VAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPP 199
           V A +D +     +A + A A  P    AA P PPP
Sbjct: 2   VRAAEDAEPATSSSAASPAAAAAPAAAPAAKPKPPP 37



 Score = 28.8 bits (64), Expect = 1.8
 Identities = 10/33 (30%), Positives = 13/33 (39%)

Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
           A  A++ +   P  AAAP   P      P   P
Sbjct: 8   AEPATSSSAASPAAAAAPAAAPAAKPKPPPIGP 40


>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
          Length = 256

 Score = 31.2 bits (71), Expect = 0.87
 Identities = 7/43 (16%), Positives = 11/43 (25%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
             P    +AP  P         P  P      +P     ++  
Sbjct: 140 VTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAPKVEAEKEKE 182



 Score = 30.0 bits (68), Expect = 2.3
 Identities = 10/41 (24%), Positives = 13/41 (31%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
            P A  + P P  P      P  P   AP  +    P  + 
Sbjct: 129 QPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQA 169



 Score = 28.5 bits (64), Expect = 5.6
 Identities = 11/45 (24%), Positives = 13/45 (28%), Gaps = 4/45 (8%)

Query: 174 APPAAGASAPAPPPPKVAA----APPPPPPKAAPAPSPTPVPSQK 214
            P       P  PP +  A      P P      AP    V + K
Sbjct: 130 PPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAPK 174


>gnl|CDD|165528 PHA03270, PHA03270, envelope glycoprotein C; Provisional.
          Length = 466

 Score = 31.4 bits (71), Expect = 0.88
 Identities = 15/52 (28%), Positives = 21/52 (40%)

Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKR 229
           AGA AP       + AP    P +A  P  TP P++     T   + P +  
Sbjct: 21  AGAGAPRGAVSNASEAPTSGSPGSAEGPRTTPTPTRGKGTPTGPASPPKSGP 72


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 31.5 bits (72), Expect = 0.89
 Identities = 15/37 (40%), Positives = 15/37 (40%), Gaps = 2/37 (5%)

Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
           FK D  P      P P     AA PPPP   AA A  
Sbjct: 11  FKWD--PRPAVPPPPPRLLPPAAVPPPPLAAAAAARK 45


>gnl|CDD|234383 TIGR03895, protease_PatA, cyanobactin maturation protease,
           PatA/PatG family.  This model describes a protease
           domain associated with the maturation of various members
           of the cyanobactin family of ribosomally produced,
           heavily modified bioactive metabolites. Members include
           the PatA protein and C-terminal domain of the PatG
           protein of Prochloron didemni, TenA and a region of TenG
           from Nostoc spongiaeforme var. tenue, etc.
          Length = 602

 Score = 31.6 bits (72), Expect = 0.89
 Identities = 17/63 (26%), Positives = 20/63 (31%), Gaps = 9/63 (14%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAA---------PAPSPTPVPSQKTSGGTRVYAS 224
             P   A+ PAP    VAA      P AA         P       P +  S G     +
Sbjct: 261 VIPPGRAAEPAPVSIPVAAPGEGATPAAAQIELSAGVLPNAISPATPVRPASNGVTPSQA 320

Query: 225 PLA 227
           P A
Sbjct: 321 PSA 323


>gnl|CDD|237545 PRK13888, PRK13888, conjugal transfer protein TrbN; Provisional.
          Length = 206

 Score = 31.0 bits (70), Expect = 0.90
 Identities = 11/49 (22%), Positives = 13/49 (26%), Gaps = 2/49 (4%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPA--PSPTPVPSQKTSGG 218
           D     A  S P+ P      A P   P  AP             +  G
Sbjct: 146 DVTKAGATPSTPSQPATVAQRATPAARPAPAPKQAAPAAAAARSLSVAG 194


>gnl|CDD|233980 TIGR02712, urea_carbox, urea carboxylase.  Members of this family are
            ATP-dependent urea carboxylase, including characterized
            members from Oleomonas sagaranensis (alpha class
            Proteobacterium) and yeasts such as Saccharomyces
            cerevisiae. The allophanate hydrolase domain of the yeast
            enzyme is not included in this model and is represented
            by an adjacent gene in Oleomonas sagaranensis. The fusion
            of urea carboxylase and allophanate hydrolase is
            designated urea amidolyase. The enzyme from Oleomonas
            sagaranensis was shown to be highly active on acetamide
            and formamide as well as urea [Central intermediary
            metabolism, Nitrogen metabolism].
          Length = 1201

 Score = 31.5 bits (72), Expect = 0.96
 Identities = 16/52 (30%), Positives = 23/52 (44%), Gaps = 1/52 (1%)

Query: 106  EGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSKDVPIGKLVCII 157
             GD++  G  L  +E  K  M    P  G + KIL   G   V  G +V ++
Sbjct: 1150 VGDRVEAGQPLVILEAMKMEMPVSAPVAGKVTKILCQPGDM-VDAGDIVAVL 1200


>gnl|CDD|234012 TIGR02784, addA_alphas, double-strand break repair helicase AddA,
           alphaproteobacterial type.  AddAB, also called RexAB,
           substitutes for RecBCD in several bacterial lineages.
           These DNA recombination proteins act before synapse and
           are particularly important for DNA repair of
           double-stranded breaks by homologous recombination. The
           term AddAB is used broadly, with AddA homologous between
           the alphaproteobacteria (as modeled here) and the
           Firmicutes, while the partner AddB proteins show no
           strong homology across the two groups of species [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 1135

 Score = 31.6 bits (72), Expect = 0.99
 Identities = 13/66 (19%), Positives = 17/66 (25%), Gaps = 4/66 (6%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP----TPVPSQKTSGGTRVYASPLAKR 229
            P  A  +      P    AP P  P      +P      + S       R      A  
Sbjct: 918 LPVEAAQTDTLEALPDWLRAPAPAEPALPRPLAPSGLGGAIDSALPGEAVRSSLGTQAPA 977

Query: 230 LAAEKG 235
            A  +G
Sbjct: 978 FALLRG 983


>gnl|CDD|217047 pfam02455, Hex_IIIa, Hexon-associated protein (IIIa).  The major
           capsid protein of the adenovirus strain is also known as
           a hexon. This is a family of hexon-associated proteins
           (protein IIIa).
          Length = 491

 Score = 31.2 bits (71), Expect = 0.99
 Identities = 19/84 (22%), Positives = 30/84 (35%), Gaps = 7/84 (8%)

Query: 159 ENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
           ++    A  +  K +A      S  APPP    A P  P P  +   S            
Sbjct: 339 DDVDSPAELRLLKPEAGEEEDDSLFAPPPS--VAGPSSPAPSLSRYSSVPRSAFDGRLAD 396

Query: 219 TRVYAS-----PLAKRLAAEKGLD 237
           +R+  +     PLA++ A    L+
Sbjct: 397 SRLPGAGDLLAPLAEKNAYNNPLE 420


>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
           modification].
          Length = 2365

 Score = 31.5 bits (71), Expect = 1.0
 Identities = 15/32 (46%), Positives = 15/32 (46%), Gaps = 5/32 (15%)

Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           AS P   PP     PPPPPP   P   P P P
Sbjct: 2   ASLPPGNPP-----PPPPPPGFEPPSQPPPPP 28



 Score = 31.5 bits (71), Expect = 1.2
 Identities = 14/30 (46%), Positives = 15/30 (50%), Gaps = 8/30 (26%)

Query: 183 PAPPPPKVAAAPPPPP---PKAAPAPSPTP 209
           P  PPP     PPPPP   P + P P P P
Sbjct: 6   PGNPPP-----PPPPPGFEPPSQPPPPPPP 30



 Score = 31.1 bits (70), Expect = 1.4
 Identities = 14/27 (51%), Positives = 16/27 (59%), Gaps = 4/27 (14%)

Query: 175 PPAAGASAPAPPPP--KVAAAPPPPPP 199
           PP  G   P PPPP  +  + PPPPPP
Sbjct: 5   PP--GNPPPPPPPPGFEPPSQPPPPPP 29



 Score = 30.3 bits (68), Expect = 2.4
 Identities = 16/35 (45%), Positives = 17/35 (48%), Gaps = 8/35 (22%)

Query: 185 PPPPKVAAAPPPP---PPKAAPAPSPTPVPSQKTS 216
           PPPP     PPPP   PP   P P P  V  +K S
Sbjct: 9   PPPP-----PPPPGFEPPSQPPPPPPPGVNVKKRS 38


>gnl|CDD|225044 COG2133, COG2133, Glucose/sorbosone dehydrogenases [Carbohydrate
           transport and metabolism].
          Length = 399

 Score = 31.2 bits (71), Expect = 1.2
 Identities = 10/43 (23%), Positives = 14/43 (32%)

Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
            A   +   P       AP P  P+  P P    V  +  + G
Sbjct: 23  GAFLLAQLQPATTAPQLAPTPKGPRLPPTPFAKGVSVEVVAQG 65


>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein.  Tymoviruses
           are single stranded RNA viruses. This family includes a
           protein of unknown function that has been named based on
           its molecular weight. Tymoviruses such as the ononis
           yellow mosaic tymovirus encode only three proteins. Of
           these two are overlapping this protein overlaps a larger
           ORF that is thought to be the polymerase.
          Length = 458

 Score = 31.2 bits (71), Expect = 1.2
 Identities = 10/33 (30%), Positives = 12/33 (36%)

Query: 185 PPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
           PP  +     P PPPK    P     P  + S 
Sbjct: 361 PPTKRRLRLLPVPPPKVQALPLTALAPLVRHSP 393


>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
          Length = 1000

 Score = 31.2 bits (70), Expect = 1.2
 Identities = 23/85 (27%), Positives = 30/85 (35%), Gaps = 16/85 (18%)

Query: 172 DDAPPAAGASAPAPPP--PKVAAAPPPPPPKAAP--------------APSPTPVPSQKT 215
           D  PP A     APP   P+V A P   P   AP               P+  P   Q  
Sbjct: 559 DRGPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPASGPHEKQPP 618

Query: 216 SGGTRVYASPLAKRLAAEKGLDLSS 240
           S   R  A  + +    E+ L+ S+
Sbjct: 619 SSAPRDMAPSVVRMFLRERLLEQST 643


>gnl|CDD|234398 TIGR03921, T7SS_mycosin, type VII secretion-associated serine
           protease mycosin.  Members of this family are
           subtilisin-related serine proteases, found strictly in
           the Actinobacteria and associated with type VII
           secretion operons. The designation mycosin is used for
           members from Mycobacterium [Protein fate, Protein and
           peptide secretion and trafficking, Protein fate, Protein
           modification and repair].
          Length = 350

 Score = 30.8 bits (70), Expect = 1.2
 Identities = 14/36 (38%), Positives = 16/36 (44%), Gaps = 1/36 (2%)

Query: 172 DDAPPAAGASAPAPPPPK-VAAAPPPPPPKAAPAPS 206
           +  P       PAP P + VAA  PPPPP   P   
Sbjct: 285 ELPPEDGRPLRPAPAPARPVAAPAPPPPPDDTPRGR 320



 Score = 30.4 bits (69), Expect = 2.0
 Identities = 11/34 (32%), Positives = 12/34 (35%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAP 205
            + PP  G      P P    A P PPP     P
Sbjct: 284 GELPPEDGRPLRPAPAPARPVAAPAPPPPPDDTP 317


>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
          Length = 226

 Score = 30.6 bits (69), Expect = 1.3
 Identities = 19/108 (17%), Positives = 32/108 (29%), Gaps = 7/108 (6%)

Query: 159 ENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPP----PKAAPAPSPTPVPSQK 214
            + ++  A  D    A P   A   A       +APP          A    P    ++ 
Sbjct: 79  GDGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSATDEAATDPPATAAARD 138

Query: 215 --TSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKAS 260
             T   T   A+P  +R +  +   +S          +  +  L  A 
Sbjct: 139 GPTPDPTAQPATP-DERRSPRQRPPVSGEPPTPSTPDAHVAGTLQAAR 185


>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
          Length = 280

 Score = 30.6 bits (68), Expect = 1.3
 Identities = 17/47 (36%), Positives = 18/47 (38%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTR 220
           A PA    A AP  P  A   P P P   PA +PT  P        R
Sbjct: 89  AAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPAR 135



 Score = 30.6 bits (68), Expect = 1.6
 Identities = 20/53 (37%), Positives = 23/53 (43%), Gaps = 10/53 (18%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPP----------PKAAPAPSPTPVPSQKTS 216
           AP    A+AP  PPP V  AP  P           P A P P+P P P+ K  
Sbjct: 113 APACPPATAPTCPPPAVCPAPARPAPACPPSTRQCPPAPPLPTPKPAPAAKPI 165


>gnl|CDD|218735 pfam05760, IER, Immediate early response protein (IER).  This
           family consists of several eukaryotic immediate early
           response (IER) 2 and 5 proteins. The role of IER5 is
           unclear although it play an important role in mediating
           the cellular response to mitogenic signals. Again,
           little is known about the function of IER2 although it
           is thought to play a role in mediating the cellular
           responses to a variety of extracellular signals.
          Length = 272

 Score = 30.6 bits (69), Expect = 1.4
 Identities = 22/114 (19%), Positives = 29/114 (25%), Gaps = 10/114 (8%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEK 234
           PP     AP   PP  A  P P    +     P P P+      +  + +          
Sbjct: 68  PPEQQPGAPRLHPPSGAEPPLPARAASPEEEQPAPEPAFYRPSVSEGFPA---------S 118

Query: 235 GLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGV 288
           G    +   GSG     T  DL        V +P                   V
Sbjct: 119 GERPDATSTGSGQCSQTTVLDLDTHV-VTTVESPYLHQDCPCQAEDGQPAPPAV 171


>gnl|CDD|235665 PRK05996, motB, flagellar motor protein MotB; Validated.
          Length = 423

 Score = 30.8 bits (70), Expect = 1.4
 Identities = 11/71 (15%), Positives = 15/71 (21%)

Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
                  D  PP              AA    P     P   P    +++        A+
Sbjct: 190 VEVTTAGDLLPPGQAREQAQGAKSATAAPATVPQAAPLPQAQPKKAATEEELIADAKKAA 249

Query: 225 PLAKRLAAEKG 235
                  A K 
Sbjct: 250 TGEPAANAAKA 260


>gnl|CDD|217214 pfam02749, QRPTase_N, Quinolinate phosphoribosyl transferase,
           N-terminal domain.  Quinolinate phosphoribosyl
           transferase (QPRTase) or nicotinate-nucleotide
           pyrophosphorylase EC:2.4.2.19 is involved in the de novo
           synthesis of NAD in both prokaryotes and eukaryotes. It
           catalyzes the reaction of quinolinic acid with
           5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence
           of Mg2+ to give rise to nicotinic acid mononucleotide
           (NaMN), pyrophosphate and carbon dioxide. The QA
           substrate is bound between the C-terminal domain of one
           subunit, and the N-terminal domain of the other. The
           N-terminal domain has an alpha/beta hammerhead fold.
          Length = 88

 Score = 28.6 bits (65), Expect = 1.4
 Identities = 9/21 (42%), Positives = 15/21 (71%)

Query: 100 VSWAKKEGDKLNEGDLLAEIE 120
           V W  K+G+++  GD++ EIE
Sbjct: 47  VEWLVKDGERVEAGDVILEIE 67


>gnl|CDD|236449 PRK09282, PRK09282, pyruvate carboxylase subunit B; Validated.
          Length = 592

 Score = 31.0 bits (71), Expect = 1.4
 Identities = 20/56 (35%), Positives = 30/56 (53%), Gaps = 1/56 (1%)

Query: 89  ALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
           A++  M  GT+V    KEGDK+  GD +  +E  K     + P +G + +ILV  G
Sbjct: 524 AVTSPMP-GTVVKVKVKEGDKVKAGDTVLVLEAMKMENEIQAPVDGTVKEILVKEG 578


>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
           (DUF1750).  This is a fungal domain of unknown function.
          Length = 669

 Score = 31.2 bits (70), Expect = 1.4
 Identities = 17/81 (20%), Positives = 27/81 (33%), Gaps = 2/81 (2%)

Query: 207 PTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVA 266
           P P  S    GG      P    +  +  +D+      S L     +   +  + +    
Sbjct: 467 PEPDVSCVEKGGLLEKIEPEPSLIIGDSDIDMG--HTDSHLLDQFNTGGTTFQTASTPAP 524

Query: 267 APSKSAKPTANGPFTDLPVSG 287
             S  A PTA G  +  P +G
Sbjct: 525 NASGQATPTATGVASPQPPAG 545



 Score = 29.2 bits (65), Expect = 5.6
 Identities = 25/89 (28%), Positives = 32/89 (35%), Gaps = 7/89 (7%)

Query: 175 PPAAGASAPAPPPPKVAAAPPP---PPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
           P  AG S   PP P   A        P   A A  P      K   G R  AS  A + A
Sbjct: 206 PQMAGQSMYQPPGPYPNAMVGRQPFYPQPGAVAGPPKRRGGHKAPRGHR--ASQAAGQAA 263

Query: 232 AEKG-LDLSSIGAGSGLFGSITSADLSKA 259
           +    L+   +  G  L   +T  ++SK 
Sbjct: 264 SLDFALEDEDVSRGD-LMDLLTPREVSKM 291


>gnl|CDD|182059 PRK09752, PRK09752, adhesin; Provisional.
          Length = 1250

 Score = 31.2 bits (70), Expect = 1.4
 Identities = 22/75 (29%), Positives = 32/75 (42%), Gaps = 4/75 (5%)

Query: 143 AGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAA 202
           AGS  V +G     +VE+ +D   +   ++  PP+     P P P       P P P   
Sbjct: 890 AGSGYVNMGAYDYTLVEDNND--WYLRSQEVTPPSPPDPDPTPDPD--PTPDPDPTPDPE 945

Query: 203 PAPSPTPVPSQKTSG 217
           P P+  PV + K  G
Sbjct: 946 PTPAYQPVLNAKVGG 960


>gnl|CDD|218950 pfam06236, MelC1, Tyrosinase co-factor MelC1.  This family consists
           of several tyrosinase co-factor MELC1 proteins from a
           number of Streptomyces species. The melanin operon
           (melC) of Streptomyces antibioticus contains two genes,
           melC1 and melC2 (apotyrosinase). It is thought that
           MelC1 forms a transient binary complex with the
           downstream apotyrosinase MelC2 to facilitate the
           incorporation of copper ion and the secretion of
           tyrosinase indicating that MelC1 is a chaperone for the
           apotyrosinase MelC2.
          Length = 124

 Score = 29.5 bits (66), Expect = 1.5
 Identities = 15/51 (29%), Positives = 18/51 (35%)

Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
           AAG +A         AA  P P     AP   P    +   G R+   P A
Sbjct: 10  AAGLAAGGEAAAAPDAAAHPGPSTGRGAPGGAPASFDEVYKGRRIQGGPSA 60


>gnl|CDD|216534 pfam01499, Herpes_UL25, Herpesvirus UL25 family.  The herpesvirus
           UL25 gene product is a virion component involved in
           virus penetration and capsid assembly. The product of
           the UL25 gene is required for packaging but not cleavage
           of replicated viral DNA. This family includes a number
           of herpesvirus proteins: EHV-1 36, EBV BVRF1, HCMV UL77,
           ILTV ORF2, and VZV gene 34.
          Length = 538

 Score = 30.8 bits (70), Expect = 1.5
 Identities = 9/37 (24%), Positives = 11/37 (29%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
            P     +A     P  AA P     +AAP       
Sbjct: 88  DPLDLLETAARAAEPADAAQPDATAGEAAPGEGVGGT 124


>gnl|CDD|225401 COG2845, COG2845, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 354

 Score = 30.5 bits (69), Expect = 1.6
 Identities = 12/40 (30%), Positives = 17/40 (42%), Gaps = 1/40 (2%)

Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPP-PPPPKAAPAPSPT 208
           F+   PP      P+   P+ +AA    P P A   P+ T
Sbjct: 69  FRQGFPPPIAKVGPSLDMPRASAASEVAPRPGATAEPATT 108


>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator.  In
           eukaryotes, this family of proteins induces
           mitochondrial fission.
          Length = 248

 Score = 30.5 bits (69), Expect = 1.6
 Identities = 12/46 (26%), Positives = 18/46 (39%)

Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
           +      +  P ++  S P  PP +      PPPP   P P P  +
Sbjct: 153 SDLLSSDESVPSSSTTSFPISPPTEEPVLEVPPPPPPPPPPPPPSL 198


>gnl|CDD|225629 COG3087, FtsN, Cell division protein [Cell division and chromosome
           partitioning].
          Length = 264

 Score = 30.2 bits (68), Expect = 1.6
 Identities = 11/44 (25%), Positives = 15/44 (34%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
           +  +  A   +      P      P  P   APAP+P PV    
Sbjct: 128 RTQSQKAQSQATTVQTQPVKPKPRPEKPQPVAPAPAPEPVEKAP 171



 Score = 28.7 bits (64), Expect = 6.1
 Identities = 19/59 (32%), Positives = 24/59 (40%), Gaps = 2/59 (3%)

Query: 175 PPAAGASAPAPPPPKVA--AAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
            P   A APAP P + A  A   PPP   A   + T    Q  +  T   A  +  +LA
Sbjct: 154 KPQPVAPAPAPEPVEKAPKAEAAPPPKPKAEDAAETRYMLQCGAFKTAEQAESVRAQLA 212


>gnl|CDD|236098 PRK07789, PRK07789, acetolactate synthase 1 catalytic subunit;
           Validated.
          Length = 612

 Score = 30.7 bits (70), Expect = 1.7
 Identities = 15/33 (45%), Positives = 18/33 (54%)

Query: 185 PPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
           P PP  AAA    PP AAPA  P  V  ++ +G
Sbjct: 1   PSPPTPAAAASAAPPPAAPAARPRIVAPERMTG 33



 Score = 29.6 bits (67), Expect = 3.7
 Identities = 12/32 (37%), Positives = 13/32 (40%)

Query: 184 APPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
             PP   AAA   PPP A  A      P + T
Sbjct: 1   PSPPTPAAAASAAPPPAAPAARPRIVAPERMT 32


>gnl|CDD|237873 PRK14969, PRK14969, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 527

 Score = 30.5 bits (69), Expect = 1.7
 Identities = 16/62 (25%), Positives = 19/62 (30%), Gaps = 7/62 (11%)

Query: 169 DFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP-------VPSQKTSGGTRV 221
           +    A  A GA         V  A P  P  A P+   +        V   K SG  R 
Sbjct: 362 EQDSGAGRAPGAVPGVNRTRAVPTAAPAAPAAATPSDRQSFNGNWRALVDQLKLSGLARE 421

Query: 222 YA 223
            A
Sbjct: 422 LA 423


>gnl|CDD|223583 COG0509, GcvH, Glycine cleavage system H protein (lipoate-binding)
           [Amino acid transport and metabolism].
          Length = 131

 Score = 29.2 bits (66), Expect = 1.7
 Identities = 16/60 (26%), Positives = 26/60 (43%), Gaps = 3/60 (5%)

Query: 78  VREGNDRVALPALSPTM--EMGTIVSW-AKKEGDKLNEGDLLAEIETDKATMGFETPEEG 134
           VR   D  A   ++     ++G IV     + G ++  G+ LA +E+ KA      P  G
Sbjct: 18  VRVEGDGTATVGITDYAQDQLGDIVFVELPEVGAEVKAGESLAVVESVKAASDVYAPVSG 77


>gnl|CDD|235709 PRK06132, PRK06132, hypothetical protein; Provisional.
          Length = 359

 Score = 30.4 bits (69), Expect = 1.8
 Identities = 14/39 (35%), Positives = 15/39 (38%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP 209
              AP  A AS  A      A AP P     A A +PT 
Sbjct: 187 PKAAPAPAPASDAAANGRPAAPAPAPAEAAWADADAPTG 225


>gnl|CDD|236365 PRK09041, motB, flagellar motor protein MotB; Validated.
          Length = 317

 Score = 30.3 bits (69), Expect = 1.9
 Identities = 13/59 (22%), Positives = 20/59 (33%), Gaps = 7/59 (11%)

Query: 149 PIGKLVCIIVEN-------ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPK 200
           P+ + + I+V N         + A          P +   APA  P     A P   P+
Sbjct: 259 PVNRRISIVVLNKKAEEAILHENAESVALSAQNEPVSALEAPAAAPAASVPAAPAAEPR 317


>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509).  This
           family consists of several uncharacterized viral
           proteins from the Marek's disease-like viruses. Members
           of this family are typically around 400 residues in
           length. The function of this family is unknown.
          Length = 377

 Score = 30.4 bits (68), Expect = 1.9
 Identities = 14/55 (25%), Positives = 17/55 (30%), Gaps = 4/55 (7%)

Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAP---SPTPVP-SQKTSGGTRVYASPL 226
           P A     AP PP+     P P           +P+P P S          A P 
Sbjct: 226 PVATNRGEAPSPPRTDTLDPDPAIAGPSRAVNRTPSPRPSSPPPEIDEEYNAQPD 280


>gnl|CDD|225711 COG3170, FimV, Tfp pilus assembly protein FimV [Cell motility and
           secretion / Intracellular trafficking and secretion].
          Length = 755

 Score = 30.7 bits (69), Expect = 1.9
 Identities = 18/46 (39%), Positives = 24/46 (52%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
           +P +A ++ P+ P P  AAA  PPPP  A  P   P P+    G T
Sbjct: 145 SPKSALSAEPSHPVPAPAAASAPPPPPRAARPVRQPAPAPAAPGDT 190


>gnl|CDD|220840 pfam10667, DUF2486, Protein of unknown function (DUF2486).  This
           family is made up of members from various Burkholderia
           spp. The function is unknown.
          Length = 245

 Score = 29.9 bits (67), Expect = 2.0
 Identities = 32/157 (20%), Positives = 49/157 (31%), Gaps = 12/157 (7%)

Query: 142 PAGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAP-PPPKVAAAPPPPPPK 200
           P      P+     +    ++  AA      +A       A  P P P V A   P    
Sbjct: 34  PHDDAAEPVLTDQIVPGAEQAASAAPVHAAREATADPEFVAVEPVPTPHVPAVALPGDTD 93

Query: 201 AAPAPSPTP-VPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKA 259
           A   P   P V +++ +       S LA           ++  AG        + D +  
Sbjct: 94  APAEPGAAPHVVAERAAAMQAPLPSALAADDPQAPPAGATAADAGD------AAPDATPP 147

Query: 260 SKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIAKRL 296
           +   A    +  A  +A    TDL        IA+RL
Sbjct: 148 AAGDASPPAAAQAAASAAAALTDLDAQ----HIAERL 180


>gnl|CDD|223067 PHA03380, PHA03380, transactivating tegument protein VP16;
           Provisional.
          Length = 432

 Score = 30.4 bits (69), Expect = 2.1
 Identities = 8/43 (18%), Positives = 12/43 (27%), Gaps = 3/43 (6%)

Query: 165 AAFKDFKDDAPPAAGASAPAPPPPKVAAAPP---PPPPKAAPA 204
               DF +       +     P    A +      PPP+ A  
Sbjct: 7   DFIADFDESLLGPPESETAPAPYATSAPSSAQLALPPPRPASP 49


>gnl|CDD|233273 TIGR01108, oadA, oxaloacetate decarboxylase alpha subunit.  This
           model describes the bacterial oxaloacetate decarboxylase
           alpha subunit and its equivalents in archaea. The
           oxaloacetate decarboxylase Na+ pump is the paradigm of
           the family of Na+ transport decarboxylases that present
           in bacteria and archaea. It a multi subunit enzyme
           consisting of a peripheral alpha-subunit and integral
           membrane subunits beta and gamma. The energy released by
           the decarboxylation reaction of oxaloacetate is coupled
           to Na+ ion pumping across the membrane [Transport and
           binding proteins, Cations and iron carrying compounds,
           Energy metabolism, Other].
          Length = 582

 Score = 30.2 bits (68), Expect = 2.2
 Identities = 15/50 (30%), Positives = 23/50 (46%)

Query: 97  GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAGSK 146
           G+IV     EG  + EG++L  +E  K     +    G + +ILV  G  
Sbjct: 526 GSIVKVKVSEGQTVAEGEVLLILEAMKMETEIKAAAAGTVREILVKVGDA 575


>gnl|CDD|216421 pfam01299, Lamp, Lysosome-associated membrane glycoprotein (Lamp). 
          Length = 305

 Score = 30.1 bits (68), Expect = 2.2
 Identities = 13/23 (56%), Positives = 14/23 (60%)

Query: 195 PPPPPKAAPAPSPTPVPSQKTSG 217
           P P   A P+PSPTPVPS    G
Sbjct: 86  PSPTTVATPSPSPTPVPSSPAVG 108


>gnl|CDD|220732 pfam10401, IRF-3, Interferon-regulatory factor 3.  This is the
           interferon-regulatory factor 3 chain of the
           hetero-dimeric structure which also contains the shorter
           chain CREB-binding protein. These two subunits make up
           the DRAF1 (double-stranded RNA-activated factor 1).
           Viral dsRNA produced during viral transcription or
           replication leads to the activation of DRAF1. The
           DNA-binding specificity of DRAF1 correlates with
           transcriptional induction of ISG
           (interferon-alpha,beta-stimulated gene). IRF-3 preexists
           in the cytoplasm of uninfected cells and translocates to
           the nucleus following viral infection. Translocation of
           IRF-3 is accompanied by an increase in serine and
           threonine phosphorylation, and association with the CREB
           coactivator occurs only after infection.
          Length = 180

 Score = 29.5 bits (67), Expect = 2.3
 Identities = 20/69 (28%), Positives = 25/69 (36%), Gaps = 10/69 (14%)

Query: 181 SAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSS 240
           S    PP      P          PSP  +P +K    TR     L K L  E+GL L  
Sbjct: 29  SYGPSPPQGHLCGPGTLEQ--VLFPSPEGIPDEKQREYTRR----LLKHL--ERGLLLEV 80

Query: 241 IGAGSGLFG 249
              G G++ 
Sbjct: 81  --NGQGIYA 87


>gnl|CDD|217428 pfam03213, Pox_P35, Poxvirus P35 protein. 
          Length = 325

 Score = 30.0 bits (68), Expect = 2.4
 Identities = 24/76 (31%), Positives = 32/76 (42%), Gaps = 5/76 (6%)

Query: 143 AGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAA 202
              K +PI  ++ ++    S+V    D K      A        PPK + AP P P K  
Sbjct: 3   PPDKKIPI-YVIPVVGRPPSEV--IPDLKSL-VIEADEVDDVKDPPKPSPAPAPLP-KPK 57

Query: 203 PAPSPTPVPSQKTSGG 218
           P PSP+    Q  SGG
Sbjct: 58  PDPSPSYKYYQWKSGG 73


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 30.3 bits (68), Expect = 2.4
 Identities = 12/33 (36%), Positives = 14/33 (42%)

Query: 181 SAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
           S P   P   +   PP  PKAA   S  P P +
Sbjct: 560 SEPGTLPYGSSVQTPPDRPKAATKGSRKPSPRK 592


>gnl|CDD|197828 smart00674, CENPB, Putative DNA-binding domain in centromere
           protein B, mouse jerky and transposases. 
          Length = 66

 Score = 27.6 bits (62), Expect = 2.4
 Identities = 11/60 (18%), Positives = 23/60 (38%), Gaps = 10/60 (16%)

Query: 304 KLREQMNKAL-----EKRGAKLSINDFIIKATALASRRV-----PEANSSWQDTFIREYH 353
           +L   + KAL      +    + I+   I+  AL   +       +A+  W   F + ++
Sbjct: 2   RLYPLLEKALYEWILRQEALGIPISGEDIREKALEILQRLGLENFKASKGWLTRFKKRHN 61


>gnl|CDD|233927 TIGR02557, HpaP, type III secretion protein HpaP.  This family of
           genes is always found in type III secretion operons,
           althought its function in the processes of secretion and
           virulence is unclear. Hpa stands for Hrp-associated
           gene, where Hrp stands for hypersensitivity response and
           virulence.
          Length = 201

 Score = 29.5 bits (66), Expect = 2.4
 Identities = 15/55 (27%), Positives = 16/55 (29%), Gaps = 13/55 (23%)

Query: 174 APPAAGASAPAPPP-------------PKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
           A  A  A A    P               +A APPP P    P     P P   T
Sbjct: 5   ARLARPAPADPARPARRRTPLAQLRRRDALAYAPPPRPEPPPPCDEDRPEPRADT 59


>gnl|CDD|146531 pfam03948, Ribosomal_L9_C, Ribosomal protein L9, C-terminal domain.
           
          Length = 86

 Score = 27.8 bits (63), Expect = 2.5
 Identities = 14/42 (33%), Positives = 24/42 (57%), Gaps = 5/42 (11%)

Query: 227 AKRLAAE-KGLDLS---SIGAGSGLFGSITSADLSKA-SKAG 263
           A+ LA + +GL ++     G    LFGS+T+ D+++A    G
Sbjct: 4   AEALAEKLEGLTVTIKAKAGEDGKLFGSVTTKDIAEALKAQG 45


>gnl|CDD|221009 pfam11162, DUF2946, Protein of unknown function (DUF2946).  This
           family of proteins has no known function.
          Length = 119

 Score = 28.6 bits (64), Expect = 2.6
 Identities = 11/46 (23%), Positives = 14/46 (30%)

Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
           A  PA P              A P P P  +P +      +  A P
Sbjct: 74  AHTPALPAALPLLLALVRLAAAVPPPPPASLPPRSRWPSAQPRAPP 119


>gnl|CDD|215064 PLN00122, PLN00122, serine/threonine protein phosphatase 2A;
           Provisional.
          Length = 170

 Score = 29.3 bits (66), Expect = 2.6
 Identities = 8/43 (18%), Positives = 17/43 (39%), Gaps = 2/43 (4%)

Query: 172 DDAPPAAGASAPAPPPPKVAAA--PPPPPPKAAPAPSPTPVPS 212
           + +P ++ +   +      ++A  P        P PS   VP+
Sbjct: 14  NSSPSSSSSGKSSNSSKSASSAVNPASVVAGYEPLPSFRDVPN 56


>gnl|CDD|223044 PHA03325, PHA03325, nuclear-egress-membrane-like protein;
           Provisional.
          Length = 418

 Score = 29.9 bits (67), Expect = 2.6
 Identities = 21/127 (16%), Positives = 35/127 (27%), Gaps = 7/127 (5%)

Query: 158 VENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
               +D+A     +   P    AS P PPP +      P   K  P  +      +    
Sbjct: 287 AGETADLADDDGSEHSDPEPLPASLP-PPPVRRPRVKHPEAGKEEPDGARNAEAKEPAQP 345

Query: 218 GTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTAN 277
            T   +        +    +  S   G G   +  S+ L            + S +   +
Sbjct: 346 ATSTSSK------GSSSAQNKDSGSTGPGSSLAAASSFLEDDDFGSPPLDLTTSLRHMPS 399

Query: 278 GPFTDLP 284
              T  P
Sbjct: 400 PSVTSAP 406



 Score = 28.3 bits (63), Expect = 9.1
 Identities = 7/27 (25%), Positives = 8/27 (29%)

Query: 183 PAPPPPKVAAAPPPPPPKAAPAPSPTP 209
           P           P P   +AP P   P
Sbjct: 386 PPLDLTTSLRHMPSPSVTSAPEPPSIP 412


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
           procyclic acidic repetitive protein (PARP) like
           sequences. The procyclic acidic repetitive protein
           (parp) genes of Trypanosoma brucei encode a small family
           of abundant surface proteins whose expression is
           restricted to the procyclic form of the parasite. They
           are found at two unlinked loci, parpA and parpB;
           transcription of both loci is developmentally regulated.
          Length = 145

 Score = 29.2 bits (64), Expect = 2.6
 Identities = 15/88 (17%), Positives = 22/88 (25%), Gaps = 7/88 (7%)

Query: 124 ATMGFETPEEGYLAKILVPAGSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAP 183
           A +GF    +    K +   G      G  V     N +D        DD P       P
Sbjct: 20  AGVGFAAAADEPADKGITKGGKGKGEKGTKVGADDTNGTD-------PDDEPEEEEEPEP 72

Query: 184 APPPPKVAAAPPPPPPKAAPAPSPTPVP 211
                +          +  P  +    P
Sbjct: 73  EEEGEEEPEPEEEGEEEPEPEETGEEEP 100


>gnl|CDD|234404 TIGR03931, T7SS_Rv3446c, type VII secretion-associated protein,
           Rv3446c family, C-terminal domain.  Members of this
           protein family occur as part of the ESX-4 cluster of
           type VII secretion system (T7SS) proteins in
           Mycobacterium tuberculosis and in similar T7SS clusters
           in other Actinobacteria genera, including
           Corynebacterium, Nocardia, Rhodococcus, and
           Saccharopolyspora. This model describes the
           better-conserved C-terminal region [Protein fate,
           Protein and peptide secretion and trafficking].
          Length = 172

 Score = 29.4 bits (66), Expect = 2.8
 Identities = 12/39 (30%), Positives = 13/39 (33%)

Query: 183 PAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRV 221
            AP      A         A A S TP P     G TR+
Sbjct: 2   GAPSGDTPKAKVRESSTAQATAESTTPAPPPPRPGVTRL 40


>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
          Length = 209

 Score = 29.4 bits (66), Expect = 2.8
 Identities = 18/107 (16%), Positives = 28/107 (26%), Gaps = 3/107 (2%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT-RVYASPLAKRLA 231
             P      A    P +    P  PP    PAP+P    +   +    +  A P AK  +
Sbjct: 103 SEPATENKPAEVTTPVEPMGLPETPPAVPVPAPAPAVAAAAAQAAAAPKAPAKPRAK--S 160

Query: 232 AEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANG 278
                  +     +           +    A   A P+K        
Sbjct: 161 PRPAAKAAPKPTETITAKKAKKTAAAPKPTADKTATPAKKTTKKKKT 207



 Score = 28.6 bits (64), Expect = 4.5
 Identities = 18/77 (23%), Positives = 23/77 (29%), Gaps = 2/77 (2%)

Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP-TPVPSQK-TSGG 218
                  +       P A       P    AAA     PKA   P   +P P+ K     
Sbjct: 112 AEVTTPVEPMGLPETPPAVPVPAPAPAVAAAAAQAAAAPKAPAKPRAKSPRPAAKAAPKP 171

Query: 219 TRVYASPLAKRLAAEKG 235
           T    +  AK+ AA   
Sbjct: 172 TETITAKKAKKTAAAPK 188


>gnl|CDD|216245 pfam01015, Ribosomal_S3Ae, Ribosomal S3Ae family. 
          Length = 195

 Score = 29.4 bits (67), Expect = 2.8
 Identities = 8/36 (22%), Positives = 19/36 (52%)

Query: 291 VIAKRLLQSKQVIKLREQMNKALEKRGAKLSINDFI 326
              K+  Q+ Q   +R++M + + K  ++L   +F+
Sbjct: 123 AFTKKRAQTSQKRAIRKKMTEIITKEASELDFKEFV 158


>gnl|CDD|165416 PHA03145, PHA03145, helicase-primase primase subunit; Provisional.
          Length = 1058

 Score = 30.0 bits (67), Expect = 2.9
 Identities = 19/56 (33%), Positives = 24/56 (42%), Gaps = 3/56 (5%)

Query: 169 DFKDDAPPAAGASAPAPPPPKVA---AAPPPPPPKAAPAPSPTPVPSQKTSGGTRV 221
           D +DD  P A A A      + A   A    PP +AAPA   +P  +     G RV
Sbjct: 707 DEEDDEDPDAEALAHGGVDEEAAYLDALHEQPPAEAAPAEPESPRCTCADKIGLRV 762


>gnl|CDD|233219 TIGR00976, /NonD, putative hydrolase, CocE/NonD family.  This model
           represents a protein subfamily that includes the cocaine
           esterase CocE, several glutaryl-7-ACA acylases, and the
           putative diester hydrolase NonD of Streptomyces griseus
           (all hydrolases). This family shows extensive, low-level
           similarity to a family of xaa-pro dipeptidyl-peptidases,
           and local similarity by PSI-BLAST to many other
           hydrolases [Unknown function, Enzymes of unknown
           specificity].
          Length = 550

 Score = 29.8 bits (67), Expect = 3.0
 Identities = 13/53 (24%), Positives = 16/53 (30%), Gaps = 1/53 (1%)

Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKR 229
           + GA + APP P    A P       P PS     S             +  R
Sbjct: 356 SGGALSTAPPSPT-EGADPYLYDPWDPVPSLGGQLSGAGGPAGPAKQDRIDDR 407


>gnl|CDD|223037 PHA03301, PHA03301, envelope glycoprotein L; Provisional.
          Length = 226

 Score = 29.5 bits (66), Expect = 3.0
 Identities = 11/28 (39%), Positives = 12/28 (42%)

Query: 184 APPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           A P PK  A PPP    +   P P   P
Sbjct: 187 ASPQPKSLATPPPVAAPSRRTPRPRRKP 214


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 29.6 bits (67), Expect = 3.0
 Identities = 12/65 (18%), Positives = 24/65 (36%), Gaps = 4/65 (6%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT----SGGTRVYASPLAKR 229
            P   G+ +  P      ++P PP    +   + TP P+  +    +   R  A    ++
Sbjct: 299 EPEPTGSVSDRPRHLSSDSSPSPPDTSDSDPSTETPPPASLSHSPPAAFERPLALSPKRK 358

Query: 230 LAAEK 234
              +K
Sbjct: 359 REGDK 363


>gnl|CDD|215056 PLN00104, PLN00104, MYST -like histone acetyltransferase;
           Provisional.
          Length = 450

 Score = 29.7 bits (67), Expect = 3.0
 Identities = 16/65 (24%), Positives = 21/65 (32%), Gaps = 5/65 (7%)

Query: 161 ESDVAAFKDFKDDAPPAAGASAPAPPPPK--VAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
             +  A +D    A PAA   A A        AA   P     +       +P +    G
Sbjct: 1   SMEAPATEDAGRPAAPAASDDAAATDGAGANAAAPAAPAESDPSKKRPGVMLPLEV---G 57

Query: 219 TRVYA 223
           TRV  
Sbjct: 58  TRVMC 62


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 30.0 bits (67), Expect = 3.0
 Identities = 18/47 (38%), Positives = 22/47 (46%), Gaps = 11/47 (23%)

Query: 174 APPAAGAS----APAPPPPKVAAAP-------PPPPPKAAPAPSPTP 209
             PAAG +    APA P P  A +P       PP PP++   P P P
Sbjct: 408 TAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAP 454



 Score = 29.6 bits (66), Expect = 3.2
 Identities = 31/125 (24%), Positives = 43/125 (34%), Gaps = 37/125 (29%)

Query: 176 PAAGASAPAPPPPKV--------------AAAPPPPPPKAAPAPSP--TPVPSQKTSGGT 219
           PA+G +A  P P                 +AAP  P P AAP+P       P      G 
Sbjct: 389 PASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGI 448

Query: 220 RVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGA---VAAPSKSAKPTA 276
               +P               +   S     +  A  S AS + A   +  PS +A+ T 
Sbjct: 449 PPRPAP--------------RMPEAS----PVPGAPDSVASASDAPPTLGDPSDTAEHTP 490

Query: 277 NGPFT 281
           +GP T
Sbjct: 491 SGPRT 495



 Score = 29.6 bits (66), Expect = 3.6
 Identities = 25/113 (22%), Positives = 34/113 (30%), Gaps = 4/113 (3%)

Query: 174 APPAAGASAPAPPPPKVAAAPPP---PPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRL 230
           A   +G++A  P     A  P P    P   APA   TP  +   +       SP     
Sbjct: 378 ASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWD 437

Query: 231 AAEKGLDLSSIGA-GSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTD 282
            A      S I    +      +    +  S A A  AP     P+     T 
Sbjct: 438 DAPPAPPRSGIPPRPAPRMPEASPVPGAPDSVASASDAPPTLGDPSDTAEHTP 490


>gnl|CDD|236692 PRK10431, PRK10431, N-acetylmuramoyl-l-alanine amidase II;
           Provisional.
          Length = 445

 Score = 29.8 bits (67), Expect = 3.0
 Identities = 18/51 (35%), Positives = 22/51 (43%), Gaps = 10/51 (19%)

Query: 173 DAPPAAGASAPAPPPPKVA--AAPPP--PPPKAAPAPSPTPVPSQKTSGGT 219
           D PP      P PPPP VA     P    P  + PA +P    S +T+G  
Sbjct: 128 DVPP------PPPPPPVVAKRVETPAVVAPRVSEPARNPFKTESNRTTGVI 172


>gnl|CDD|237868 PRK14960, PRK14960, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 702

 Score = 30.0 bits (67), Expect = 3.0
 Identities = 15/57 (26%), Positives = 22/57 (38%)

Query: 158 VENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQK 214
            +  + V+A +  +  + PA     P P P       P P P+  P P P P P   
Sbjct: 389 AQEITPVSAVQPVEVISQPAMVEPEPEPEPEPEPEPEPEPEPEPEPEPEPEPEPQPN 445


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 29.5 bits (66), Expect = 3.1
 Identities = 12/33 (36%), Positives = 13/33 (39%)

Query: 181 SAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
           S P PPP +   A  P PP   P       P Q
Sbjct: 126 SQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQ 158



 Score = 28.8 bits (64), Expect = 5.7
 Identities = 13/46 (28%), Positives = 16/46 (34%), Gaps = 2/46 (4%)

Query: 174 APPAAGASAPAPPPPKVAAAP--PPPPPKAAPAPSPTPVPSQKTSG 217
           AP  +G      P    +  P  P P   A   P     PSQ+  G
Sbjct: 173 APQVSGLYPEESPYQPQSYPPNEPLPSSMAMQPPYSGAPPSQQFYG 218


>gnl|CDD|234068 TIGR02946, acyl_WS_DGAT, acyltransferase, WS/DGAT/MGAT.  This
           bacteria-specific protein family includes a
           characterized, homodimeric, broad specificity
           acyltransferase from Acinetobacter sp. strain ADP1,
           active as wax ester synthase, as acyl coenzyme
           A:diacylglycerol acyltransferase, and as
           acyl-CoA:monoacylglycerol acyltransferase [Unknown
           function, Enzymes of unknown specificity].
          Length = 446

 Score = 29.6 bits (67), Expect = 3.2
 Identities = 40/144 (27%), Positives = 59/144 (40%), Gaps = 26/144 (18%)

Query: 194 PPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITS 253
            P PPP  AP P P P      SG      S L +R+A             S   G + +
Sbjct: 150 DPDPPPLPAPPPPPQPSTRGLLSGALSGLPSALLRRVA-------------STAPGVVRA 196

Query: 254 ADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIAKRLLQSKQVIKLREQMNKAL 313
           A         AV   ++SA+P    PFT  P + + G I+++   + Q + L   +    
Sbjct: 197 A-------GRAVEGVARSARPAL--PFTA-PPTPLNGPISRKRRFAAQSLPL-ADVKAVA 245

Query: 314 EKRGAKLSINDFIIKATALASRRV 337
           +  G   +IND ++ A A A RR 
Sbjct: 246 KAFGV--TINDVVLAAVAGALRRY 267


>gnl|CDD|204676 pfam11574, DUF3235, Protein of unknown function (DUF3235).  Some
           members in this family of proteins with unknown function
           are annotated as RpfA however this cannot be confirmed.
          Length = 90

 Score = 27.7 bits (61), Expect = 3.2
 Identities = 11/21 (52%), Positives = 12/21 (57%)

Query: 184 APPPPKVAAAPPPPPPKAAPA 204
           AP P  + A P P P  AAPA
Sbjct: 4   APTPRDLVANPKPAPAAAAPA 24


>gnl|CDD|237782 PRK14666, uvrC, excinuclease ABC subunit C; Provisional.
          Length = 694

 Score = 29.9 bits (67), Expect = 3.3
 Identities = 20/86 (23%), Positives = 24/86 (27%), Gaps = 18/86 (20%)

Query: 159 ENESDVAAFKDFKDDAPPAA----GASAPAPPPPKVAAAPP-----------PPPPKAAP 203
              + V        D P       G+S P  P   VAAA P             P   A 
Sbjct: 327 LAPTAVCTDAGLLPDTPLLPDAPEGSSDPVVP---VAAATPVDASLPDVRTGTAPTSLAN 383

Query: 204 APSPTPVPSQKTSGGTRVYASPLAKR 229
                P  +Q T   T   A+P    
Sbjct: 384 VSHADPAVAQPTQAATLAGAAPKGAT 409


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 29.8 bits (67), Expect = 3.4
 Identities = 26/116 (22%), Positives = 42/116 (36%), Gaps = 12/116 (10%)

Query: 159 ENESDVAAFKD-------FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           E++ D+  F D        K    P   +    P  PK ++ P      ++      P  
Sbjct: 29  EDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRK 88

Query: 212 SQKTSGGTRVY-----ASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKA 262
           S +++  +             +    E    ++SIG    L+GSITS   S AS A
Sbjct: 89  SAESTRSSHPKSKAPSTESEEEEEPEETPDPIASIGGWWSLWGSITSTATSTASAA 144


>gnl|CDD|227111 COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit
           [Lipid metabolism].
          Length = 645

 Score = 29.6 bits (67), Expect = 3.4
 Identities = 20/48 (41%), Positives = 28/48 (58%)

Query: 97  GTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKILVPAG 144
           GT+VS A KEG +++ GDLL  +E  K       P +G +AK+ V  G
Sbjct: 584 GTVVSVAVKEGQEVSAGDLLVVLEAMKMENTLRAPRDGVVAKLAVAEG 631


>gnl|CDD|225501 COG2951, MltB, Membrane-bound lytic murein transglycosylase B [Cell
           envelope biogenesis, outer membrane].
          Length = 343

 Score = 29.6 bits (67), Expect = 3.4
 Identities = 21/103 (20%), Positives = 31/103 (30%), Gaps = 9/103 (8%)

Query: 158 VENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAP--SPTPVPSQKT 215
           ++  +  AA          AA A       P+V A   P P    P    S   +   + 
Sbjct: 45  LQGFAPEAAAAGISAAVLDAAFAG--VSYDPRVIARDRPQPEFRQPVAEFSARRISEARV 102

Query: 216 SGGTRVYASPLA--KRLAAEKGLDLSSIGAGSGL---FGSITS 253
             G    A   A   R     G+    + A  G+   FG +  
Sbjct: 103 QQGRGFPAQYAAALARAERRYGVPAPILVAIWGMETGFGRVMG 145


>gnl|CDD|152349 pfam11914, DUF3432, Domain of unknown function (DUF3432).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is about 100
           amino acids in length. This domain is found associated
           with pfam00096. This domain has two conserved sequence
           motifs: YPSPV and PSP.
          Length = 100

 Score = 27.8 bits (61), Expect = 3.5
 Identities = 15/52 (28%), Positives = 23/52 (44%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
           A  AA  S  +P     +++P    P       P+PVP+  +S  +  Y SP
Sbjct: 1   ADKAAPVSTASPNISIYSSSPVSSYPSPIATSYPSPVPTSYSSPVSSCYPSP 52


>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
           inhibitor.
          Length = 670

 Score = 29.7 bits (66), Expect = 3.5
 Identities = 17/83 (20%), Positives = 31/83 (37%), Gaps = 4/83 (4%)

Query: 162 SDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTP----VPSQKTSG 217
           S  ++   F  D+P    A + +PP   + + P  P    +P+ SP       PS   S 
Sbjct: 576 SAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPPASHLGSPSTTPSS 635

Query: 218 GTRVYASPLAKRLAAEKGLDLSS 240
                     +  + E  + ++S
Sbjct: 636 PESSIKVASTETASPESSIKVAS 658


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 29.6 bits (67), Expect = 3.6
 Identities = 22/75 (29%), Positives = 25/75 (33%), Gaps = 11/75 (14%)

Query: 183 PAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG----------TRVYASP-LAKRLA 231
           PAPP P     PPPPPP       P     Q  +             R    P L  RLA
Sbjct: 259 PAPPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEIVLEAVRAALPPDLLARLA 318

Query: 232 AEKGLDLSSIGAGSG 246
           A      +    G+G
Sbjct: 319 AGGARARARSAGGAG 333


>gnl|CDD|135173 PRK04654, PRK04654, sec-independent translocase; Provisional.
          Length = 214

 Score = 29.0 bits (64), Expect = 3.8
 Identities = 17/57 (29%), Positives = 22/57 (38%), Gaps = 2/57 (3%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAA--PAPSPTPVPSQKTSGGTRVYASPLA 227
                A    PAP P    A P  P P     PAP  T VP+   +      A+P++
Sbjct: 133 PGAGQAHTPVPAPAPVIAQAQPIAPAPHQTLVPAPHDTIVPAPHAAHLPSAPATPVS 189



 Score = 28.6 bits (63), Expect = 4.6
 Identities = 16/43 (37%), Positives = 18/43 (41%), Gaps = 3/43 (6%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKA---APAPSPTPVPS 212
               AAGA   A        AP P   +A   APAP  T VP+
Sbjct: 124 QVDAAAGAEPGAGQAHTPVPAPAPVIAQAQPIAPAPHQTLVPA 166


>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 292

 Score = 29.1 bits (65), Expect = 3.8
 Identities = 13/48 (27%), Positives = 18/48 (37%)

Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
           A    A  P   K A A       A+PA +P   P+ K +  +    S
Sbjct: 159 AVSEPAVEPKAAKTATATEAKVQTASPAQTPATPPAGKGAAASGQLKS 206


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 29.6 bits (66), Expect = 3.8
 Identities = 13/55 (23%), Positives = 15/55 (27%), Gaps = 1/55 (1%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP 225
            D   P      P   P +      PP P+    P     P    S GT     P
Sbjct: 379 ADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLV-SPYNPQSPGTSYGPEP 432



 Score = 28.4 bits (63), Expect = 9.3
 Identities = 22/90 (24%), Positives = 25/90 (27%), Gaps = 16/90 (17%)

Query: 142 PAGSKDVPIGKLVCIIVENESDVAAFKDF---------KDDAPPAAGASAPAPPPPKVAA 192
           PA         L     +      A                A      S PAP  P  A+
Sbjct: 568 PAFVHSFTSTTLAAAAGQGSDTAEALAGAIETLLTQASAQPAGL----SLPAPAVPVNAS 623

Query: 193 APPPPPPKAAPAPSPTPV---PSQKTSGGT 219
            P   PP  AP   P P    PS +TS   
Sbjct: 624 TPASTPPPLAPQEPPQPGTSAPSLETSLPQ 653


>gnl|CDD|236063 PRK07586, PRK07586, hypothetical protein; Validated.
          Length = 514

 Score = 29.4 bits (67), Expect = 3.8
 Identities = 14/39 (35%), Positives = 16/39 (41%), Gaps = 4/39 (10%)

Query: 172 DDAPPA----AGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
           +DA  A    A A    P  P +AA   PP P  A  P 
Sbjct: 302 EDAAAALEALADALGAKPAAPPLAAPARPPLPTGALTPE 340


>gnl|CDD|227315 COG4982, COG4982, 3-oxoacyl-[acyl-carrier protein].
          Length = 866

 Score = 29.4 bits (66), Expect = 3.8
 Identities = 15/39 (38%), Positives = 16/39 (41%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
           +  PPA  ASAPAP     A      P  A P P   P 
Sbjct: 15  EATPPAPAASAPAPAAAAPAPVAAAAPAAAGPRPDDEPF 53



 Score = 28.3 bits (63), Expect = 9.7
 Identities = 14/40 (35%), Positives = 17/40 (42%)

Query: 167 FKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
           F     + P    A+ PAP     A A   P P AA AP+
Sbjct: 3   FATDAKEEPAKEEATPPAPAASAPAPAAAAPAPVAAAAPA 42


>gnl|CDD|173798 cd05562, Peptidases_S53_like, Peptidase domain in the S53 family.
           Members of the peptidase S53 (sedolisin) family include
           endopeptidases and exopeptidases. The S53 family
           contains a catalytic triad Glu/Asp/Ser with an
           additional acidic residue Asp in the oxyanion hole,
           similar to that of Asn in subtilisin. The stability of
           these enzymes may be enhanced by calcium, some members
           have been shown to bind up to 4 ions via binding sites
           with different affinity. Some members of this clan
           contain disulfide bonds. These enzymes can be intra- and
           extracellular, some function at extreme temperatures and
           pH values. Characterized sedolisins include Kumamolisin,
           an extracellular calcium-dependent thermostable
           endopeptidase from Bacillus. The enzyme is synthesized
           with a 188 amino acid N-terminal preprotein region which
           is cleaved after the extraction into the extracellular
           space with low pH. One kumamolysin paralog,
           kumamolisin-As, is believed to be a collagenase. TPP1 is
           a serine protease that functions as a tripeptidyl
           exopeptidase as well as an endopeptidase. Less is known
           about PSCP from Pseudomonas which is thought to be an
           aspartic proteinase.
          Length = 275

 Score = 29.2 bits (66), Expect = 3.8
 Identities = 28/93 (30%), Positives = 36/93 (38%), Gaps = 12/93 (12%)

Query: 190 VAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKR---LAAEKGLDLSSIGAGSG 246
           V A      P     P+P   PS     G R   +P  ++   + A  G++ +  G G G
Sbjct: 151 VGAVDYGNTPAFGSDPAPGGTPSSFDPVGIR-LPTPEVRQKPDVTAPDGVNGTVDGDGDG 209

Query: 247 L---FGSITSADLSKASKAGAVAAPSKSAKPTA 276
               FG  TSA    A  A  VAA   SA P  
Sbjct: 210 PPNFFG--TSA---AAPHAAGVAALVLSANPGL 237


>gnl|CDD|222843 PHA02030, PHA02030, hypothetical protein.
          Length = 336

 Score = 29.2 bits (65), Expect = 4.0
 Identities = 15/63 (23%), Positives = 19/63 (30%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
           A P   A A +   P V AA       A   P    V            A+P A  + A 
Sbjct: 268 AVPNVAADAGSAAAPAVPAAAAAVAQAAPSVPQVPNVAVLPDVPQVAPVAAPAAPEVPAV 327

Query: 234 KGL 236
             +
Sbjct: 328 PVV 330



 Score = 28.4 bits (63), Expect = 7.9
 Identities = 15/68 (22%), Positives = 19/68 (27%), Gaps = 1/68 (1%)

Query: 144 GSKDVPIGKLVCIIVENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAP 203
           G +D+ I         N   V         A   A  +A A       + P  P     P
Sbjct: 249 GGEDLIIKPKSKAAGSNLPAVPNVAADAGSAAAPAVPAAAAAVAQAAPSVPQVPNVAVLP 308

Query: 204 -APSPTPV 210
             P   PV
Sbjct: 309 DVPQVAPV 316


>gnl|CDD|220585 pfam10123, Mu-like_Pro, Mu-like prophage I protein.  Members of
           this family of proteins comprise various viral Mu-like
           prophage I proteins.
          Length = 326

 Score = 29.3 bits (66), Expect = 4.0
 Identities = 13/46 (28%), Positives = 14/46 (30%), Gaps = 5/46 (10%)

Query: 162 SDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
            D AA K F   APP A          +     PP       A S 
Sbjct: 268 QDPAALKAFLAKAPPVAALLG-----TQTGGKIPPGKDTGTAALSA 308


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 29.6 bits (66), Expect = 4.0
 Identities = 15/48 (31%), Positives = 19/48 (39%), Gaps = 6/48 (12%)

Query: 175 PPAAGASAPAP------PPPKVAAAPPPPPPKAAPAPSPTPVPSQKTS 216
           P     S P+P      PP  V+    P PP   P+PSP    S  + 
Sbjct: 427 PQPPQPSVPSPGGPGSQPPQSVSGGMIPSPPALMPSPSPQMSQSPASQ 474


>gnl|CDD|184256 PRK13700, PRK13700, conjugal transfer protein TraD; Provisional.
          Length = 732

 Score = 29.5 bits (66), Expect = 4.1
 Identities = 12/65 (18%), Positives = 20/65 (30%), Gaps = 4/65 (6%)

Query: 170 FKDDAP---PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPL 226
           F+ D P        +    P        P  P +    P    +  +K+  G  V A  +
Sbjct: 603 FEPDVPEVASGEDVTQAEQPQQPQQPQQPQQPQQPQ-QPVSPVINDKKSDAGVNVPAGGI 661

Query: 227 AKRLA 231
            + L 
Sbjct: 662 EQELK 666


>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
          Length = 318

 Score = 29.1 bits (65), Expect = 4.2
 Identities = 11/38 (28%), Positives = 15/38 (39%), Gaps = 5/38 (13%)

Query: 180 ASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
           A + +P PP       P PP+  P  SP   P +    
Sbjct: 4   ARSRSPSPP-----RRPSPPRPTPPRSPDASPEETPPS 36



 Score = 27.9 bits (62), Expect = 9.6
 Identities = 17/41 (41%), Positives = 20/41 (48%), Gaps = 3/41 (7%)

Query: 174 APPAAGASAPAPPPPKV---AAAPPPPPPKAAPAPSPTPVP 211
           A PAA    P   P  V   ++APP PP     AP+ TP P
Sbjct: 50  AGPAAPRRRPRGCPAGVTFSSSAPPRPPLGLDDAPAATPPP 90


>gnl|CDD|234273 TIGR03593, yidC_nterm, membrane protein insertase, YidC/Oxa1
           family, N-terminal domain.  Essentially all bacteria
           have a member of the YidC family, whose C-terminal
           domain is modeled by TIGR03592. The two copies are found
           in endospore-forming bacteria such as Bacillus subtilis
           appear redundant during vegetative growth, although the
           member designated spoIIIJ (stage III sporulation protein
           J) has a distinct role in spore formation. YidC, its
           mitochondrial homolog Oxa1, and its chloroplast homolog
           direct insertion into the bacterial/organellar inner (or
           only) membrane. This model describes an N-terminal
           sequence region, including a large periplasmic domain
           lacking in YidC members from Gram-positive species. The
           multifunctional YidC protein acts both with and
           independently of the Sec system [Protein fate, Protein
           and peptide secretion and trafficking].
          Length = 366

 Score = 29.2 bits (66), Expect = 4.2
 Identities = 10/46 (21%), Positives = 12/46 (26%)

Query: 182 APAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLA 227
            P PP P  AA         A A      P+           +  A
Sbjct: 26  GPPPPKPPAAAQTASAQSATAAATPAAAPPAADAPAAAAAADAAAA 71


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 29.4 bits (66), Expect = 4.3
 Identities = 20/131 (15%), Positives = 34/131 (25%), Gaps = 15/131 (11%)

Query: 152 KLVCIIVENESDVAAFKDFKDDAPPAAGASAPAP-------PPPKVAAAPPPPPPKAAPA 204
           +   I   N                +      A        PP  + +            
Sbjct: 126 QAGPITNPNVKRRTGVGLPPVAPAASPALKPTANGKRPSSKPPKSIMSPEVKVKSAKKTQ 185

Query: 205 PSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGA 264
            +     ++KT G T V A+ L KR   +K          S  F   T     K   + +
Sbjct: 186 DTSKETTTEKTEGKTSVKAASL-KRNPPKKS------NIMSSFFKKKTKEKKEKKEASES 238

Query: 265 VAAPSKSAKPT 275
                +S + +
Sbjct: 239 -TVKEESEEES 248


>gnl|CDD|240415 PTZ00429, PTZ00429, beta-adaptin; Provisional.
          Length = 746

 Score = 29.5 bits (66), Expect = 4.4
 Identities = 14/61 (22%), Positives = 23/61 (37%), Gaps = 3/61 (4%)

Query: 167 FKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPL 226
             D     P   GAS+PA      + AP  PP     A +   + +     G++    P+
Sbjct: 672 LGDLFSGLPSTVGASSPAFQAASGSQAPASPP---TAASAIEDLFANGMGSGSQTVPLPI 728

Query: 227 A 227
           +
Sbjct: 729 S 729


>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
          Length = 590

 Score = 29.3 bits (66), Expect = 4.7
 Identities = 16/72 (22%), Positives = 26/72 (36%), Gaps = 13/72 (18%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPA---PSPTPVPSQKTSGGTRVYASP----- 225
           A  ++ +S+ + PP   +  P    P A+     PS  P+P   +     V   P     
Sbjct: 14  ASSSSSSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSSPSPPSVTLPPAATTQ 73

Query: 226 -----LAKRLAA 232
                  +R AA
Sbjct: 74  TPQLNPLQRAAA 85



 Score = 28.5 bits (64), Expect = 8.8
 Identities = 9/41 (21%), Positives = 13/41 (31%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
              P+A +    P    +     P PP     P+ T    Q
Sbjct: 36  RRKPSASSLLHTPSILPLPKLSSPSPPSVTLPPAATTQTPQ 76


>gnl|CDD|235585 PRK05733, PRK05733, single-stranded DNA-binding protein;
           Provisional.
          Length = 172

 Score = 28.4 bits (63), Expect = 4.9
 Identities = 8/31 (25%), Positives = 10/31 (32%)

Query: 181 SAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           SAP     +   A       A   P+P P  
Sbjct: 131 SAPRQQAQRPQQAAQQQSRPAPQQPAPQPAA 161


>gnl|CDD|189051 cd09881, PIN_VapC-FitB, PIN domain of ribonucleases (toxins), VapC
           and FitB, of prokaryotic toxin/antitoxin operons,
           Pyrococcus horikoshii protein PH0500, and other similar
           bacterial and archaeal homologs.  PIN (PilT N terminus)
           domain-containing proteins of prokaryotic
           toxin/antitoxin (TA) operons, such as, Mycobacterium
           tuberculosis VapC of the VapBC (virulence associated
           proteins) TA operon, and Neisseria gonorrhoeae FitB of
           the FitAB (fast intracellular trafficking) TA operon, as
           well as, the archaeal Pyrococcus horikoshii protein
           PH0500 are included in this family. Toxins of TA operons
           are believed to be involved in growth inhibition by
           regulating translation and are nearly always
           co-expressed with an antitoxin, a cognate protein
           inhibitor, forming an inert protein complex.
           Disassociation of the complex activates the ribonuclease
           activity of the toxin. In N. gonorrhoeae, FitA and FitB
           form a heterodimer: FitA is the DNA binding subunit and
           FitB contains a ribonuclease activity that is blocked by
           the presence of FitA. A tetramer of FitAB heterodimers
           binds DNA from the fitAB upstream promoter region with
           high affinity. This results in both sequestration of
           FitAB and repression of fitAB transcription.  It is
           thought that FitAB release from the DNA and subsequent
           dissociation both slows N. gonorrhoeae replication and
           transcytosis by an as yet undefined mechanism.  The
           toxin M. tuberculosis VapC is a structural homolog of N.
           gonorrhoeae FitB, but their antitoxin partners, VapB and
           FitA, respectively, differ structurally.  The M.
           tuberculosis VapC-5 is proposed to be both an
           endoribonuclease and an exoribonuclease that can act on
           free RNA in a similar manner to the endo and exonuclease
           flap endonuclease-1 (FEN1). VapC-like toxins are
           structural homologs of FEN1-like PIN domains, but lack
           the extensive arch/clamp region  and the H3TH
           (helix-3-turn-helix) domain, an atypical
           helix-hairpin-helix-2-like region, seen in FEN1-like PIN
           domains.  PIN domains within this group typically
           contain three or four conserved acidic residues that
           cluster at the C-terminal end of the beta-sheet and form
           a negatively charged pocket near the center of the
           molecule. These putative active site residues are
           thought to bind Mg2+ and/or Mn2+ ions and be essential
           for single-stranded ribonuclease activity. VapC-like PIN
           domains are single domain proteins that form dimers and
           dimerization configures the active sites in a groove
           along the long-axis of the structure.
          Length = 127

 Score = 27.9 bits (63), Expect = 5.0
 Identities = 10/22 (45%), Positives = 15/22 (68%)

Query: 312 ALEKRGAKLSINDFIIKATALA 333
            L ++G  + +ND +I ATALA
Sbjct: 83  RLRRKGRPIGLNDALIAATALA 104


>gnl|CDD|225369 COG2812, DnaX, DNA polymerase III, gamma/tau subunits [DNA
           replication, recombination, and repair].
          Length = 515

 Score = 29.3 bits (66), Expect = 5.0
 Identities = 31/153 (20%), Positives = 44/153 (28%), Gaps = 10/153 (6%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRL 230
                 A            V+ A  P     APA S  P    +    T++  S  A   
Sbjct: 368 SPLLQRAPRLEQELVLKANVSLAAAPAQSPQAPALSKNPELKAREQLATKIKKSEPAPES 427

Query: 231 AAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRG 290
           +      + S       +G +       A K  A A   K + P    P           
Sbjct: 428 SEPSLAPVRSF------WGELLDE--VTAQKLKAEAKVLKDSDPVLAKPGALKRAFEGEI 479

Query: 291 VIAKRLLQSKQVIKLREQ--MNKALEKRGAKLS 321
              K      +V +  EQ   N A+E+R AK  
Sbjct: 480 TAKKLETGRVEVERGGEQLAANAAVEERAAKQP 512


>gnl|CDD|238286 cd00516, PRTase_typeII, Phosphoribosyltransferase (PRTase) type II;
           This family contains two enzymes that play an important
           role in NAD production by either allowing quinolinic
           acid (QA) , quinolinate phosphoribosyl transferase
           (QAPRTase), or nicotinic acid (NA), nicotinate
           phosphoribosyltransferase (NAPRTase), to be used in the
           synthesis of NAD. QAPRTase catalyses the reaction of
           quinolinic acid (QA) with
           5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence
           of Mg2+ to produce nicotinic acid mononucleotide (NAMN),
           pyrophosphate and carbon dioxide, an important step in
           the de novo synthesis of NAD. NAPRTase catalyses a
           similar reaction leading to NAMN and pyrophosphate,
           using nicotinic acid an PPRP as substrates, used in the
           NAD salvage pathway.
          Length = 281

 Score = 28.7 bits (65), Expect = 5.0
 Identities = 10/36 (27%), Positives = 15/36 (41%), Gaps = 1/36 (2%)

Query: 97  GTIVSWAKKEGDKLNEGDLLAEIE-TDKATMGFETP 131
           G +V  A  EG  +  G+ L  IE   +  +  E  
Sbjct: 49  GPLVILAVPEGTVVEPGEPLLTIEGPARELLLLERV 84


>gnl|CDD|218108 pfam04487, CITED, CITED.  CITED, CBP/p300-interacting
           transactivator with ED-rich tail, are characterized by a
           conserved 32-amino acid sequence at the C-terminus.
           CITED proteins do not bind DNA directly and are thought
           to function as transcriptional co-activators.
          Length = 206

 Score = 28.7 bits (64), Expect = 5.0
 Identities = 6/31 (19%), Positives = 7/31 (22%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPA 204
            P   G     P   +       P P   P 
Sbjct: 122 HPGGGGPQQFRPGAGQPPGMQHMPAPALPPN 152


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 28.7 bits (65), Expect = 5.0
 Identities = 10/40 (25%), Positives = 12/40 (30%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
           ++ A     A      P   A      P    P P  TPV
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPV 161


>gnl|CDD|234822 PRK00720, tatA, twin arginine translocase protein A; Provisional.
          Length = 78

 Score = 26.9 bits (60), Expect = 5.2
 Identities = 17/50 (34%), Positives = 17/50 (34%), Gaps = 8/50 (16%)

Query: 163 DVA-AFKDFK-----DDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
           DVA   K FK     DDAPPA           K  A  P  P K      
Sbjct: 31  DVAKGIKSFKKGMADDDAPPAEPPEPVRTIDHK--ADEPVAPVKKDVGSK 78


>gnl|CDD|172536 PRK14042, PRK14042, pyruvate carboxylase subunit B; Provisional.
          Length = 596

 Score = 28.9 bits (64), Expect = 5.4
 Identities = 20/69 (28%), Positives = 31/69 (44%), Gaps = 8/69 (11%)

Query: 81  GNDRVALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKATMGFETPEEGYLAKIL 140
           G+  VA+P        G+I++     GD++  G  +  IE  K     + P  G +A+IL
Sbjct: 526 GDITVAIP--------GSIIAIHVSAGDEVKAGQAVLVIEAMKMETEIKAPANGVVAEIL 577

Query: 141 VPAGSKDVP 149
              G K  P
Sbjct: 578 CQKGDKVTP 586


>gnl|CDD|139164 PRK12700, flgH, flagellar basal body L-ring protein; Reviewed.
          Length = 230

 Score = 28.5 bits (63), Expect = 5.5
 Identities = 35/147 (23%), Positives = 52/147 (35%), Gaps = 29/147 (19%)

Query: 177 AAGASAPAPPPPKVAA---APPPPPPKAAPAPSPTPVPSQKTSGGT-------------R 220
            A   A  PP P V     APPPPPP   P+  P     Q ++ G               
Sbjct: 18  LAAGCAMIPPEPVVTGPLTAPPPPPP--QPSARPNGSIYQPSAYGNYPLFEDRRPRNVGD 75

Query: 221 VYASPLAKRLAAEKGLDLSS---------IGAGSGLFGSITSADLSKASKAGAVAAPSKS 271
           +    L ++  A KG+  ++         + A       + +  L      G  A  + +
Sbjct: 76  IVTIVLEEKTNAAKGVATNTSRDGSATLGVAAAPRFMDGVINDKLDTDISGGNTA--NGT 133

Query: 272 AKPTANGPFTDLPVSGVRGVIAKRLLQ 298
            K +AN  FT    + V GV+    LQ
Sbjct: 134 GKSSANNTFTGTITTTVIGVLPNGNLQ 160


>gnl|CDD|236716 PRK10564, PRK10564, maltose regulon periplasmic protein;
           Provisional.
          Length = 303

 Score = 28.8 bits (65), Expect = 5.5
 Identities = 14/35 (40%), Positives = 16/35 (45%), Gaps = 1/35 (2%)

Query: 176 PAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV 210
           P  G+SAPAP      AAP   P  A    S  P+
Sbjct: 219 PLFGSSAPAPVTVGNTAAPVAAPAPAPAKKS-EPM 252



 Score = 28.5 bits (64), Expect = 7.1
 Identities = 11/31 (35%), Positives = 15/31 (48%)

Query: 185 PPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT 215
           P     A AP      AAP  +P P P++K+
Sbjct: 219 PLFGSSAPAPVTVGNTAAPVAAPAPAPAKKS 249


>gnl|CDD|151322 pfam10873, DUF2668, Protein of unknown function (DUF2668).  Members
           in this family of proteins are annotated as Cysteine and
           tyrosine-rich protein 1, however currently no function
           is known.
          Length = 154

 Score = 27.9 bits (62), Expect = 5.6
 Identities = 13/38 (34%), Positives = 17/38 (44%), Gaps = 8/38 (21%)

Query: 169 DFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
           ++  D PP      P  P P+ +A   PPPP   P  S
Sbjct: 123 EYPTDLPP------PYSPAPQASAQRSPPPP--YPGNS 152


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 29.0 bits (65), Expect = 5.7
 Identities = 10/89 (11%), Positives = 20/89 (22%), Gaps = 18/89 (20%)

Query: 160 NESDVAAFKDFKDDAPPA----AGASAPAPPPPKVAAAP--------------PPPPPKA 201
              +  A                   +P     + +  P                P P+ 
Sbjct: 75  VRYNQNAPGAPSVGPDSDLSQKTSTFSPCQSGYEASTDPEYIPDLQPDPSLWGTAPKPEP 134

Query: 202 APAPSPTPVPSQKTSGGTRVYASPLAKRL 230
            P  +P   P  +T     +    +  +L
Sbjct: 135 QPPQAPESQPQPQTPAQKMLSLEEVEAQL 163


>gnl|CDD|238802 cd01568, QPRTase_NadC, Quinolinate phosphoribosyl transferase
           (QAPRTase or QPRTase), also called nicotinate-nucleotide
           pyrophosphorylase, is involved in the de novo synthesis
           of NAD in both prokaryotes and eukaryotes. It catalyses
           the reaction of quinolinic acid (QA) with
           5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence
           of Mg2+ to produce nicotinic acid mononucleotide (NAMN),
           pyrophosphate and carbon dioxide. QPRTase functions as a
           homodimer with two active sites, each formed by the
           C-terminal region of one subunit and the N-terminal
           region of the other.
          Length = 269

 Score = 28.6 bits (65), Expect = 5.7
 Identities = 9/21 (42%), Positives = 14/21 (66%)

Query: 100 VSWAKKEGDKLNEGDLLAEIE 120
           V W  K+GD++  G +L E+E
Sbjct: 59  VEWLVKDGDRVEAGQVLLEVE 79


>gnl|CDD|234896 PRK01077, PRK01077, cobyrinic acid a,c-diamide synthase; Validated.
          Length = 451

 Score = 28.9 bits (66), Expect = 6.0
 Identities = 13/32 (40%), Positives = 15/32 (46%)

Query: 173 DAPPAAGASAPAPPPPKVAAAPPPPPPKAAPA 204
           DA  A   +AP PPP      P PP  + A A
Sbjct: 220 DALLALARAAPPPPPAAAPPPPAPPGVRIAVA 251


>gnl|CDD|221294 pfam11891, DUF3411, Domain of unknown function (DUF3411).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 168 to 186 amino acids in length. This domain
           has a conserved RYQ sequence motif.
          Length = 179

 Score = 28.1 bits (63), Expect = 6.1
 Identities = 15/67 (22%), Positives = 23/67 (34%), Gaps = 11/67 (16%)

Query: 222 YASPLAKRLAA--EKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGP 279
            +  L +R      KG  L+++G  +GL G   S  L  A KA             +   
Sbjct: 97  RSYSLQQRFGTVFYKGAKLAAVGFIAGLVGQAISNALMAARKA---------VDKNSEES 147

Query: 280 FTDLPVS 286
               P+ 
Sbjct: 148 VKVPPLF 154


>gnl|CDD|234764 PRK00442, tatA, twin arginine translocase protein A; Provisional.
          Length = 92

 Score = 27.1 bits (60), Expect = 6.1
 Identities = 14/46 (30%), Positives = 16/46 (34%), Gaps = 5/46 (10%)

Query: 168 KDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
           K   +D      A  PAPP       P PP    AP   P  +  Q
Sbjct: 41  KAMNEDEKEDKPAEQPAPPAQ-----PVPPVQNTAPLNQPHTIDGQ 81



 Score = 26.4 bits (58), Expect = 9.6
 Identities = 10/34 (29%), Positives = 14/34 (41%)

Query: 166 AFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPP 199
           A  + + +  PA   + PA P P V    P   P
Sbjct: 42  AMNEDEKEDKPAEQPAPPAQPVPPVQNTAPLNQP 75


>gnl|CDD|233382 TIGR01372, soxA, sarcosine oxidase, alpha subunit family,
           heterotetrameric form.  This model describes the alpha
           subunit of a family of known and putative
           heterotetrameric sarcosine oxidases. Five operons of
           such oxidases are found in Mesorhizobium loti and three
           in Agrobacterium tumefaciens, a high enough copy number
           to suggest that not all members are share the same
           function. The model is designated as subfamily rather
           than equivalog for this reason.Sarcosine oxidase
           catalyzes the oxidative demethylation of sarcosine to
           glycine. The reaction converts tetrahydrofolate to
           5,10-methylene-tetrahydrofolate. The enzyme is known in
           monomeric and heterotetrameric (alpha,beta,gamma,delta)
           forms [Energy metabolism, Amino acids and amines].
          Length = 985

 Score = 28.9 bits (65), Expect = 6.1
 Identities = 19/61 (31%), Positives = 29/61 (47%), Gaps = 2/61 (3%)

Query: 216 SGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSADLSKASKAGAVAAPSKSAKPT 275
            GG   + + +A  L  +        GA +GLFG   +A L+  + AGA AA +   + T
Sbjct: 419 RGGKLAWDAAIAAFLPGDAVQGCILAGAANGLFG--LAAALADGAAAGAAAARAAGFEGT 476

Query: 276 A 276
           A
Sbjct: 477 A 477


>gnl|CDD|222346 pfam13728, TraF, F plasmid transfer operon protein.  TraF protein
           undergoes proteolytic processing associated with export.
           The 19 amino acids at the amino terminus of the
           polypeptides appear to constitute a typical membrane
           leader peptide - not included in this family, while the
           remainder of the molecule is predicted to be primarily
           hydrophilic in character. F plasmid TraF and TraH are
           required for F pilus assembly and F plasmid transfer,
           and they are both localised to the outer membrane in the
           presence of the complete F transfer region, especially
           TraV, the putative anchor.
          Length = 215

 Score = 28.3 bits (64), Expect = 6.2
 Identities = 9/23 (39%), Positives = 10/23 (43%)

Query: 187 PPKVAAAPPPPPPKAAPAPSPTP 209
            P     P   P  AAPA SP+ 
Sbjct: 7   EPPEPKEPEKAPAPAAPAQSPSA 29


>gnl|CDD|172484 PRK13982, PRK13982, bifunctional
           SbtC-like/phosphopantothenoylcysteine
           decarboxylase/phosphopantothenate synthase; Provisional.
          Length = 475

 Score = 29.0 bits (65), Expect = 6.2
 Identities = 17/69 (24%), Positives = 27/69 (39%), Gaps = 6/69 (8%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASP-LAKR 229
           +D   P  G +A +  P   AA P      +  +   T +      GG   Y +  L +R
Sbjct: 37  RDATTPRHGPAASSAAPVSAAAPPAAREQASLASKRVTLI----IGGGIAAYKALDLIRR 92

Query: 230 LAAEKGLDL 238
           L  E+G  +
Sbjct: 93  L-KERGAHV 100


>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA.  Members
           of this protein share two defining regions. One is a
           histidine/serine-rich cluster, typically
           H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
           the context of a pair of radical SAM proteins, HxsB and
           HxsC, and a fourth protein HxsD. The system is predicted
           to perform peptide modifications, likely in the
           His-Xaa-Ser region, to produce some uncharacterized
           natural product.
          Length = 186

 Score = 28.3 bits (63), Expect = 6.2
 Identities = 16/63 (25%), Positives = 25/63 (39%), Gaps = 3/63 (4%)

Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPV-PSQKTSGGTRVYASPLAKRLAAEKG 235
            +GA      P    +    P P  + +PSP     S  ++ G R  +S  A+   +EK 
Sbjct: 69  YSGAGGSYSVPSGDTSTYSYPVPSPSYSPSPGSSIQSLPSTTGVRPQSS--AENANSEKR 126

Query: 236 LDL 238
             L
Sbjct: 127 KLL 129


>gnl|CDD|219865 pfam08493, AflR, Aflatoxin regulatory protein.  This domain is
           found in the aflatoxin regulatory protein (AflR) which
           is involved in the regulation of the biosynthesis of
           aflatoxin in the fungal genus Aspergillus. It occurs
           together with the fungal Zn(2)-Cys(6) binuclear cluster
           domain (pfam00172).
          Length = 275

 Score = 28.8 bits (64), Expect = 6.2
 Identities = 32/120 (26%), Positives = 47/120 (39%), Gaps = 16/120 (13%)

Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAP-APSPTPVPSQKTSGG-----TRVYASPLAKRL 230
           A   S+  P PP  +      PP A P  P+ T  PS K S         ++ S L+   
Sbjct: 19  ANTTSSSHPQPPVQSGPSSIQPPVATPHTPNGTSSPSPKFSHQSPPAEPELWGSILSPNA 78

Query: 231 AAEKGLDLSSIGAG----SGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVS 286
           + +   DLSS+ +       LF S++ + L   + A         A+ T      DL VS
Sbjct: 79  SNQDQGDLSSLLSVNTDFGQLFASLSPSPLFDGNDA------DLHAEATGELSVADLEVS 132


>gnl|CDD|237081 PRK12372, PRK12372, ribonuclease III; Reviewed.
          Length = 413

 Score = 28.7 bits (64), Expect = 6.2
 Identities = 13/64 (20%), Positives = 18/64 (28%)

Query: 158 VENESDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSG 217
            E     A     +   P ++ A  P       A  P      AAP     P  +   + 
Sbjct: 345 AEAAPRAADKPAGQAADPASSSADKPGASADAAARTPARARDAAAPDADTPPGGASLAAA 404

Query: 218 GTRV 221
             RV
Sbjct: 405 QARV 408


>gnl|CDD|232814 TIGR00078, nadC, nicotinate-nucleotide pyrophosphorylase.  Synonym:
           quinolinate phosphoribosyltransferase (decarboxylating)
           [Biosynthesis of cofactors, prosthetic groups, and
           carriers, Pyridine nucleotides].
          Length = 265

 Score = 28.4 bits (64), Expect = 6.3
 Identities = 10/26 (38%), Positives = 19/26 (73%)

Query: 95  EMGTIVSWAKKEGDKLNEGDLLAEIE 120
           ++G  V W  K+GD++  G+++AE+E
Sbjct: 51  QLGVQVEWLVKDGDRVEPGEVVAEVE 76


>gnl|CDD|218350 pfam04959, ARS2, Arsenite-resistance protein 2.  Arsenite is a
           carcinogenic compound which can act as a co-mutagen by
           inhibiting DNA repair. Arsenite-resistance protein 2 is
           thought to play a role in arsenite resistance.
          Length = 211

 Score = 28.2 bits (63), Expect = 6.4
 Identities = 18/69 (26%), Positives = 24/69 (34%), Gaps = 5/69 (7%)

Query: 157 IVENESDVAAFKDFKDDA-----PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVP 211
           + E   +V  F ++  DA     P       P P P  +A   P  PP+   A  P   P
Sbjct: 105 VEEVRKEVEFFNNYLTDAKRPALPEIKPLQPPKPDPGGLAPGLPGYPPQTPQALMPYGQP 164

Query: 212 SQKTSGGTR 220
                G  R
Sbjct: 165 RPPMMGYGR 173


>gnl|CDD|215520 PLN02964, PLN02964, phosphatidylserine decarboxylase.
          Length = 644

 Score = 28.7 bits (64), Expect = 6.5
 Identities = 14/31 (45%), Positives = 22/31 (70%), Gaps = 2/31 (6%)

Query: 84  RVALPALSPTMEMGTIVSWAKKEGDKLNEGD 114
           +VA  A+  TM +G+I ++ KKEGD + +GD
Sbjct: 551 KVAFVAIGATM-VGSI-TFVKKEGDHVKKGD 579


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
           subunit 62; Provisional.
          Length = 576

 Score = 28.7 bits (64), Expect = 7.0
 Identities = 26/112 (23%), Positives = 41/112 (36%), Gaps = 7/112 (6%)

Query: 185 PPPPKVAAAPPPPPPKAAPAPSPTPVPSQKT-SGGTRVYASPLAKRLAAEKGLDLSSIGA 243
           PP    AA  P P P     P     P ++       V   PL+   A E   DL    +
Sbjct: 329 PPKESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAYE---DLKPPTS 385

Query: 244 GSGLFGSITSADLSKASKAGAVAAPSKSAKPTANGPFTDLPVSGVRGVIAKR 295
                 +  S+  + +    AVA P++     + G  +++P      V AK+
Sbjct: 386 PI---PTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKK 434


>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
           (DUF2076).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function. The domain,
           however, is found in various periplasmic ligand-binding
           sensor proteins.
          Length = 234

 Score = 28.1 bits (63), Expect = 7.1
 Identities = 20/121 (16%), Positives = 30/121 (24%), Gaps = 19/121 (15%)

Query: 170 FKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKR 229
           F   AP    A+    PP   A         +       P  +Q   G        L   
Sbjct: 87  FGGGAPRPPPAAPAVQPPAPPARPGWGSGGPSQQGAGQQPGYAQPGPGSF------LGGA 140

Query: 230 LAAEKGLDLSSIGAGSGLFGSITS--------ADLSKASKAGAVAAPSKSAKPTANGPFT 281
                G     +  G  L   + +        A++  A   G   +    A  T    + 
Sbjct: 141 AQTAAG-----VAGGMLLGNGLENLFGGHSQPAEIVDAIGEGGDGSGPAPADDTGINNYG 195

Query: 282 D 282
           D
Sbjct: 196 D 196


>gnl|CDD|223235 COG0157, NadC, Nicotinate-nucleotide pyrophosphorylase [Coenzyme
           metabolism].
          Length = 280

 Score = 28.3 bits (64), Expect = 7.2
 Identities = 11/21 (52%), Positives = 16/21 (76%)

Query: 100 VSWAKKEGDKLNEGDLLAEIE 120
           + W  K+GD++  GD+LAEIE
Sbjct: 66  IQWLVKDGDRVKPGDVLAEIE 86


>gnl|CDD|113156 pfam04375, HemX, HemX.  This family consists of several bacterial
           HemX proteins. The hemX gene is not essential for haem
           synthesis in B. subtilis. HemX is a polytopic membrane
           protein which by an unknown mechanism down-regulates the
           level of HemA.
          Length = 372

 Score = 28.3 bits (63), Expect = 7.2
 Identities = 11/47 (23%), Positives = 16/47 (34%), Gaps = 2/47 (4%)

Query: 192 AAPPPPPPKAAPAPS--PTPVPSQKTSGGTRVYASPLAKRLAAEKGL 236
           ++ P          S    P P+ K+ G T + A  L   L    G 
Sbjct: 1   SSVPAETASQTTVTSSPVAPGPAAKSGGATGLAALALLVALGLGAGG 47


>gnl|CDD|235553 PRK05672, dnaE2, error-prone DNA polymerase; Validated.
          Length = 1046

 Score = 28.7 bits (65), Expect = 7.3
 Identities = 10/30 (33%), Positives = 11/30 (36%), Gaps = 1/30 (3%)

Query: 177 AAGASAPAPPPPKVAAAPPPPPPKAAPAPS 206
           AAGA A       +  AP      A P  S
Sbjct: 880 AAGA-AATERLDLLPGAPASEAAPALPGMS 908


>gnl|CDD|237530 PRK13843, PRK13843, conjugal transfer protein TraH; Provisional.
          Length = 207

 Score = 28.1 bits (63), Expect = 7.5
 Identities = 17/73 (23%), Positives = 19/73 (26%), Gaps = 16/73 (21%)

Query: 173 DAPPAAGASAPAPPPPKVA-AAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
            A          P P     A     PP+ A   S    P      G R+          
Sbjct: 148 RAEDPGDVKLAEPEPGDSDEAEESSDPPENAEDASEPAEPDDPNKAGIRI---------- 197

Query: 232 AEKGLDLSSIGAG 244
                DLS IGA 
Sbjct: 198 -----DLSGIGAQ 205


>gnl|CDD|237182 PRK12727, PRK12727, flagellar biosynthesis regulator FlhF;
           Provisional.
          Length = 559

 Score = 28.4 bits (63), Expect = 7.6
 Identities = 12/39 (30%), Positives = 16/39 (41%)

Query: 175 PPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
           P +    APA  P + A+ P P     A A +    P Q
Sbjct: 117 PVSVPRQAPAAAPVRAASIPSPAAQALAHAAAVRTAPRQ 155


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 28.6 bits (64), Expect = 7.6
 Identities = 9/48 (18%), Positives = 14/48 (29%)

Query: 166 AFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQ 213
           A+  ++    P A   A      +      PP    A   S  P   +
Sbjct: 145 AYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYAPEQER 192


>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
          Length = 171

 Score = 27.8 bits (62), Expect = 7.7
 Identities = 13/63 (20%), Positives = 19/63 (30%), Gaps = 8/63 (12%)

Query: 155 CIIVENESDVAAFKDFKDDAPPAAGASAP--------APPPPKVAAAPPPPPPKAAPAPS 206
           C+ +E  ++ A   +   DA PAA +            P     A  P    P     P 
Sbjct: 36  CMTMEQANEKAKKLEQSSDAKPAAASLPRLAEGNFRTMPVQTVTATTPSGSRPAVTATPE 95

Query: 207 PTP 209
              
Sbjct: 96  QKL 98


>gnl|CDD|237803 PRK14724, PRK14724, DNA topoisomerase III; Provisional.
          Length = 987

 Score = 28.8 bits (64), Expect = 7.7
 Identities = 14/60 (23%), Positives = 17/60 (28%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAE 233
           A     A   A P  K AA        AA  P                 ++ LA  + AE
Sbjct: 870 AFGGTVAVKAAKPAKKAAAKKVAAKTAAAKTPRKAAKKKAAPPAAGLKPSAALAAVIGAE 929


>gnl|CDD|225950 COG3416, COG3416, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 233

 Score = 28.3 bits (63), Expect = 7.8
 Identities = 10/35 (28%), Positives = 14/35 (40%)

Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPS 212
              + PA  PP    A PP    +   P+  P P+
Sbjct: 95  QEPAPPANAPPPKEPAAPPSWRSSPAGPTTQPSPA 129


>gnl|CDD|215430 PLN02800, PLN02800, imidazoleglycerol-phosphate dehydratase.
          Length = 261

 Score = 28.3 bits (63), Expect = 8.0
 Identities = 6/51 (11%), Positives = 12/51 (23%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYAS 224
             P        P    + ++P    P+     S         +  +   A 
Sbjct: 13  LRPKLSFIDLLPRRAAIVSSPSSSLPRFLRMESQLRQSISCAASSSSSNAL 63


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 26.8 bits (60), Expect = 8.1
 Identities = 10/42 (23%), Positives = 14/42 (33%)

Query: 164 VAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAP 205
           VAA +D   +      A+AP       AAA      +     
Sbjct: 42  VAALEDVNIEEAIKKAAAAPVAAAAAAAAAAAAAAAEEKKEE 83


>gnl|CDD|133414 cd04787, HTH_HMRTR_unk, Helix-Turn-Helix DNA binding domain of
           putative Heavy Metal Resistance transcription
           regulators.  Putative helix-turn-helix (HTH) heavy metal
           resistance transcription regulators (HMRTR), unknown
           subgroup. Based on sequence similarity, these proteins
           are predicted to function as transcription regulators
           that mediate responses to heavy metal stress in
           eubacteria. They belong to the MerR superfamily of
           transcription regulators that promote transcription of
           various stress regulons by reconfiguring the operator
           sequence located between the -35 and -10 promoter
           elements. A typical MerR regulator is comprised of two
           distinct domains that harbor the regulatory
           (effector-binding) site and the active (DNA-binding)
           site. Their N-terminal domains are homologous and
           contain a DNA-binding winged HTH motif, while the
           C-terminal domains are often dissimilar and bind
           specific coactivator molecules, such as, metal ions,
           drugs, and organic substrates. This subgroup lacks one
           of the conserved, metal-binding cysteines seen in the
           MerR1 group.
          Length = 133

 Score = 27.3 bits (61), Expect = 8.5
 Identities = 12/39 (30%), Positives = 24/39 (61%), Gaps = 4/39 (10%)

Query: 281 TDLPVSGVRGVIAKRLLQSKQVI----KLREQMNKALEK 315
            + P   VR +I +RL ++++ I    KLR++M +A+ +
Sbjct: 73  GESPCPMVRRLIEQRLAETERRIKELLKLRDRMQQAVSQ 111


>gnl|CDD|219262 pfam07001, BAT2_N, BAT2 N-terminus.  This family represents the
           N-terminus (approximately 200 residues) of the
           proline-rich protein BAT2. BAT2 is similar to other
           proteins with large proline-rich domains, such as some
           nuclear proteins, collagens, elastin, and synapsin.
          Length = 189

 Score = 28.0 bits (62), Expect = 8.5
 Identities = 15/57 (26%), Positives = 19/57 (33%), Gaps = 5/57 (8%)

Query: 162 SDVAAFKDFKDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
           SD    +  +   P  +  SA +PPPP     P P  P          V S K    
Sbjct: 99  SDALPPQTPESQPPAVSQTSASSPPPP-----PQPATPLVPGGAKSWAVASAKPGAQ 150


>gnl|CDD|129483 TIGR00388, glyQ, glycyl-tRNA synthetase, tetrameric type, alpha
           subunit.  This tetrameric form of glycyl-tRNA synthetase
           (2 alpha, 2 beta) is found in the majority of completed
           eubacterial genomes, with the two genes fused in a few
           species. A substantially different homodimeric form (not
           recognized by this model) replaces this form in the
           Archaea, animals, yeasts, and some eubacteria [Protein
           synthesis, tRNA aminoacylation].
          Length = 293

 Score = 28.3 bits (63), Expect = 8.6
 Identities = 9/31 (29%), Positives = 16/31 (51%)

Query: 100 VSWAKKEGDKLNEGDLLAEIETDKATMGFET 130
           + W+     K   GD+  + E +++T  FET
Sbjct: 177 LEWSDGPLGKTTYGDVFHQNEVEQSTYNFET 207


>gnl|CDD|234700 PRK00249, flgH, flagellar basal body L-ring protein; Reviewed.
          Length = 222

 Score = 27.9 bits (63), Expect = 8.6
 Identities = 12/41 (29%), Positives = 18/41 (43%), Gaps = 6/41 (14%)

Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGG 218
           AG ++    P           P AAP  +PTP+P+   +G 
Sbjct: 8   AGCASIPRLPLV------GGAPAAAPVEAPTPLPNPAPNGS 42


>gnl|CDD|233547 TIGR01730, RND_mfp, RND family efflux transporter, MFP subunit.
           This model represents the MFP (membrane fusion protein)
           component of the RND family of transporters. RND refers
           to Resistance, Nodulation, and cell Division. It is, in
           part, a subfamily of pfam00529 (Pfam release 7.5) but
           hits substantial numbers of proteins missed by that
           model. The related HlyD secretion protein, for which
           pfam00529 is named, is outside the scope of this model.
           Attributed functions imply outward transport. These
           functions include nodulation, acriflavin resistance,
           heavy metal efflux, and multidrug resistance proteins.
           Most members of this family are found in Gram-negative
           bacteria. The proposed function of MFP proteins is to
           bring the inner and outer membranes together and enable
           transport to the outside of the outer membrane. Note,
           however, that a few members of this family are found in
           Gram-positive bacteria, where there is no outer membrane
           [Transport and binding proteins, Unknown substrate].
          Length = 322

 Score = 28.0 bits (63), Expect = 9.1
 Identities = 16/56 (28%), Positives = 23/56 (41%), Gaps = 10/56 (17%)

Query: 69  AVGSLSQRLVREGNDRVALPALSPTMEMGTIVSWAKKEGDKLNEGDLLAEIETDKA 124
             GSL      E  D   L A       G I   + +EG K+ +G +LA ++ D  
Sbjct: 17  FPGSL------EAVDEADLAAEVA----GKITKISVREGQKVKKGQVLARLDDDDY 62


>gnl|CDD|233154 TIGR00859, ENaC, sodium channel transporter.  The Epithelial Na+
           Channel (ENaC) Family (TC 1.A.06)The ENaC family
           consists of sodium channels from animals and has no
           recognizable homologues in other eukaryotes or bacteria.
           The vertebrate ENaC proteins from epithelial cells
           cluster tightly together on the phylogenetic tree:
           voltage-insensitive ENaC homologues are also found in
           the brain. Eleven sequenced C. elegans proteins,
           including the degenerins, are distantly related to the
           vertebrate proteins as well as to each other. At least
           some ofthese proteins form part of a mechano-transducing
           complex for touch sensitivity. Other members of the ENaC
           family, the acid-sensing ion channels, ASIC1-3,are homo-
           or hetero-oligomeric neuronal H+-gated channels that
           mediate pain sensation in response to tissue acidosis.
           The homologous Helix aspersa(FMRF-amide)-activated Na+
           channel is the first peptide neurotransmitter-gated
           ionotropic receptor to be sequenced.Mammalian ENaC is
           important for the maintenance of Na+ balance and the
           regulation of blood pressure. Three homologous ENaC
           subunits, a, b and g, havebeen shown to assemble to form
           the highly Na+-selective channel.This model is designed
           from the vertebrate members of the ENaC family
           [Transport and binding proteins, Cations and iron
           carrying compounds].
          Length = 595

 Score = 28.2 bits (63), Expect = 9.1
 Identities = 8/32 (25%), Positives = 13/32 (40%)

Query: 167 FKDFKDDAPPAAGASAPAPPPPKVAAAPPPPP 198
               + D PP   ++ P P    ++    PPP
Sbjct: 559 PPSLQLDDPPTFPSALPLPHASGLSLPGTPPP 590


>gnl|CDD|219130 pfam06674, DUF1176, Protein of unknown function (DUF1176).  This
           family consists of several hypothetical bacterial
           proteins of around 340 residues in length. Members of
           this family contain six highly conserved cysteine
           residues. The function of this family is unknown.
          Length = 338

 Score = 28.1 bits (63), Expect = 9.3
 Identities = 13/37 (35%), Positives = 15/37 (40%), Gaps = 3/37 (8%)

Query: 171 KDDAPPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSP 207
           K   P +   S P  PP  V  A P PP  A   P+ 
Sbjct: 158 KGTKPAS---SVPPAPPLPVIRAAPAPPAAAPLDPAE 191


>gnl|CDD|223064 PHA03375, PHA03375, hypothetical protein; Provisional.
          Length = 844

 Score = 28.3 bits (63), Expect = 9.4
 Identities = 11/46 (23%), Positives = 16/46 (34%), Gaps = 2/46 (4%)

Query: 174 APPAAGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGT 219
            P  A  S  APPP      P      ++ + S +  P   T   +
Sbjct: 376 RPRTARLS--APPPSAAPPVPRSGSSPSSLSSSSSSAPPPATDPSS 419


>gnl|CDD|219241 pfam06958, Pyocin_S, S-type Pyocin.  This family represents a
           conserved region approximately 180 residues long within
           bacterial S-type pyocins. Pyocins are polypeptide toxins
           produced by, and active against, bacteria. S-type
           pyocins cause cell death by DNA breakdown due to
           endonuclease activity.
          Length = 139

 Score = 27.3 bits (61), Expect = 9.5
 Identities = 12/45 (26%), Positives = 16/45 (35%), Gaps = 2/45 (4%)

Query: 172 DDAPPAAGASAPAPPPPKVAAAPP--PPPPKAAPAPSPTPVPSQK 214
              P    +S  +P PP         P  P   P  +P P P +K
Sbjct: 74  ASPPGDIPSSTSSPAPPPDTPVITGTPITPVVDPVITPYPAPDEK 118


>gnl|CDD|215605 PLN03153, PLN03153, hypothetical protein; Provisional.
          Length = 537

 Score = 28.3 bits (63), Expect = 9.8
 Identities = 21/67 (31%), Positives = 27/67 (40%), Gaps = 10/67 (14%)

Query: 195 PPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAAEKGLDLSSIGAGSGLFGSITSA 254
           P PPP     PS   +PS       R  +    +RL  E  L L+ I     +FG   S+
Sbjct: 83  PLPPP-----PSSPSLPSSLLLDHFRNRSLSEIERLKVEAELSLNHI-----MFGIAGSS 132

Query: 255 DLSKASK 261
            L K  K
Sbjct: 133 QLWKRRK 139


>gnl|CDD|215258 PLN02465, PLN02465, L-galactono-1,4-lactone dehydrogenase.
          Length = 573

 Score = 28.3 bits (63), Expect = 9.8
 Identities = 14/55 (25%), Positives = 22/55 (40%), Gaps = 5/55 (9%)

Query: 178 AGASAPAPPPPKVAAAPPPPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLAA 232
           +GA+A     P       PPPPK    P      +  ++GG    A+ + + L  
Sbjct: 1   SGAAAGNFAGP-----LAPPPPKPTQGPRSFARGALSSAGGEPASAATVRRYLGY 50


>gnl|CDD|183147 PRK11465, PRK11465, putative mechanosensitive channel protein;
           Provisional.
          Length = 741

 Score = 28.2 bits (63), Expect = 9.9
 Identities = 12/45 (26%), Positives = 15/45 (33%), Gaps = 2/45 (4%)

Query: 189 KVAAAPP--PPPPKAAPAPSPTPVPSQKTSGGTRVYASPLAKRLA 231
             AA PP  P P    P         Q  +  +R Y   L+ R  
Sbjct: 71  TAAATPPAEPVPKIVPPTLVEEKTVLQNVTEVSRHYGEALSARFG 115


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.131    0.376 

Gapped
Lambda     K      H
   0.267   0.0700    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 21,982,181
Number of extensions: 2202076
Number of successful extensions: 10721
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6507
Number of HSP's successfully gapped: 1009
Length of query: 426
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 326
Effective length of database: 6,502,202
Effective search space: 2119717852
Effective search space used: 2119717852
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 60 (26.9 bits)