RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17541
         (717 letters)



>gnl|CDD|216240 pfam01008, IF-2B, Initiation factor 2 subunit family.  This family
           includes initiation factor 2B alpha, beta and delta
           subunits from eukaryotes, initiation factor 2B subunits
           1 and 2 from archaebacteria and some proteins of unknown
           function from prokaryotes. Initiation factor 2 binds to
           Met-tRNA, GTP and the small ribosomal subunit. Members
           of this family have also been characterized as
           5-methylthioribose- 1-phosphate isomerases, an enzyme of
           the methionine salvage pathway. The crystal structure of
           Ypr118w, a non-essential, low-copy number gene product
           from Saccharomyces cerevisiae, reveals a dimeric protein
           with two domains and a putative active site cleft.
          Length = 281

 Score =  252 bits (646), Expect = 1e-78
 Identities = 92/289 (31%), Positives = 150/289 (51%), Gaps = 8/289 (2%)

Query: 414 VVRGSNARCVALLSAIKQMVCDYTTPSEKEYSRGFEERLGPAMSYLNKCRPHSVSMLNAV 473
            VRGS A  +A L  +  ++ D  TP         +E+L  A+ +L   RP +VS+ NA+
Sbjct: 1   KVRGSPAIAIAALEGLLLVIQDARTP----TVAELKEQLRSAIEFLISARPTAVSLGNAI 56

Query: 474 KHFKSHLTQLPNDITDTQARLRLKEVIATYIHEQVDMAGNAICMFFHNKLANDDVILTYG 533
                 +++L +     +A+  L E    +I E    A   I       + + D ILT+ 
Sbjct: 57  DRLLRIISELHSSSDVEEAKESLIEAADEFIDEIE-EARRKIGAIAAELIKDGDTILTHC 115

Query: 534 CSSLVEKILLTAHEKGTKFRVIIVDGSPWYEGKEMLRRLVKHQVDCSYVLLSAVSYIMRE 593
            S  V  +L  AH++G +FRVI+ +  P  +G+   + LV+  +  + +  SAV Y+M+E
Sbjct: 116 NSGTVLGVLRAAHKEGKRFRVIVTESRPRLQGRLTAKELVQAGIPVTLITDSAVGYVMQE 175

Query: 594 VSKVIIGAHALLSNGAVMSRAGTAQVSLVARAFNVPVLAACETHKFCERVQTDALVFNEL 653
           V KVI+GA  +L+NG + ++ GT Q++L+A+A NVP     ET+KF  R   D  +F E 
Sbjct: 176 VDKVIVGADRILANGGIANKIGTYQLALLAKAHNVPFYVVAETYKFDPRFPLDEDIFIEE 235

Query: 654 GDPNELISDKSAAKNWKSLAHLTPLSLTYDITPSHLVTAVITELAIVPC 702
            DP E++         ++L    P    +D TP  L+T +ITE+ ++P 
Sbjct: 236 RDPEEVLYRTGVRIAPENLKVRNP---AFDYTPPELITLIITEVGVLPP 281


>gnl|CDD|224105 COG1184, GCD2, Translation initiation factor 2B subunit, eIF-2B
           alpha/beta/delta family [Translation, ribosomal
           structure and biogenesis].
          Length = 301

 Score =  221 bits (564), Expect = 2e-66
 Identities = 88/312 (28%), Positives = 155/312 (49%), Gaps = 16/312 (5%)

Query: 399 VHPAIYRLGVQYATGVVRGSNARCVALLSAIKQMVCDYTTPSEKEYSRGFEERLGPAMSY 458
           + P +     +  +  +RG++   +A   A++ +  D   P+ +E      E        
Sbjct: 2   IMPEVDETAEKLKSMEIRGASWIAIAAAEALEILASDSQAPTVEELIDAIRE----LSET 57

Query: 459 LNKCRPHSVSMLNAVKHFKSHLTQLPNDITDTQARLRLKEVIATYIHEQVDMAGNAICMF 518
           L K RP +VS+ N ++          +       R  L +    +I   V+ A   I   
Sbjct: 58  LVKARPTAVSLGNLIRFVLR----DSSGGDKENRRQSLIKAAQEFIDR-VEKAKERIAEI 112

Query: 519 FHNKLANDDVILTYGCSSLVEKILLTAHEKGTKFRVIIVDGSPWYEGKEMLRRLVKHQVD 578
              ++ + DVILT+  S  V ++L TA ++G +F+VI+ +  P  EG+ M + L +  + 
Sbjct: 113 GAERIHDGDVILTHSFSKTVLEVLKTAADRGKRFKVIVTESRPRGEGRIMAKELRQSGIP 172

Query: 579 CSYVLLSAVSYIMREVSKVIIGAHALLSNGAVMSRAGTAQVSLVARAFNVPVLAACETHK 638
            + ++ SAV   M  V KV++GA A+L+NGA++++ GT+ ++L AR   VP     E++K
Sbjct: 173 VTVIVDSAVGAFMSRVDKVLVGADAILANGALVNKIGTSPLALAARELRVPFYVVAESYK 232

Query: 639 FCERVQTDALVFNELGDPNELISDKSAAKNWKSLAHLTPLSLTYDITPSHLVTAVITELA 698
           F  +   D LV  EL DP E+  +       + L +L   +  +D+TP   + A+ITEL 
Sbjct: 233 FVPKTLLDTLVEIELRDPLEVARE-------EPLGNLKVRNPAFDVTPPEYIDAIITELG 285

Query: 699 IVPCTSVPVVLR 710
           I+P +S+  +LR
Sbjct: 286 IIPPSSIYRILR 297


>gnl|CDD|236282 PRK08535, PRK08535, translation initiation factor IF-2B subunit
           delta; Provisional.
          Length = 310

 Score =  136 bits (345), Expect = 1e-35
 Identities = 89/318 (27%), Positives = 153/318 (48%), Gaps = 23/318 (7%)

Query: 398 EVHPAIYRLGVQYATGVVRGSN--ARCVALLSAIKQMVCDYTTPSEKEYSRGFEERLGPA 455
           EV P +     +  T  +RG+   AR  A   A+K         S +E    F+  +  A
Sbjct: 2   EVMPEVLETAEKIKTMEIRGAGRIARAAA--EALKDQAEKSDAESPEE----FKAEMRAA 55

Query: 456 MSYLNKCRPHSVSMLNAVKHFKSHLTQLPNDITDTQARLRLKEVIA---TYIHEQVDMAG 512
            + L   RP +VS+ NAV++   + +       +T    R + VI     +I E  + A 
Sbjct: 56  ANILISTRPTAVSLPNAVRYVMRYYSG------ETVEEAR-ESVIERAEEFI-ESSENAV 107

Query: 513 NAICMFFHNKLANDDVILTYGCSSLVEKILLTAHEKGTKFRVIIVDGSPWYEGKEMLRRL 572
             I      ++ + DVI+T+  SS    ++ TAHE+G    VI  +  P  +G    + L
Sbjct: 108 EKIGEIGAKRIRDGDVIMTHCNSSAALSVIKTAHEQGKDIEVIATETRPRNQGHITAKEL 167

Query: 573 VKHQVDCSYVLLSAVSYIMREVSKVIIGAHALLSNGAVMSRAGTAQVSLVARAFNVPVLA 632
            ++ +  + ++ SAV Y M++V KV++GA A+ +NGAV+++ GT+Q++L A    VP + 
Sbjct: 168 AEYGIPVTLIVDSAVRYFMKDVDKVVVGADAITANGAVINKIGTSQIALAAHEARVPFMV 227

Query: 633 ACETHKFCERVQTDALVFNELGDPNELISDKSAAKNWKSLAHLTPLSLTYDITPSHLVTA 692
           A ET+KF  +     LV  E  DP E++  +        +    P    +D+TP   + A
Sbjct: 228 AAETYKFSPKTLLGELVEIEERDPTEVL-PEEILAKLPGVKVRNP---AFDVTPPEYIDA 283

Query: 693 VITELAIVPCTSVPVVLR 710
           +ITE+  +P      +++
Sbjct: 284 IITEIGAIPPEMAYTIIK 301


>gnl|CDD|188057 TIGR00511, ribulose_e2b2, ribose-1,5-bisphosphate isomerase, e2b2
           family.  The delineation of this family was based
           originally, in part, on a discussion and
           neighbor-joining phylogenetic study by Kyrpides and
           Woese of archaeal and other proteins homologous to the
           alpha, beta, and delta subunits of eukaryotic initiation
           factor 2B (eIF-2B), a five-subunit molecule that
           catalyzes GTP recycling for eIF-2. Recently, Sato, et
           al. assigned the function ribulose-1,5 bisphosphate
           isomerase [Energy metabolism, Other].
          Length = 301

 Score =  126 bits (319), Expect = 3e-32
 Identities = 73/276 (26%), Positives = 136/276 (49%), Gaps = 9/276 (3%)

Query: 435 DYTTPSEKEYSRGFEERLGPAMSYLNKCRPHSVSMLNAVKHFKSHLTQLPNDITDTQARL 494
           +    S+      F   +  A + L   RP +VS+ NA+++   +++           R 
Sbjct: 30  EQAEKSDAASPEEFRAEMRAAANILISTRPTAVSLPNALRYVLKYMSGA----DVETLRQ 85

Query: 495 RLKEVIATYIHEQVDMAGNAICMFFHNKLANDDVILTYGCSSLVEKILLTAHEKGTKFRV 554
            + E    +I+ + + A   I      ++ + DVI+T+  S     ++ TA E+G    V
Sbjct: 86  SVIERADEFIN-RSEKAQERIGEIGAKRIRDGDVIMTHCNSEAALSVIKTAFEQGKDIEV 144

Query: 555 IIVDGSPWYEGKEMLRRLVKHQVDCSYVLLSAVSYIMREVSKVIIGAHALLSNGAVMSRA 614
           I+ +  P  +G    + L  + +  + ++ SA  Y M+EV  V++GA A+ +NGAV+++ 
Sbjct: 145 IVTETRPRNQGHITAKELRDYGIPVTLIVDSAARYYMKEVDHVVVGADAITANGAVINKI 204

Query: 615 GTAQVSLVARAFNVPVLAACETHKFCERVQTDALVFNELGDPNELISDKSAAKNWKSLAH 674
           GT+Q++L AR   VP + A ET+KF  +  T  LV  E  DP E++ ++   +    L +
Sbjct: 205 GTSQLALAAREARVPFMVAAETYKFHPKTITGELVEIEERDPTEVLDEEDLKQ----LGN 260

Query: 675 LTPLSLTYDITPSHLVTAVITELAIVPCTSVPVVLR 710
           +   +  +D+TP   V A+ITE   +P      +++
Sbjct: 261 VKVRNPAFDVTPPEYVDAIITERGQIPPEMAYTIIK 296


>gnl|CDD|233008 TIGR00524, eIF-2B_rel, eIF-2B alpha/beta/delta-related
           uncharacterized proteins.  This model, eIF-2B_rel,
           describes half of a superfamily, where the other half
           consists of eukaryotic translation initiation factor 2B
           (eIF-2B) subunits alpha, beta, and delta. It is unclear
           whether the eIF-2B_rel set is monophyletic, or whether
           they are all more closely related to each other than to
           any eIF-2B subunit because the eIF-2B clade is highly
           derived. Members of this branch of the family are all
           uncharacterized with respect to function and are found
           in the Archaea, Bacteria, and Eukarya, although a number
           are described as putative translation intiation factor
           components. Proteins found by eIF-2B_rel include at
           least three clades, including a set of uncharacterized
           eukaryotic proteins, a set found in some but not all
           Archaea, and a set universal so far among the Archaea
           and closely related to several uncharacterized bacterial
           proteins [Unknown function, General].
          Length = 303

 Score = 99.8 bits (249), Expect = 4e-23
 Identities = 76/306 (24%), Positives = 131/306 (42%), Gaps = 36/306 (11%)

Query: 414 VVRGSNARCVALLSAIKQMVCDYTTPSEKEYSRGFEERLGPAMSYLNKCRPHSVSMLNAV 473
           VVRG+ A  VA    +        T + +E    F+E L  A  +L   RP +V++  A+
Sbjct: 15  VVRGAPAIGVAAAYGLALAARKIETDNVEE----FKEDLEKAADFLLSTRPTAVNLFWAL 70

Query: 474 KHFKSHLTQLPNDITDTQARLRLKEVIATYIHEQVDMAGN-AICMFFHNKLANDDVILTY 532
           +     L    N  +  +A+  L       I E  D+  N  I       + + D +LT+
Sbjct: 71  ERV---LNSAENGESVEEAKESLLREAIEIIEE--DLETNRKIGENGAKLIKDGDTVLTH 125

Query: 533 GCSSLVEK---------ILLTAHEKGTKFRVIIVDGSPWYEGKEM-LRRLVKHQVDCSYV 582
            C++             ++ +A E G + RVI  +  P  +G  +    L++  +D + +
Sbjct: 126 -CNAGALATSDYGTALGVIRSAWEDGKRIRVIACETRPRNQGSRLTAWELMQDGIDVTLI 184

Query: 583 LLSAVSYIMR--EVSKVIIGAHALLSNGAVMSRAGTAQVSLVARAFNVPVLAACETHKFC 640
             S  +Y M+  E+  VI+GA  +  NG V ++ GT Q++++A+ F +P   A     F 
Sbjct: 185 TDSMAAYFMQKGEIDAVIVGADRIARNGDVANKIGTYQLAVLAKEFRIPFFVAAPLSTFD 244

Query: 641 ERVQTDALVFNELGDPNELISDKSAAKNWKSLAHLTPLSL-----TYDITPSHLVTAVIT 695
            +      +  E  DP E+               + PL +      +DITP  L+ A+IT
Sbjct: 245 TKTSCGEDIVIEERDPEEVA--------QVGGVRIAPLGVKVYNPAFDITPHDLIDAIIT 296

Query: 696 ELAIVP 701
           E  I+ 
Sbjct: 297 EKGIIT 302


>gnl|CDD|169387 PRK08335, PRK08335, translation initiation factor IF-2B subunit
           alpha; Validated.
          Length = 275

 Score = 79.8 bits (197), Expect = 2e-16
 Identities = 48/176 (27%), Positives = 84/176 (47%), Gaps = 15/176 (8%)

Query: 525 NDDVILTYGCSSLVEKILLTAHEKGTKFRVIIVDGSPWYEGKEMLRRLVKHQVDCSYVLL 584
           + DVI+T+  SS V +IL TA  KG +F+VI+ + +P YEG  +   L    ++   +  
Sbjct: 109 DGDVIITHSFSSAVLEILKTAKRKGKRFKVILTESAPDYEGLALANELEFLGIEFEVITD 168

Query: 585 SAVSYIMREVSKVIIGAHALLSNGAVMSRAGTAQVSLVARAFNVPVLAACETHKFCERVQ 644
           + +    +E +  ++GA  +  +G V+++AGT  ++L      VP   A ET KF   ++
Sbjct: 169 AQLGLFAKEATLALVGADNVTRDGYVVNKAGTYLLALACHDNGVPFYVAAETFKFHPELK 228

Query: 645 TDALVFNELGDPNELISDKSAAKNWKSLAHLTPLSLTYDITPSHLVTAVITELAIV 700
           ++ +   E              +N             +D+TP   V  +ITEL I+
Sbjct: 229 SEEVELVE----RPYARQGHRVRNV-----------LFDVTPWKYVRGIITELGIL 269


>gnl|CDD|233003 TIGR00512, salvage_mtnA, S-methyl-5-thioribose-1-phosphate
           isomerase.  The delineation of this family was based in
           part on a discussion and neighbor-joining phylogenetic
           study, by Kyrpides and Woese, of archaeal and other
           proteins homologous to the alpha, beta, and delta
           subunits of eukaryotic initiation factor 2B (eIF-2B), a
           five-subunit molecule that catalyzes GTP recycling for
           eIF-2. This clade is now recognized to include the
           methionine salvage pathway enzyme MtnA [Amino acid
           biosynthesis, Aspartate family].
          Length = 331

 Score = 73.0 bits (180), Expect = 7e-14
 Identities = 68/287 (23%), Positives = 116/287 (40%), Gaps = 46/287 (16%)

Query: 441 EKEYSRGFEERLGPAMSYLNKCRPHSVSMLNAVKHFKSHLTQLPNDITDTQARLRLKEVI 500
           E +    F+  L   + YL   RP +V++  A+   ++ L          +A L      
Sbjct: 62  EADEREEFKALLEEKLQYLVSSRPTAVNLSWALDRMRAALEAAKTVADIKEALLAE---- 117

Query: 501 ATYIHEQVDMAGN-AI----CMFFHNKLANDDVILT-----------YGCSSLVEKILLT 544
           A  I E+ D+  N AI           +A    +LT           YG +  V   + +
Sbjct: 118 AERILEE-DLEDNRAIGENGAALIKKGVAAPLRVLTHCNTGSLATAGYGTALGV---IRS 173

Query: 545 AHEKGTKFRVIIVDGSPWYEGKEMLR-RLVKHQVDCSYVLLSAVSYIM--REVSKVIIGA 601
           AHEKG    V   +  P  +G  +    LV+  +  + +  S  +++M   EV  VI+GA
Sbjct: 174 AHEKGRLEHVYADETRPRLQGARLTAWELVQEGIPATLITDSMAAHLMKHGEVDAVIVGA 233

Query: 602 HALLSNGAVMSRAGTAQVSLVARAFNVPVLAACETHKFCERVQTDALVFNELGDPNEL-- 659
             + +NG   ++ GT Q++++A+   VP   A  T       +  A +  E   P E+  
Sbjct: 234 DRIAANGDTANKIGTYQLAVLAKHHGVPFYVAAPTSTIDLETKDGAEIPIEERPPEEVTH 293

Query: 660 -----ISDK-SAAKNWKSLAHLTPLSLTYDITPSHLVTAVITELAIV 700
                I+       N        P    +D+TP+ L+T +ITE  ++
Sbjct: 294 VGGVRIAPPGIDVWN--------P---AFDVTPAELITGIITEKGVI 329


>gnl|CDD|223260 COG0182, COG0182, Predicted translation initiation factor 2B
           subunit, eIF-2B alpha/beta/delta family [Translation,
           ribosomal structure and biogenesis].
          Length = 346

 Score = 68.8 bits (169), Expect = 2e-12
 Identities = 67/308 (21%), Positives = 126/308 (40%), Gaps = 41/308 (13%)

Query: 414 VVRGSNA-RCVALLSAIKQMVCDYTTPSEKEYSRGFEERLGPAMSYLNKCRPHSVSMLNA 472
           VVRG+ A    A                 +E+    E+    A   L   RP +V++  A
Sbjct: 45  VVRGAPAIGVAAAYGLALAARESKNDSKGEEFIEALEK----AAETLKSTRPTAVNLFWA 100

Query: 473 VKHFKSHLTQLPNDITDTQARLRLKEVIATYIHEQVDMAGN-AICMFFHNKLANDDVILT 531
           +    +   +      + +         A  I E+ D+  N AI       L + D +LT
Sbjct: 101 LDRMLNAAKEAI----EVKEPKESILQEAEEIAEE-DLEANRAIGENGAELLPDGDTVLT 155

Query: 532 YGCS--SLVEKILLTA-------HEKGTKFRVIIVDGSPWYEGKEMLRRL-----VKHQV 577
           + C+  +L      TA       HE+G   RV   +  P+ +G     RL     V+  +
Sbjct: 156 H-CNAGALATVGYGTALGVIRSAHEEGKDIRVFADETRPYLQGA----RLTAWELVQDGI 210

Query: 578 DCSYVLLSAVSYIMR--EVSKVIIGAHALLSNGAVMSRAGTAQVSLVARAFNVPVLAACE 635
             + +  +A  ++M+   +  VI+GA  + +NG V ++ GT Q++++A+   +P   A  
Sbjct: 211 PVTLITDNAAGHLMQQGMIDAVIVGADRIAANGDVANKIGTYQLAVLAKHHGIPFYVAAP 270

Query: 636 THKFCERVQTDALVFNELGDPNELI---SDKSAAKNWKSLAHLTPLSLTYDITPSHLVTA 692
                  +++   +  E  DP E++     + A +  ++       +  +D+TP  L+T 
Sbjct: 271 LSTIDFELKSGEDIPIEERDPEEVLEVGGVRIAPEGVEA------YNPAFDVTPPELITG 324

Query: 693 VITELAIV 700
           +ITE  + 
Sbjct: 325 IITEKGVF 332


>gnl|CDD|168237 PRK05772, PRK05772, translation initiation factor IF-2B subunit
           alpha; Provisional.
          Length = 363

 Score = 59.4 bits (144), Expect = 3e-09
 Identities = 62/267 (23%), Positives = 119/267 (44%), Gaps = 26/267 (9%)

Query: 452 LGPAMSYLNKCRPHSVSMLNAVKHFKSHLTQLPNDITDTQAR-----LRLKEVIATYIHE 506
           L  A + L+  RP +V+++ A       L +  N +    A+     + L +V A  I E
Sbjct: 91  LTRAKTILDSARPTAVNLVWATSRM---LNKAKNTVESGNAKSVNELIELLKVEAKKIFE 147

Query: 507 QVDMAGNAICMFFHNKLANDDVILTY----------GCSSLVEKILLTAHEKGTKFRVII 556
           +   A   + ++   KL + D +LT           G  + +  + L A   G    VI 
Sbjct: 148 EEYDAEIQMGLYGLEKLNDGDTVLTQCNAGGLATGTGLGTALAPVKL-AKALGMSVSVIA 206

Query: 557 VDGSPWYEGKEM-LRRLVKHQVDCSYVLLSAVSYIM--REVSKVIIGAHALLSNGAVMSR 613
            +  PW +G  + +  L++  +  + +  +AV  +M    V+ V++GA  +L +G V ++
Sbjct: 207 PETRPWLQGSRLTVYELMEEGIKVTLITDTAVGLVMYKDMVNNVMVGADRILRDGHVFNK 266

Query: 614 AGTAQVSLVARAFNVPVLAACETHKFCERVQTDALVFNELGDPNELISDKSAAKNWKSLA 673
            GT + +++A    +P  A   T  F  +   + +   E  DPNE+ + +      + + 
Sbjct: 267 IGTFKEAVIAHELGIPFYALAPTSTFDLKSDVNDVKIEER-DPNEVRTIRGVPITPEDVN 325

Query: 674 HLTPLSLTYDITPSHLVTAVITELAIV 700
              P+   +D+TP   +T +ITE  I+
Sbjct: 326 VYNPV---FDVTPPKYITGIITEKGII 349


>gnl|CDD|235578 PRK05720, mtnA, methylthioribose-1-phosphate isomerase; Reviewed.
          Length = 344

 Score = 57.1 bits (139), Expect = 1e-08
 Identities = 41/171 (23%), Positives = 72/171 (42%), Gaps = 30/171 (17%)

Query: 545 AHEKGTKFRVIIVDGSPWYEGKEMLRRL-----VKHQVDCSYVLLSAVSYIMRE--VSKV 597
           A EKG    V   +  P  +G     RL      +  +D + +  +  +++M+   +  V
Sbjct: 174 AKEKGIDIHVYADETRPRLQGA----RLTAWELYQAGIDVTVITDNMAAHLMQTGKIDAV 229

Query: 598 IIGAHALLSNGAVMSRAGTAQVSLVARAFNVPVLAACETHKFCERVQTDALVFNELGDPN 657
           I+GA  + +NG V ++ GT Q+++ A+   VP   A  +      +     +  E  DP 
Sbjct: 230 IVGADRIAANGDVANKIGTYQLAIAAKYHGVPFYVAAPSSTIDLTLADGKEIPIEERDPE 289

Query: 658 ELIS--------DKSAAKNWKSLAHLTPLSLTYDITPSHLVTAVITELAIV 700
           E+          +     N        P    +D+TP+ L+T +ITE  IV
Sbjct: 290 EVTEVGGVRIAPEGVKVYN--------PA---FDVTPAELITGIITEKGIV 329


>gnl|CDD|180547 PRK06371, PRK06371, translation initiation factor IF-2B subunit
           alpha; Provisional.
          Length = 329

 Score = 54.1 bits (130), Expect = 9e-08
 Identities = 43/166 (25%), Positives = 74/166 (44%), Gaps = 22/166 (13%)

Query: 545 AHEKGTKFRVIIVDGSPWYEGKEMLR-RLVKHQVDCSYVLLSAVSYIMR--EVSKVIIGA 601
           AH  G    V + +  P  +G  +    L +  +D + +  +A  Y MR  E+  VI+GA
Sbjct: 164 AHRNGKNIFVFVDETRPRLQGARLTAWELAQEGIDHAIIADNAAGYFMRKKEIDLVIVGA 223

Query: 602 HALLSNGAVMSRAGTAQVSLVARAFNVPVLAACETHKFCERVQTDALVFNELGDPNELIS 661
             + SNG   ++ GT + +++A+   +P   A     F   +++   +  E  D NE++ 
Sbjct: 224 DRIASNGDFANKIGTYEKAVLAKVNGIPFYVAAPGSTFDFSIKSGDEIPIEERDENEVLE 283

Query: 662 --------DKSAAKNWKSLAHLTPLSLTYDITPSHLVTAVITELAI 699
                    +S A+N        P    +D+TP+  VT  ITE  I
Sbjct: 284 INGCRIGPQESHARN--------P---AFDVTPNEYVTGFITEYGI 318


>gnl|CDD|180362 PRK06036, PRK06036, translation initiation factor IF-2B subunit
           alpha; Provisional.
          Length = 339

 Score = 52.4 bits (126), Expect = 3e-07
 Identities = 57/206 (27%), Positives = 95/206 (46%), Gaps = 28/206 (13%)

Query: 522 KLAND-DVILTY---GCSSLVE-----KILLTAHEKGTKFRVIIVDGSPWYEGKEMLR-R 571
           KL  D D +LT+   G  + V+      ++ +A E+G + +VI  +  P  +G  +    
Sbjct: 143 KLLEDGDTVLTHCNAGRLACVDWGTALGVIRSAVEQGKEIKVIACETRPLNQGSRLTTWE 202

Query: 572 LVKHQVDCSYVLLSAVSYIMRE--VSKVIIGAHALLSNGAVMSRAGTAQVSLVARAFNVP 629
           L++  +  + +  S    +MR+  V KVI+GA  +  + AV ++ GT   S++A+   +P
Sbjct: 203 LMQDNIPVTLITDSMAGIVMRQGMVDKVIVGADRITRD-AVFNKIGTYTHSVLAKEHEIP 261

Query: 630 --VLAACETHKFCERVQTDALVFNELGDPNELI---SDKSAAKNWKSLAHLTPLSLTYDI 684
             V A   T  F E  +    +  E  DP+EL      + A K+        P    +D 
Sbjct: 262 FYVAAPLSTFDF-EGWEGSVKI--EERDPDELRYCGKTQIAPKDVPVY---NP---AFDA 312

Query: 685 TPSHLVTAVITELAIV-PCTSVPVVL 709
           TP   VTA+ITE  +  P   +  VL
Sbjct: 313 TPMENVTAIITEKGVFYPPFLLDEVL 338


>gnl|CDD|235788 PRK06372, PRK06372, translation initiation factor IF-2B subunit
           delta; Provisional.
          Length = 253

 Score = 46.1 bits (109), Expect = 3e-05
 Identities = 43/184 (23%), Positives = 74/184 (40%), Gaps = 17/184 (9%)

Query: 525 NDDVILTYGCSSLVEKILLTAHEKGTKFRVIIVDGSPWYEGKEMLRRLVKHQVDCSYVLL 584
           ND VI T   S +++   +++ EK     V I++  P  EG +M + LVK  +D   +L 
Sbjct: 86  NDSVIGTISSSQVLKA-FISSSEKIK--SVYILESRPMLEGIDMAKLLVKSGID-VVLLT 141

Query: 585 SAVSYIMRE-VSKVIIGAHALLSNGAVMSRAGTAQVSLVARAFNVPVLAACETHKFCERV 643
            A        V  VI+G+ ++L +G ++ + GT  ++L AR    P  +   + K     
Sbjct: 142 DASMCEAVLNVDAVIVGSDSVLYDGGLIHKNGTFPLALCARYLKKPFYSLTISMKIERNF 201

Query: 644 QTDALVFNELGDPNELISDKSAAKNWKSLAHLTPLSLTYDITPSHLVTAVITELAIVPCT 703
                   +    +E   D      +            +D TP  L+   I E   V  +
Sbjct: 202 LYSTYPNFKNHPCSEWNIDIPCINRY------------FDKTPPDLIDYYINENGFVKPS 249

Query: 704 SVPV 707
            V +
Sbjct: 250 DVNI 253


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 47.4 bits (112), Expect = 3e-05
 Identities = 37/161 (22%), Positives = 67/161 (41%), Gaps = 6/161 (3%)

Query: 196  DLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSK----AKSKSEPSKE 251
               K   EAKKAA+ A  +      +  A  +     E K+ E  K    AK K+E  K+
Sbjct: 1333 AAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392

Query: 252  ANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQA 311
            A+   K A+  +D  K        +     +  K + +E K   + K +    ++     
Sbjct: 1393 ADEAKKKAE--EDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAK 1450

Query: 312  KAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSK 352
            K A   +K + + +K+    K +++K  +E++ K +  K K
Sbjct: 1451 KKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKK 1491



 Score = 44.7 bits (105), Expect = 2e-04
 Identities = 47/177 (26%), Positives = 83/177 (46%), Gaps = 18/177 (10%)

Query: 186  EKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSK 245
            EK  + E  +   KA+ + KKA +L     AK K D     +A  + E K+ +  +AK K
Sbjct: 1389 EKKKADEAKK---KAEEDKKKADELKKAAAAKKKAD-----EAKKKAEEKK-KADEAKKK 1439

Query: 246  SEPSKEANVQNKPAQ---GSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQ---LKA 299
            +E +K+A+   K A+    +++  K             K     + DE K KA+    KA
Sbjct: 1440 AEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKA 1499

Query: 300  ER-RAKQEQQRQAKAAALLEKTKTSNE--KSMTKSKTEDSKPASEKSSKTEVLKSKD 353
            +  +   E +++A  A   E+ K ++E  K+    K +++K A EK    E+ K+++
Sbjct: 1500 DEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEE 1556



 Score = 44.4 bits (104), Expect = 3e-04
 Identities = 47/186 (25%), Positives = 75/186 (40%), Gaps = 7/186 (3%)

Query: 174  ASLSQVTTAIMSEKNNSAELSR--DLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQ 231
            A  ++   AI +E+   A+  +  +  K   EAKKA +     +AK K +     D   +
Sbjct: 1263 AHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKK 1322

Query: 232  P--ESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGD 289
               E+K+   +  K   E  K A      A+ + D  +     A  +    KE  K + D
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEK-KKEEAKKKAD 1381

Query: 290  EVKSKAQ--LKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTE 347
              K KA+   KA+   K+ ++ + KA  L +      +    K K E+ K A E   K E
Sbjct: 1382 AAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAE 1441

Query: 348  VLKSKD 353
              K  D
Sbjct: 1442 EAKKAD 1447



 Score = 40.1 bits (93), Expect = 0.004
 Identities = 38/174 (21%), Positives = 71/174 (40%), Gaps = 15/174 (8%)

Query: 187  KNNSAELSRDLVKAQREAKKAAKL----------AAKQKAKVKGDVTAPPDAVTQPESKQ 236
                AE + D  +A  E  +AA+           AAK+KA+ K           + + K 
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKA 1407

Query: 237  SEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQ 296
             E  KA +  + + EA  + +  + + +  K             K     + +E K KA+
Sbjct: 1408 DELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467

Query: 297  LKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLK 350
               E +   E +++A+ A   ++ K   E++  K K +++K A+E   K +  K
Sbjct: 1468 ---EAKKADEAKKKAEEAKKADEAKKKAEEA--KKKADEAKKAAEAKKKADEAK 1516



 Score = 40.1 bits (93), Expect = 0.005
 Identities = 42/178 (23%), Positives = 77/178 (43%), Gaps = 14/178 (7%)

Query: 184  MSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAK 243
              +K   A+ + +  K   EAKKA +   K +   K D     +A  + E  + +  +AK
Sbjct: 1449 AKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKAD-----EAKKKAEEAKKKADEAK 1503

Query: 244  SKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQL-KAERR 302
              +E  K+A+   K  +  +         A  ++   K   K + DE+K   +L KAE +
Sbjct: 1504 KAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEK 1563

Query: 303  AKQEQQRQA--------KAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSK 352
             K E+ ++A        + A   +K + +  + + K   E+ K  +E++ K E  K K
Sbjct: 1564 KKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIK 1621



 Score = 34.7 bits (79), Expect = 0.24
 Identities = 39/173 (22%), Positives = 74/173 (42%), Gaps = 7/173 (4%)

Query: 185  SEKNNSAELSR--DLVKAQREAKKAAKLAAK-QKAKVKGDVTAPPDAVTQPESKQSEPSK 241
            + K  + E  +  +  K   EAKK A+ A K  +AK K +     +   +   +  +  +
Sbjct: 1415 AAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADE 1474

Query: 242  AKSKSEPSKEANVQNKPAQGSQ---DTPKPGGPPATLSNVDNKENVKSEGDEV-KSKAQL 297
            AK K+E +K+A+   K A+ ++   D  K        ++   K     + DE  K++   
Sbjct: 1475 AKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAK 1534

Query: 298  KAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLK 350
            KA+   K E++++A      E+ K + EK   +   +  +  +    K E  K
Sbjct: 1535 KADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAK 1587



 Score = 32.4 bits (73), Expect = 1.2
 Identities = 33/167 (19%), Positives = 55/167 (32%), Gaps = 30/167 (17%)

Query: 199  KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKS--KSEPSKEANVQN 256
            K   EAKKA +    ++ +   +      A  Q   K  E  KA    K+E  K+A+   
Sbjct: 1237 KDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAK 1296

Query: 257  KPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLK--AERRAK---------Q 305
            K  +                     +  K + +E K   + K  AE   K         +
Sbjct: 1297 KAEE-----------------KKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAE 1339

Query: 306  EQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSK 352
            E ++ A+AA    +      ++  +      K   E   K +  K K
Sbjct: 1340 EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386



 Score = 32.0 bits (72), Expect = 1.4
 Identities = 41/196 (20%), Positives = 75/196 (38%), Gaps = 12/196 (6%)

Query: 170  QTTPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKL--AAKQKAKVKGDVTAPPD 227
            +   A + +V      EK   AE ++   KA+    KA +L  A ++K KV+       +
Sbjct: 1588 KAEEARIEEVMKLYEEEKKMKAEEAK---KAEEAKIKAEELKKAEEEKKKVEQLKKKEAE 1644

Query: 228  AVTQPES--KQSEPSKAKSKSEPSKEANVQNKPAQGSQ-DTPKPGGPPATLSNVDNKENV 284
               + E   K  E +K K+  E  K    + K  +  + +  +     A     +  +  
Sbjct: 1645 EKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKA 1704

Query: 285  ----KSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPAS 340
                K E +E K   +LK      + +  +AK  A  +K K    K   + K + +    
Sbjct: 1705 EELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKK 1764

Query: 341  EKSSKTEVLKSKDPNV 356
            E+  K E ++ +   V
Sbjct: 1765 EEEKKAEEIRKEKEAV 1780



 Score = 31.3 bits (70), Expect = 2.7
 Identities = 35/193 (18%), Positives = 76/193 (39%), Gaps = 8/193 (4%)

Query: 185  SEKNNSAELSRDLVKAQR--EAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSK- 241
            +E    AE +R   + ++  E +KA      + A+   +     +A    ++K++E  K 
Sbjct: 1172 AEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKK 1231

Query: 242  ---AKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLK 298
               AK  +E +K+A  +    +  +         A        E  +   +  K++ + K
Sbjct: 1232 AEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKK 1291

Query: 299  AERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPS 358
            A+   K E+++  KA    +K + + +    K K E++K  ++ + K      K      
Sbjct: 1292 ADEAKKAEEKK--KADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAK 1349

Query: 359  TKKYSGVDGVKAT 371
             +  +  D  +A 
Sbjct: 1350 AEAEAAADEAEAA 1362


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 46.1 bits (109), Expect = 4e-05
 Identities = 37/185 (20%), Positives = 80/185 (43%), Gaps = 15/185 (8%)

Query: 154 FISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVK-AQREAKKAAKLAA 212
            I   + ++  + S V T   S  +V  A+  +        RD+    +RE+++ AK   
Sbjct: 157 VIPLKKNILSGNVSDVDTDSISDKKVVEALREDNEKGVNFRRDMTDLKERESQEDAK--R 214

Query: 213 KQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPP 272
            Q+ K + D         Q ++  ++ +  K + E      V+ K  Q +++ PKP    
Sbjct: 215 AQQLKEELDKKQIDADKAQQKADFAQDNADKQRDE------VRQK-QQEAKNLPKPADTS 267

Query: 273 ATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSK 332
           +   +    EN K E +    KAQ++ ++  ++  + +   A  L++   ++EK   + K
Sbjct: 268 SPKEDKQVAENQKREIE----KAQIEIKKNDEEALKAKDHKAFDLKQESKASEK-EAEDK 322

Query: 333 TEDSK 337
             +++
Sbjct: 323 ELEAQ 327



 Score = 30.4 bits (68), Expect = 4.0
 Identities = 11/79 (13%), Positives = 23/79 (29%), Gaps = 4/79 (5%)

Query: 278 VDNKENVKSEG----DEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKT 333
            D KE    E      ++K +   K     K +Q+         ++     +K       
Sbjct: 201 TDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNL 260

Query: 334 EDSKPASEKSSKTEVLKSK 352
                 S      +V +++
Sbjct: 261 PKPADTSSPKEDKQVAENQ 279


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 44.5 bits (105), Expect = 1e-04
 Identities = 36/143 (25%), Positives = 54/143 (37%), Gaps = 13/143 (9%)

Query: 203 EAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGS 262
            AKKAAK  A +    K    A   A     SK+     A S  + +K A    K A+ +
Sbjct: 198 NAKKAAKTPAAKSGAKKASAKAKSAAKKV--SKKKAAKTAVSAKKAAKTA---AKAAKKA 252

Query: 263 QDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKT 322
           + T K     A        + VK    +   KA  KA + A +  + +AKA     K   
Sbjct: 253 KKTAKKALKKA-------AKAVKKAAKKAA-KAAAKAAKGAAKATKGKAKAKKKAGKKAA 304

Query: 323 SNEKSMTKSKTEDSKPASEKSSK 345
           +  K+   +K        +K+ K
Sbjct: 305 AGSKAKATAKAPKRGAKGKKAKK 327



 Score = 35.2 bits (81), Expect = 0.10
 Identities = 30/136 (22%), Positives = 52/136 (38%), Gaps = 13/136 (9%)

Query: 198 VKAQREAKKAAKLAAKQKAKVKGDVTA-PPDAVTQPESKQSEPSKAKSKSEPSKEANVQN 256
               ++A   AK AAK+ +K K   TA       +  +K ++ +K  +K    K A    
Sbjct: 209 KSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVK 268

Query: 257 KPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAAL 316
           K A+ +          A           K+   + K+K +   +  A  + +  AKA   
Sbjct: 269 KAAKKAAKAAAKAAKGAA----------KATKGKAKAKKKAGKKAAAGSKAKATAKAPKR 318

Query: 317 LEKTKTSNEKSMTKSK 332
             K K +  K +TK +
Sbjct: 319 GAKGKKA--KKVTKKR 332



 Score = 31.8 bits (72), Expect = 1.2
 Identities = 22/118 (18%), Positives = 42/118 (35%), Gaps = 3/118 (2%)

Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKP 258
           KA+  AKK +K  A + A              +   K ++ +  K+     K A    K 
Sbjct: 218 KAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKA 277

Query: 259 A-QGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAA 315
           A + ++   K     A       K+   + G + K+ A+        ++ ++  K  A
Sbjct: 278 AAKAAKGAAKATKGKAKAKKKAGKK--AAAGSKAKATAKAPKRGAKGKKAKKVTKKRA 333



 Score = 31.4 bits (71), Expect = 1.5
 Identities = 21/86 (24%), Positives = 36/86 (41%)

Query: 165 SPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTA 224
           + ++V    A+ +    A  ++K     L +     ++ AKKAAK AAK           
Sbjct: 232 AKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKG 291

Query: 225 PPDAVTQPESKQSEPSKAKSKSEPSK 250
              A  +   K +  SKAK+ ++  K
Sbjct: 292 KAKAKKKAGKKAAAGSKAKATAKAPK 317



 Score = 30.6 bits (69), Expect = 3.0
 Identities = 30/119 (25%), Positives = 43/119 (36%), Gaps = 7/119 (5%)

Query: 259 AQGSQDTPKPG-GPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALL 317
           A G+ +  K    P A           KS   +V  K   K    AK+  +  AKAA   
Sbjct: 193 AVGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKA 252

Query: 318 EKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVP--STKKYSGVDGVKATPGT 374
           +KT     K   K+     K A++K++K     +K          K     G KA  G+
Sbjct: 253 KKTAKKALKKAAKA----VKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGS 307



 Score = 29.5 bits (66), Expect = 5.9
 Identities = 34/140 (24%), Positives = 57/140 (40%), Gaps = 4/140 (2%)

Query: 142 SATGPSSCKAPQFISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQ 201
           +A    + K P   S A++    + S+ +      +  T    S K  +   ++   KA+
Sbjct: 196 AANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAV--SAKKAAKTAAKAAKKAK 253

Query: 202 REAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQG 261
           + AKKA K AAK   K             +  +K ++  KAK+K +  K+A   +K A+ 
Sbjct: 254 KTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKG-KAKAKKKAGKKAAAGSK-AKA 311

Query: 262 SQDTPKPGGPPATLSNVDNK 281
           +   PK G        V  K
Sbjct: 312 TAKAPKRGAKGKKAKKVTKK 331


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 40.7 bits (96), Expect = 0.002
 Identities = 19/130 (14%), Positives = 42/130 (32%), Gaps = 2/130 (1%)

Query: 234 SKQSEPSKAKSKSEPSKEANVQNKPAQG--SQDTPKPGGPPATLSNVDNKENVKSEGDEV 291
           +       A+ +++   +       ++G  +++  K        +     + +      V
Sbjct: 4   ASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMV 63

Query: 292 KSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKS 351
           K           K++ +  AKAAA     K   +  +  SK  + K A +K      +K 
Sbjct: 64  KDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKD 123

Query: 352 KDPNVPSTKK 361
            D    +   
Sbjct: 124 IDVLNQADDD 133


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 40.2 bits (94), Expect = 0.003
 Identities = 32/146 (21%), Positives = 63/146 (43%), Gaps = 20/146 (13%)

Query: 200 AQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPA 259
           A ++A    K A +  AK      A  +A    E+K++  +  K+ +E  K+A  +    
Sbjct: 123 AAKQAALKQKQAEEAAAKAAAAAKAKAEA----EAKRAAAAAKKAAAEAKKKAEAEAA-K 177

Query: 260 QGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEK 319
           + + +  K     A                E K KA+ +A+++A  E +++A A A    
Sbjct: 178 KAAAEAKKKAEAEAA----------AKAAAEAKKKAEAEAKKKAAAEAKKKAAAEA---- 223

Query: 320 TKTSNEKSMTKSKTEDSKPASEKSSK 345
            K +  K+  ++K    K A+ K+++
Sbjct: 224 -KAAAAKAAAEAKAAAEKAAAAKAAE 248



 Score = 35.6 bits (82), Expect = 0.089
 Identities = 33/145 (22%), Positives = 60/145 (41%), Gaps = 7/145 (4%)

Query: 186 EKNNSAELSRDLVKAQREAKKAA-KLAAKQKAKVKGDVTAPPDAVTQPES----KQSEPS 240
           +K  + E ++     Q++A++AA K AA  KAK + +      A  +  +    K    +
Sbjct: 116 QKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEA 175

Query: 241 KAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAE 300
             K+ +E  K+A  +      ++   K        +  + K+   +E     +KA   AE
Sbjct: 176 AKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKA--AAE 233

Query: 301 RRAKQEQQRQAKAAALLEKTKTSNE 325
            +A  E+   AKAA      K + E
Sbjct: 234 AKAAAEKAAAAKAAEKAAAAKAAAE 258



 Score = 32.5 bits (74), Expect = 0.82
 Identities = 28/122 (22%), Positives = 43/122 (35%), Gaps = 28/122 (22%)

Query: 199 KAQREAKK----AAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANV 254
           KA  EAKK     A   A  +AK K +  A   A  + + K    +K K+ +E  K+A  
Sbjct: 162 KAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAA 221

Query: 255 QNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAA 314
           + K A                              E K+ A+  A  +A ++      AA
Sbjct: 222 EAKAAA------------------------AKAAAEAKAAAEKAAAAKAAEKAAAAKAAA 257

Query: 315 AL 316
            +
Sbjct: 258 EV 259



 Score = 30.9 bits (70), Expect = 2.7
 Identities = 18/65 (27%), Positives = 31/65 (47%), Gaps = 3/65 (4%)

Query: 281 KENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPAS 340
           ++   +E + +K   QL+ ER A QEQ++QA+ AA     K    +         +K  +
Sbjct: 93  QQKQAAEQERLK---QLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKA 149

Query: 341 EKSSK 345
           E  +K
Sbjct: 150 EAEAK 154


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 39.5 bits (92), Expect = 0.007
 Identities = 46/205 (22%), Positives = 73/205 (35%), Gaps = 5/205 (2%)

Query: 171  TTPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVT 230
            T  AS S+ T           E   ++VK +  A    K  A  K K + D         
Sbjct: 1211 TKKASESETTEETYGSSAMETENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLKDRL 1270

Query: 231  QPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDE 290
               +  S P+++    E  K    +   A+      KP    + +S+ D+ ++  +    
Sbjct: 1271 AAYNLDSAPAQSAKMEETVKAVPARRAAAR-----KKPLASVSVISDSDDDDDDFAVEVS 1325

Query: 291  VKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLK 350
            +  + + K  R+     ++ AK  A  +K   +  +S  K  TE  KPA       E   
Sbjct: 1326 LAERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEMLKPAEAIGISPEKKV 1385

Query: 351  SKDPNVPSTKKYSGVDGVKATPGTT 375
             K    P  KK   V G  AT   T
Sbjct: 1386 RKMRASPFNKKSGSVLGRAATNKET 1410



 Score = 36.0 bits (83), Expect = 0.088
 Identities = 30/163 (18%), Positives = 48/163 (29%), Gaps = 13/163 (7%)

Query: 201  QREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQ 260
            + +AK        Q+A  +G+  A      Q   K +     K  SE           A 
Sbjct: 1170 KEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAM 1229

Query: 261  GSQDTP---KPGGPPATLSNVDNKENVKSEGDEVKS-KAQLKAERRAKQEQQRQAKAAAL 316
             +++     KP G              K E DE+   K +L A        Q       +
Sbjct: 1230 ETENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETV 1289

Query: 317  LE---------KTKTSNEKSMTKSKTEDSKPASEKSSKTEVLK 350
                       K   ++   ++ S  +D   A E S    + K
Sbjct: 1290 KAVPARRAAARKKPLASVSVISDSDDDDDDFAVEVSLAERLKK 1332


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 38.4 bits (89), Expect = 0.011
 Identities = 29/132 (21%), Positives = 41/132 (31%), Gaps = 5/132 (3%)

Query: 142 SATGPSSCKAPQFISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQ 201
           SA GP     P       + +    + V    A+ S V  A       S     +  +A 
Sbjct: 313 SAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVA--DRDGESTPAVEETSEAD 370

Query: 202 REAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQG 261
            E ++   LA +  A  + D  A   A  +P +  SE         P K A     P   
Sbjct: 371 IEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAA---PIPDPA 427

Query: 262 SQDTPKPGGPPA 273
             D     GP  
Sbjct: 428 KPDELAVAGPGD 439


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 38.8 bits (91), Expect = 0.012
 Identities = 33/149 (22%), Positives = 62/149 (41%), Gaps = 7/149 (4%)

Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKP 258
           KA REA+   K A  + AK K  V A    V   ++  ++P   K+ + P   A +  + 
Sbjct: 466 KAAREAR-HKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAARE 524

Query: 259 AQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLE 318
           A+ +Q   +     A  +    K  V +      + A+ KA++ A+Q    +A+     +
Sbjct: 525 ARKAQARARQAEKQAAAAADPKKAAVAA------AIARAKAKKAAQQAANAEAEEEVDPK 578

Query: 319 KTKTSNEKSMTKSKTEDSKPASEKSSKTE 347
           K   +   +  K+K    + AS +  +  
Sbjct: 579 KAAVAAAIARAKAKKAAQQAASAEPEEQV 607



 Score = 34.5 bits (80), Expect = 0.20
 Identities = 29/164 (17%), Positives = 54/164 (32%), Gaps = 36/164 (21%)

Query: 186 EKNNSAELSR---DLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKA 242
           ++   AE ++   +  +A+ E +KAA+ A  +KA                        +A
Sbjct: 443 QEKKKAEEAKARFEARQARLEREKAAREARHKKAAEA---------------------RA 481

Query: 243 KSKSEPSKEAN--VQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAE 300
               +    A   V+ K A  +Q      G        DN   + +           +AE
Sbjct: 482 AKDKDAVAAALARVKAKKAAATQPIVIKAGARP-----DNSAVIAAREARKAQARARQAE 536

Query: 301 RRAKQ-EQQRQAKAAALLEKTK----TSNEKSMTKSKTEDSKPA 339
           ++A      ++A  AA + + K         +    +  D K A
Sbjct: 537 KQAAAAADPKKAAVAAAIARAKAKKAAQQAANAEAEEEVDPKKA 580



 Score = 33.8 bits (78), Expect = 0.33
 Identities = 27/141 (19%), Positives = 44/141 (31%), Gaps = 13/141 (9%)

Query: 182 AIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSK 241
           A     N  AE   D  KA   A  A   A K   +          A   P+      + 
Sbjct: 562 AAQQAANAEAEEEVDPKKAAVAAAIARAKAKKAAQQAASAEPEEQVAEVDPKKAAVAAAI 621

Query: 242 AKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAER 301
           A++K   +K+A  Q          P+     A ++    +          K+  Q     
Sbjct: 622 ARAK---AKKAEQQANAEPEEPVDPRKAAVAAAIARAKAR----------KAAQQQANAE 668

Query: 302 RAKQEQQRQAKAAALLEKTKT 322
             + E  ++A  AA + + K 
Sbjct: 669 PEEAEDPKKAAVAAAIARAKA 689


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 36.7 bits (85), Expect = 0.031
 Identities = 36/146 (24%), Positives = 58/146 (39%), Gaps = 27/146 (18%)

Query: 186 EKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSK 245
           EK  +AE +R     QR A + A   A+Q AK   +           + KQ+E +KAK  
Sbjct: 81  EKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEE-----------KQKQAEEAKAKQA 129

Query: 246 SEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKA-----QLKAE 300
           +E   +A  + +     +   +             +E       E K KA     + +AE
Sbjct: 130 AEAKAKAEAEAEKKAKEEAKKQA-----------EEEAKAKAAAEAKKKAAEAKKKAEAE 178

Query: 301 RRAKQEQQRQAKAAALLEKTKTSNEK 326
            +AK E + +AKA     K + +  K
Sbjct: 179 AKAKAEAKAKAKAEEAKAKAEAAKAK 204



 Score = 36.0 bits (83), Expect = 0.069
 Identities = 28/120 (23%), Positives = 50/120 (41%), Gaps = 24/120 (20%)

Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPES---KQSEPSKAKSKSEPSKEANVQ 255
           K   EAK  A+  A++KAK +    A  +A  +  +   K++  +K K+++E   +A  +
Sbjct: 127 KQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAK 186

Query: 256 NKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAA 315
            K                        E  K++ +  K+KA  +A  +A+ E    A A A
Sbjct: 187 AKAKA---------------------EEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEA 225



 Score = 29.4 bits (66), Expect = 6.2
 Identities = 23/145 (15%), Positives = 56/145 (38%), Gaps = 14/145 (9%)

Query: 205 KKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQD 264
              A+ A + + + K       +   + E +  E  K ++  +  ++   Q   A+  + 
Sbjct: 46  GAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAE--KA 103

Query: 265 TPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQE----QQRQAKAAALLEKT 320
             +             K+  + +    ++KA+  AE +AK E    ++ + +A    E+ 
Sbjct: 104 AKQA--------EQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEE 155

Query: 321 KTSNEKSMTKSKTEDSKPASEKSSK 345
             +   +  K K  ++K  +E  +K
Sbjct: 156 AKAKAAAEAKKKAAEAKKKAEAEAK 180


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 36.4 bits (84), Expect = 0.051
 Identities = 24/159 (15%), Positives = 53/159 (33%), Gaps = 9/159 (5%)

Query: 206 KAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDT 265
           +A K   K  +K     T P     + ES + E  + +   E  K+     KP +     
Sbjct: 75  EAVKRVEKGGSKGPAAKTKPAKE-PKNESGKEEEKEKEQVKEEKKKKKE--KPKE-EPKD 130

Query: 266 PKPG-----GPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKT 320
            KP        P      + ++ V+   D  + K + +   +++ ++  + K     ++ 
Sbjct: 131 RKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEP 190

Query: 321 KTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPST 359
               ++     +    KP     ++    +  D     T
Sbjct: 191 PEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRET 229



 Score = 29.9 bits (67), Expect = 5.2
 Identities = 18/122 (14%), Positives = 38/122 (31%), Gaps = 19/122 (15%)

Query: 240 SKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKA 299
            K  SK   +K           +++     G        + ++  + +  + +   +   
Sbjct: 81  EKGGSKGPAAKTK--------PAKEPKNESGKE---EEKEKEQVKEEKKKKKEKPKEEPK 129

Query: 300 ERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPST 359
           +R+ K+E +         ++     EK   K   E      EK  +    KS+    P  
Sbjct: 130 DRKPKEEAKE--------KRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKK 181

Query: 360 KK 361
           K 
Sbjct: 182 KP 183



 Score = 29.9 bits (67), Expect = 5.7
 Identities = 32/175 (18%), Positives = 64/175 (36%), Gaps = 3/175 (1%)

Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKP 258
             + E +K      K+K K K     P D   + E+K+  P K K K +  K    +++ 
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEE-PKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDRE 161

Query: 259 AQGSQDTPKPGGPPATLSNVDNKENVKSEGD-EVKSKAQLKAERRAKQEQQRQAKAAALL 317
            +  ++  +    P            K   + E + +A  +A +   +E     +     
Sbjct: 162 EEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEE 221

Query: 318 EKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPSTKKYSGVDGVKATP 372
           +  K   E + +  + ++S+ +SE S ++     K    PS       +  K T 
Sbjct: 222 DDGKDR-ETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTE 275


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 34.6 bits (80), Expect = 0.054
 Identities = 15/68 (22%), Positives = 32/68 (47%), Gaps = 1/68 (1%)

Query: 289 DEVKSKAQLK-AERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTE 347
           +E K K + K A++RAK+++++Q K      K     E+      +E+S    E+  + +
Sbjct: 73  EEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDK 132

Query: 348 VLKSKDPN 355
             +  +  
Sbjct: 133 QEEPVEIM 140



 Score = 28.1 bits (63), Expect = 9.4
 Identities = 14/65 (21%), Positives = 26/65 (40%), Gaps = 1/65 (1%)

Query: 294 KAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPAS-EKSSKTEVLKSK 352
           K +   E+ AK+  +RQ K     +K K        + +   S   S ++  + E  K +
Sbjct: 75  KKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQE 134

Query: 353 DPNVP 357
           +P   
Sbjct: 135 EPVEI 139


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 36.1 bits (83), Expect = 0.062
 Identities = 34/154 (22%), Positives = 59/154 (38%), Gaps = 6/154 (3%)

Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKP 258
           KA   AK  A   AKQK +   +VT    A  + ++  +  +KA + ++  +E   +   
Sbjct: 96  KAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTE 155

Query: 259 AQGSQDTPKPGGPPATLSNVDNKENVKSEG-----DEVKSKAQLKAERRAKQEQQRQAKA 313
            +   D  K     A  +        K +         +   + KA+ +AK     +AKA
Sbjct: 156 EEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAAKAKA 215

Query: 314 AALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTE 347
           AA L K K S     +  +   +K  +   +K  
Sbjct: 216 AA-LAKQKASQGNGDSGDEDAKAKAIAAAKAKAA 248



 Score = 35.3 bits (81), Expect = 0.097
 Identities = 27/162 (16%), Positives = 65/162 (40%), Gaps = 26/162 (16%)

Query: 187 KNNSAELSRDLVKAQREAKKAAKLA-AKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSK 245
           K  +A  +++  + +  AK  A+++  +++ + K       D +T  E+K+   + AK+K
Sbjct: 11  KKEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAK 70

Query: 246 SEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQ 305
           +     A +  +  +G+++                     +E ++ K+KA+  A  +AK 
Sbjct: 71  A-----AALAKQKREGTEEV--------------------TEEEKAKAKAKAAAAAKAKA 105

Query: 306 EQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTE 347
               + K     E T+     +  K+       A+  + +  
Sbjct: 106 AALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKR 147



 Score = 31.5 bits (71), Expect = 1.8
 Identities = 35/143 (24%), Positives = 50/143 (34%), Gaps = 24/143 (16%)

Query: 192 ELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKE 251
           E   D  KA+ +A  AAK  A   AK K             E K    +KA + ++    
Sbjct: 157 EEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAAKAKAA 216

Query: 252 ANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQA 311
           A  + K +QG+ D+                      GDE  +KA+  A  +AK     +A
Sbjct: 217 ALAKQKASQGNGDS----------------------GDE-DAKAKAIAAAKAKAAAAARA 253

Query: 312 KAAALLEKTKTSNEKSMTKSKTE 334
           K     E  K    K    S  +
Sbjct: 254 KTKG-AEGKKEEEPKQEEPSVNQ 275



 Score = 29.6 bits (66), Expect = 7.4
 Identities = 25/90 (27%), Positives = 36/90 (40%), Gaps = 5/90 (5%)

Query: 187 KNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAK--- 243
           K  +AE      +   E K  AK  A   AK K    A   A         E +KAK   
Sbjct: 182 KQKAAEAGEGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGNGDSGDEDAKAKAIA 241

Query: 244 -SKSEPSKEANVQNKPAQGS-QDTPKPGGP 271
            +K++ +  A  + K A+G  ++ PK   P
Sbjct: 242 AAKAKAAAAARAKTKGAEGKKEEEPKQEEP 271


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 36.1 bits (84), Expect = 0.063
 Identities = 22/86 (25%), Positives = 36/86 (41%), Gaps = 7/86 (8%)

Query: 271 PPATLSNVDNKENVKSEGDEVKS--KAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSM 328
           P       +  EN      E K   K Q KAE++A++E+  +A A    E      +   
Sbjct: 391 PLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPD 450

Query: 329 TKSKTEDSKPASEKSSKTEVLKSKDP 354
            ++K  D  P  EK     + +++DP
Sbjct: 451 GETKKVDPDPLGEK-----LARTEDP 471



 Score = 32.2 bits (74), Expect = 1.0
 Identities = 18/75 (24%), Positives = 28/75 (37%), Gaps = 5/75 (6%)

Query: 183 IMSEKNNSAELSRDLVKAQREAKKAAKLAA-----KQKAKVKGDVTAPPDAVTQPESKQS 237
             +   + AE  +   K ++  KKA K  A     K+KA+        PD  T+      
Sbjct: 401 GENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDP 460

Query: 238 EPSKAKSKSEPSKEA 252
              K     +P +EA
Sbjct: 461 LGEKLARTEDPLEEA 475


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 36.2 bits (84), Expect = 0.071
 Identities = 30/194 (15%), Positives = 57/194 (29%), Gaps = 16/194 (8%)

Query: 173  PASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAK-VKGDVTAPPDAVTQ 231
            P    +      S  + S + S      + ++ +  KL  K   K      +   D   Q
Sbjct: 1172 PKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQ 1231

Query: 232  PESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEV 291
                +    K     + +   + ++     S D  K G P      V             
Sbjct: 1232 KTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVS-----------A 1280

Query: 292  KSKAQLKAERRAKQEQQRQAKA----AALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTE 347
               +     +R   E    +K        ++K    +  ++ K K  + K A +K SKT 
Sbjct: 1281 VQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTR 1340

Query: 348  VLKSKDPNVPSTKK 361
            V ++         +
Sbjct: 1341 VKQASASQSSRLLR 1354


>gnl|CDD|147051 pfam04698, MOBP_C-Myrip, Myelin-associated oligodendrocytic basic
           protein (MOBP).  MOBP is abundantly expressed in central
           nervous system myelin, and shares several
           characteristics with myelin basic protein (MBP), in
           terms of regional distribution and function. This family
           is the middle and C-terminal regions of MOBP which has
           been shown to be essential for normal arrangement of the
           radial component in central nervous system myelin. Most
           member-proteins carry a FVHE-PHD type zinc-finger at
           their N-terminus.
          Length = 710

 Score = 35.9 bits (82), Expect = 0.083
 Identities = 37/155 (23%), Positives = 70/155 (45%), Gaps = 9/155 (5%)

Query: 205 KKAAKLAAKQKAKVKG--DVTAPPDAVTQPESKQSEPSKAKSKSE--PSKEANVQNKPAQ 260
           +KA K  A +K  + G  DV   P A     S  S+P +A   ++    +    +  P +
Sbjct: 328 RKAPKNPAAEKMPLHGELDVNFNPQAAGGETSDSSDPEEALHTADRRARRWRRARLGPEE 387

Query: 261 GSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKT 320
            +++ P PG     L      +++ SE D + ++AQ         E++ + +   L E  
Sbjct: 388 PNKELPSPGAHLRALDTAQVSDDL-SETD-ISNEAQDPQSLTDSTEEKLRNR---LYELA 442

Query: 321 KTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPN 355
              +EK  +  + ++S+P +E  ++ E L S+D N
Sbjct: 443 MKMSEKETSSGEDQESEPKAEPENQKESLSSEDNN 477


>gnl|CDD|225711 COG3170, FimV, Tfp pilus assembly protein FimV [Cell motility and
           secretion / Intracellular trafficking and secretion].
          Length = 755

 Score = 35.7 bits (82), Expect = 0.098
 Identities = 22/140 (15%), Positives = 43/140 (30%), Gaps = 33/140 (23%)

Query: 195 RDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANV 254
            +L     +A+      A   A  +  + AP +  T P             S P+ + + 
Sbjct: 305 AELQAQLNKAQHELAQKAAPLAAAQAALDAPAETATAP-------------SAPAPQVSA 351

Query: 255 QNKPAQG-----SQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQR 309
           ++ PAQ      +     P G  A                +   +   +    A + + R
Sbjct: 352 ESSPAQPGSYLLAAPGDAPLGELAQA--------------QSARERLAEESVPAAEPRSR 397

Query: 310 QAKAAALLEKTKTSNEKSMT 329
            A   A +E+     E  ++
Sbjct: 398 LA-PVAAVEQPFAEVESPLS 416


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 35.3 bits (81), Expect = 0.10
 Identities = 25/119 (21%), Positives = 40/119 (33%), Gaps = 4/119 (3%)

Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKP 258
           K + EA KA   A   K K   +     +   +      E       +   K+A  + K 
Sbjct: 146 KKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAE---EAKAKAEAAAAKKKAEAEAKA 202

Query: 259 AQGSQDTPKPGGPPATLSNVDN-KENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAAL 316
           A             A        +E   +E  +  +KA+      A +  +R+A AAAL
Sbjct: 203 AAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAAL 261



 Score = 34.5 bits (79), Expect = 0.17
 Identities = 37/154 (24%), Positives = 60/154 (38%), Gaps = 7/154 (4%)

Query: 198 VKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNK 257
           +++Q+ + K  +   K+K +   +   P  A  Q   KQ E  + K++ E  K+A    K
Sbjct: 67  IQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQ-EQQKQAEEAEK 125

Query: 258 PAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQA------ 311
            AQ  Q   +     A        E  K++     +K +  AE + K E+  +A      
Sbjct: 126 QAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKA 185

Query: 312 KAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSK 345
           KA A   K K   E      K +    A  K+ K
Sbjct: 186 KAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEK 219



 Score = 31.8 bits (72), Expect = 1.2
 Identities = 32/173 (18%), Positives = 73/173 (42%), Gaps = 8/173 (4%)

Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSK---EANVQ 255
           + Q++A++A K A  ++ + +    A   A  Q   K++E +KAK+ +E +K    A  +
Sbjct: 115 EQQKQAEEAEKQAQLEQKQQEE--QARKAAAEQ--KKKAEAAKAKAAAEAAKLKAAAEAK 170

Query: 256 NKPAQGSQDTPKPGGPPATLSNVDNKE-NVKSEGDEVKSKAQLKAERRAKQEQQRQAKAA 314
            K  + ++   +        +     E   K+  ++ K++A+ KA+   K E   + KAA
Sbjct: 171 KKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAA 230

Query: 315 ALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPSTKKYSGVDG 367
           A  +K     +     +  + ++  +  ++  ++         + K   G   
Sbjct: 231 AEKKKAAAKAKADKAAAAAKAAERKAAAAALDDIFGGLSSGKNAPKTGGGAKN 283


>gnl|CDD|197239 cd09141, PLDc_vPLD1_2_yPLD_like_2, Catalytic domain, repeat 2, of
           vertebrate phospholipases, PLD1 and PLD2, yeast PLDs,
           and similar proteins.  Catalytic domain, repeat 2, of
           vertebrate phospholipases D (PLD1 and PLD2), yeast
           phospholipase D (PLD SPO14/PLD1), and other similar
           eukaryotic proteins. These PLD enzymes play a pivotal
           role in transmembrane signaling and cellular regulation.
           They hydrolyze the terminal phosphodiester bond of
           phospholipids resulting in the formation of phosphatidic
           acid and alcohols. Phosphatidic acid is an essential
           compound involved in signal transduction. PLDs also
           catalyze the transphosphatidylation of phospholipids to
           acceptor alcohols, by which various phospholipids can be
           synthesized. The vertebrate PLD1 and PLD2 are membrane
           associated phosphatidylinositol 4,5-bisphosphate
           (PIP2)-dependent enzymes that selectively hydrolyze
           phosphatidylcholine (PC). Protein cofactors and calcium
           may be required for their activation. Yeast SPO14/PLD1
           is a calcium-independent PLD, which needs PIP2 for its
           activity. Instead of the regulatory calcium-dependent
           phospholipid-binding C2 domain in plants, most mammalian
           and yeast PLDs have adjacent Phox (PX) and the
           Pleckstrin homology (PH) domains at the N-terminus,
           which have been shown to mediate membrane targeting of
           the protein and are closely linked to
           polyphosphoinositide signaling. The PX and PH domains
           are also present in zeta-type PLD from Arabidopsis,
           which is more closely related to vertebrate PLDs than to
           other plant PLD types. In addition, this subfamily also
           includes some related proteins which have either PX-like
           or PH domains in their N-termini. Like other members of
           the PLD superfamily, the monomer of mammalian and yeast
           PLDs consists of two catalytic domains, each containing
           one copy of the conserved HKD motif (H-x-K-x(4)-D, where
           x represents any amino acid residue). Two HKD motifs
           from the two domains form a single active site. These
           PLDs utilize a common two-step ping-pong catalytic
           mechanism involving an enzyme-substrate intermediate to
           cleave phosphodiester bonds. The two histidine residues
           from the two HKD motifs play key roles in the catalysis.
           Upon substrate binding, a histidine residue from one HKD
           motif could function as the nucleophile, attacking the
           phosphodiester bond to create a covalent
           phosphohistidine intermediate, while the other histidine
           residue from the second HKD motif could serve as a
           general acid, stabilizing the leaving group.
          Length = 183

 Score = 34.5 bits (80), Expect = 0.11
 Identities = 18/55 (32%), Positives = 27/55 (49%), Gaps = 11/55 (20%)

Query: 503 YIHEQVDMAGNAICMFFHNKLANDDVILTYGCSSLVEKILLTAHEKGTKFRVIIV 557
           YI  Q          FF +    +D +      +LV++I+  AH++G KFRV IV
Sbjct: 25  YIENQ----------FFISSTGGEDPVKNRIGEALVDRII-RAHKEGEKFRVYIV 68


>gnl|CDD|238319 cd00570, GST_N_family, Glutathione S-transferase (GST) family,
           N-terminal domain; a large, diverse group of cytosolic
           dimeric proteins involved in cellular detoxification by
           catalyzing the conjugation of  glutathione (GSH) with a
           wide range of endogenous and xenobiotic alkylating
           agents, including carcinogens, therapeutic drugs,
           environmental toxins and products of oxidative stress.
           In addition, GSTs also show GSH peroxidase activity and
           are involved in the synthesis of prostaglandins and
           leukotrienes. This family, also referred to as soluble
           GSTs, is the largest family of GSH transferases and is
           only distantly related to the mitochondrial GSTs (GSTK
           subfamily, a member of the DsbA family). Soluble GSTs
           bear no structural similarity to microsomal GSTs (MAPEG
           family) and display additional activities unique to
           their group, such as catalyzing thiolysis, reduction
           and isomerization of certain compounds. The GST fold
           contains an N-terminal TRX-fold domain and a C-terminal
           alpha helical domain, with an active site located in a
           cleft between the two domains. Based on sequence
           similarity, different classes of GSTs have been
           identified, which display varying tissue distribution,
           substrate specificities and additional specific
           activities. In humans, GSTs display polymorphisms which
           may influence individual susceptibility to diseases such
           as cancer, arthritis, allergy and sclerosis. Some GST
           family members with non-GST functions include
           glutaredoxin 2, the CLIC subfamily of anion channels,
           prion protein Ure2p, crystallins, metaxin 2 and
           stringent starvation protein A.
          Length = 71

 Score = 31.4 bits (72), Expect = 0.17
 Identities = 11/47 (23%), Positives = 19/47 (40%), Gaps = 1/47 (2%)

Query: 531 TYGCSSLVEKILLTAHEKGTKFRVIIVDGSPWYEGKEMLRRLVKHQV 577
            +  S    ++ L   EKG  + ++ VD     E +E L      +V
Sbjct: 5   YFPGSPRSLRVRLALEEKGLPYELVPVDLGEG-EQEEFLALNPLGKV 50


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 34.8 bits (80), Expect = 0.17
 Identities = 16/96 (16%), Positives = 39/96 (40%), Gaps = 4/96 (4%)

Query: 265 TPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSN 324
           T KP    A+  + ++ E++    DE++   + K      ++ +  ++        K+S 
Sbjct: 15  TKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKP----PKKPKEASRPGTPRNPKKSSK 70

Query: 325 EKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPSTK 360
               + + +E+      KS+++          PST+
Sbjct: 71  PTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTE 106



 Score = 33.6 bits (77), Expect = 0.37
 Identities = 25/123 (20%), Positives = 45/123 (36%), Gaps = 14/123 (11%)

Query: 207 AAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSK----------EANVQN 256
                AK   K K           +  SK +E S A S+ +P+K                
Sbjct: 42  EQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSK 101

Query: 257 KPAQGSQDTPKPGGPPATLSNVDNKEN----VKSEGDEVKSKAQLKAERRAKQEQQRQAK 312
            P+  S++  +P   P  ++++    +    + S      S A  +AE+   + QQ +A+
Sbjct: 102 APSTESEEEEEPEETPDPIASIGGWWSLWGSITSTATSTASAAVKQAEQAVNEIQQEEAQ 161

Query: 313 AAA 315
             A
Sbjct: 162 LWA 164


>gnl|CDD|107248 cd01391, Periplasmic_Binding_Protein_Type_1, Type 1 periplasmic
           binding fold superfamily.  Type 1 periplasmic binding
           fold superfamily. This model and hierarchy represent the
           ligand binding domains of the LacI family of
           transcriptional regulators, periplasmic binding proteins
           of the ABC-type transport systems, the family C
           G-protein couples receptors (GPCRs), membrane bound
           guanylyl cyclases including the family of natriuretic
           peptide receptors (NPRs), and the N-terminal
           leucine/isoleucine/valine- binding protein  (LIVBP)-like
           domains of the ionotropic glutamate receptors (iGluRs).
           In LacI-like transcriptional regulator and the bacterial
           periplasmic binding proteins the ligands are
           monosaccharides including lactose, ribose, fructose,
           xylose, arabinose, galactose/glucose, and other sugars,
           with a few exceptions.  Periplasmic sugar binding
           proteins are one of the components of ABC transporters
           and are involved in the active transport of
           water-soluble ligands. The LacI family of proteins
           consists of transcriptional regulators related to the
           lac repressor. In this case, the sugar binding domain
           binds a sugar which changes the DNA binding activity of
           the repressor domain. The periplasmic binding proteins
           are the primary receptors for chemotaxis and transport
           of many sugar based solutes. The core structures of
           periplasmic binding proteins are classified into two
           types, and they differ in number and order of beta
           strands: type 1 has  six beta strands, while type 2 has
           five beta strands per sub-domain. These two structural
           folds are thought to be distantly related via a common
           ancestor. Notably, while the N-terminal LIVBP-like
           domain of iGluRs belongs to the type 1
           periplasmic-binding fold protein superfamily, the
           glutamate-binding domain of the iGluR is structurally
           similar to the type 2 periplasmic-binding fold.
          Length = 269

 Score = 33.7 bits (77), Expect = 0.24
 Identities = 17/58 (29%), Positives = 24/58 (41%), Gaps = 3/58 (5%)

Query: 541 ILLTAHEKGTKFRVIIVDG-SPWYEGKEMLRRLVKHQVDCSYVLLSAVSYIMREVSKV 597
           I L A E G    VI+ D  S      E LR L++  VD   ++    S     V ++
Sbjct: 22  IELAAEEIGRGLEVILADSQSDPERALEALRDLIQQGVD--GIIGPPSSSSALAVVEL 77


>gnl|CDD|222808 PHA00666, PHA00666, putative protease.
          Length = 233

 Score = 33.5 bits (76), Expect = 0.32
 Identities = 27/118 (22%), Positives = 48/118 (40%), Gaps = 8/118 (6%)

Query: 201 QREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQ 260
           ++     ++ AA + A   GD  AP    TQ E  + +P+    K E  K+A       +
Sbjct: 15  EQPGDGGSQPAASEPAAGAGDNPAPQGDPTQEEGDKPQPAAGADKPEGDKKA--DGDKPE 72

Query: 261 GSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLE 318
             +   KP G P      +  E   +EG E+ + A    E  A++      +A  +++
Sbjct: 73  EKKPGEKPEGAP------EKYEFQAAEGVELDTGALGAFEPVARELNLTNEQAQKVVD 124


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 33.7 bits (77), Expect = 0.33
 Identities = 30/171 (17%), Positives = 52/171 (30%), Gaps = 12/171 (7%)

Query: 203 EAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEA--NVQNKPAQ 260
             K+A  +      +  G    P      P  K +   K  S   P       V+ K A+
Sbjct: 123 YGKQAGPITNPNVKRRTGVGLPPVAPAASPALKPTANGKRPSSKPPKSIMSPEVKVKSAK 182

Query: 261 GSQDTPKPGGPPATLSNVDNKENVKSEGDEVKS---------KAQLKAERRAKQEQQRQA 311
            +QDT K      T      K          KS         K + K E++   E   + 
Sbjct: 183 KTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKE 242

Query: 312 KAAALLEKT-KTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPSTKK 361
           ++     K      ++S   +  ++ +   E     E   S++      K+
Sbjct: 243 ESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKE 293



 Score = 30.2 bits (68), Expect = 4.3
 Identities = 37/217 (17%), Positives = 69/217 (31%), Gaps = 14/217 (6%)

Query: 155 ISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQ 214
               +    T+P+  + T   L  V  A  S         +    + +  K       K 
Sbjct: 122 EYGKQAGPITNPNVKRRTGVGLPPVAPA-ASPALKPTANGK--RPSSKPPKSIMSPEVKV 178

Query: 215 KAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQN-----KPAQGSQDTPKPG 269
           K+  K   T+     T+    ++    A  K  P K++N+ +     K  +  +      
Sbjct: 179 KSAKKTQDTSKE-TTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASE 237

Query: 270 GPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQ---QRQAKAAALLEKTKTSNE- 325
                 S  ++ +      DE      L  +    + +   +R        EK K   + 
Sbjct: 238 STVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKR 297

Query: 326 -KSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPSTKK 361
            K M + + ED +      S  E  +S++P  P   K
Sbjct: 298 LKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPK 334


>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
           Provisional.
          Length = 672

 Score = 33.7 bits (78), Expect = 0.35
 Identities = 20/132 (15%), Positives = 40/132 (30%), Gaps = 12/132 (9%)

Query: 196 DLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQ 255
           +L+ A +EA+     AA                      +  +P+   + ++       +
Sbjct: 50  ELIAAIKEARGGGAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAAR 109

Query: 256 NKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAA 315
              A  ++    P    A             E  E    A+  A R+A +  ++ A  A 
Sbjct: 110 AAAAAAAEAASAPEAAQA------------RERRERGEAARRGAARKAGEGGEQPATEAR 157

Query: 316 LLEKTKTSNEKS 327
                +T  E+ 
Sbjct: 158 ADAAERTEEEER 169


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 33.5 bits (77), Expect = 0.39
 Identities = 21/98 (21%), Positives = 31/98 (31%)

Query: 177 SQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQ 236
           +    A  +EK   A        A   A+ AA  A            A P A   P    
Sbjct: 368 AAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVA 427

Query: 237 SEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPAT 274
           +  + A + +  +  A V   PA  +Q  P+    P  
Sbjct: 428 APAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVR 465


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 33.2 bits (76), Expect = 0.44
 Identities = 23/120 (19%), Positives = 43/120 (35%), Gaps = 6/120 (5%)

Query: 192 ELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKS--KSEPS 249
           +  R+L K  ++  K+ K     +   K   T       +P+    +  + K   +S  S
Sbjct: 92  QQLRELEKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATS 151

Query: 250 KEANVQNKPAQGSQDTPKPGGPPATLSNVDNK----ENVKSEGDEVKSKAQLKAERRAKQ 305
            E++  + P      TP         S    K     +  S GDE   K++   ++  K+
Sbjct: 152 PESSPSSTPNSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKK 211


>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3.  The SLD3 DNA
           replication regulator is required for loading and
           maintenance of Cdc45 on chromatin during DNA
           replication.
          Length = 437

 Score = 33.2 bits (76), Expect = 0.53
 Identities = 38/175 (21%), Positives = 59/175 (33%), Gaps = 30/175 (17%)

Query: 226 PDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQ-----------DTPKPGGPPAT 274
           P     P  K+   +   SK EP      Q  PA  SQ              +      T
Sbjct: 233 PSMKISPLKKKKTGTLKSSKPEPGTPLKRQTSPASSSQKSRRRSLQRVLTDERKSSSRRT 292

Query: 275 ------LSNVDNKENVKSEGDE-------VKSKAQLKAERRAKQEQ-QRQAKAAALLEKT 320
                  +N    E +K E  E        ++ + L   +R ++ Q      +    EK 
Sbjct: 293 PSLLRSRTNSSLIEFLKRESSENLLPSLSSRTSSDLLKNKRLQKRQVDLSDSSRQHEEKL 352

Query: 321 KTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPSTKKYSGVDG---VKATP 372
           K     +  K + + +  A +KS++   L SKD    + K+ S       V ATP
Sbjct: 353 KKKQMLNEQKKELKRAISALKKSNRE--LSSKDIVETAEKRSSQFGQGVQVTATP 405


>gnl|CDD|233728 TIGR02102, pullulan_Gpos, pullulanase, extracellular, Gram-positive. 
            Pullulan is an unusual, industrially important
            polysaccharide in which short alpha-1,4 chains
            (maltotriose) are connected in alpha-1,6 linkages.
            Enzymes that cleave alpha-1,6 linkages in pullulan and
            release maltotriose are called pullulanases although
            pullulan itself may not be the natural substrate. In
            contrast, a glycogen debranching enzyme such GlgX,
            homologous to this family, can release glucose at
            alpha,1-6 linkages from glycogen first subjected to limit
            degradation by phosphorylase. Characterized members of
            this family include a surface-located pullulanase from
            Streptococcus pneumoniae (PMID:11083842) and an
            extracellular bifunctional amylase/pullulanase with
            C-terminal pullulanase activity (PMID:8798645).
          Length = 1111

 Score = 32.9 bits (75), Expect = 0.64
 Identities = 17/100 (17%), Positives = 29/100 (29%), Gaps = 6/100 (6%)

Query: 227  DAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKS 286
            D +T    +       +    P +      KP     D  KP          D K+N   
Sbjct: 995  DPLTAAVVRVGGIEAPEKTPPPPEHEPQAPKPPTQDPDGSKPKDKV------DPKDNKDP 1048

Query: 287  EGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEK 326
                       +  +  +++++ Q    A L  T T N  
Sbjct: 1049 LTPPGSDDENGETPKGNEEKKEEQPDKGANLPNTGTKNSN 1088


>gnl|CDD|227499 COG5171, YRB1, Ran GTPase-activating protein (Ran-binding protein)
           [Intracellular trafficking and secretion].
          Length = 211

 Score = 32.3 bits (73), Expect = 0.66
 Identities = 28/132 (21%), Positives = 52/132 (39%), Gaps = 5/132 (3%)

Query: 200 AQREAKKAAKLAAKQKAKVKG-DVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKP 258
            +R+ K+A     + + K +  DV +  DA    ++   E    K +  P  E  V    
Sbjct: 3   VERKKKQAKIEKEENEQKERSLDVVSKGDAFGDGKAGGEE---KKVQQSPFLENAVPEGD 59

Query: 259 AQGSQDTPKPGGPPAT-LSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALL 317
                ++P     P   L  V  K N + E    K++A+L       +E + +     ++
Sbjct: 60  EGKGPESPNIHFEPVVELQRVHLKTNEEDETVLFKARAKLFRFDEEAKEWKERGTGDMII 119

Query: 318 EKTKTSNEKSMT 329
            K K +N+  +T
Sbjct: 120 LKHKKTNKARIT 131


>gnl|CDD|235665 PRK05996, motB, flagellar motor protein MotB; Validated.
          Length = 423

 Score = 32.7 bits (75), Expect = 0.72
 Identities = 13/104 (12%), Positives = 26/104 (25%), Gaps = 7/104 (6%)

Query: 212 AKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGP 271
           +KQ          PP    +        + A +    +        P    +        
Sbjct: 187 SKQVEVTTAGDLLPPGQAREQAQGAKSATAAPATVPQAAPL-----PQAQPKKAATEEEL 241

Query: 272 PATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAA 315
            A            +      +K +   + + K+ +Q QA  A 
Sbjct: 242 IADAK--KAATGEPAANAAKAAKPEPMPDDQQKEAEQLQAAIAQ 283



 Score = 30.1 bits (68), Expect = 4.4
 Identities = 15/104 (14%), Positives = 32/104 (30%), Gaps = 5/104 (4%)

Query: 198 VKAQREAKKAAKLAAKQKAKVKGDVTA----PPDAVTQPESKQSEPSKAKSKSEPSKEAN 253
           V+            A+++A+     TA     P A   P+++  + +  +     +K+A 
Sbjct: 190 VEVTTAGDLLPPGQAREQAQGAKSATAAPATVPQAAPLPQAQPKKAATEEELIADAKKAA 249

Query: 254 VQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQL 297
                A  ++   KP   P        +              +L
Sbjct: 250 TGEPAANAAK-AAKPEPMPDDQQKEAEQLQAAIAQAIGGVAGKL 292


>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
           Provisional.
          Length = 481

 Score = 32.9 bits (74), Expect = 0.74
 Identities = 39/173 (22%), Positives = 69/173 (39%), Gaps = 12/173 (6%)

Query: 198 VKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEAN---- 253
           VK +   ++AA  AA+ K +VK    A   A    E+K++      + +   K  +    
Sbjct: 154 VKKETTTQQAAP-AAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHAVKSGDTIWA 212

Query: 254 VQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQ------ 307
           +  K     QD        ++   V  K  +K   +    KA++K E  A ++Q      
Sbjct: 213 LSVKYGVSVQDIMSWNNLSSSSIYVGQKLAIKQTANTATPKAEVKTEAPAAEKQAAPVVK 272

Query: 308 -QRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPST 359
                  A   +K  T+ +++  K+ TE +KPA   S+ T   K+      +T
Sbjct: 273 ENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNT 325


>gnl|CDD|169386 PRK08334, PRK08334, translation initiation factor IF-2B subunit
           beta; Validated.
          Length = 356

 Score = 32.2 bits (73), Expect = 0.88
 Identities = 57/274 (20%), Positives = 119/274 (43%), Gaps = 26/274 (9%)

Query: 440 SEKEYSRGFEERLGPAMSYLNKCRPHSVSMLNAVKHFKSHLTQLPNDITDTQARLRLKEV 499
           ++ E+  GF +    A   L   RP +V++  A+   K  + +   D  D   RL ++E 
Sbjct: 81  TKDEFMDGFYK----AYETLKNTRPTAVNLFWALNRIKKLVEEHLEDPLDEIKRLIVEE- 135

Query: 500 IATYIHEQVDMAGNAICMFFHNKLANDDVILTY-GCSSL-------VEKILLTAHEKGTK 551
            A  I ++ D+  N     +  ++  +  +LT+    SL       V  +L   H+ GT 
Sbjct: 136 -AQKIADE-DVEANLRMGHYGAEVLPEGNVLTHCNAGSLATVHLGTVGAVLRVMHKDGT- 192

Query: 552 FRVIIVDGS-PWYEGKEMLRRLVKHQVDCSYVLLS--AVSYIMRE--VSKVIIGAHALLS 606
            +++ VD + P  +G   L     H       L+S     ++M++  V  +I+GA  +++
Sbjct: 193 LKLLWVDETRPVLQGAR-LSAWEYHYDGIPLKLISDNMAGFVMQQGKVDAIIVGADRIVA 251

Query: 607 NGAVMSRAGTAQVSLVARAFNVPVLAACETHKFCERVQTDALVFNELGDPNELISDKSAA 666
           NG   ++ GT  ++++A+   +P             +++   +  E   P E+++     
Sbjct: 252 NGDFANKIGTYTLAVLAKEHGIPFFTVAPLSTIDMSLKSGKEIPIEERSPEEVLT----C 307

Query: 667 KNWKSLAHLTPLSLTYDITPSHLVTAVITELAIV 700
              +    +   +  +D+TP   +T +IT+  +V
Sbjct: 308 GGCRIAPDVDVYNPAFDVTPHKYLTGIITDRGVV 341


>gnl|CDD|234336 TIGR03734, PRTRC_parB, PRTRC system ParB family protein.  A novel
           genetic system characterized by six major proteins,
           included a ParB homolog and a ThiF homolog, is
           designated PRTRC, or ParB-Related,ThiF-Related Cassette.
           It is often found on plasmids. This protein family the
           member related to ParB, and is designated PRTRC system
           ParB family protein.
          Length = 554

 Score = 32.4 bits (74), Expect = 1.0
 Identities = 29/129 (22%), Positives = 48/129 (37%), Gaps = 19/129 (14%)

Query: 130 KPLSGNKEIPLYSATGPSSCKAPQFISCAEQLIYTSPSSVQTTPASLSQVT------TAI 183
           +P      IPL +A G       Q  +C  +      ++V   P SL +VT      T  
Sbjct: 257 RPGDNFTVIPL-AADGTLGVGEEQATAC--RSCEDFGAAVSALPGSLGKVTKGLCFDTVC 313

Query: 184 MSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAK 243
            + K          V A+++A++AA  AA++ A        P    +  E+  S  +K  
Sbjct: 314 NTTK----------VAARKKAERAAAAAAQKPAAPAAGPGTPAKEKSPAETATSGAAKPA 363

Query: 244 SKSEPSKEA 252
           +K       
Sbjct: 364 AKKAVPSSQ 372



 Score = 31.2 bits (71), Expect = 2.2
 Identities = 20/64 (31%), Positives = 30/64 (46%), Gaps = 5/64 (7%)

Query: 292 KSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKS 351
           K  A+ KAER A    Q+ A  AA       + EKS  ++ T     A++ ++K  V  S
Sbjct: 317 KVAARKKAERAAAAAAQKPAAPAA--GPGTPAKEKSPAETAT---SGAAKPAAKKAVPSS 371

Query: 352 KDPN 355
           +  N
Sbjct: 372 QPSN 375


>gnl|CDD|106976 PHA00667, PHA00667, hypothetical protein.
          Length = 158

 Score = 30.8 bits (68), Expect = 1.4
 Identities = 6/91 (6%), Positives = 20/91 (21%), Gaps = 5/91 (5%)

Query: 179 VTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSE 238
            T A ++     A  ++     +++ +  +       A              +     S+
Sbjct: 27  TTAAQVAAVAAEAPAAQQEASTEKKPRATSAKTTAATAPTPPTAEVGAADAPE-----SK 81

Query: 239 PSKAKSKSEPSKEANVQNKPAQGSQDTPKPG 269
              +          +     A  +       
Sbjct: 82  AENSAPVQAGDAGQDAAAGDAAPTYQDTADA 112


>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
          Length = 592

 Score = 32.0 bits (72), Expect = 1.4
 Identities = 19/79 (24%), Positives = 32/79 (40%), Gaps = 11/79 (13%)

Query: 203 EAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGS 262
           EA K+A+    ++A      TA P+  T+P  +  +    +    P ++     +     
Sbjct: 60  EASKSAEANTTKEA----PATATPENTTEPTVEPKQTETKEQTKTPEEKQPAAKQVE--- 112

Query: 263 QDTPKPGGPPATLSNVDNK 281
               K    PAT+SN DN 
Sbjct: 113 ----KAPAEPATVSNPDNA 127


>gnl|CDD|215467 PLN02866, PLN02866, phospholipase D.
          Length = 1068

 Score = 32.0 bits (73), Expect = 1.5
 Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 1/40 (2%)

Query: 518 FFHNKLANDDVILTYGCSSLVEKILLTAHEKGTKFRVIIV 557
           FF + L+ DD I      +L  +IL  AH++   FRVIIV
Sbjct: 753 FFISGLSGDDTIQNRVLEALYRRIL-RAHKEKKCFRVIIV 791


>gnl|CDD|213844 TIGR03657, IsdB, heme uptake protein IsdB.  Isd proteins are
           iron-regulated surface proteins found in Bacillus,
           Staphylococcus and Listeria species and are responsible
           for heme scavenging from hemoproteins. The IsdB protein
           is only observed in Staphylococcus and consists of an
           N-terminal hydrophobic signal sequence, a pair of tandem
           NEAT (NEAr Transporter, pfam05031) domains which confers
           the ability to bind heme and a C-terminal sortase
           processing signal which targets the protein to the cell
           wall. IsdB is believed to make a direct contact with
           methemoglobin facilitating transfer of heme to IsdB. The
           heme is then transferred to other cell wall-bound NEAT
           domain proteins such as IsdA and IsdC.
          Length = 644

 Score = 31.8 bits (71), Expect = 1.5
 Identities = 30/120 (25%), Positives = 55/120 (45%), Gaps = 5/120 (4%)

Query: 135 NKEIPLYSATGPSSCKAPQFISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELS 194
           NK+ P       +S ++ +  + A +       S  TTP  +   T  +      S+E +
Sbjct: 502 NKQSPSVEKENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKPTTASSETT 561

Query: 195 RDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPE-SKQSEPSKAKSKSEPSKEAN 253
           +D+V+    + +A   A  QKA +K       D  TQ + +K ++ +KAKS  +  +E+N
Sbjct: 562 KDVVQTSAGSSEAKDSAPLQKANIKNT----NDGHTQSQNNKNTQENKAKSLPQTGEESN 617


>gnl|CDD|222110 pfam13409, GST_N_2, Glutathione S-transferase, N-terminal domain.
           This family is closely related to pfam02798.
          Length = 68

 Score = 28.7 bits (65), Expect = 1.6
 Identities = 9/36 (25%), Positives = 15/36 (41%)

Query: 535 SSLVEKILLTAHEKGTKFRVIIVDGSPWYEGKEMLR 570
           S    ++ L    KG  + +  V   PW +  E+L 
Sbjct: 2   SPFARRVRLALELKGLPYEIEEVPLDPWDKPPELLA 37


>gnl|CDD|181561 PRK08818, PRK08818, prephenate dehydrogenase; Provisional.
          Length = 370

 Score = 31.4 bits (71), Expect = 1.7
 Identities = 24/92 (26%), Positives = 46/92 (50%), Gaps = 9/92 (9%)

Query: 141 YSATGPSSCKAPQFISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKA 200
           +    P S      +  A+ LI+++P  ++ T A + +     ++    + +L  D V +
Sbjct: 35  HDPADPGSLDPATLLQRADVLIFSAP--IRHTAALIEEYVA--LAGGRAAGQLWLD-VTS 89

Query: 201 QREAKKAAKLAAKQKAKVKG--DVTAPPDAVT 230
            ++A  AA LA+  +A+V G   +TAPP + T
Sbjct: 90  IKQAPVAAMLAS--QAEVVGLHPMTAPPKSPT 119


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 31.6 bits (72), Expect = 1.8
 Identities = 23/130 (17%), Positives = 47/130 (36%), Gaps = 6/130 (4%)

Query: 232 PESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEV 291
            E    E ++  SK    +    +  P  G ++            N   ++    E +E+
Sbjct: 404 EEESDEEENEEPSKKNVGRR---KFGPENGEKEAESKK-LKKENKNEFKEKKESDEEEEL 459

Query: 292 KSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKS 351
           + + + K E+ A +  +R  KA    E+ +   ++     KT  S   S K   ++   S
Sbjct: 460 EDEEEAKVEKVANKLLKRSEKAQKEEEEEE--LDEENPWLKTTSSVGKSAKKQDSKKKSS 517

Query: 352 KDPNVPSTKK 361
              +  + K 
Sbjct: 518 SKLDKAANKI 527


>gnl|CDD|233365 TIGR01347, sucB, 2-oxoglutarate dehydrogenase complex
           dihydrolipoamide succinyltransferase (E2 component).
           This model describes the TCA cycle 2-oxoglutarate system
           E2 component, dihydrolipoamide succinyltransferase. It
           is closely related to the pyruvate dehydrogenase E2
           component, dihydrolipoamide acetyltransferase. The seed
           for this model includes mitochondrial and Gram-negative
           bacterial forms. Mycobacterial candidates are highly
           derived, differ in having and extra copy of the
           lipoyl-binding domain at the N-terminus. They score
           below the trusted cutoff, but above the noise cutoff and
           above all examples of dihydrolipoamide acetyltransferase
           [Energy metabolism, TCA cycle].
          Length = 403

 Score = 31.2 bits (71), Expect = 1.8
 Identities = 19/110 (17%), Positives = 33/110 (30%), Gaps = 7/110 (6%)

Query: 228 AVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGP----PATLSNVDNKEN 283
           A+ +  +  +    AKS  E  +        A  +        P     A    +D    
Sbjct: 72  AILEEGNDATAAPPAKSGEEKEETPAASAAAAPTAAANRPSLSPAARRLAKEHGIDLSAV 131

Query: 284 VKSEGDEVKSKAQLKAERRAKQEQQRQAKAAAL---LEKTKTSNEKSMTK 330
             +      +K  +  +  A    Q+ A AAA       T+      MT+
Sbjct: 132 PGTGVTGRVTKEDIIKKTEAPASAQQPAPAAAAKAPANFTRPEERVKMTR 181


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 31.1 bits (69), Expect = 1.9
 Identities = 38/148 (25%), Positives = 55/148 (37%), Gaps = 7/148 (4%)

Query: 200 AQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNK-- 257
           A    KK+AK AA  KA       A P A       ++  + AK+ + P+K A    K  
Sbjct: 209 AAPSGKKSAKAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAA 268

Query: 258 --PAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAA 315
             PA+ +    K   PPA  +    K    +      + A  KA     +     AKAAA
Sbjct: 269 APPAKAAAPPAKAAAPPAKAAAPPAK---AAAAPAKAAAAPAKAAAAPAKAAAPPAKAAA 325

Query: 316 LLEKTKTSNEKSMTKSKTEDSKPASEKS 343
              K  T   K+        + P  +K+
Sbjct: 326 PPAKAATPPAKAAAPPAKAAAAPVGKKA 353



 Score = 30.7 bits (68), Expect = 3.1
 Identities = 40/154 (25%), Positives = 56/154 (36%), Gaps = 12/154 (7%)

Query: 200 AQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSK-----AKSKSEPSKEANV 254
           A  + K AAK AA    K      AP  A   P    + P+K     AK+ + P+K A  
Sbjct: 197 AAAKQKAAAKKAAAPSGKKSAKAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAP 256

Query: 255 QNK----PAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQ 310
             K    PA+ +    K   PPA  +    K    +      + A  KA     +     
Sbjct: 257 PAKAAAPPAKAAAPPAKAAAPPAKAAAPPAK---AAAPPAKAAAAPAKAAAAPAKAAAAP 313

Query: 311 AKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSS 344
           AKAAA   K      K+ T      + PA   ++
Sbjct: 314 AKAAAPPAKAAAPPAKAATPPAKAAAPPAKAAAA 347


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 31.2 bits (70), Expect = 2.1
 Identities = 16/61 (26%), Positives = 26/61 (42%)

Query: 305 QEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPSTKKYSG 364
           + + R A AAA+LE+  + ++     +  E  KP   K ++ E L S           SG
Sbjct: 99  KSRVRSASAAAILEEDDSKDDMEFKANPNEAGKPGKPKGNQGEGLASSSDGKSKASAKSG 158

Query: 365 V 365
            
Sbjct: 159 S 159


>gnl|CDD|233208 TIGR00956, 3a01205, Pleiotropic Drug Resistance (PDR) Family
           protein.  [Transport and binding proteins, Other].
          Length = 1394

 Score = 31.2 bits (71), Expect = 2.3
 Identities = 13/60 (21%), Positives = 28/60 (46%)

Query: 179 VTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSE 238
               +++E N  A+   +++  +R + K AK A +  A  K D+ A     +   + +S+
Sbjct: 684 FVYILLTEFNKGAKQKGEILVFRRGSLKRAKKAGETSASNKNDIEAGEVLGSTDLTDESD 743


>gnl|CDD|188123 TIGR01265, tyr_nico_aTase, tyrosine/nicotianamine family
           aminotransferase.  This subfamily of pyridoxal
           phosphate-dependent enzymes includes known examples of
           both tyrosine aminotransferase from animals and
           nicotianamine aminotransferase from barley.
          Length = 403

 Score = 30.8 bits (70), Expect = 2.6
 Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 12/49 (24%)

Query: 497 KEVIATYIHEQVDMAGNAICMFFHNKLANDDVILTYGCSSLVEKILLTA 545
           +E +A Y+   +             KL  DDV+LT GCS  +E I + A
Sbjct: 79  REAVAEYLSSDLP-----------GKLTADDVVLTSGCSQAIE-ICIEA 115


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 30.7 bits (69), Expect = 2.8
 Identities = 21/111 (18%), Positives = 44/111 (39%), Gaps = 8/111 (7%)

Query: 239 PSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVD-----NKENVKSEGDEVKS 293
             + + +S   +E   + +    S+D  +P   P      D      +E   ++   ++ 
Sbjct: 6   ELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKALEE 65

Query: 294 KAQLKAERRAKQEQQ---RQAKAAALLEKTKTSNEKSMTKSKTEDSKPASE 341
           +A+ KAE R ++  +    + K    L+K  T  E ++    T+D     E
Sbjct: 66  EAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVDTDDENEEEE 116


>gnl|CDD|237081 PRK12372, PRK12372, ribonuclease III; Reviewed.
          Length = 413

 Score = 30.6 bits (69), Expect = 2.9
 Identities = 27/156 (17%), Positives = 51/156 (32%), Gaps = 9/156 (5%)

Query: 218 VKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSN 277
           VKG V    D  +    +++   +A++ +           PA+       P   P     
Sbjct: 251 VKG-VQEALDLRSPERKERAAAREARAAAAAPAATAAAAAPAEE------PAVAPMAAIR 303

Query: 278 VDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSK 337
             + E    +G+     A   A++ A +   R   A    EK   +  ++  K   + + 
Sbjct: 304 AAHVETAADKGERAAKPA--AADKAADKPADRPDAAEKAAEKPAEAAPRAADKPAGQAAD 361

Query: 338 PASEKSSKTEVLKSKDPNVPSTKKYSGVDGVKATPG 373
           PAS  + K           P+  + +        PG
Sbjct: 362 PASSSADKPGASADAAARTPARARDAAAPDADTPPG 397



 Score = 29.1 bits (65), Expect = 8.5
 Identities = 22/113 (19%), Positives = 34/113 (30%), Gaps = 8/113 (7%)

Query: 203 EAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGS 262
            A  AA  A    A    +    P A  +    ++   K +  ++P+      +KPA   
Sbjct: 275 RAAAAAPAATAAAAAPAEEPAVAPMAAIRAAHVETAADKGERAAKPAAADKAADKPADRP 334

Query: 263 QDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAA 315
               K    PA        E      D+   +A   A   A +       AA 
Sbjct: 335 DAAEKAAEKPA--------EAAPRAADKPAGQAADPASSSADKPGASADAAAR 379


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 29.9 bits (68), Expect = 3.3
 Identities = 19/108 (17%), Positives = 42/108 (38%), Gaps = 9/108 (8%)

Query: 194 SRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEAN 253
           ++  V+AQR  ++A K  A  + +        P      + ++    K + +  P  +  
Sbjct: 106 AKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDI- 164

Query: 254 VQNKPAQGSQDTPKPGGPP--ATLSNVDNKENVK---SEGDEVKSKAQ 296
             ++   G     K G     AT+  +  K++V+     G  +  +A+
Sbjct: 165 --SELTVGQAVKVKAGKSAMDATVLEI-TKDDVRVQLDSGLSMIVRAE 209


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 30.4 bits (69), Expect = 3.5
 Identities = 20/102 (19%), Positives = 37/102 (36%), Gaps = 23/102 (22%)

Query: 279 DNKENVKSEGDEV--------KSKAQLKAERRAKQEQQRQAKAAALLEKTKTSN------ 324
              E  +SE + +        K+KAQ   E+R K+ ++   +   L +K           
Sbjct: 257 SAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIA 316

Query: 325 ------EKSMTKSKTEDSKPASEKSSKTEVL---KSKDPNVP 357
                 EK+  + K +  +   +K  K   L   K  +P + 
Sbjct: 317 KEVAQKEKARARKKEQRKERGEKKKLKRRKLGKHKYPEPPLE 358


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 29.6 bits (67), Expect = 3.9
 Identities = 29/124 (23%), Positives = 45/124 (36%), Gaps = 12/124 (9%)

Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQN-K 257
           + +RE +KA K   +++AK K D     +A      +     KA               +
Sbjct: 12  RRKREQRKARKKQKRKEAKKKEDAQK-SEAEEVKNEENKSKKKAAPIENAEGNIVFSKVE 70

Query: 258 PAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALL 317
            A G Q           L     K+  K + D  K   +    R+ K E+  + KAA + 
Sbjct: 71  FADGEQAKKD-------LKLKKKKK--KKKTD-YKQLLKKLEARKKKLEELDEDKAAEIE 120

Query: 318 EKTK 321
           EK K
Sbjct: 121 EKEK 124


>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
          Length = 1052

 Score = 30.3 bits (68), Expect = 4.0
 Identities = 21/107 (19%), Positives = 39/107 (36%), Gaps = 19/107 (17%)

Query: 166  PSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAA---------KLAAKQKA 216
            P+       +       +        E +   V A  EA + A         K+A   + 
Sbjct: 924  PTQDGPAAEAQPAQENTV-------VESAEKAVAAVSEAAQQAATAVASGIEKVAEAVRK 976

Query: 217  KVKGDV---TAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQ 260
             V+  V    A   A+ +  +K++   KA +K   +K+A  + K A+
Sbjct: 977  TVRRSVKKAAATRAAMKKKVAKKAPAKKAAAKKAAAKKAAAKKKVAK 1023


>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).
           This family consists of several brain acid soluble
           protein 1 (BASP1) or neuronal axonal membrane protein
           NAP-22. The BASP1 is a neuron enriched Ca(2+)-dependent
           calmodulin-binding protein of unknown function.
          Length = 233

 Score = 29.8 bits (66), Expect = 4.0
 Identities = 37/158 (23%), Positives = 65/158 (41%), Gaps = 15/158 (9%)

Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKS----------KSEP 248
           K   EA+ AA+    ++AK +       D   + E K+ E   A +          K+E 
Sbjct: 37  KENEEAQAAAETTEVKEAKEEKPDKDAQDTANKTEEKEGEKEAAAAKEEAPKAEPEKTEG 96

Query: 249 SKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQE-- 306
           + EA  +   A   +  P     PA            S+  E  S A  K E ++K+E  
Sbjct: 97  AAEAKAEPPKASDPEQEPAAAPGPAAGGEAPKASEASSQPAE--SAAPAKEEEKSKEEGE 154

Query: 307 -QQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKS 343
            ++ +A AAA  E    +   S +K  + ++ P+S+++
Sbjct: 155 AKKTEAPAAAAQETKSDAAPASDSKPSSSEAAPSSKET 192


>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
           dihydrolipoamide succinyltransferase.  This model
           represents an Actinobacterial clade of E2 enzyme, a
           component of the 2-oxoglutarate dehydrogenase complex
           involved in the TCA cycle. These proteins have multiple
           domains including the catalytic domain (pfam00198), one
           or two biotin domains (pfam00364) and an E3-component
           binding domain (pfam02817).
          Length = 579

 Score = 30.4 bits (68), Expect = 4.3
 Identities = 23/117 (19%), Positives = 43/117 (36%), Gaps = 6/117 (5%)

Query: 159 EQLIYTSPSSVQT-TPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAK 217
           E L+  S   V T  P+ ++     I + ++++ E+   L     +A  A    A+++A 
Sbjct: 158 EPLLEVSTDKVDTEIPSPVAGTLLEIRAPEDDTVEVGTVLAIIG-DANAAPAEPAEEEAP 216

Query: 218 VKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPAT 274
              +  + P     P+     P  A     P+        PA  +  +    GP  T
Sbjct: 217 APSEAGSEP----APDPAARAPHAAPDPPAPAPAPAKTAAPAAAAPVSSGDSGPYVT 269


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 30.3 bits (68), Expect = 4.3
 Identities = 29/133 (21%), Positives = 44/133 (33%), Gaps = 12/133 (9%)

Query: 196 DLVKAQREAKKAAKLAAKQKA--KVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEAN 253
           DL   +   K+A+    K+++           P    Q       P+   + +     A+
Sbjct: 367 DLSAGRHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPAS 426

Query: 254 VQNKPAQ-------GSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQE 306
               PA        GS D P P  PP         E    + D+   KA L A R  +  
Sbjct: 427 APPPPATPLPSAEPGSDDGPAP--PPERQPPAPATEPAPDDPDDATRKA-LDALRERRPP 483

Query: 307 QQRQAKAAALLEK 319
           +   A  A LL +
Sbjct: 484 EPPGADLAELLGR 496


>gnl|CDD|129310 TIGR00206, fliF, flagellar basal-body M-ring protein/flagellar
           hook-basal body protein (fliF).  Component of the M
           (cytoplasmic associated) ring, one of four rings
           (L,P,S,M) which make up the flagellar hook-basal body
           which is a major portion of the flagellar organelle.
           Although the basic structure of the flagella appears to
           be similar for all bacteria, additional rings and
           structures surrounding the basal body have been observed
           for some bacteria (eg Vibrio cholerae and Treponema
           pallidum) [Cellular processes, Chemotaxis and motility].
          Length = 555

 Score = 30.2 bits (68), Expect = 4.5
 Identities = 33/172 (19%), Positives = 53/172 (30%), Gaps = 39/172 (22%)

Query: 183 IMSEKNNSAELSRDLVKAQREAK---------KAAKLAAK---QKAKVKGDVTAPPD-AV 229
           I+++ N       DL + Q + K         +   + +        V   V A  D + 
Sbjct: 219 ILNDSNLDGIDRIDLAEKQLKYKSKVEAMLQGEIDSILSPIVGVDNFVVARVNAKLDFSK 278

Query: 230 TQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGD 289
                ++  P++ +S     + +  Q +  QG      PGG P   SN            
Sbjct: 279 ETTTEEEYAPNELESSDSTIRSSQTQEEEYQG--QGYSPGGVPGQESNT---------PP 327

Query: 290 EVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASE 341
            V+    LK      +E Q         EK    NE    K   E  +   E
Sbjct: 328 PVQD---LKDNTGNYEESQY--------EKNVQLNE----KKNYEVKRTIRE 364


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 29.7 bits (67), Expect = 4.6
 Identities = 35/165 (21%), Positives = 66/165 (40%), Gaps = 23/165 (13%)

Query: 195 RDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANV 254
           ++L + +R  KK        K   K      P A   P++    P   K     S    +
Sbjct: 76  KELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPK--KKSERISWAPTL 133

Query: 255 QNKPAQGSQDTPKPGGPPATLSNVDNKE--NVKSEGDEVKSKAQLKAERRAKQEQQR--- 309
            + P + S  +          S V NKE  + + +  E++ K      R+ K++++    
Sbjct: 134 LDSPRRKSSRS----------STVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKEL 183

Query: 310 -QAKAAALLEKTKTSNEKSMTKSKT-EDSKPASEKSSKTEVLKSK 352
            Q +  A  ++T+  N KS+ + +  E+ K    K +K + LK +
Sbjct: 184 TQEERLAEAKETERINLKSLERYEEQEEEK----KKAKIQALKKR 224


>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
          Length = 567

 Score = 30.2 bits (68), Expect = 4.6
 Identities = 14/35 (40%), Positives = 23/35 (65%), Gaps = 2/35 (5%)

Query: 281 KENVKSEGDEVKSKAQLKA--ERRAKQEQQRQAKA 313
           +E ++ E  E K +A+LK   ER+AK+E  +Q +A
Sbjct: 9   REKLEREQRERKQRAKLKLERERKAKEEAAKQREA 43


>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
          Length = 801

 Score = 30.1 bits (68), Expect = 4.8
 Identities = 12/51 (23%), Positives = 23/51 (45%)

Query: 298 KAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEV 348
           +A+R A+ E     K A  L+K   S+     + K       S+ +++ E+
Sbjct: 588 QADRAARAEAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKAAAEREI 638


>gnl|CDD|234399 TIGR03923, T7SS_EccE, type VII secretion protein EccE.  This model
           represents the transmembrane protein EccB of the
           actinobacterial flavor of type VII secretion systems.
           Species such as Mycobacterium tuberculosis have several
           instances of this system per genome, designated EccE1,
           EccE2, etc. This model represents a conserved core
           region, and many members have 200 or more additional
           C-terminal residues [Protein fate, Protein and peptide
           secretion and trafficking].
          Length = 341

 Score = 29.7 bits (67), Expect = 4.9
 Identities = 9/48 (18%), Positives = 20/48 (41%)

Query: 669 WKSLAHLTPLSLTYDITPSHLVTAVITELAIVPCTSVPVVLRVKPTET 716
           W+ L         Y + P  +++  + +L  +P     V +R++P   
Sbjct: 249 WRGLRGDNGYLAAYRVDPDPILSDTLAQLWALPADETTVTVRLRPGAG 296


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 30.0 bits (67), Expect = 5.2
 Identities = 21/119 (17%), Positives = 38/119 (31%)

Query: 227 DAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKS 286
           +    PE       + KSK    KE   + K     +     G     L+  D+  +  S
Sbjct: 182 ETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAAS 241

Query: 287 EGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSK 345
             +  ++         A   +  + K A   E  K+   K   + K ++ K   +K   
Sbjct: 242 VAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHH 300


>gnl|CDD|190741 pfam03763, Remorin_C, Remorin, C-terminal region.  Remorins are
           plant-specific plasma membrane-associated proteins. In
           tobacco remorin co-purifies with lipid rafts. Most
           remorins have a variable, proline-rich C-half and a more
           conserved N-half that is predicted to form coiled coils.
           Consistent with this, circular dichroism studies have
           demonstrated that much of the protein is alpha-helical.
           Remorins exist in plasma membrane preparations as
           oligomeric structures and form filaments in vitro. The
           proteins can bind polyanions including the extracellular
           matrix component oligogalacturonic acid (OGA). In vitro,
           remorin in plasma membrane preparations is
           phosphorylated (principally on threonine residues) in
           the presence of OGA and thus co-purifies with a protein
           kinases(s). The biological functions of remorins are
           unknown but roles as components of the
           membrane/cytoskeleton are possible.
          Length = 112

 Score = 28.5 bits (64), Expect = 5.4
 Identities = 21/71 (29%), Positives = 34/71 (47%), Gaps = 3/71 (4%)

Query: 280 NKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPA 339
            +E  K +  E K KA+ +AE + K E++ + K A   EK K  N+ +    K E+ +  
Sbjct: 26  QREEAKIQAWENKKKAKAEAELK-KIERKLEKKKAEAAEKLK--NKLAAAHKKAEEKRAT 82

Query: 340 SEKSSKTEVLK 350
           +E     E  K
Sbjct: 83  AEAKRGEEEAK 93


>gnl|CDD|215074 PLN00145, PLN00145, tyrosine/nicotianamine aminotransferase;
           Provisional.
          Length = 430

 Score = 29.7 bits (67), Expect = 5.6
 Identities = 36/127 (28%), Positives = 58/127 (45%), Gaps = 28/127 (22%)

Query: 522 KLANDDVILTYGCSSLVEKILLTAHEKGTKFRVIIVD--GSPWYEGKEMLRRL-VKH--- 575
           +L+ DD+ LT GC+  +E I+    + G     I++   G P YE + +   L V+H   
Sbjct: 114 ELSTDDIYLTAGCAQAIEIIMSVLAQPGAN---ILLPRPGYPLYEARAVFSGLEVRHFDL 170

Query: 576 ------QVDCSYVLLSAVSYIMRE--VSKVIIGAHALLSN--GAVMSRAGTAQVSLVARA 625
                 +VD     L  V  +  E  V+ VII      +N  G+V S    A+++  AR 
Sbjct: 171 LPERGWEVD-----LEGVEALADENTVAMVIINP----NNPCGSVYSYEHLAKIAETARK 221

Query: 626 FNVPVLA 632
             + V+A
Sbjct: 222 LGILVIA 228


>gnl|CDD|119332 cd06562, GH20_HexA_HexB-like, Beta-N-acetylhexosaminidases catalyze
           the removal of beta-1,4-linked N-acetyl-D-hexosamine
           residues from the non-reducing ends of
           N-acetyl-beta-D-hexosaminides including
           N-acetylglucosides and N-acetylgalactosides. The hexA
           and hexB genes encode the alpha- and beta-subunits of
           the two major beta-N-acetylhexosaminidase isoenzymes,
           N-acetyl-beta-D-hexosaminidase A (HexA) and
           beta-N-acetylhexosaminidase B  (HexB). Both the alpha
           and the beta catalytic subunits have a TIM-barrel fold
           and belong to the glycosyl hydrolase family 20 (GH20).
           The HexA enzyme is a heterodimer containing one alpha
           and one beta subunit while the HexB enzyme is a
           homodimer containing two beta-subunits.  Hexosaminidase
           mutations cause an inability to properly hydrolyze
           certain sphingolipids which accumulate in lysosomes
           within the brain, resulting in the lipid storage
           disorders Tay-Sachs and Sandhoff.  Mutations in the
           alpha subunit cause in a deficiency in the HexA enzyme
           and result in Tay-Sachs, mutations in the beta-subunit
           cause in a deficiency in both HexA and HexB enzymes and
           result in Sandhoff disease.  In both disorders GM(2)
           gangliosides accumulate in lysosomes. The GH20
           hexosaminidases are thought to act via a catalytic
           mechanism in which the catalytic nucleophile is not
           provided by solvent or the enzyme, but by the substrate
           itself.
          Length = 348

 Score = 29.5 bits (67), Expect = 5.8
 Identities = 10/42 (23%), Positives = 19/42 (45%), Gaps = 6/42 (14%)

Query: 522 KLANDDVILTYGCSSLVEKILLTAHEKGTKFRVIIVDGSPWY 563
            L  D ++  +G S  ++ +L      G  ++VI+     WY
Sbjct: 218 LLPKDTIVQVWGGSDELKNVL----AAG--YKVILSSYDFWY 253


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 29.3 bits (66), Expect = 6.1
 Identities = 19/91 (20%), Positives = 37/91 (40%), Gaps = 12/91 (13%)

Query: 269 GGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSM 328
              P+  +  D +E  KS+  E     ++K E   ++E      AA   +K    + +  
Sbjct: 37  PSSPSDQAAADEQEAKKSDDQETAEIEEVKEE---EKE------AANSEDKEDKGDAEKE 87

Query: 329 TKSKTEDSKPASEKSSK---TEVLKSKDPNV 356
            +   E+++   E+SS     E  +  + NV
Sbjct: 88  DEESEEENEEEDEESSDENEKETEEKTESNV 118


>gnl|CDD|234681 PRK00182, tatB, sec-independent translocase; Provisional.
          Length = 160

 Score = 28.6 bits (64), Expect = 6.2
 Identities = 10/39 (25%), Positives = 15/39 (38%)

Query: 235 KQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPA 273
            +++  +  S       ANV  +PA GSQ       P  
Sbjct: 108 GEAQRQQQASAGRADNNANVVTRPADGSQPRSSQNDPKG 146


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
           which functions are known are part of an exonuclease
           complex with sbcD homologs. This complex is involved in
           the initiation of recombination to regulate the levels
           of palindromic sequences in DNA. This family is based on
           the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1042

 Score = 29.9 bits (67), Expect = 6.2
 Identities = 13/63 (20%), Positives = 27/63 (42%), Gaps = 1/63 (1%)

Query: 157 CAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKA 216
           C   +  T     Q     L  +  A+   + + A L++   +AQ E  K  +L  + +A
Sbjct: 209 CTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKR-EAQEEQLKKQQLLKQLRA 267

Query: 217 KVK 219
           +++
Sbjct: 268 RIE 270


>gnl|CDD|237082 PRK12373, PRK12373, NADH dehydrogenase subunit E; Provisional.
          Length = 400

 Score = 29.8 bits (67), Expect = 6.3
 Identities = 23/131 (17%), Positives = 38/131 (29%), Gaps = 5/131 (3%)

Query: 176 LSQVTTAIMSEKNN-SAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPES 234
           L+ +T      + N S  L+ D+    +                   V  PP    +P+S
Sbjct: 194 LTSLTEEAGKARYNASKALAEDIGDTVKRIDGTEVPLLAPWQGDAAPV--PPSEAARPKS 251

Query: 235 KQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGP-PATLSNVDNKENVKSEGDEVKS 293
             +E + A      + +A  +N  A  +Q         PA                E K 
Sbjct: 252 ADAETNAALKTPATAPKAAAKNAKAPEAQPVSGTAAAEPAPKEAAKAAAAAAKPALEDKP 311

Query: 294 KAQLKAERRAK 304
           +  L   R   
Sbjct: 312 RP-LGIARPGG 321


>gnl|CDD|224739 COG1826, TatA, Sec-independent protein secretion pathway components
           [Intracellular trafficking and secretion].
          Length = 94

 Score = 27.7 bits (62), Expect = 6.3
 Identities = 18/68 (26%), Positives = 30/68 (44%), Gaps = 1/68 (1%)

Query: 192 ELSRDLVKAQREAKKAAK-LAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSK 250
           E  RDL KA RE KKAA  +  +   ++K +           E + ++    +  SE  +
Sbjct: 27  EAGRDLGKAIREFKKAASDVKNELDEELKLEELDDKKKELTAELQATKEELDQLASELKE 86

Query: 251 EANVQNKP 258
           +   + KP
Sbjct: 87  DLKKKAKP 94


>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
          Length = 226

 Score = 29.2 bits (66), Expect = 6.7
 Identities = 14/47 (29%), Positives = 16/47 (34%)

Query: 222 VTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKP 268
           V   P  V  P+ K  E  K K K +   EA    KP        K 
Sbjct: 97  VEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKA 143


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 28.9 bits (65), Expect = 6.7
 Identities = 22/114 (19%), Positives = 41/114 (35%), Gaps = 8/114 (7%)

Query: 203 EAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSK----SEPSKEANVQNKP 258
           EAKK  K  A++  KVK +         + +  + +  K K K     +   E   + + 
Sbjct: 59  EAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118

Query: 259 AQGSQDTPKPGGPPATLSNVDNKENVKSE--GDEVKSKAQLKAERRAKQEQQRQ 310
               +D  K      TLS +   +  K     D  +S+   K      +++  +
Sbjct: 119 EDKLEDLTKS--YSETLSTLSELKPRKYALHKDIYQSRLDRKRRAEVAKKEAEE 170


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 28.6 bits (65), Expect = 7.2
 Identities = 17/45 (37%), Positives = 22/45 (48%), Gaps = 2/45 (4%)

Query: 294 KAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKP 338
           K  L+A    K+E +   KA    EK K   EK   K+K E+ KP
Sbjct: 55  KESLEAALLDKKELKAWEKAEKKAEKAKAKAEK--KKAKKEEPKP 97


>gnl|CDD|181437 PRK08470, PRK08470, adenylosuccinate lyase; Provisional.
          Length = 442

 Score = 29.3 bits (66), Expect = 7.2
 Identities = 18/44 (40%), Positives = 24/44 (54%), Gaps = 6/44 (13%)

Query: 89  PINNKG--LLLHAALNDYQSACQVSEQEVKQFFDSKGIYTYCSK 130
            IN KG  L L A LND      +SE+E++  FD    Y+Y +K
Sbjct: 392 AINEKGESLFLQALLNDEDLRKSLSEEEIRACFD----YSYYTK 431


>gnl|CDD|221445 pfam12157, DUF3591, Protein of unknown function (DUF3591).  This
           domain is found in eukaryotes and is typically between
           445 to 462 amino acids in length. Most members are
           annotated as being transcription initiation factor TFIID
           subunit 1, and this region is the conserved central
           portion of these proteins.
          Length = 457

 Score = 29.6 bits (67), Expect = 7.3
 Identities = 15/51 (29%), Positives = 19/51 (37%), Gaps = 9/51 (17%)

Query: 459 LNKCRPHSVSMLNAVKHFKSHLTQLPNDITDTQARLRLKEVIATYIHEQVD 509
           L K     +S+ +  KHF            + Q R RLKE    Y  E  D
Sbjct: 251 LRKSGRRRISVDDITKHFPDQ--------NEMQNRQRLKE-FMKYQREGGD 292


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 28.6 bits (64), Expect = 7.3
 Identities = 11/67 (16%), Positives = 29/67 (43%), Gaps = 2/67 (2%)

Query: 279 DNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKP 338
           ++++  K    E + K     + + +++++++ K A  L K K++     T  K      
Sbjct: 94  ESEKEQKEVSKETEEKE--AIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151

Query: 339 ASEKSSK 345
             ++  K
Sbjct: 152 TKKEEGK 158


>gnl|CDD|232831 TIGR00115, tig, trigger factor.  Trigger factor is a
           ribosome-associated molecular chaperone and is the first
           chaperone to interact with nascent polypeptide. Trigger
           factor can bind at the same time as the signal
           recognition particle (SRP), but is excluded by the SRP
           receptor (FtsY). The central domain of trigger factor
           has peptidyl-prolyl cis/trans isomerase activity. This
           protein is found in a single copy in virtually every
           bacterial genome [Protein fate, Protein folding and
           stabilization].
          Length = 408

 Score = 29.4 bits (67), Expect = 7.8
 Identities = 14/51 (27%), Positives = 24/51 (47%), Gaps = 9/51 (17%)

Query: 280 NKENVKSEGDEVKSKAQLKA------ERRAKQEQQRQAKAA---ALLEKTK 321
           + E  KS G+E ++  +LKA      ER  K+  + + K      L+E  +
Sbjct: 238 DDEFAKSLGEEFETLEELKADIRKNLEREKKERAKNKLKEQLLDKLVENNE 288


>gnl|CDD|145910 pfam03012, PP_M1, Phosphoprotein.  This family includes the M1
           phosphoprotein non-structural RNA polymerase alpha
           subunit, which is thought to be a component of the
           active polymerase, and may be involved in template
           binding.
          Length = 298

 Score = 28.9 bits (65), Expect = 8.0
 Identities = 18/66 (27%), Positives = 36/66 (54%), Gaps = 3/66 (4%)

Query: 225 PPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPA---TLSNVDNK 281
           PP   T+ +S Q++P K+K  S+ S     + + +    D+ +  GPPA   + +N ++ 
Sbjct: 136 PPGKSTEDKSTQTDPEKSKPASQSSAPVKKEEQSSSEKVDSQEASGPPALEWSATNDEDD 195

Query: 282 ENVKSE 287
            +V++E
Sbjct: 196 ASVEAE 201


>gnl|CDD|235026 PRK02292, PRK02292, V-type ATP synthase subunit E; Provisional.
          Length = 188

 Score = 28.8 bits (65), Expect = 8.1
 Identities = 14/71 (19%), Positives = 36/71 (50%), Gaps = 4/71 (5%)

Query: 285 KSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEK----SMTKSKTEDSKPAS 340
           +++ +E+    + +AER  +Q ++++  +A L  K +  N +       +++ ED   + 
Sbjct: 37  EADAEEILEDREAEAEREIEQLREQELSSAKLEAKRERLNARKEVLEDVRNQVEDEIASL 96

Query: 341 EKSSKTEVLKS 351
           +   + E+ KS
Sbjct: 97  DGDKREELTKS 107


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 29.4 bits (65), Expect = 8.6
 Identities = 38/238 (15%), Positives = 71/238 (29%), Gaps = 11/238 (4%)

Query: 142 SATGPSSCKAPQFISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQ 201
           S T  ++  A            TSP++ +T+    +   T I+     +A  S       
Sbjct: 490 SRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTAT-SPPTGTTS 548

Query: 202 REAKKAAKLA--AKQKAKVKGDVTAPPD-AVTQPESKQSEPSKAKSKSEPSKEANVQNKP 258
                + ++   +         VT+ P    +   + Q     + +  +P   ++  + P
Sbjct: 549 VPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTP 608

Query: 259 AQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLE 318
              S  T     P  T ++    EN+  E   V S   +                 +   
Sbjct: 609 RSNSTSTT----PLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPG--TTSQVSGPG 662

Query: 319 KTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPSTKKYSGVDGVKATPGTTL 376
            + TS              P +   S     K+  P V ++         K T G+TL
Sbjct: 663 NSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTV-TSTGGKANSTTKETSGSTL 719


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 29.4 bits (66), Expect = 8.6
 Identities = 28/135 (20%), Positives = 48/135 (35%), Gaps = 4/135 (2%)

Query: 142 SATGPSSCKAPQFISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQ 201
                 +  AP+  + A      +P +    PA+  +   A   +    A+ +       
Sbjct: 402 VTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADS 461

Query: 202 REAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQG 261
           R  ++ A+  A   +       APPDA  +P  + + PS A   + P   A      A  
Sbjct: 462 RCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARA----PAAAS 517

Query: 262 SQDTPKPGGPPATLS 276
            +D P    PPA  +
Sbjct: 518 REDAPAAAAPPAPEA 532


>gnl|CDD|239173 cd02772, MopB_NDH-1_NuoG2, MopB_NDH-1_NuoG2: The second domain of
           the NuoG subunit of the NADH-quinone oxidoreductase/NADH
           dehydrogenase-1 (NDH-1), found in beta- and
           gammaproteobacteria. The NDH-1 is the first
           energy-transducting complex in the respiratory chain and
           functions as a redox pump that uses the redox energy to
           translocate H+ ions across the membrane, resulting in a
           significant contribution to energy production. In
           Escherichia coli NDH-1, the largest subunit is encoded
           by the nuoG gene, and is part of the 14 distinct
           subunits constituting the functional enzyme. The NuoG
           subunit is made of two domains: the first contains three
           binding sites for FeS clusters (the fer2 domain), the
           second domain (this CD), is of unknown function or, as
           postulated, has lost an ancestral formate dehydrogenase
           activity that became redundant during the evolution of
           the complex I enzyme. Although only vestigial sequence
           evidence remains of a molybdopterin binding site, this
           protein domain belongs to the molybdopterin_binding
           (MopB) superfamily of proteins.
          Length = 414

 Score = 29.2 bits (66), Expect = 9.0
 Identities = 19/71 (26%), Positives = 33/71 (46%), Gaps = 5/71 (7%)

Query: 149 CKAPQFISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKK-A 207
                    + + I  +PS++     +L+QV  A+  EK  +       V+A  EA+K A
Sbjct: 190 ADDDFLFPLSGKAI-VAPSALA---NALAQVAKALAEEKGLAVPDEDAKVEASEEARKIA 245

Query: 208 AKLAAKQKAKV 218
           A L + ++A V
Sbjct: 246 ASLVSAERAAV 256


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 29.3 bits (65), Expect = 9.0
 Identities = 31/143 (21%), Positives = 51/143 (35%), Gaps = 20/143 (13%)

Query: 233 ESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTP--------KPGGPPATLSNVDNKENV 284
           +SK+S+  K   K   +KE  V  KP    +  P        KP  P       D +E  
Sbjct: 536 DSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEE-- 593

Query: 285 KSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPAS-EKS 343
                  K K    A+R  + +  +  +   + +  K        K      +P+S E+ 
Sbjct: 594 -----PKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERP 648

Query: 344 SKTEVLKS----KDPNVPSTKKY 362
              +++KS    K P  P   K+
Sbjct: 649 EGPKIIKSPKPPKSPKPPFDPKF 671


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 29.1 bits (65), Expect = 9.0
 Identities = 26/101 (25%), Positives = 41/101 (40%), Gaps = 17/101 (16%)

Query: 264 DTPKPGGPPATLSNV------DNKENVKSEGDEVKSKAQLK------AERRAKQEQQRQA 311
           D  KPG  PA L          N+E   S  +EV SK  +K       ERR +QE++RQ 
Sbjct: 354 DDYKPGTDPALLDPSLARKLRSNREVAVSRLEEVISKYAMKQDDTEEEERRKRQERERQG 413

Query: 312 KAA-----ALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTE 347
            ++     +    T   +    ++   E+     E+  + E
Sbjct: 414 TSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEE 454


>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
          Length = 222

 Score = 28.8 bits (64), Expect = 9.0
 Identities = 29/116 (25%), Positives = 47/116 (40%), Gaps = 9/116 (7%)

Query: 174 ASLSQVTTAIMSEKNNSAELSRDLVKAQREA-KKAAKLAAKQKAKVKGDVTAPPDAVTQP 232
           +SL  +   IMS +N  + ++R L +   E   K  +L+   + K     T   +     
Sbjct: 98  SSLRNIPVVIMSSENVPSRITRCLEEGAEEFFLKPVQLSDLNRLKPHMMKTKSKN----- 152

Query: 233 ESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEG 288
             KQ    K +   E   ++  Q +P+Q  Q  P+P   P   +N  NK     EG
Sbjct: 153 -QKQENQEKQEKLEESEIQSEKQEQPSQQPQSQPQPQQQPQQPNN--NKRKAMEEG 205


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 29.1 bits (65), Expect = 9.5
 Identities = 17/84 (20%), Positives = 30/84 (35%), Gaps = 3/84 (3%)

Query: 281 KENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAA---ALLEKTKTSNEKSMTKSKTEDSK 337
            E  K    E + + + K E +AK+++ ++ KAA   A  +             K  + K
Sbjct: 6   SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKK 65

Query: 338 PASEKSSKTEVLKSKDPNVPSTKK 361
                          DP+ P  +K
Sbjct: 66  SRKRDVEDENPEDFIDPDTPFGQK 89


>gnl|CDD|218825 pfam05956, APC_basic, APC basic domain.  This region of the APC
           family of proteins is known as the basic domain. It
           contains a high proportion of positively charged amino
           acids and interacts with microtubules.
          Length = 359

 Score = 28.9 bits (64), Expect = 9.5
 Identities = 23/120 (19%), Positives = 44/120 (36%), Gaps = 8/120 (6%)

Query: 161 LIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKG 220
           +IY    + ++   + S+        K   ++  +     Q+ ++   + A   K     
Sbjct: 8   VIYIPGPANRSQSTTPSKKGP---PLKTQPSDPPKSPSPGQQRSRSLHRPA---KPSELA 61

Query: 221 DVTAPPDAVTQPESKQSEPSKAKSK-SEPSKEANVQNKPAQGSQDTPKP-GGPPATLSNV 278
           +++ PP + T P      PS + S+ S PS+           S        GP  +LS V
Sbjct: 62  ELSPPPRSATPPARLAKTPSSSSSQTSTPSQPLPRPLPRPTQSAGRNSILPGPGNSLSQV 121


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.124    0.346 

Gapped
Lambda     K      H
   0.267   0.0699    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 33,314,120
Number of extensions: 3078797
Number of successful extensions: 4070
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3474
Number of HSP's successfully gapped: 310
Length of query: 717
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 613
Effective length of database: 6,324,786
Effective search space: 3877093818
Effective search space used: 3877093818
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 63 (28.1 bits)