RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16012
         (712 letters)



>gnl|CDD|215740 pfam00134, Cyclin_N, Cyclin, N-terminal domain.  Cyclins regulate
           cyclin dependent kinases (CDKs). Human cyclin-O is a
           Uracil-DNA glycosylase that is related to other cyclins.
           Cyclins contain two domains of similar all-alpha fold,
           of which this family corresponds with the N-terminal
           domain.
          Length = 127

 Score =  145 bits (369), Expect = 4e-41
 Identities = 49/99 (49%), Positives = 70/99 (70%), Gaps = 2/99 (2%)

Query: 244 RTVLLDWLSEVCEVYTLHRETFYLTIDYIDRYLSTNANIVPKQQLQLIGVAALMIAAKVE 303
           R +L+DWL EV E + L  ET YL ++Y+DR+LS     VP+ +LQL+GV  L+IAAK E
Sbjct: 31  RAILIDWLVEVHEEFKLLPETLYLAVNYLDRFLSKQ--PVPRTKLQLVGVTCLLIAAKYE 88

Query: 304 EIYPPKVSDYAYVTDGACSSQDILSTEMNILAVLEWNIT 342
           EIYPP V D+ Y+TD A + ++IL  E+ IL+ L W+++
Sbjct: 89  EIYPPSVEDFVYITDNAYTKEEILRMELLILSTLNWDLS 127


>gnl|CDD|227357 COG5024, COG5024, Cyclin [Cell division and chromosome
           partitioning].
          Length = 440

 Score = 99.5 bits (248), Expect = 4e-22
 Identities = 56/167 (33%), Positives = 88/167 (52%), Gaps = 17/167 (10%)

Query: 244 RTVLLDWLSEVCEVYTLHRETFYLTIDYIDRYLSTNANIVPKQQLQLIGVAALMIAAKVE 303
           R++L+DWL EV   + L  ET +L I+ IDR+LS  + +V  ++ QL+G++AL IA+K E
Sbjct: 213 RSILVDWLVEVHGKFGLLPETLFLAINIIDRFLS--SRVVSLEKYQLVGISALFIASKYE 270

Query: 304 EIYPPKVSDYAYVTDGACSSQDILSTEMNILAVLEWNITPVTAHYWLNVFLQVVYHRSSR 363
           E+  P + D  Y TDGA +  DI+  E  +L VL++NI+  +   +L    +        
Sbjct: 271 EVNCPSIKDLVYATDGAFTRDDIIRAERYMLEVLDFNISWPSPMSFLRRISKAS------ 324

Query: 364 CHNLGFIYPAFSRTLYTQCIRLLDLCHLDIRCLNYSYSVLATTAIYL 410
                  Y  FSRT         ++  +D + +  S S  A  A+YL
Sbjct: 325 ------DYDIFSRTPAKF---SSEISPVDYKFIQISPSWCAAAAMYL 362


>gnl|CDD|238003 cd00043, CYCLIN, Cyclin box fold. Protein binding domain
           functioning in cell-cycle and transcription control.
           Present in cyclins, TFIIB and Retinoblastoma (RB).The
           cyclins consist of 8 classes of cell cycle regulators
           that regulate cyclin dependent kinases (CDKs). TFIIB is
           a transcription factor that binds the TATA box. Cyclins,
           TFIIB and RB contain 2 copies of the domain.
          Length = 88

 Score = 78.0 bits (193), Expect = 1e-17
 Identities = 34/91 (37%), Positives = 49/91 (53%), Gaps = 4/91 (4%)

Query: 244 RTVLLDWLSEVCEVYTLHRETFYLTIDYIDRYLSTNANIVPKQQLQLIGVAALMIAAKVE 303
           R   LD+L  V +   L  ET  L ++ +DR+L      V  +   L+  AAL +AAKVE
Sbjct: 2   RPTPLDFLRRVAKALGLSPETLTLAVNLLDRFLLD--YSVLGRSPSLVAAAALYLAAKVE 59

Query: 304 EIYPPKVSDYAYVTDGACSSQDILSTEMNIL 334
           EI PP + D  +VT  A + ++IL  E  +L
Sbjct: 60  EI-PPWLKDLVHVTGYA-TEEEILRMEKLLL 88



 Score = 29.9 bits (68), Expect = 0.99
 Identities = 14/36 (38%), Positives = 18/36 (50%), Gaps = 1/36 (2%)

Query: 377 TLYTQCIRLLDLCHLDIRCLNYSYSVLATTAIYLVC 412
           TL    + LLD   LD   L  S S++A  A+YL  
Sbjct: 22  TLTL-AVNLLDRFLLDYSVLGRSPSLVAAAALYLAA 56


>gnl|CDD|214641 smart00385, CYCLIN, domain present in cyclins, TFIIB and
           Retinoblastoma.  A helical domain present in cyclins and
           TFIIB (twice) and Retinoblastoma (once). A protein
           recognition domain functioning in cell-cycle and
           transcription control.
          Length = 83

 Score = 74.2 bits (183), Expect = 2e-16
 Identities = 28/86 (32%), Positives = 40/86 (46%), Gaps = 4/86 (4%)

Query: 249 DWLSEVCEVYTLHRETFYLTIDYIDRYLSTNANIVPKQQLQLIGVAALMIAAKVEEIYPP 308
           D+L  VC+   L  ET  L ++ +DR+LS       K    LI  AAL +A+K EE  P 
Sbjct: 1   DFLRRVCKALNLDPETLNLAVNLLDRFLSDY--KFLKYSPSLIAAAALYLASKTEETPPW 58

Query: 309 KVSDYAYVTDGACSSQDILSTEMNIL 334
                 Y      + ++IL  E  +L
Sbjct: 59  TKELVHYTGY--FTEEEILRMERLLL 82



 Score = 30.3 bits (69), Expect = 0.69
 Identities = 14/36 (38%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 377 TLYTQCIRLLDLCHLDIRCLNYSYSVLATTAIYLVC 412
           TL    + LLD    D + L YS S++A  A+YL  
Sbjct: 16  TLNL-AVNLLDRFLSDYKFLKYSPSLIAAAALYLAS 50


>gnl|CDD|217307 pfam02984, Cyclin_C, Cyclin, C-terminal domain.  Cyclins regulate
           cyclin dependent kinases (CDKs). Human CCNO is a
           Uracil-DNA glycosylase that is related to other cyclins.
           Cyclins contain two domains of similar all-alpha fold,
           of which this family corresponds with the C-terminal
           domain.
          Length = 117

 Score = 41.5 bits (98), Expect = 2e-04
 Identities = 17/59 (28%), Positives = 26/59 (44%), Gaps = 7/59 (11%)

Query: 385 LLDLCHLDIRCLNYSYSVLATTAIYLVCS-------KELACMISGLRLESLEDCIEWMN 436
           LL+L  LD   L Y  S++A  A+YL           E     +G   E L+ C++ + 
Sbjct: 27  LLELSLLDYDFLKYPPSLIAAAAVYLARKTLGSPPWTETLEHYTGYSEEDLKPCVKLLL 85


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 40.3 bits (94), Expect = 0.004
 Identities = 29/91 (31%), Positives = 37/91 (40%), Gaps = 11/91 (12%)

Query: 503  PTTPG---NRTYSVPCDMTP-PPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILP 558
            P  PG    R  S      P PP+   G  TP   AR AR PT++  P   P       P
Sbjct: 2720 PLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAP-------P 2772

Query: 559  ATPSSSKASRKSAPPAITTPCSSKSLRSGSD 589
            A P++    R + P   +   S +SL S  D
Sbjct: 2773 AAPAAGPPRRLTRPAVASLSESRESLPSPWD 2803



 Score = 38.8 bits (90), Expect = 0.014
 Identities = 15/85 (17%), Positives = 23/85 (27%)

Query: 503  PTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPS 562
            P  P       P  +T P  +S  ++  S  +        +       +      PA P 
Sbjct: 2769 PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPL 2828

Query: 563  SSKASRKSAPPAITTPCSSKSLRSG 587
                S +   P         SL  G
Sbjct: 2829 PPPTSAQPTAPPPPPGPPPPSLPLG 2853



 Score = 36.1 bits (83), Expect = 0.075
 Identities = 17/93 (18%), Positives = 24/93 (25%), Gaps = 14/93 (15%)

Query: 489  AIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPT 548
             +     L      P TP            P    S     P   A +  +P    AP  
Sbjct: 2691 TVGSLTSLADPPPPPPTPEPA---------PHALVSATPLPPGPAAARQASPALPAAPAP 2741

Query: 549  TPSSSKTILPATPSSSKASRKSAPPAITTPCSS 581
                +    P  P     +R + PP    P + 
Sbjct: 2742 PAVPAGPATPGGP-----ARPARPPTTAGPPAP 2769



 Score = 33.0 bits (75), Expect = 0.81
 Identities = 18/92 (19%), Positives = 32/92 (34%), Gaps = 2/92 (2%)

Query: 489  AIAESAKLLKEEQQPTTPGNRTYSV--PCDMTPPPSSSKGQTTPSSKARQARTPTSSKAP 546
            A+A  ++  +    P  P +   +V  P    PP +S  G   P + A+    P     P
Sbjct: 2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPP 2846

Query: 547  PTTPSSSKTILPATPSSSKASRKSAPPAITTP 578
            P +     ++ P      +   +S       P
Sbjct: 2847 PPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP 2878



 Score = 32.6 bits (74), Expect = 0.92
 Identities = 16/81 (19%), Positives = 28/81 (34%), Gaps = 4/81 (4%)

Query: 503  PTTPGNRTYSVPCD--MTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPAT 560
            PTT G    + P      PP   ++      S++R++       A P     +     A 
Sbjct: 2761 PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA--AAL 2818

Query: 561  PSSSKASRKSAPPAITTPCSS 581
            P ++  +    PP    P + 
Sbjct: 2819 PPAASPAGPLPPPTSAQPTAP 2839



 Score = 30.7 bits (69), Expect = 3.6
 Identities = 18/70 (25%), Positives = 24/70 (34%)

Query: 504 TTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSS 563
           + P      VP    PPP++      P S    A  P      P T  +      AT  +
Sbjct: 414 SVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRKA 473

Query: 564 SKASRKSAPP 573
             A R+  PP
Sbjct: 474 LDALRERRPP 483



 Score = 29.5 bits (66), Expect = 9.6
 Identities = 13/78 (16%), Positives = 25/78 (32%), Gaps = 3/78 (3%)

Query: 501  QQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPAT 560
            +   +    ++++P D    P   +    P  + +    P     PP  P       P  
Sbjct: 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP---PLA 2944

Query: 561  PSSSKASRKSAPPAITTP 578
            P++  A       A+  P
Sbjct: 2945 PTTDPAGAGEPSGAVPQP 2962



 Score = 29.1 bits (65), Expect = 9.9
 Identities = 17/88 (19%), Positives = 22/88 (25%), Gaps = 1/88 (1%)

Query: 501  QQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPA- 559
                 P          +  P  S   ++      +  R P     PP  P       P  
Sbjct: 2870 SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP 2929

Query: 560  TPSSSKASRKSAPPAITTPCSSKSLRSG 587
             P      R   P A TT  +     SG
Sbjct: 2930 QPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957



 Score = 29.1 bits (65), Expect = 10.0
 Identities = 14/81 (17%), Positives = 19/81 (23%), Gaps = 3/81 (3%)

Query: 503 PTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPS 562
           P   G           P P+S          A     P +        S      PA P 
Sbjct: 394 PFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDG---PAPPP 450

Query: 563 SSKASRKSAPPAITTPCSSKS 583
             +    +  PA   P  +  
Sbjct: 451 ERQPPAPATEPAPDDPDDATR 471


>gnl|CDD|213844 TIGR03657, IsdB, heme uptake protein IsdB.  Isd proteins are
           iron-regulated surface proteins found in Bacillus,
           Staphylococcus and Listeria species and are responsible
           for heme scavenging from hemoproteins. The IsdB protein
           is only observed in Staphylococcus and consists of an
           N-terminal hydrophobic signal sequence, a pair of tandem
           NEAT (NEAr Transporter, pfam05031) domains which confers
           the ability to bind heme and a C-terminal sortase
           processing signal which targets the protein to the cell
           wall. IsdB is believed to make a direct contact with
           methemoglobin facilitating transfer of heme to IsdB. The
           heme is then transferred to other cell wall-bound NEAT
           domain proteins such as IsdA and IsdC.
          Length = 644

 Score = 39.9 bits (92), Expect = 0.005
 Identities = 30/109 (27%), Positives = 41/109 (37%), Gaps = 2/109 (1%)

Query: 466 HNNVFDYAYSYHKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSK 525
           H    DY   YH   V  E    A A+     KE+Q  +     T + P   T PP   +
Sbjct: 433 HVKTIDYDGQYHVRIVDKEAFTKANADKTNK-KEQQDNSAKKETTPATPSKPTTPPVEKE 491

Query: 526 GQTTPSSKARQARTPTSSKAPPTTPSSSKTILPAT-PSSSKASRKSAPP 573
            Q   S K    ++P+  K    +  S K   PAT P+  +    S  P
Sbjct: 492 SQKQDSQKDDNKQSPSVEKENDASSESGKDKTPATKPAKGEVESSSTTP 540


>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
          Length = 592

 Score = 39.3 bits (91), Expect = 0.007
 Identities = 24/101 (23%), Positives = 38/101 (37%), Gaps = 1/101 (0%)

Query: 499 EEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILP 558
           +E   +   N T   P   TP  ++        ++ ++       K P           P
Sbjct: 59  DEASKSAEANTTKEAPATATPENTTEPTVEPKQTETKEQTKTPEEKQPAAKQVEKAPAEP 118

Query: 559 ATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQ-LQTITA 598
           AT S+   +  S+ PA        +LRSG+  Q  +QTI A
Sbjct: 119 ATVSNPDNATSSSTPATYNLLQKSALRSGATVQSFIQTIQA 159


>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
          Length = 331

 Score = 38.5 bits (90), Expect = 0.010
 Identities = 19/105 (18%), Positives = 41/105 (39%), Gaps = 5/105 (4%)

Query: 484 ELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSS 543
           E++ +A   SA+L +   Q   P + + +     TP P++    T  +S+        ++
Sbjct: 142 EITTMADQSSAELSQNSGQSV-PLDTSTTTDPATTPAPAAPVDTTPTNSQT----PAVAT 196

Query: 544 KAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGS 588
              P        ++  + ++   +   AP A  TP  +  L +  
Sbjct: 197 APAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQ 241



 Score = 31.9 bits (73), Expect = 0.97
 Identities = 17/91 (18%), Positives = 26/91 (28%), Gaps = 3/91 (3%)

Query: 497 LKEEQQPTTPGNRTYSVPCDMTPPPSSS---KGQTTPSSKARQARTPTSSKAPPTTPSSS 553
           L              + P D TP  S +        P+   +Q      S+A   T ++ 
Sbjct: 164 LDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATP 223

Query: 554 KTILPATPSSSKASRKSAPPAITTPCSSKSL 584
               PATP  +           T      +L
Sbjct: 224 APAAPATPDGAAPLPTDQAGVSTPAADPNAL 254


>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
          Length = 291

 Score = 38.1 bits (88), Expect = 0.010
 Identities = 25/92 (27%), Positives = 36/92 (39%), Gaps = 10/92 (10%)

Query: 481 VSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTP 540
           V    +  A A +    K E   +TP       P      P S    TTP S   +A   
Sbjct: 20  VITTTTQAANATTPSSTKVEAPQSTP-------PSTKVEAPQSKPNATTPPSTKVEAPQQ 72

Query: 541 TSSKAPPTTPSSSKTILPATPSSSKASRKSAP 572
           T +    TTPSS+K   P +P++ +   +  P
Sbjct: 73  TPN---ATTPSSTKVETPQSPTTKQVPTEINP 101



 Score = 38.1 bits (88), Expect = 0.011
 Identities = 25/67 (37%), Positives = 35/67 (52%), Gaps = 8/67 (11%)

Query: 524 SKGQTTPSSKARQARTPTSSK--APPTTPSSSKTILP------ATPSSSKASRKSAPPAI 575
           + G  T +++A  A TP+S+K  AP +TP S+K   P       TP S+K       P  
Sbjct: 17  TTGVITTTTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNA 76

Query: 576 TTPCSSK 582
           TTP S+K
Sbjct: 77  TTPSSTK 83


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 37.8 bits (88), Expect = 0.021
 Identities = 26/91 (28%), Positives = 34/91 (37%), Gaps = 3/91 (3%)

Query: 502 QPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATP 561
            PT     +        P P+SS       S +    +P S  AP +  +SS +      
Sbjct: 265 LPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRES 324

Query: 562 SSSKASRKSAPP--AITTPCSSKSLRSGSDS 590
           SSS  S  S     A  +P  S S RS S S
Sbjct: 325 SSSSTSSSSESSRGAAVSPGPSPS-RSPSPS 354



 Score = 37.1 bits (86), Expect = 0.045
 Identities = 15/68 (22%), Positives = 28/68 (41%)

Query: 519 PPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTP 578
               SS   T+ SS++ +    +   +P  +PS S+   PA PSS +   + +    +  
Sbjct: 320 SSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPA 379

Query: 579 CSSKSLRS 586
            S+     
Sbjct: 380 ASAGRPTR 387



 Score = 35.1 bits (81), Expect = 0.17
 Identities = 15/82 (18%), Positives = 22/82 (26%)

Query: 493 SAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSS 552
                    +  +    + S     +P    S     PSS      TP  +  PP+    
Sbjct: 74  GPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPD 133

Query: 553 SKTILPATPSSSKASRKSAPPA 574
              +L    S       S P A
Sbjct: 134 LSEMLRPVGSPGPPPAASPPAA 155



 Score = 34.4 bits (79), Expect = 0.29
 Identities = 11/61 (18%), Positives = 20/61 (32%)

Query: 514 PCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPP 573
              ++P PS S+  +          +    +  P+   SS       P+  +A    A  
Sbjct: 338 GAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGR 397

Query: 574 A 574
           A
Sbjct: 398 A 398



 Score = 34.0 bits (78), Expect = 0.32
 Identities = 19/100 (19%), Positives = 33/100 (33%), Gaps = 4/100 (4%)

Query: 503 PTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPT----SSKAPPTTPSSSKTILP 558
                + +         P S   G    S +A  + + +    SS    ++ SS    + 
Sbjct: 283 GPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVS 342

Query: 559 ATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQTITA 598
             PS S++   S PP    P S +     S +      +A
Sbjct: 343 PGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASA 382



 Score = 33.2 bits (76), Expect = 0.68
 Identities = 18/91 (19%), Positives = 33/91 (36%), Gaps = 6/91 (6%)

Query: 504 TTPGNRTYSVPCDMTP---PPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPAT 560
               +R  ++P           SS     P S    A +P       ++P S+    PA 
Sbjct: 165 DAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPP--RRSSPISASASSPA- 221

Query: 561 PSSSKASRKSAPPAITTPCSSKSLRSGSDSQ 591
           P+  +++   A  + +   SS+S   G   +
Sbjct: 222 PAPGRSAADDAGASSSDSSSSESSGCGWGPE 252



 Score = 32.1 bits (73), Expect = 1.3
 Identities = 17/99 (17%), Positives = 36/99 (36%), Gaps = 7/99 (7%)

Query: 492 ESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPS 551
           + +++L+    P  P     S P     P + +    +    A    +P  +   P++P 
Sbjct: 133 DLSEMLRPVGSPGPPPAA--SPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPP 190

Query: 552 SSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDS 590
           +        PS+  A+    PP  ++P S+ +       
Sbjct: 191 A-----EPPPSTPPAAASPRPPRRSSPISASASSPAPAP 224



 Score = 31.3 bits (71), Expect = 2.6
 Identities = 20/87 (22%), Positives = 29/87 (33%)

Query: 504 TTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSS 563
           ++  +     P +  P P  +                 SS+  P + SSS      +PS 
Sbjct: 241 SSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSP 300

Query: 564 SKASRKSAPPAITTPCSSKSLRSGSDS 590
           S      AP +     SS S R  S S
Sbjct: 301 SSPGSGPAPSSPRASSSSSSSRESSSS 327



 Score = 30.9 bits (70), Expect = 3.0
 Identities = 10/83 (12%), Positives = 25/83 (30%), Gaps = 3/83 (3%)

Query: 518 TPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSK--TILPATPSSSKASRKSAPPAI 575
           +   S     +   S +R + +P+    P    S  K      A  S + ++ +      
Sbjct: 332 SSESSRGAAVSPGPSPSR-SPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRA 390

Query: 576 TTPCSSKSLRSGSDSQQLQTITA 598
               + ++ R  +  +       
Sbjct: 391 RAAVAGRARRRDATGRFPAGRPR 413



 Score = 30.5 bits (69), Expect = 3.6
 Identities = 20/84 (23%), Positives = 35/84 (41%), Gaps = 8/84 (9%)

Query: 502 QPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPA-- 559
            P++PG+     P   +P  SSS   +  SS +  + +  SS+    +P  S +  P+  
Sbjct: 299 SPSSPGSG----PAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPS 354

Query: 560 --TPSSSKASRKSAPPAITTPCSS 581
              P +  +S +  P     P S 
Sbjct: 355 RPPPPADPSSPRKRPRPSRAPSSP 378



 Score = 29.4 bits (66), Expect = 8.1
 Identities = 21/99 (21%), Positives = 37/99 (37%), Gaps = 17/99 (17%)

Query: 498 KEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSS--------KARQARTPTSSKAPPTT 549
             E  PT PG  +        PPP+       PS         +   +  P  + +PP  
Sbjct: 101 AREGSPTPPGPSS-----PDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAA 155

Query: 550 PSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGS 588
            +S      A  S + +SR++A P  +   ++++  S  
Sbjct: 156 GASP----AAVASDAASSRQAALPLSSPEETARAPSSPP 190


>gnl|CDD|236792 PRK10905, PRK10905, cell division protein DamX; Validated.
          Length = 328

 Score = 37.2 bits (86), Expect = 0.024
 Identities = 24/109 (22%), Positives = 48/109 (44%), Gaps = 6/109 (5%)

Query: 489 AIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQART--PTSSKAP 546
           A  ++AK    E+  TT   R  +V   + P    +  +T P   A+  +   P +  A 
Sbjct: 138 ASRQTAKTQTAERPATTRPARKQAV---IEPKKPQATAKTEPKPVAQTPKRTEPAAPVAS 194

Query: 547 PTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQT 595
              P+++ T  P   +++ A  ++A PA TT   +   ++  +   L++
Sbjct: 195 TKAPAATSTPAP-KETATTAPVQTASPAQTTATPAAGGKTAGNVGSLKS 242


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 37.2 bits (86), Expect = 0.028
 Identities = 16/95 (16%), Positives = 32/95 (33%), Gaps = 6/95 (6%)

Query: 487 DLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAP 546
           D  + E    +KE+       ++  +    + PP ++     TP+     +     S AP
Sbjct: 56  DKTVKEYK--IKEKDFVVVMVSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAP 113

Query: 547 PTT--PSSS--KTILPATPSSSKASRKSAPPAITT 577
            +     S   ++     P S   S  S+     +
Sbjct: 114 ASAVEEKSPSEESATATAPESPSTSVPSSGSDAAS 148



 Score = 35.3 bits (81), Expect = 0.098
 Identities = 16/85 (18%), Positives = 27/85 (31%), Gaps = 5/85 (5%)

Query: 513 VPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAP 572
           V   ++ P + +     P++    A TPT S         S     A P+S+   +  + 
Sbjct: 70  VVVMVSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMS-----AAPASAVEEKSPSE 124

Query: 573 PAITTPCSSKSLRSGSDSQQLQTIT 597
            + T         S   S      T
Sbjct: 125 ESATATAPESPSTSVPSSGSDAAST 149


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 36.4 bits (85), Expect = 0.056
 Identities = 13/88 (14%), Positives = 26/88 (29%), Gaps = 1/88 (1%)

Query: 499 EEQQPT-TPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTIL 557
            +  P  T      +          ++       +    A    ++ A P  P+++    
Sbjct: 34  ADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAA 93

Query: 558 PATPSSSKASRKSAPPAITTPCSSKSLR 585
            A  +   A+  +AP A         LR
Sbjct: 94  AAPAAPPAAAAAAAPAAAAVEDEVTPLR 121


>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
          Length = 327

 Score = 35.8 bits (83), Expect = 0.063
 Identities = 19/92 (20%), Positives = 32/92 (34%), Gaps = 10/92 (10%)

Query: 502 QPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPT-TPSSSKTILPAT 560
            PT P             P   +K   T  +     + P + K       +++  + PAT
Sbjct: 174 LPTAPA---------TVAPSKGAKVPATAETHPTPPQKPATKKPAVNHHKTATVAVPPAT 224

Query: 561 PSSSKASRKSAPPAITTPCSSKSLRSGSDSQQ 592
               K+   SA    + P S  +L+  S S+ 
Sbjct: 225 SGKPKSGAASARALSSAPASHYTLQLSSASRS 256



 Score = 31.2 bits (71), Expect = 1.8
 Identities = 24/104 (23%), Positives = 38/104 (36%), Gaps = 13/104 (12%)

Query: 501 QQPTTP-GNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPA 559
             P TP G +   +P +++   S  +GQ   +S+  Q  T T   AP T   S    +PA
Sbjct: 132 APPQTPNGQQRIELPGNISDALSQQQGQVNAASQNAQGNTSTLPTAPATVAPSKGAKVPA 191

Query: 560 TPSSSK------------ASRKSAPPAITTPCSSKSLRSGSDSQ 591
           T  +               +          P +S   +SG+ S 
Sbjct: 192 TAETHPTPPQKPATKKPAVNHHKTATVAVPPATSGKPKSGAASA 235



 Score = 30.0 bits (68), Expect = 4.3
 Identities = 14/73 (19%), Positives = 25/73 (34%), Gaps = 6/73 (8%)

Query: 501 QQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPAT 560
              T   ++   VP      P+  +   T        +T T +  P T+        P +
Sbjct: 177 APATVAPSKGAKVPATAETHPTPPQKPATKKPAVNHHKTATVAVPPATSGK------PKS 230

Query: 561 PSSSKASRKSAPP 573
            ++S  +  SAP 
Sbjct: 231 GAASARALSSAPA 243


>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
          Length = 580

 Score = 36.3 bits (84), Expect = 0.063
 Identities = 31/222 (13%), Positives = 53/222 (23%), Gaps = 16/222 (7%)

Query: 444 EKSPGTDYCDSDSNELP-PGPLRHNNVFDYAYSY----HKHSVSMELSDLAIAESAKLLK 498
            K  G     +  +    P   R  N       +       S +    D  +    +   
Sbjct: 3   GKPRGQPGRWNYDSSDESPEGSRDENFDAERDDFLTPLGSTSEATSEDDDDLYPPRETGS 62

Query: 499 EEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILP 558
                T+     Y+VP    PP    +    P S       P     P           P
Sbjct: 63  GGGVATST---IYTVP---RPPRGPEQTLDKPDSLPASRELP-PGPTPVPPGGFRGASSP 115

Query: 559 ATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQTITAEYIIYHAIISRRLQTKISRT 618
              + S + R           +     + S  +  +            +  +   K+   
Sbjct: 116 RLGADSTSPRFLYQVNFPVILAPIGESNSSSEELSEEEEHSRPPPSESLKVKNGGKVYPK 175

Query: 619 AFYMMCA-KDQDSLSLRDPHMLDRHPSIV---PRMRTLSVSY 656
            F      K  +   L       R  S V    +++ LS S 
Sbjct: 176 GFSKHKTHKRSEFSGLTKKAARKRKGSFVFKPSQLKELSGSL 217


>gnl|CDD|113413 pfam04642, DUF601, Protein of unknown function, DUF601.  This
           family represents a conserved region found in several
           uncharacterized plant proteins.
          Length = 311

 Score = 35.4 bits (81), Expect = 0.077
 Identities = 30/114 (26%), Positives = 42/114 (36%), Gaps = 12/114 (10%)

Query: 482 SMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPT 541
           ++E S L  AE AK  + E+     G+R         P   + K    P+S++R     T
Sbjct: 4   TLEDSKLRAAEKAKQPQAEEDS---GSR-------QKPSTLAGKNPDAPTSESRTPSKAT 53

Query: 542 SSKAPPTTPSSSKTILPAT--PSSSKASRKSAPPAITTPCSSKSLRSGSDSQQL 593
           SSK P    +  K         S  ++SR         P   K      DSQ L
Sbjct: 54  SSKDPSKRYADKKRKQSEKDARSPPRSSRPRTEEKDAGPSQQKEKGKKGDSQDL 107


>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 709

 Score = 35.8 bits (82), Expect = 0.080
 Identities = 21/91 (23%), Positives = 38/91 (41%), Gaps = 7/91 (7%)

Query: 508 NRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPAT------- 560
           N     P   T    ++  +  P  +A  A+TP  + +    PS  KT  P +       
Sbjct: 374 NTELQSPSAQTAEKETAAKKPQPRPEAETAQTPVQTASAAAMPSEGKTAGPVSNQENNDV 433

Query: 561 PSSSKASRKSAPPAITTPCSSKSLRSGSDSQ 591
           P    A  ++   A T   S+KS+++ S+++
Sbjct: 434 PPWEDAPDEAQTAAGTAQTSAKSIQTASEAE 464


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 35.3 bits (80), Expect = 0.11
 Identities = 22/97 (22%), Positives = 39/97 (40%), Gaps = 7/97 (7%)

Query: 489 AIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPT 548
           A A  AK      +   P  +  + P     PP+  K    P   A+ A  P  + APP 
Sbjct: 239 AAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPA--KAAAPP---AKAAAPPAKAAAPPA 293

Query: 549 TPSSS--KTILPATPSSSKASRKSAPPAITTPCSSKS 583
             +++  K       +++  ++ +APPA      +K+
Sbjct: 294 KAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKA 330



 Score = 33.0 bits (74), Expect = 0.56
 Identities = 24/101 (23%), Positives = 40/101 (39%), Gaps = 13/101 (12%)

Query: 489 AIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTP----SSKARQARTPTSSK 544
           A A  AK      +   P  +  + P     PP+  K    P    ++ A+ A  P  + 
Sbjct: 260 AAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPA--KAAAAPAKAAAAPAKAAAAPAKAA 317

Query: 545 APPTTPSSSKTILPATPSSSKASRKSAPP--AITTPCSSKS 583
           APP     +K   P   +++  ++ +APP  A   P   K+
Sbjct: 318 APP-----AKAAAPPAKAATPPAKAAAPPAKAAAAPVGKKA 353



 Score = 32.6 bits (73), Expect = 0.72
 Identities = 20/99 (20%), Positives = 39/99 (39%), Gaps = 4/99 (4%)

Query: 489 AIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSK--ARQARTPTSSKAP 546
           A A  AK      +   P  +  + P     PP+ +      ++   A+ A  P  + A 
Sbjct: 246 AAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAPAKAAAA 305

Query: 547 PTTPSSSKTILPATPSSSKA--SRKSAPPAITTPCSSKS 583
           P   +++     A P+ + A  ++ + PPA      +K+
Sbjct: 306 PAKAAAAPAKAAAPPAKAAAPPAKAATPPAKAAAPPAKA 344



 Score = 30.7 bits (68), Expect = 2.6
 Identities = 20/99 (20%), Positives = 40/99 (40%), Gaps = 4/99 (4%)

Query: 487 DLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAP 546
           D A A +AK     ++   P  +  +      P  +++      +  A+ A  P  + A 
Sbjct: 192 DAAAAAAAKQKAAAKKAAAPSGKKSAKAA--APAKAAAAPAKAAAPPAKAAAAPAKAAAA 249

Query: 547 PTTPSSSKTILPATPSSSKA--SRKSAPPAITTPCSSKS 583
           P   ++      A P+ + A  ++ +APPA      +K+
Sbjct: 250 PAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKA 288


>gnl|CDD|148820 pfam07432, Hc1, Histone H1-like protein Hc1.  This family consists
           of several bacterial histone H1-like Hc1 proteins. In
           Chlamydia, Hc1 is expressed in the late stages of the
           life cycle, concomitant with the reorganisation of
           chlamydial reticulate bodies into elementary bodies.
           This suggests that Hc1 protein plays a role in the
           condensation of chromatin during intracellular
           differentiation.
          Length = 123

 Score = 33.2 bits (75), Expect = 0.12
 Identities = 13/65 (20%), Positives = 27/65 (41%), Gaps = 2/65 (3%)

Query: 523 SSKGQTTPSSKARQ-ARTPTSSKAPPTTPSSSKTILPATPSSSK-ASRKSAPPAITTPCS 580
           S   +  P+ KA + A+ P  +   P   +++     A    +K A++K+A   +     
Sbjct: 58  SGLMKKKPAKKAPKAAKAPKKAAKAPAKKAAAAPAKKAKAVKAKPATKKTAAKKVKKMSK 117

Query: 581 SKSLR 585
            +S  
Sbjct: 118 KRSTA 122


>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
           inhibitor.
          Length = 670

 Score = 35.5 bits (81), Expect = 0.12
 Identities = 22/85 (25%), Positives = 35/85 (41%), Gaps = 8/85 (9%)

Query: 518 TPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAI-- 575
           TP  S++   TT SS      +P++  AP T+P +     P    S   S  ++PPA   
Sbjct: 572 TPTGSAASSNTTFSSD-----SPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPPASHL 626

Query: 576 -TTPCSSKSLRSGSDSQQLQTITAE 599
            +   +  S  S       +T + E
Sbjct: 627 GSPSTTPSSPESSIKVASTETASPE 651



 Score = 34.3 bits (78), Expect = 0.22
 Identities = 30/114 (26%), Positives = 47/114 (41%), Gaps = 19/114 (16%)

Query: 503 PTTPG------NRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSS--KAPPTTPS--- 551
           P  PG        T S P       +++    +PS+    + +P +    +PP TPS   
Sbjct: 556 PYIPGLFAGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIV 615

Query: 552 SSKTILPA--------TPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQTIT 597
           S  T  PA        TPSS ++S K A     +P SS  + S   S  + +++
Sbjct: 616 SPSTSPPASHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESSVSMVSMS 669


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 35.1 bits (81), Expect = 0.14
 Identities = 16/102 (15%), Positives = 34/102 (33%), Gaps = 4/102 (3%)

Query: 491 AESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTP 550
            + A   +  +Q   P    ++ P     P +++    +PS  +  A+      A     
Sbjct: 366 GDDASGGRGPKQHIKP---VFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAG 422

Query: 551 SSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQ 592
            +  T+    P++   +  S  P    P   K  +    S+ 
Sbjct: 423 -TPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKV 463


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 34.7 bits (79), Expect = 0.17
 Identities = 21/82 (25%), Positives = 37/82 (45%), Gaps = 5/82 (6%)

Query: 493 SAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQAR--TPTSSKAPPTTP 550
           SA  +++  QP     ++ ++    TP P   + QTT  S   + +  TP S+   P   
Sbjct: 88  SASKIEKISQPKQEEQKSLNISA--TPAPKQEQSQTTTESTTPKTKVTTPPSTNT-PQPM 144

Query: 551 SSSKTILPATPSSSKASRKSAP 572
            S+K+  P +P+  +A     P
Sbjct: 145 QSTKSDTPQSPTIKQAQTDMTP 166



 Score = 29.7 bits (66), Expect = 5.1
 Identities = 21/79 (26%), Positives = 32/79 (40%), Gaps = 5/79 (6%)

Query: 522 SSSKGQTTPSSKARQARTPTSSKAPPT----TPSSSKTILPATPSSS-KASRKSAPPAIT 576
           + +   TT ++  RQ RTP   KAP T    T +S    +        K+   SA PA  
Sbjct: 56  AGANSATTQAANTRQERTPKLEKAPNTNEEKTSASKIEKISQPKQEEQKSLNISATPAPK 115

Query: 577 TPCSSKSLRSGSDSQQLQT 595
              S  +  S +   ++ T
Sbjct: 116 QEQSQTTTESTTPKTKVTT 134



 Score = 29.3 bits (65), Expect = 7.3
 Identities = 24/93 (25%), Positives = 36/93 (38%), Gaps = 11/93 (11%)

Query: 491 AESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQT-----TPSSKARQARTPTSSKA 545
               +  K E+ P T   +T +   +    P   + ++     TP+ K  Q++T T S  
Sbjct: 68  TRQERTPKLEKAPNTNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTT 127

Query: 546 PP---TTPSSSKTILPATPSSSKASRKSAPPAI 575
           P    TTP S+ T     P  S  S     P I
Sbjct: 128 PKTKVTTPPSTNT---PQPMQSTKSDTPQSPTI 157


>gnl|CDD|178141 PLN02526, PLN02526, acyl-coenzyme A oxidase.
          Length = 412

 Score = 34.4 bits (79), Expect = 0.20
 Identities = 16/50 (32%), Positives = 23/50 (46%), Gaps = 12/50 (24%)

Query: 305 IYPPKVSDYAYVTDGACSSQDILSTE-----MNILAVLEWNITPVTAHYW 349
           I+PP VSDY Y  D      D+L+ E       +   +E  + P+   YW
Sbjct: 14  IFPPSVSDY-YQFD------DLLTPEEQALRKRVRECMEKEVAPIMTEYW 56


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 32.9 bits (75), Expect = 0.21
 Identities = 18/86 (20%), Positives = 28/86 (32%), Gaps = 8/86 (9%)

Query: 476 YHKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKAR 535
           +++H +SME       E+ K+L     P  P     + P   +P P+      T +    
Sbjct: 64  HNEHILSME------QEARKILAPLTPPAPP--EPVTPPTAQSPAPAVPTPPPTSTPAVP 115

Query: 536 QARTPTSSKAPPTTPSSSKTILPATP 561
            A            P S     P  P
Sbjct: 116 PAPAAAVPAPAAAPPPSDPPQPPRAP 141


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 34.0 bits (78), Expect = 0.26
 Identities = 27/93 (29%), Positives = 35/93 (37%), Gaps = 17/93 (18%)

Query: 516 DMTPPPSSSKGQTTP---SSKARQARTPTSSKAPPTTP---------SSSKTILPATPSS 563
                 SSS  + T     S A+++  P+SSKA   +          SS      A P S
Sbjct: 227 PSPKSGSSSPAKPTSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKKAAPES 286

Query: 564 SKASRKSAPPAITTP-----CSSKSLRSGSDSQ 591
           +  S  SA P   +P      S  SL   SDS 
Sbjct: 287 TSQSPSSAEPTSESPQTAGNSSLSSLGDYSDSD 319


>gnl|CDD|219262 pfam07001, BAT2_N, BAT2 N-terminus.  This family represents the
           N-terminus (approximately 200 residues) of the
           proline-rich protein BAT2. BAT2 is similar to other
           proteins with large proline-rich domains, such as some
           nuclear proteins, collagens, elastin, and synapsin.
          Length = 189

 Score = 33.4 bits (76), Expect = 0.26
 Identities = 20/71 (28%), Positives = 26/71 (36%), Gaps = 1/71 (1%)

Query: 524 SKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKS 583
            K  T  +SK  Q+   +S   PP TP S    +  T +SS       P     P  +KS
Sbjct: 81  PKDGTGWASKQDQSDPKSSDALPPQTPESQPPAVSQTSASSPPP-PPQPATPLVPGGAKS 139

Query: 584 LRSGSDSQQLQ 594
               S     Q
Sbjct: 140 WAVASAKPGAQ 150


>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
          Length = 590

 Score = 34.3 bits (79), Expect = 0.27
 Identities = 14/71 (19%), Positives = 21/71 (29%), Gaps = 1/71 (1%)

Query: 509 RTYSVPCDMTPPPSSSKGQT-TPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKAS 567
              S         SSS  Q+  PSS + + R    S +      S   +   +  S  + 
Sbjct: 5   NPTSRSQSHASSSSSSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSSPSPPSV 64

Query: 568 RKSAPPAITTP 578
                    TP
Sbjct: 65  TLPPAATTQTP 75


>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
          Length = 209

 Score = 32.9 bits (75), Expect = 0.36
 Identities = 12/79 (15%), Positives = 25/79 (31%), Gaps = 2/79 (2%)

Query: 513 VPCDMTPPPSSSKGQTTPSS--KARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKS 570
           VP     P  ++      ++     + R  +   A    P  ++TI       + A+ K 
Sbjct: 130 VPVPAPAPAVAAAAAQAAAAPKAPAKPRAKSPRPAAKAAPKPTETITAKKAKKTAAAPKP 189

Query: 571 APPAITTPCSSKSLRSGSD 589
                 TP    + +  + 
Sbjct: 190 TADKTATPAKKTTKKKKTK 208


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 33.7 bits (77), Expect = 0.39
 Identities = 9/74 (12%), Positives = 17/74 (22%), Gaps = 4/74 (5%)

Query: 501 QQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPAT 560
                P         D       ++      S +     P S   P      +      +
Sbjct: 445 GDAPVPAKANARASADSRCDERDAQPPADSGSAS----APASDAPPDAAFEPAPRAAAPS 500

Query: 561 PSSSKASRKSAPPA 574
            ++  A   +  PA
Sbjct: 501 AATPAAVPDARAPA 514



 Score = 32.5 bits (74), Expect = 0.97
 Identities = 16/84 (19%), Positives = 26/84 (30%), Gaps = 5/84 (5%)

Query: 519 PPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSS---KASRKSAPPAI 575
            P   +     P+  AR A    +S  P  T  +        P ++    A+R  APPA 
Sbjct: 372 VPARVAGA--VPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAA 429

Query: 576 TTPCSSKSLRSGSDSQQLQTITAE 599
             P ++      +           
Sbjct: 430 PAPPATADRGDDAADGDAPVPAKA 453



 Score = 31.4 bits (71), Expect = 2.1
 Identities = 20/105 (19%), Positives = 35/105 (33%), Gaps = 8/105 (7%)

Query: 489 AIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPT 548
           A A+S    ++ Q P      + S P    PP ++ +     ++ +            P 
Sbjct: 457 ASADSRCDERDAQPPADS--GSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPA 514

Query: 549 TPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQL 593
             S       A P + +A      PA   P    + R+G  +  L
Sbjct: 515 AASREDAPAAAAPPAPEAR--PPTPAAAAP----AARAGGAAAAL 553


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 33.5 bits (76), Expect = 0.41
 Identities = 20/98 (20%), Positives = 34/98 (34%)

Query: 479 HSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQAR 538
           H  +    D A+A ++   ++      P             P  ++     P+   + A 
Sbjct: 47  HQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPDPAVAPQLAA 106

Query: 539 TPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAIT 576
            P    A   T ++     PA   +S AS+K  P A T
Sbjct: 107 APKPDAAEAFTSAAQAHEAPADAGTSAASKKPDPAAHT 144


>gnl|CDD|191251 pfam05283, MGC-24, Multi-glycosylated core protein 24 (MGC-24).
           This family consists of several MGC-24 (or Cd164
           antigen) proteins from eukaryotic organisms.
           MGC-24/CD164 is a sialomucin expressed in many normal
           and cancerous tissues. In humans, soluble and
           transmembrane forms of MGC-24 are produced by
           alternative splicing.
          Length = 187

 Score = 32.7 bits (74), Expect = 0.43
 Identities = 22/90 (24%), Positives = 27/90 (30%), Gaps = 7/90 (7%)

Query: 502 QPTTPGNRTYSVPCDMTPPPSSSKGQTTPS-SKARQARTPTSSKAPPTTPSSSKTILPAT 560
           Q   P    Y                TT S S A     PT+S A PT   S       T
Sbjct: 78  QQCGPEEPGYCSSQAEVVKSGCQIYNTTDSCSVATTTPVPTNSTAKPTITPSP------T 131

Query: 561 PSSSKASRKSAPPAITTPCSSKSLRSGSDS 590
            S    + +       TP S    +S  D+
Sbjct: 132 TSHHHVTSEPKTNTTVTPTSQPDRKSTFDA 161


>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
           carboxylase.
          Length = 274

 Score = 32.9 bits (75), Expect = 0.49
 Identities = 17/56 (30%), Positives = 24/56 (42%), Gaps = 1/56 (1%)

Query: 519 PPPSSSKGQTTPSSKARQARTPTSSKAP-PTTPSSSKTILPATPSSSKASRKSAPP 573
           PPP +      P            +  P P+ P+SS    PA+P  +KA + S PP
Sbjct: 144 PPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPKSSHPP 199


>gnl|CDD|227600 COG5275, COG5275, BRCT domain type II [General function prediction
           only].
          Length = 276

 Score = 32.4 bits (73), Expect = 0.61
 Identities = 25/104 (24%), Positives = 40/104 (38%), Gaps = 5/104 (4%)

Query: 490 IAESAKLLKEEQQPTTPGN--RTYSVPCDMTPPPSSSKGQTTPSSKARQARTP--TSSKA 545
           + E  KL+  +   TTP       S      P   S+K  TT        +T   T+S +
Sbjct: 1   VHEDDKLVGSDGVSTTPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDS 60

Query: 546 PPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSD 589
            P    +  T  PA P + K++   +    TT  ++ + RS   
Sbjct: 61  KPVVHQTRATRKPAQPKAEKSTTSKSKSHTTTA-TTHTSRSSKS 103



 Score = 32.4 bits (73), Expect = 0.66
 Identities = 14/74 (18%), Positives = 21/74 (28%), Gaps = 8/74 (10%)

Query: 499 EEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILP 558
             +    P         D  P    ++   T      +A   T      T+ S S T   
Sbjct: 42  TSKDVVHPVKTELDTTSDSKPVVHQTR--ATRKPAQPKAEKST------TSKSKSHTTTA 93

Query: 559 ATPSSSKASRKSAP 572
            T +S  +  K  P
Sbjct: 94  TTHTSRSSKSKGLP 107


>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein.  This is a viral
           attachment glycoprotein from region G of metaviruses. It
           is high in serine and threonine suggesting it is highly
           glycosylated.
          Length = 183

 Score = 31.9 bits (72), Expect = 0.63
 Identities = 29/101 (28%), Positives = 42/101 (41%), Gaps = 1/101 (0%)

Query: 475 SYHKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPS-SK 533
           S H    S E S L  A S   L+ E   T             T PPSSS+ QTT +  +
Sbjct: 57  SQHPTQQSTESSTLPAATSESHLETEPTSTPDTTNRQQTVDRHTTPPSSSRTQTTQAVHE 116

Query: 534 ARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPA 574
            +  RT + ++ PPTT +++      T +SS     +    
Sbjct: 117 KKNTRTTSRTQTPPTTSTAAVQTTTTTNTSSTGKEPTTTSV 157


>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
           2'-phosphodiesterase/3'-nucleotidase precursor protein;
           Reviewed.
          Length = 814

 Score = 32.9 bits (75), Expect = 0.69
 Identities = 21/103 (20%), Positives = 38/103 (36%), Gaps = 3/103 (2%)

Query: 478 KHSVS---MELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKA 534
           KH  S   + L+   +  S   L + ++  T    T +     TP  S +  +   +   
Sbjct: 3   KHYFSKSAVALTLALLTASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETP 62

Query: 535 RQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITT 577
             A T   + +   T  +S     AT +++  +R   P A  T
Sbjct: 63  VAATTAAEAPSSSETAETSDPTSEATDTTTSEARTVTPAATET 105



 Score = 29.4 bits (66), Expect = 7.7
 Identities = 13/73 (17%), Positives = 28/73 (38%), Gaps = 3/73 (4%)

Query: 528 TTPSSKARQARTPTSSKAPPTTPSSSKTILPA-TPSSSKASRKSAPPAITTPCSSKSLRS 586
           T  + K  QA    +    P T + ++   P  + ++ +A     P A TT   + S   
Sbjct: 19  TASNPKLAQAEEIVT--TTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSE 76

Query: 587 GSDSQQLQTITAE 599
            +++    +   +
Sbjct: 77  TAETSDPTSEATD 89


>gnl|CDD|219094 pfam06583, Neogenin_C, Neogenin C-terminus.  This family represents
           the C-terminus of eukaryotic neogenin precursor
           proteins, which contains several potential
           phosphorylation sites. Neogenin is a member of the N-CAM
           family of cell adhesion molecules (and therefore
           contains multiple copies of pfam00047 and pfam00041) and
           is closely related to the DCC tumour suppressor gene
           product - these proteins may play an integral role in
           regulating differentiation programmes and/or cell
           migration events within many adult and embryonic
           tissues.
          Length = 295

 Score = 32.6 bits (74), Expect = 0.70
 Identities = 19/108 (17%), Positives = 35/108 (32%), Gaps = 11/108 (10%)

Query: 474 YSYHKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSK-------- 525
           Y +H+ S+          +      +  +      +T        P   S          
Sbjct: 120 YLHHQFSLRPMPFSTLTVQRLYQHGDRAESVESVRQTPE--PPYLPAAQSESSNAAEEAP 177

Query: 526 GQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPP 573
            ++ P++  R    P  S A P  P+S  TI P  PS+   +++    
Sbjct: 178 SRSIPTAHVRPTH-PLKSFAVPALPASMSTIEPKLPSTPLLTQQGPTL 224


>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
          Length = 1320

 Score = 33.0 bits (75), Expect = 0.74
 Identities = 30/91 (32%), Positives = 38/91 (41%), Gaps = 11/91 (12%)

Query: 497 LKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSK--APPTTPSSSK 554
           +K    P     R  S   + +P PSS K  + PSS     R   SSK  APP   +S  
Sbjct: 1   MKHFMLPRNAILRETSSGEEQSPNPSSHK--SKPSS-----RKLKSSKENAPPPDLNSLT 53

Query: 555 TILPATPSSSKASRKSAPPAITTPCSSKSLR 585
           + L     S+ A  KS  P    P SS  L+
Sbjct: 54  SDLKPDHRSASAKLKSPLPP--RPPSSNPLK 82


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 32.8 bits (74), Expect = 0.81
 Identities = 21/81 (25%), Positives = 32/81 (39%), Gaps = 7/81 (8%)

Query: 478 KHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQA 537
             S+ +E+    I E+  L      P  PG+ T+ + C    P  +S G  TP S     
Sbjct: 194 SFSIRLEMGGNGIDETCDLEDNYISPVLPGSNTFFITCSGDKPHFASGGILTPIS----- 248

Query: 538 RTPTSSKAPPTTPSSSKTILP 558
             P  +  P T  + S  + P
Sbjct: 249 --PMKTPTPGTGYNYSLWLGP 267



 Score = 32.4 bits (73), Expect = 0.96
 Identities = 22/96 (22%), Positives = 36/96 (37%), Gaps = 5/96 (5%)

Query: 504 TTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATP-- 561
            TP   T S   + T P S +    TP++ +      T +   PTT  +S T    +P  
Sbjct: 472 GTPTGTTSSTLPEDTSPTSRTT-SATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTP 530

Query: 562 --SSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQT 595
                  +  S P   T+  ++ S +   +S    T
Sbjct: 531 IVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNT 566



 Score = 31.3 bits (70), Expect = 2.4
 Identities = 19/83 (22%), Positives = 32/83 (38%), Gaps = 6/83 (7%)

Query: 500 EQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPS------SKARQARTPTSSKAPPTTPSSS 553
           E   TT    T S+P   T P  S+   T+ +      S   +  +PTS     T  ++S
Sbjct: 442 EPHKTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATS 501

Query: 554 KTILPATPSSSKASRKSAPPAIT 576
            T    TP+++  + +       
Sbjct: 502 PTPAVTTPNATSPTTQKTSDTPN 524



 Score = 30.1 bits (67), Expect = 5.1
 Identities = 16/90 (17%), Positives = 30/90 (33%), Gaps = 3/90 (3%)

Query: 511 YSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKS 570
           +  P   T  P++       +        PTS     TT S+       T  ++ A+  +
Sbjct: 440 HVEPHKTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNA 499

Query: 571 APPAITTPC---SSKSLRSGSDSQQLQTIT 597
             P         +S + +  SD+    + T
Sbjct: 500 TSPTPAVTTPNATSPTTQKTSDTPNATSPT 529



 Score = 29.8 bits (66), Expect = 7.5
 Identities = 13/75 (17%), Positives = 34/75 (45%), Gaps = 1/75 (1%)

Query: 504 TTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSS 563
           T+  N T     + +P  +++    T +     +   T      ++P+S +  +P++  S
Sbjct: 547 TSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHS 606

Query: 564 S-KASRKSAPPAITT 577
           + +++  S  P +T+
Sbjct: 607 TPRSNSTSTTPLLTS 621


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 32.6 bits (74), Expect = 0.84
 Identities = 22/118 (18%), Positives = 43/118 (36%), Gaps = 3/118 (2%)

Query: 478 KHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQA 537
           K S   E + L        L       TP +R  +       P    + +++    A + 
Sbjct: 546 KASAGDERTGLRPESEPGTLPYGSSVQTPPDRPKAATKGSRKPSPRKEPKSSVPPAAEKR 605

Query: 538 RTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQT 595
           +  + SK  P +    +T    + SS     +S PP+  +P +++S +    S +   
Sbjct: 606 KYKSPSKIVPKSREFIET---DSSSSDSPEDESLPPSSQSPGNTESSKESCASLRTPV 660


>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
          Length = 256

 Score = 31.9 bits (73), Expect = 0.95
 Identities = 19/82 (23%), Positives = 27/82 (32%), Gaps = 13/82 (15%)

Query: 501 QQPTT----PGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTI 556
           QQPT     P N       + TP    S  Q    ++ +Q    T+   P T P  +   
Sbjct: 97  QQPTQLSEVPYN-------EQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTA- 148

Query: 557 LPATPSSSKASRKSAPPAITTP 578
            P  P +    R      +T  
Sbjct: 149 -PVQPQTPAPVRTQPAAPVTQA 169



 Score = 29.6 bits (67), Expect = 5.5
 Identities = 13/69 (18%), Positives = 26/69 (37%), Gaps = 6/69 (8%)

Query: 516 DMTPPPSSSK-----GQTTPSSKAR-QARTPTSSKAPPTTPSSSKTILPATPSSSKASRK 569
           DM   P+         QT    ++  Q +     + PP T +  + + P   +++    +
Sbjct: 94  DMRQQPTQLSEVPYNEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQ 153

Query: 570 SAPPAITTP 578
           +  P  T P
Sbjct: 154 TPAPVRTQP 162


>gnl|CDD|239731 cd03762, proteasome_beta_type_6, proteasome beta type-6 subunit.
           The 20S proteasome, multisubunit proteolytic complex, is
           the central enzyme of nonlysosomal protein degradation
           in both the cytosol and nucleus. It is composed of 28
           subunits arranged as four homoheptameric rings that
           stack on top of one another forming an elongated
           alpha-beta-beta-alpha cylinder with a central cavity.
           The proteasome alpha and beta subunits are members of
           the N-terminal nucleophile (Ntn)-hydrolase superfamily.
           Their N-terminal threonine residues are exposed as a
           nucleophile in peptide bond hydrolysis. Mammals have 7
           alpha and 7 beta proteasome subunits while archaea have
           one of each.
          Length = 188

 Score = 31.4 bits (72), Expect = 0.99
 Identities = 18/78 (23%), Positives = 29/78 (37%), Gaps = 12/78 (15%)

Query: 567 SRKSAPPAITTPCSSK---------SLRSGS--DSQQLQTITAEYIIYHAII-SRRLQTK 614
           SR S    +    + K           RSGS  D+Q +      Y+  H+I        K
Sbjct: 18  SRTSTGSYVANRVTDKLTQLHDRIYCCRSGSAADTQAIADYVRYYLDMHSIELGEPPLVK 77

Query: 615 ISRTAFYMMCAKDQDSLS 632
            + + F  +C   ++ LS
Sbjct: 78  TAASLFKNLCYNYKEMLS 95


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 32.5 bits (74), Expect = 1.0
 Identities = 25/120 (20%), Positives = 41/120 (34%), Gaps = 13/120 (10%)

Query: 486  SDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKA 545
            +      +AK  +EE +     +R  +   D  P  S+   +T  +  AR+A       A
Sbjct: 1246 AKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETVKAVPARRAAARKKPLA 1305

Query: 546  PPTTPSSSKT--------ILPATPSSSKASRKSA---PPAITTPCSSKSLRSGSDSQQLQ 594
              +  S S          +  A     K  RK A     A   P ++K  + G  + Q  
Sbjct: 1306 SVSVISDSDDDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAK--KRGPATVQSG 1363


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 32.2 bits (73), Expect = 1.1
 Identities = 23/84 (27%), Positives = 33/84 (39%), Gaps = 8/84 (9%)

Query: 498 KEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSS-----KAPPTTPSS 552
           KEE   + P +   S P    P P S       ++    +++P S      K      SS
Sbjct: 386 KEEPVDSNPSSPGNSGPA--RPSPESKDKGKRKAA-NEVSKSPASVPAKKLKTENAPKSS 442

Query: 553 SKTILPATPSSSKASRKSAPPAIT 576
           S    P T S SK+S  +A   +T
Sbjct: 443 SGKSTPQTFSGSKSSSNAADGGVT 466


>gnl|CDD|215039 PLN00041, PLN00041, photosystem I reaction center subunit II;
           Provisional.
          Length = 196

 Score = 31.5 bits (71), Expect = 1.2
 Identities = 14/59 (23%), Positives = 26/59 (44%), Gaps = 5/59 (8%)

Query: 520 PPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTP 578
            P+SS  +  P +         S+   P + S+++ +  A  +  +A+ K AP   T P
Sbjct: 5   TPASSGRKLVPWASFL----SKSTSKAPASLSATRAVRAAAAAE-EAAAKEAPVGFTPP 58


>gnl|CDD|218825 pfam05956, APC_basic, APC basic domain.  This region of the APC
           family of proteins is known as the basic domain. It
           contains a high proportion of positively charged amino
           acids and interacts with microtubules.
          Length = 359

 Score = 32.0 bits (72), Expect = 1.2
 Identities = 18/71 (25%), Positives = 32/71 (45%), Gaps = 5/71 (7%)

Query: 505 TPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTT-----PSSSKTILPA 559
           +P  R+ + P  +   PSSS  QT+  S+      P  +++         P +S + +P 
Sbjct: 64  SPPPRSATPPARLAKTPSSSSSQTSTPSQPLPRPLPRPTQSAGRNSILPGPGNSLSQVPR 123

Query: 560 TPSSSKASRKS 570
           T S ++A   S
Sbjct: 124 TSSPARALLAS 134



 Score = 31.2 bits (70), Expect = 1.8
 Identities = 20/81 (24%), Positives = 33/81 (40%), Gaps = 4/81 (4%)

Query: 513 VPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSK-TILPATPSSSKASRKSA 571
            P   TPP   +K  ++ SS+      P     P  T S+ + +ILP   +S     +++
Sbjct: 66  PPRSATPPARLAKTPSSSSSQTSTPSQPLPRPLPRPTQSAGRNSILPGPGNSLSQVPRTS 125

Query: 572 PPAITTPCSSKSLRSGSDSQQ 592
            PA        S  S   +Q+
Sbjct: 126 SPARA---LLASSGSQHKTQK 143



 Score = 31.2 bits (70), Expect = 1.9
 Identities = 23/81 (28%), Positives = 33/81 (40%), Gaps = 5/81 (6%)

Query: 518 TPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSA---PPA 574
            P P++    TTPS K    +T  S   PP +PS  +    +    +K S  +    PP 
Sbjct: 11  IPGPANRSQSTTPSKKGPPLKTQPS--DPPKSPSPGQQRSRSLHRPAKPSELAELSPPPR 68

Query: 575 ITTPCSSKSLRSGSDSQQLQT 595
             TP +  +    S S Q  T
Sbjct: 69  SATPPARLAKTPSSSSSQTST 89


>gnl|CDD|177618 PHA03381, PHA03381, tegument protein VP22; Provisional.
          Length = 290

 Score = 31.5 bits (71), Expect = 1.2
 Identities = 22/108 (20%), Positives = 37/108 (34%), Gaps = 7/108 (6%)

Query: 477 HKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTP-PPSSSKGQTTPSSKAR 535
             H      +D      +    E      P  R ++ P    P P   ++G      + R
Sbjct: 60  RFHHYDEARADYPYYTGSSSEDERPADPRPSRRPHAQPEASGPGPARGARGPAGSRGRGR 119

Query: 536 QARTPTSSKAPP-----TTPSSSKTILPATPSSSKASRKSAPPAITTP 578
           +A +P S + PP     + P   K+    + +   A   +AP    TP
Sbjct: 120 RAESP-SPRDPPNPKGASAPRGRKSACADSAALLDAPAPAAPKRQKTP 166


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 31.9 bits (72), Expect = 1.2
 Identities = 30/161 (18%), Positives = 53/161 (32%), Gaps = 18/161 (11%)

Query: 440 VILCEKSPGTDYCDSDSNELPPGPLRHNNVFDYAYSYHKHSVSMELSDLAIAESAKLLKE 499
           + LCE  P    C  D N     P     +           VS          ++K    
Sbjct: 132 LALCESGP---ICQRDYN-PRDRPKCRCTL-------RGKDVSCCKEPKTAVTTSKTTSW 180

Query: 500 EQQPTTPGNRTYSVPCDMTPPP-SSSKGQTTPSSKARQARTPTSSKAPPTTPSSSK-TIL 557
             + + P     + P  +TP    +++G  T ++  R + T        TT S+ +    
Sbjct: 181 PTEVSHP-----TYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPEPQTE 235

Query: 558 PATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQTITA 598
           P       +     PP+ T+   S +      +Q+ +T  A
Sbjct: 236 PPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPA 276



 Score = 30.4 bits (68), Expect = 4.1
 Identities = 19/91 (20%), Positives = 35/91 (38%), Gaps = 3/91 (3%)

Query: 509 RTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASR 568
           +T S P +++ P   S+  T  S  A Q     ++    ++     T    T S+ +   
Sbjct: 176 KTTSWPTEVSHPTYPSQ-VTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPE--P 232

Query: 569 KSAPPAITTPCSSKSLRSGSDSQQLQTITAE 599
           ++ PP      S       S + Q Q+ T +
Sbjct: 233 QTEPPPSQRGPSGSPQHPPSTTSQDQSTTGD 263



 Score = 30.0 bits (67), Expect = 4.3
 Identities = 17/74 (22%), Positives = 26/74 (35%)

Query: 505 TPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSS 564
           T G  T S P   T PP S +G +          +   S        + +   P   S+ 
Sbjct: 221 TQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNR 280

Query: 565 KASRKSAPPAITTP 578
           ++   +A P  TT 
Sbjct: 281 RSPHSTATPPPTTK 294



 Score = 30.0 bits (67), Expect = 5.0
 Identities = 26/131 (19%), Positives = 51/131 (38%), Gaps = 23/131 (17%)

Query: 500 EQQPTTPGNRTYSVPCDMTPPPSSSKGQTT-------------PSSKARQARTPTSSKAP 546
           E Q   P ++         PP ++S+ Q+T             P + + + R+P S+  P
Sbjct: 231 EPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNR-RSPHSTATP 289

Query: 547 PTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSG--------SDSQQLQTITA 598
           P T    +T  P TP  +  ++  + P  ++P   ++  +          D  +  +I  
Sbjct: 290 PPTTKRQETGRP-TPRPTATTQSGSSPPHSSPPGVQANPTTQNLVDCKELDPPKPNSICY 348

Query: 599 EYIIYHAIISR 609
              IY+  + R
Sbjct: 349 GVGIYNEALPR 359



 Score = 30.0 bits (67), Expect = 5.1
 Identities = 19/88 (21%), Positives = 36/88 (40%), Gaps = 2/88 (2%)

Query: 500 EQQPTTPGNRTYSVPCDMTP-PPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILP 558
           + QP T G++T +    ++   P  ++G TT S+   Q   P  S+  P+          
Sbjct: 196 QSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPEPQT-EPPPSQRGPSGSPQHPPSTT 254

Query: 559 ATPSSSKASRKSAPPAITTPCSSKSLRS 586
           +   S+    +       TP ++ + RS
Sbjct: 255 SQDQSTTGDGQEHTQRRKTPPATSNRRS 282


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 32.0 bits (72), Expect = 1.3
 Identities = 21/88 (23%), Positives = 32/88 (36%), Gaps = 7/88 (7%)

Query: 491 AESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTP 550
            +  K  +E ++P  P  R+   P   T P S    +     K+   + P S K+P   P
Sbjct: 585 PKHPKDPEEPKKPKRP--RSAQRP---TRPKSPKLPELLDIPKS--PKRPESPKSPKRPP 637

Query: 551 SSSKTILPATPSSSKASRKSAPPAITTP 578
              +   P  P   K  +   PP    P
Sbjct: 638 PPQRPSSPERPEGPKIIKSPKPPKSPKP 665


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 31.8 bits (72), Expect = 1.3
 Identities = 19/89 (21%), Positives = 35/89 (39%), Gaps = 6/89 (6%)

Query: 514 PCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPP 573
           P D+  P +      T +S      TP+++ +PP+T   + +   A P +         P
Sbjct: 199 PADVFVPATPRPTPRTTAS-PETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTP 257

Query: 574 AITTPCS-----SKSLRSGSDSQQLQTIT 597
           A  TP       + +  +   S+   T+T
Sbjct: 258 APPTPGGGEAPPANATPAPEASRYELTVT 286


>gnl|CDD|219392 pfam07382, HC2, Histone H1-like nucleoprotein HC2.  This family
           contains the bacterial histone H1-like nucleoprotein HC2
           (approximately 200 residues long), which seems to be
           found mostly in Chlamydia. HC2 functions in DNA
           condensation, although it has been suggested that it
           also has other roles.
          Length = 187

 Score = 31.3 bits (70), Expect = 1.3
 Identities = 13/60 (21%), Positives = 23/60 (38%), Gaps = 1/60 (1%)

Query: 532 SKARQARTPTSSKAPPTTPSSSKTIL-PATPSSSKASRKSAPPAITTPCSSKSLRSGSDS 590
           +K   A+   + KA    P++ K +   A  +  K  + +A        S+     GS S
Sbjct: 110 AKKPAAKKAVAKKAVARKPAAKKAVAKKAASTCHKNHKHTAACKRVASSSATRAACGSKS 169


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 31.7 bits (72), Expect = 1.4
 Identities = 16/62 (25%), Positives = 24/62 (38%), Gaps = 7/62 (11%)

Query: 517 MTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAIT 576
           + P P+    +  P++ A     PT   AP T P   K    A     +  R++A P   
Sbjct: 360 LVPVPAPQPAK--PTAAAPSPVRPT--PAPSTRP---KAAAAANIPPKEPVRETATPPPV 412

Query: 577 TP 578
            P
Sbjct: 413 PP 414



 Score = 30.5 bits (69), Expect = 3.8
 Identities = 17/95 (17%), Positives = 23/95 (24%), Gaps = 4/95 (4%)

Query: 502 QPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATP 561
           +PT   +           PP     +T           P     P      S   L    
Sbjct: 380 RPTPAPSTRPKAAAAANIPPKEPVRETATPPPV----PPRPVAPPVPHTPESAPKLTRAA 435

Query: 562 SSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQTI 596
                  K  PPA         +  G   +QL+ I
Sbjct: 436 IPVDEKPKYTPPAPPKEEEKALIADGDVLEQLEAI 470



 Score = 29.8 bits (67), Expect = 6.1
 Identities = 11/74 (14%), Positives = 21/74 (28%), Gaps = 1/74 (1%)

Query: 509 RTYSVPCDMTPP-PSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKAS 567
                P   T   PS  +    PS++ + A         P   +++   +P  P +    
Sbjct: 363 VPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVP 422

Query: 568 RKSAPPAITTPCSS 581
                    T  + 
Sbjct: 423 HTPESAPKLTRAAI 436


>gnl|CDD|215299 PLN02543, PLN02543, pfkB-type carbohydrate kinase family protein.
          Length = 496

 Score = 31.8 bits (72), Expect = 1.5
 Identities = 16/117 (13%), Positives = 36/117 (30%), Gaps = 12/117 (10%)

Query: 475 SYHKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTY--SVPCDMTPPPSSSKGQTTPSS 532
             H    S++  +     S++  +   +P    + +   S P   +   ++      PS 
Sbjct: 9   HLHHSYSSLDRREKTCLRSSQKTRRFPKPKASLHPSIKRSRPGRCSTNGAAVPESPKPSR 68

Query: 533 KARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSD 589
           + R+ +  +S     TT   +K          K  ++  P        +       D
Sbjct: 69  RGRKKKPTSSPPKAKTTRRRTK----------KTDQELDPEGAEEDQEAAEDGEDYD 115


>gnl|CDD|219401 pfam07404, TEBP_beta, Telomere-binding protein beta subunit (TEBP
           beta).  This family consists of several telomere-binding
           protein beta subunits which appear to be specific to the
           family Oxytrichidae. Telomeres are specialised
           protein-DNA complexes that compose the ends of
           eukaryotic chromosomes. Telomeres protect chromosome
           termini from degradation and recombination and act
           together with telomerase to ensure complete genome
           replication. TEBP beta forms a complex with TEBP alpha
           and this complex is able to recognise and bind ssDNA to
           form a sequence-specific, telomeric nucleoprotein
           complex that caps the very 3' ends of chromosomes.
          Length = 375

 Score = 31.6 bits (71), Expect = 1.5
 Identities = 16/47 (34%), Positives = 24/47 (51%), Gaps = 5/47 (10%)

Query: 529 TPSSKARQARTPTSSKAPPTTPSS----SKTILPATPS-SSKASRKS 570
           TP+  + +  TP  S+A  ++       SK  +PA PS S K S K+
Sbjct: 285 TPNKPSSRKETPQKSQAGKSSAKKTTTGSKKAVPANPSPSGKKSTKT 331


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 31.8 bits (72), Expect = 1.5
 Identities = 13/74 (17%), Positives = 25/74 (33%), Gaps = 2/74 (2%)

Query: 516 DMTPPPSSSK--GQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPP 573
              P  +++    Q  P++ A  A  P  +  P    ++      A   ++  +R+S  P
Sbjct: 371 GAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAP 430

Query: 574 AITTPCSSKSLRSG 587
                    S R  
Sbjct: 431 EALAAARQASARGP 444



 Score = 31.4 bits (71), Expect = 2.1
 Identities = 17/66 (25%), Positives = 27/66 (40%), Gaps = 3/66 (4%)

Query: 521 PSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCS 580
           P  S G   P   A  A  P +  AP     ++    PA P ++ A+  +A  A     +
Sbjct: 365 PGQSGGGAGP---ATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAA 421

Query: 581 SKSLRS 586
           + + RS
Sbjct: 422 APARRS 427



 Score = 31.0 bits (70), Expect = 2.6
 Identities = 14/67 (20%), Positives = 26/67 (38%), Gaps = 7/67 (10%)

Query: 512 SVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSA 571
           + P   +P P +       +++   AR P  + AP   P+++     A   ++   R  A
Sbjct: 421 AAPARRSPAPEA-----LAAARQASARGPGGAPAPAPAPAAAPA--AAARPAAAGPRPVA 473

Query: 572 PPAITTP 578
             A   P
Sbjct: 474 AAAAAAP 480


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 31.6 bits (71), Expect = 1.6
 Identities = 25/105 (23%), Positives = 37/105 (35%), Gaps = 12/105 (11%)

Query: 503 PTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAP-------PTTPSSSKT 555
           P + G  T   P    P  ++     TPSS A     P+++ +P       P  P  S  
Sbjct: 389 PASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSG- 447

Query: 556 ILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQTITAEY 600
            +P  P+         P A   P S  S      +    + TAE+
Sbjct: 448 -IPPRPAPRMPEASPVPGA---PDSVASASDAPPTLGDPSDTAEH 488


>gnl|CDD|218639 pfam05564, Auxin_repressed, Dormancy/auxin associated protein.
           This family contains several plant dormancy-associated
           and auxin-repressed proteins the function of which are
           poorly understood.
          Length = 119

 Score = 29.9 bits (67), Expect = 1.8
 Identities = 14/58 (24%), Positives = 22/58 (37%), Gaps = 1/58 (1%)

Query: 518 TPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKT-ILPATPSSSKASRKSAPPA 574
           T P S        SSK +  R+ +   + PT+P++  T   P +       R    P 
Sbjct: 24  TSPLSIKDSSEGGSSKDKVQRSISMPASTPTSPATPTTPTSPFSARKENVWRSVFHPG 81


>gnl|CDD|218190 pfam04651, Pox_A12, Poxvirus A12 protein. 
          Length = 188

 Score = 30.5 bits (69), Expect = 1.9
 Identities = 18/81 (22%), Positives = 30/81 (37%), Gaps = 6/81 (7%)

Query: 518 TPPPSSSKGQTTPSS-KARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAIT 576
               + +  +  PS  + +  +  TS+ +  +   S+ T      S S  SR  AP    
Sbjct: 38  NRGGNLAGPENNPSDNEVKAGKRVTSASSSKSKRCSTSTSKTKPCSRSSRSRSGAPRRRG 97

Query: 577 TPCSSKSLRSGSDSQQLQTIT 597
           T        S  D Q +Q +T
Sbjct: 98  TAFG-----SMEDPQIVQAVT 113


>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
           component of pyruvate dehydrogenase complex.
          Length = 539

 Score = 31.4 bits (71), Expect = 1.9
 Identities = 17/46 (36%), Positives = 22/46 (47%), Gaps = 2/46 (4%)

Query: 529 TPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPA 574
            PSS A  A     +K  P  P   +   PA+    KAS+ SAPP+
Sbjct: 201 KPSSSAAPA--APKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPS 244


>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210).  This is
           a family of proteins conserved in yeasts. The function
           is not known. The Schizosaccharomyces pombe member is
           SPBC18E5.07 and the Saccharomyces cerevisiae member is
           AIM21.
          Length = 671

 Score = 31.4 bits (71), Expect = 2.0
 Identities = 21/108 (19%), Positives = 33/108 (30%), Gaps = 4/108 (3%)

Query: 492 ESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPS 551
           E A     E+    P    +  P +     S S  Q T +        P +S+ P    S
Sbjct: 389 EIAVKPPTEESSRRPEEEKHRFPSEDVWEDSPSSLQDTATVSTPSNPPPRASETPEQETS 448

Query: 552 SSKTILPATPSSSKASR----KSAPPAITTPCSSKSLRSGSDSQQLQT 595
            S + +   P  S+            +     S        +SQ+L T
Sbjct: 449 RSSSEVSLDPHQSELKSEKKKARPEVSKQRFPSRDVWEDAPESQELVT 496



 Score = 31.1 bits (70), Expect = 2.3
 Identities = 18/91 (19%), Positives = 33/91 (36%), Gaps = 8/91 (8%)

Query: 486 SDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKA 545
           S++++      LK E++   P             P      +  P S+       T  + 
Sbjct: 452 SEVSLDPHQSELKSEKKKARPE------VSKQRFPSRDVW-EDAPESQELVTTEETPEEV 504

Query: 546 PPTTPSSSKTILPATPSSSK-ASRKSAPPAI 575
             ++P  +K  +P+ P   K  S K  PP +
Sbjct: 505 KSSSPGVTKPAIPSRPKKGKPTSEKRKPPPV 535



 Score = 30.3 bits (68), Expect = 4.3
 Identities = 20/64 (31%), Positives = 31/64 (48%), Gaps = 3/64 (4%)

Query: 521 PSSSKGQTTPSSKAR-QARTPTSSK-APPTTPSSSKTILPATPSSSKASRKSAPPAITTP 578
            SSS G T P+  +R +   PTS K  PP  P   K  +PA P+  +  +++   A ++ 
Sbjct: 505 KSSSPGVTKPAIPSRPKKGKPTSEKRKPPPVPKKPKPQIPARPAKLQ-KQQAGEEANSSA 563

Query: 579 CSSK 582
              K
Sbjct: 564 FKPK 567


>gnl|CDD|222011 pfam13257, DUF4048, Domain of unknown function (DUF4048).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in eukaryotes, and is typically
           between 228 and 257 amino acids in length.
          Length = 242

 Score = 30.9 bits (70), Expect = 2.0
 Identities = 17/83 (20%), Positives = 30/83 (36%), Gaps = 17/83 (20%)

Query: 518 TPPPSSSKGQTTPSSKARQ-------ARTPTSSKAPPTTPSSSKTILPATPSSSKAS--- 567
            PPP S +  +  +S++R        + + +S  +     +S K    A    S  S   
Sbjct: 122 VPPPRSRRSGSRSTSRSRLRLQGGSLSSSRSSRSSTSKGATSGKDSKSADIDVSFWSEFG 181

Query: 568 -------RKSAPPAITTPCSSKS 583
                   KS   A +TP  + +
Sbjct: 182 IDTPGQKSKSPQKASSTPAGNTN 204


>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain.  Syndecans are transmembrane
           heparin sulfate proteoglycans which are implicated in
           the binding of extracellular matrix components and
           growth factors.
          Length = 207

 Score = 30.5 bits (69), Expect = 2.1
 Identities = 18/80 (22%), Positives = 30/80 (37%), Gaps = 6/80 (7%)

Query: 516 DMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAI 575
            +T   +  K  TT SS      T T+S    T+  +S T+     +++  S      A 
Sbjct: 70  PVTTSATPPKLTTTSSSP--SNDTTTAS----TSTKTSPTVSTTVTTTTSPSETDTEEAT 123

Query: 576 TTPCSSKSLRSGSDSQQLQT 595
           TT  +      GS +    +
Sbjct: 124 TTVSTETPTEGGSSAATDPS 143


>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein.  Tymoviruses
           are single stranded RNA viruses. This family includes a
           protein of unknown function that has been named based on
           its molecular weight. Tymoviruses such as the ononis
           yellow mosaic tymovirus encode only three proteins. Of
           these two are overlapping this protein overlaps a larger
           ORF that is thought to be the polymerase.
          Length = 458

 Score = 31.2 bits (71), Expect = 2.1
 Identities = 18/69 (26%), Positives = 30/69 (43%), Gaps = 5/69 (7%)

Query: 519 PPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRK--SAPPAIT 576
           P    +   + P      +R+P+S  +P   PS+ + +LP  P   +ASR     P +  
Sbjct: 245 PLHPHTTRPSPPRP--AFSRSPSSPLSPLPRPSTRRGLLP-NPRLPRASRGHLPPPTSSA 301

Query: 577 TPCSSKSLR 585
            P  +  LR
Sbjct: 302 PPRPNGGLR 310


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 31.2 bits (71), Expect = 2.1
 Identities = 17/95 (17%), Positives = 29/95 (30%), Gaps = 5/95 (5%)

Query: 489 AIAESAKLLKEEQQP--TTPGNRTYSVPCDMTPPPSSSKGQTTP---SSKARQARTPTSS 543
           A AE     + E       P  +  + P     P +++     P   +  A  A    ++
Sbjct: 374 APAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAA 433

Query: 544 KAPPTTPSSSKTILPATPSSSKASRKSAPPAITTP 578
            A     + +   L   P +  A    A P    P
Sbjct: 434 PAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAP 468



 Score = 29.3 bits (66), Expect = 9.3
 Identities = 15/58 (25%), Positives = 26/58 (44%)

Query: 521 PSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTP 578
           P+++     P+ K   AR   ++ A      ++    PA   ++ AS  +APPA   P
Sbjct: 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPP 423


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 31.0 bits (70), Expect = 2.2
 Identities = 25/140 (17%), Positives = 45/140 (32%), Gaps = 22/140 (15%)

Query: 470 FDYAYSYHKHSVS-MELSDLAI--AESAKLLKEEQQ--------------PTTPGN--RT 510
           F+   S H +SV    L D  +       +LKE                 P T  N  R 
Sbjct: 79  FEEVTSVHVYSVQKAPLKDSNVLYDVDYDILKENLHNCSKNSLEYGKQAGPITNPNVKRR 138

Query: 511 YSVPCDMTPPPSSSKGQTTPSSKARQARTPTSS-KAPPTTPSSSKTILPATPSSSKASRK 569
             V      P +S   + T + K   ++ P S         S+ KT   +  ++++    
Sbjct: 139 TGVGLPPVAPAASPALKPTANGKRPSSKPPKSIMSPEVKVKSAKKTQDTSKETTTEK--T 196

Query: 570 SAPPAITTPCSSKSLRSGSD 589
               ++      ++    S+
Sbjct: 197 EGKTSVKAASLKRNPPKKSN 216


>gnl|CDD|221361 pfam11990, DUF3487, Protein of unknown function (DUF3487).  This
           family of proteins is functionally uncharacterized. This
           protein is found in bacteria. Proteins in this family
           are typically between 121 to 136 amino acids in length.
           This protein has a conserved RLN sequence motif.
          Length = 122

 Score = 29.5 bits (67), Expect = 2.2
 Identities = 6/27 (22%), Positives = 10/27 (37%)

Query: 139 PHMLDRHPSIVPRMRTPVFWIITGLSL 165
           P  L+R P +   +      +  GL  
Sbjct: 8   PDRLNREPVVFRGLTADELGVAAGLGA 34


>gnl|CDD|213867 TIGR03824, FlgM_jcvi, flagellar biosynthesis anti-sigma factor
           FlgM.  FlgM interacts with and inhibits the alternative
           sigma factor sigma(28) FliA. The C-terminus of FlgM
           contains the sigma(28)-binding domain.
          Length = 96

 Score = 28.8 bits (65), Expect = 2.5
 Identities = 12/52 (23%), Positives = 19/52 (36%)

Query: 548 TTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQTITAE 599
                       +  S  A++ SA        S  S+   S +QQLQ++ A 
Sbjct: 5   GIGPLPVNAALRSTRSVAAAKASAAAQAAAASSGDSVSLSSLAQQLQSLEAA 56


>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 292

 Score = 30.7 bits (69), Expect = 2.5
 Identities = 26/106 (24%), Positives = 35/106 (33%), Gaps = 11/106 (10%)

Query: 485 LSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSK 544
           LS L        L  E     P  R  SVP    P        T P      +      K
Sbjct: 116 LSQLNNVAVTSTLPTEPATVAP-VRNASVPTAERPAI------TRPVRAQAVSEPAVEPK 168

Query: 545 APPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDS 590
           A  T  ++   +  A+P+ + A     PPA     +S  L+S   S
Sbjct: 169 AAKTATATEAKVQTASPAQTPA----TPPAGKGAAASGQLKSAPSS 210


>gnl|CDD|165099 PHA02732, PHA02732, hypothetical protein; Provisional.
          Length = 1467

 Score = 31.3 bits (70), Expect = 2.5
 Identities = 18/78 (23%), Positives = 29/78 (37%), Gaps = 1/78 (1%)

Query: 519  PPPSSSKGQTTPSSKARQARTPTSSKAPPTTPS-SSKTILPATPSSSKASRKSAPPAITT 577
              P+     +   + +   + P  +  PP     S         +  + S  S+PPA TT
Sbjct: 1180 GIPAPQLLSSYIPTGSVLYQDPIFTYIPPGIIGMSGTNTFTFKAAQLQLSAASSPPAATT 1239

Query: 578  PCSSKSLRSGSDSQQLQT 595
            P    S  S S +Q + T
Sbjct: 1240 PTPPPSSSSSSSAQSIST 1257


>gnl|CDD|135173 PRK04654, PRK04654, sec-independent translocase; Provisional.
          Length = 214

 Score = 30.6 bits (68), Expect = 2.5
 Identities = 14/69 (20%), Positives = 27/69 (39%), Gaps = 7/69 (10%)

Query: 530 PSSKARQARTPTSSKAPPTTPSSSKTILPA-------TPSSSKASRKSAPPAITTPCSSK 582
            +     A  P  ++A P  P+  +T++PA        P ++      A P    P  + 
Sbjct: 137 QAHTPVPAPAPVIAQAQPIAPAPHQTLVPAPHDTIVPAPHAAHLPSAPATPVSVAPVDAG 196

Query: 583 SLRSGSDSQ 591
           +  S + S+
Sbjct: 197 TSASPTPSE 205


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 30.7 bits (69), Expect = 2.6
 Identities = 16/73 (21%), Positives = 36/73 (49%), Gaps = 1/73 (1%)

Query: 527 QTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRS 586
           + T   K  + + P + KA  T+  ++KT   A  +S+KA++K+A    T    + + + 
Sbjct: 5   KKTTKKKTTEEKKPAAKKAT-TSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKV 63

Query: 587 GSDSQQLQTITAE 599
               ++ +++  E
Sbjct: 64  TVKFEKTESVKKE 76



 Score = 30.3 bits (68), Expect = 3.0
 Identities = 9/74 (12%), Positives = 19/74 (25%), Gaps = 5/74 (6%)

Query: 518 TPPPSSSKGQTTPSSKARQARTPTSSKAPPT-----TPSSSKTILPATPSSSKASRKSAP 572
            P    +      +   + A+T ++  A        T S   T    T    K       
Sbjct: 17  KPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKE 76

Query: 573 PAITTPCSSKSLRS 586
                    +++ +
Sbjct: 77  SVAKKTVKKEAVSA 90


>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
           family). 
          Length = 619

 Score = 31.0 bits (70), Expect = 2.8
 Identities = 20/90 (22%), Positives = 30/90 (33%), Gaps = 5/90 (5%)

Query: 501 QQPTTPGNRTYSVPCDMTPPP-SSSKGQT--TPSSKARQARTPTSSKAPPTTPSSSKTIL 557
                 G+   SVP   TP    + +  T  TPS K     T +S     +  S S    
Sbjct: 456 VMEPPYGSTESSVP--STPSTRRNDRNITSNTPSLKRTPNLTKSSLSQEASLISKSTGNT 513

Query: 558 PATPSSSKASRKSAPPAITTPCSSKSLRSG 587
               +  + +     PA +       +RSG
Sbjct: 514 HKHSTPRRLTTLPKLPAASRSSKGNLIRSG 543


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 30.8 bits (69), Expect = 2.9
 Identities = 16/67 (23%), Positives = 26/67 (38%), Gaps = 4/67 (5%)

Query: 514 PCDMTPP---PSSSKGQTTPSSKARQAR-TPTSSKAPPTTPSSSKTILPATPSSSKASRK 569
           P    PP    + S    TPS++A   + +P +++  P     S   L + PS       
Sbjct: 179 PSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLP 238

Query: 570 SAPPAIT 576
           S  P + 
Sbjct: 239 SPHPPLQ 245


>gnl|CDD|218883 pfam06075, DUF936, Plant protein of unknown function (DUF936).
           This family consists of several hypothetical proteins
           from Arabidopsis thaliana and Oryza sativa. The function
           of this family is unknown.
          Length = 564

 Score = 31.0 bits (70), Expect = 2.9
 Identities = 17/81 (20%), Positives = 27/81 (33%), Gaps = 11/81 (13%)

Query: 517 MTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASR-------- 568
           +    +  K  +  S      R  +SS     +P  S     AT + SK S         
Sbjct: 226 VAVLSAPRKPGSRSSDCKSSPRARSSS---AKSPFKSSIQRKATKALSKLSLRASPKDTS 282

Query: 569 KSAPPAITTPCSSKSLRSGSD 589
           KS+   +  P  S++    S 
Sbjct: 283 KSSKSEVAPPKKSEAKVPSSS 303



 Score = 29.0 bits (65), Expect = 9.4
 Identities = 19/70 (27%), Positives = 28/70 (40%), Gaps = 5/70 (7%)

Query: 521 PSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSK----TILPATPSSSKASRKSAPPAIT 576
           PS S G + PSS   +  +   S+    + S  K       P  P S  +  KS+P A +
Sbjct: 192 PSPSGGTSCPSSSGGRRSSI-GSRRLRGSASLRKKVAVLSAPRKPGSRSSDCKSSPRARS 250

Query: 577 TPCSSKSLRS 586
           +   S    S
Sbjct: 251 SSAKSPFKSS 260


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 30.6 bits (69), Expect = 3.0
 Identities = 16/54 (29%), Positives = 27/54 (50%), Gaps = 3/54 (5%)

Query: 521 PSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPA 574
           P++  G    S+KA+ A    S K    T  S+K    A  +++KA++K+   A
Sbjct: 206 PAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAK---KAAKTAAKAAKKAKKTA 256



 Score = 30.2 bits (68), Expect = 4.0
 Identities = 10/68 (14%), Positives = 23/68 (33%), Gaps = 3/68 (4%)

Query: 519 PPPSSSKGQTTPSSKARQARTPTSSKAPPTTPS---SSKTILPATPSSSKASRKSAPPAI 575
              +    +   +  A  A+    + A     +   + K +  A  +  KA++K+A  A 
Sbjct: 220 KSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAA 279

Query: 576 TTPCSSKS 583
                +  
Sbjct: 280 KAAKGAAK 287


>gnl|CDD|233367 TIGR01349, PDHac_trf_mito, pyruvate dehydrogenase complex
           dihydrolipoamide acetyltransferase, long form.  This
           model represents one of several closely related clades
           of the dihydrolipoamide acetyltransferase subunit of the
           pyruvate dehydrogenase complex. It includes sequences
           from mitochondria and from alpha and beta branches of
           the proteobacteria, as well as from some other bacteria.
           Sequences from Gram-positive bacteria are not included.
           The non-enzymatic homolog protein X, which serves as an
           E3 component binding protein, falls within the clade
           phylogenetically but is rejected by its low score
           [Energy metabolism, Pyruvate dehydrogenase].
          Length = 436

 Score = 30.5 bits (69), Expect = 3.0
 Identities = 14/52 (26%), Positives = 18/52 (34%), Gaps = 5/52 (9%)

Query: 529 TPSSKARQARTP--TSSKAPPTTPSSSKTILPATPSSS---KASRKSAPPAI 575
              S A  A  P   +  APP+ P  S      +P  S     S K +   I
Sbjct: 89  KLESSASPAPKPSEIAPTAPPSAPKPSPAPQKQSPEPSSPAPLSDKESGDRI 140


>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
          Length = 1560

 Score = 30.8 bits (69), Expect = 3.0
 Identities = 17/76 (22%), Positives = 29/76 (38%)

Query: 508 NRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKAS 567
           N  Y+ P +   P S+      P S    + TP S+      P SS     +   ++   
Sbjct: 431 NTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQH 490

Query: 568 RKSAPPAITTPCSSKS 583
           R +  PA   P +++ 
Sbjct: 491 RAANQPAANLPTANQP 506



 Score = 30.4 bits (68), Expect = 4.8
 Identities = 20/70 (28%), Positives = 31/70 (44%), Gaps = 2/70 (2%)

Query: 508 NRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKAS 567
           N  YS P +  P  +++    TP +    + TP S+  P + P  S      TP S+ A 
Sbjct: 411 NAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSN-PPNSNPPYSNLPYSNTPYSN-AP 468

Query: 568 RKSAPPAITT 577
             +APP+   
Sbjct: 469 LSNAPPSSAK 478


>gnl|CDD|237803 PRK14724, PRK14724, DNA topoisomerase III; Provisional.
          Length = 987

 Score = 30.7 bits (69), Expect = 3.0
 Identities = 15/66 (22%), Positives = 25/66 (37%), Gaps = 4/66 (6%)

Query: 520 PPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTIL----PATPSSSKASRKSAPPAI 575
           PP  +      ++ A    T     A P   +++K +      A      A +K+APPA 
Sbjct: 855 PPRKTAAAKAGAASAAFGGTVAVKAAKPAKKAAAKKVAAKTAAAKTPRKAAKKKAAPPAA 914

Query: 576 TTPCSS 581
               S+
Sbjct: 915 GLKPSA 920


>gnl|CDD|219916 pfam08580, KAR9, Yeast cortical protein KAR9.  The KAR9 protein in
           Saccharomyces cerevisiae is a cytoskeletal protein
           required for karyogamy, correct positioning of the
           mitotic spindle and for orientation of cytoplasmic
           microtubules. KAR9 localises at the shmoo tip in mating
           cells and at the tip of the growing bud in anaphase.
          Length = 626

 Score = 30.6 bits (69), Expect = 3.1
 Identities = 18/112 (16%), Positives = 31/112 (27%), Gaps = 3/112 (2%)

Query: 477 HKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQ 536
            +             E    +   +   T  N +  +  D  P     +    P++    
Sbjct: 438 LRRMNIKPTLASIPDEKPSNISVFEDSETSPNSSTLL-RDPPPKKCGEESGHLPNNPFFN 496

Query: 537 ARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGS 588
               T S  PP   S  ++I+     S  ASR S+        S   +    
Sbjct: 497 KLKLTLSSIPPL--SPRQSIITLPTPSRPASRISSLSLRLGSYSGSIVSPPP 546


>gnl|CDD|205278 pfam13097, CENP-U, CENP-A nucleosome associated complex (NAC)
           subunit.  CENP-U is one of the components that assembles
           onto the CENP-A-nucleosome associated complex (NAC). The
           centromere, which is the basic element of chromosome
           inheritance, is epigenetically determined in mammals.
           CENP-A, the centromere-specific histone H3 variant,
           assembles an array of nucleosomes and it is this that
           seems to be the prime candidate for specifying
           centromere identity. CENP-A nucleosomes directly recruit
           a proximal CENP-A nucleosome associated complex (NAC)
           comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
           CENP-C and CENP-H. Assembly of the CENP-A NAC at
           centromeres is dependent on CENP-M, CENP-N and CENP-T.
           Additionally, there are seven other subunits which make
           up the CENP-A-nucleosome distal (CAD) centromere,
           CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
           CENP-S, also assembling on the CENP-A NAC. FTA4 is the
           equivalent component of the fission yeast Sim4 complex.
          Length = 174

 Score = 29.9 bits (67), Expect = 3.1
 Identities = 24/140 (17%), Positives = 47/140 (33%), Gaps = 10/140 (7%)

Query: 498 KEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTIL 557
              ++P++       V  D            TP   +  +  P+S    P   S S+T  
Sbjct: 10  PSAERPSS---ERPEVTSDAASLSEKPASDVTPPKTSPSSSKPSSEDESPANQSQSETQK 66

Query: 558 PATPSSSKASRKSA-----PPAITTPCSSKSLRSGSDSQQLQTITAEYIIYHAIISRRLQ 612
             +P   K  R S        ++   C     +  SD  +L  + +E+        +R++
Sbjct: 67  KLSPGRRKKPRSSHSGSDSSDSVHVWCLEG--KKRSDITELDVVLSEFEKICLEYRQRVE 124

Query: 613 TKISRTAFYMMCAKDQDSLS 632
           +K  + A        ++ L 
Sbjct: 125 SKACKQAIKDFFNNLKEELI 144


>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).
           This family consists of several brain acid soluble
           protein 1 (BASP1) or neuronal axonal membrane protein
           NAP-22. The BASP1 is a neuron enriched Ca(2+)-dependent
           calmodulin-binding protein of unknown function.
          Length = 233

 Score = 30.2 bits (67), Expect = 3.2
 Identities = 21/108 (19%), Positives = 39/108 (36%), Gaps = 10/108 (9%)

Query: 491 AESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTP 550
            E      +E+ P     +T         PP +S  +  P++    A      +AP  + 
Sbjct: 75  GEKEAAAAKEEAPKAEPEKTEGAAEAKAEPPKASDPEQEPAAAPGPA---AGGEAPKASE 131

Query: 551 SSSKTILPATPSSSK-------ASRKSAPPAITTPCSSKSLRSGSDSQ 591
           +SS+    A P+  +        ++K+  PA     +       SDS+
Sbjct: 132 ASSQPAESAAPAKEEEKSKEEGEAKKTEAPAAAAQETKSDAAPASDSK 179


>gnl|CDD|221472 pfam12228, DUF3604, Protein of unknown function (DUF3604).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 621 and 693 amino
           acids in length.
          Length = 592

 Score = 30.7 bits (70), Expect = 3.2
 Identities = 15/42 (35%), Positives = 21/42 (50%), Gaps = 4/42 (9%)

Query: 102 TKLPPRPSPFPTLDWANEEDLWDLL---CAKDQDSLSLRDPH 140
             +P +  PF +LD  N EDLWD +     +   S+ L  PH
Sbjct: 178 GAVPAQVLPFSSLDSPNPEDLWDWMDAYEEQTGGSV-LAIPH 218


>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
          Length = 226

 Score = 30.2 bits (68), Expect = 3.4
 Identities = 20/92 (21%), Positives = 35/92 (38%), Gaps = 9/92 (9%)

Query: 499 EEQQPTTPGNRT--YSVPCDMTPPP-SSSKGQTTPSSKARQARTPTSSKAPP------TT 549
           +  + T P +     S   D  P   + +  Q+ P   +  + T  ++  PP        
Sbjct: 80  DGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSATDEAATDPPATAAARDG 139

Query: 550 PSSSKTILPATPSSSKASRKSAPPAITTPCSS 581
           P+   T  PATP   ++ R+  P +   P  S
Sbjct: 140 PTPDPTAQPATPDERRSPRQRPPVSGEPPTPS 171



 Score = 29.8 bits (67), Expect = 4.5
 Identities = 18/86 (20%), Positives = 28/86 (32%), Gaps = 12/86 (13%)

Query: 500 EQQPTTPGNRTYSVPCDMTPPPSSSKGQTT-------PSSKARQARTPTSSKAPPTTPSS 552
           + QP        +   D + PP +S    T       P++ A +        A P TP  
Sbjct: 99  DAQPAAE-----AEAADQSAPPEASSTSATDEAATDPPATAAARDGPTPDPTAQPATPDE 153

Query: 553 SKTILPATPSSSKASRKSAPPAITTP 578
            ++     P S +    S P A    
Sbjct: 154 RRSPRQRPPVSGEPPTPSTPDAHVAG 179


>gnl|CDD|233366 TIGR01348, PDHac_trf_long, pyruvate dehydrogenase complex
           dihydrolipoamide acetyltransferase, long form.  This
           model describes a subset of pyruvate dehydrogenase
           complex dihydrolipoamide acetyltransferase specifically
           close by both phylogenetic and per cent identity (UPGMA)
           trees. Members of this set include two or three copies
           of the lipoyl-binding domain. E. coli AceF is a member
           of this model, while mitochondrial and some other
           bacterial forms belong to a separate model [Energy
           metabolism, Pyruvate dehydrogenase].
          Length = 546

 Score = 30.2 bits (68), Expect = 3.8
 Identities = 11/52 (21%), Positives = 20/52 (38%), Gaps = 1/52 (1%)

Query: 519 PPPSSSKGQTTPSSKARQARTPTSSKAPPTTPS-SSKTILPATPSSSKASRK 569
            P + S   T P   A  A     + AP    + +   +  A P+  + +R+
Sbjct: 206 QPAAQSPAATQPEPAAAPAAAKAQAPAPQQAGTQNPAKVDHAAPAVRRLARE 257


>gnl|CDD|225711 COG3170, FimV, Tfp pilus assembly protein FimV [Cell motility and
           secretion / Intracellular trafficking and secretion].
          Length = 755

 Score = 30.3 bits (68), Expect = 3.9
 Identities = 13/53 (24%), Positives = 22/53 (41%), Gaps = 3/53 (5%)

Query: 509 RTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATP 561
           R Y+V  D   PP  S      +  +     P ++ APP  P +++ +    P
Sbjct: 133 REYTVLLD---PPGYSPKSALSAEPSHPVPAPAAASAPPPPPRAARPVRQPAP 182


>gnl|CDD|227952 COG5665, NOT5, CCR4-NOT transcriptional regulation complex, NOT5
           subunit [Transcription].
          Length = 548

 Score = 30.4 bits (68), Expect = 4.0
 Identities = 39/164 (23%), Positives = 53/164 (32%), Gaps = 37/164 (22%)

Query: 429 EDCIEWMNIYWVILCEKSPGTDYCDSDSNELPPGPLRHNNVFDYAYSYHKHSVSMELSDL 488
            D IE+  IY  + CE  P      S SN     P   NN           +    +   
Sbjct: 184 PDFIEYDTIYEDMGCEIQP------SSSNN--EAPKEGNN----------QTSLSSIRSS 225

Query: 489 AIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPT 548
              E +   K  Q+                   S S   TTP +   ++ + + S  P  
Sbjct: 226 KKQERSPKKKAPQRDV-----------------SISDRATTPIAPGVESASQSISSTP-- 266

Query: 549 TPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQ 592
           TP S+ T L      S     S     TT  S K   S +DS+Q
Sbjct: 267 TPVSTDTPLHTVKDDSIKFDNSTLGTPTTHVSMKKKESENDSEQ 310


>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
           function prediction only].
          Length = 872

 Score = 30.2 bits (68), Expect = 4.3
 Identities = 16/75 (21%), Positives = 23/75 (30%), Gaps = 4/75 (5%)

Query: 505 TPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPAT---- 560
           T    T S                  +    Q +T TSS +P  T S ++T    T    
Sbjct: 781 TKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTS 840

Query: 561 PSSSKASRKSAPPAI 575
           PS +        P +
Sbjct: 841 PSQTTTGGGICGPIV 855


>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
           carrier protein subunit; Validated.
          Length = 153

 Score = 29.1 bits (65), Expect = 4.4
 Identities = 16/57 (28%), Positives = 23/57 (40%), Gaps = 7/57 (12%)

Query: 531 SSKARQARTPTSSKAP--PTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLR 585
           S+   Q  TP  + AP  P+ P+      PA   +S     +    +T P   K LR
Sbjct: 46  SAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPAS-----AGENVVTAPMPGKILR 97


>gnl|CDD|115805 pfam07175, Osteoregulin, Osteoregulin.  This family represents a
           conserved region approximately 180 residues long within
           osteoregulin, a bone-remodelling protein expressed
           highly in osteocytes within trabecular and cortical
           bone. A conserved RGD motif is found towards the
           C-terminal end of this region, and this is potentially
           involved in integrin recognition.
          Length = 163

 Score = 29.2 bits (65), Expect = 4.4
 Identities = 14/60 (23%), Positives = 24/60 (40%)

Query: 494 AKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSS 553
            +LL EE +  TP +    +P D+    + SK +          ++P  SK       S+
Sbjct: 68  IELLGEENKEKTPQSVLSVIPADVNYAKAHSKDKKNQQRDLLAQKSPVKSKHTHRIRRST 127


>gnl|CDD|221743 pfam12735, Trs65, TRAPP trafficking subunit Trs65.  This family is
           one of the subunits of the TRAPP Golgi trafficking
           complex. TRAPP subunits are found in two different sized
           complexes, TRAPP I and TRAPP II. While both complexes
           contain the same seven subunits, Bet3p, Bet5p, Trs20p,
           Trs23p, Trs31p, Trs33p and Trs85p, with TRAPPC human
           equivalents, TRAPP II has the additional three
           subunits,Trs65p, Trs120p and Trs130p. While it has been
           implicated in cell wall biogenesis and stress response,
           the role of Trs65 in TRAPP II is supported by the
           findings that the protein co-localises with Trs130p, and
           deletion of TRS65 in yeast leads to a conditional lethal
           phenotype if either one of the other TRAPP II-specific
           subunits is modified. Furthermore, the trs65 mutant has
           reduced Ypt31/32p guanine nucleotide exchange, GEF,
           activity.
          Length = 271

 Score = 29.8 bits (67), Expect = 4.8
 Identities = 21/75 (28%), Positives = 36/75 (48%)

Query: 494 AKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSS 553
            K+L E +  +     ++    D + PP  S      +  +  + +P++S AP   PSSS
Sbjct: 62  VKILVESENCSPRIETSWETNVDFSLPPPISTALRRSNRPSSLSISPSTSSAPSVGPSSS 121

Query: 554 KTILPATPSSSKASR 568
           +T L A  S+SK + 
Sbjct: 122 RTSLRAANSNSKLNG 136


>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
           Provisional.
          Length = 481

 Score = 29.8 bits (66), Expect = 5.2
 Identities = 10/60 (16%), Positives = 24/60 (40%)

Query: 528 TTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSG 587
             P+ + ++  T   +     T +  K    AT  + K +     P +    ++ +++SG
Sbjct: 148 VAPTQEVKKETTTQQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHAVKSG 207


>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription
           factor N terminal domain.  The N terminus of the PEA3
           transcription factors is implicated in transactivation
           and in inhibition of DNA binding. Transactivation is
           potentiated by activation of the Ras/MAP kinase and
           protein kinase A signalling cascades. The N terminal
           region contains conserved MAP kinase phosphorylation
           sites.
          Length = 336

 Score = 29.8 bits (67), Expect = 5.3
 Identities = 23/111 (20%), Positives = 40/111 (36%), Gaps = 10/111 (9%)

Query: 494 AKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSK-----APPT 548
            K+ +E Q P+   N      C    PP     +   S  A   +  +  K     + P 
Sbjct: 85  TKIKREPQSPSKELNS----SCSQKQPPYPYGEKCLYSYSAYDRKPASGFKPPTPPSTPC 140

Query: 549 TPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDSQQLQTITAE 599
           +P + +  +     S   S  S+PP+  TP  ++S      S    +  +E
Sbjct: 141 SPVNPQETVRQLQPSGPLS-NSSPPSPHTPLPNQSPLPPPMSSPDSSYPSE 190


>gnl|CDD|215362 PLN02676, PLN02676, polyamine oxidase.
          Length = 487

 Score = 29.7 bits (67), Expect = 5.6
 Identities = 15/37 (40%), Positives = 19/37 (51%), Gaps = 2/37 (5%)

Query: 171 YEYRLYCVNNYFLFTIFTPSNIVLPILTLNKVTRQVS 207
           YE  +Y +   FL T      I  P L LNKV R++S
Sbjct: 223 YESLVYYLAEQFLST--KSGKITDPRLKLNKVVREIS 257


>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1.  Mating in
           fungi is controlled by the loci that determine the
           mating type of an individual, and only individuals with
           differing mating types can mate. Basidiomycete fungi
           have evolved a unique mating system, termed tetrapolar
           or bifactorial incompatibility, in which mating type is
           determined by two unlinked loci; compatibility at both
           loci is required for mating to occur. The multi-allelic
           tetrapolar mating system is considered to be a novel
           innovation that could have only evolved once, and is
           thus unique to the mushroom fungi. This domain is
           C-terminal to the homeodomain transcription factor
           region.
          Length = 418

 Score = 29.8 bits (67), Expect = 5.7
 Identities = 14/78 (17%), Positives = 20/78 (25%), Gaps = 4/78 (5%)

Query: 516 DMTP--PPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPP 573
           DMTP         +       RQ+R    +     +P  S  +      S   S      
Sbjct: 38  DMTPDLKEQLKDEKRRRRQTPRQSRRSKRAAHAYPSPERSPALSSERLLSPSPS--VLDL 95

Query: 574 AITTPCSSKSLRSGSDSQ 591
           +          R  S S 
Sbjct: 96  SPVLASPQTGKRRRSSSP 113



 Score = 29.0 bits (65), Expect = 9.0
 Identities = 18/81 (22%), Positives = 28/81 (34%), Gaps = 10/81 (12%)

Query: 527 QTTPSSKARQA-----RTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAIT----- 576
           Q+  S +A  A     R+P  S     +PS S   L    +S +  ++    + +     
Sbjct: 60  QSRRSKRAAHAYPSPERSPALSSERLLSPSPSVLDLSPVLASPQTGKRRRSSSPSDDEDE 119

Query: 577 TPCSSKSLRSGSDSQQLQTIT 597
               SK  RS S S       
Sbjct: 120 AERPSKRPRSDSISSSSSPAK 140


>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566).  Family of
           related proteins that is plant specific.
          Length = 313

 Score = 29.5 bits (66), Expect = 5.8
 Identities = 20/119 (16%), Positives = 41/119 (34%), Gaps = 6/119 (5%)

Query: 477 HKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSK--- 533
                +   +    + S + L      T+  +R   +      PPSSS  + T SS    
Sbjct: 5   VSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLN-APASPPSSSPARNTSSSSSFG 63

Query: 534 --ARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITTPCSSKSLRSGSDS 590
              ++  + +  +      S S+    A  S + +   ++    ++P  S+   S   S
Sbjct: 64  LSKQRPSSLSRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGSSSPSRSRRTTSSDLS 122


>gnl|CDD|146273 pfam03546, Treacle, Treacher Collins syndrome protein Treacle. 
          Length = 519

 Score = 29.9 bits (66), Expect = 5.9
 Identities = 29/144 (20%), Positives = 48/144 (33%), Gaps = 8/144 (5%)

Query: 444 EKSPGTDYCDSDSNELPPGPLRHNNVFDYAYSYHKHSVSMELSDLAIAESAKLLKEEQQP 503
           E+   +   DSDS E  P                K S        A A + +  ++   P
Sbjct: 21  EEDSESSSEDSDSEEEMPAAKNPPQA--------KPSGKSPQVKAASAPAKESPQKGAPP 72

Query: 504 TTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSS 563
            TPG    +          SS+ ++    +   A T T+S A       +  + PA+  +
Sbjct: 73  VTPGKAGPAAAQAGEEEAKSSEEESDSEGETPTAATLTTSPAQAKPLGKNSQVRPASTVT 132

Query: 564 SKASRKSAPPAITTPCSSKSLRSG 587
              S K A         S +++ G
Sbjct: 133 PGPSGKGANLPCPQKAGSAAVQVG 156



 Score = 29.5 bits (65), Expect = 8.1
 Identities = 23/81 (28%), Positives = 34/81 (41%), Gaps = 12/81 (14%)

Query: 514 PCDMTP---PPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKS 570
           P  MT     P+    QT  S +     TPTS+K PP            TP+  KA   +
Sbjct: 233 PAAMTAAQAKPALKTPQTKASPRKGTPITPTSAKVPPVRVG--------TPAPRKAGAVT 284

Query: 571 APPAITTPCSSK-SLRSGSDS 590
           +P   ++P  ++ + R   DS
Sbjct: 285 SPACASSPALARGTQRPDEDS 305


>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 226

 Score = 29.1 bits (65), Expect = 6.1
 Identities = 10/67 (14%), Positives = 18/67 (26%), Gaps = 3/67 (4%)

Query: 518 TPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAIT- 576
            PP   ++        A  +  P     PP   + +   +P    + K  +    P    
Sbjct: 69  QPPEGVAQEIQDAGDAAAASVDPQPVAQPPVESTPAG--VPVAAQTPKPVKPPKQPPAGA 126

Query: 577 TPCSSKS 583
            P     
Sbjct: 127 VPAKPTP 133


>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
          Length = 171

 Score = 29.0 bits (65), Expect = 6.2
 Identities = 17/77 (22%), Positives = 28/77 (36%), Gaps = 3/77 (3%)

Query: 519 PPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAI-TT 577
           P  +    +T P           S  A   TP   K + P    ++    K+  P    T
Sbjct: 63  PRLAEGNFRTMPVQTVTATTPSGSRPAVTATPEQ-KLLAPRPLFTAAREVKTVVPVSSVT 121

Query: 578 PCS-SKSLRSGSDSQQL 593
           P +  + LR+G  + +L
Sbjct: 122 PVTPPRPLRTGEQTARL 138


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
           subunit 62; Provisional.
          Length = 576

 Score = 29.9 bits (67), Expect = 6.3
 Identities = 15/68 (22%), Positives = 27/68 (39%), Gaps = 3/68 (4%)

Query: 518 TPPPSSSKGQTTPSSKARQARTPTSSKA---PPTTPSSSKTILPATPSSSKASRKSAPPA 574
           T P +++     P     +  +P +      PPT+PS +  +    PSS+    K    A
Sbjct: 477 TAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSA 536

Query: 575 ITTPCSSK 582
             T  + +
Sbjct: 537 PPTALADE 544


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score = 29.5 bits (66), Expect = 6.4
 Identities = 10/54 (18%), Positives = 18/54 (33%)

Query: 519 PPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAP 572
           P P S      P   A+ A   T + AP      ++      P   +  + ++ 
Sbjct: 19  PAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKPASL 72


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 29.5 bits (67), Expect = 6.5
 Identities = 13/78 (16%), Positives = 19/78 (24%), Gaps = 6/78 (7%)

Query: 493 SAKLLKEEQQPTTPGNRTYSVPCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSS 552
                K    P          P       +       P   A QA +   ++ P  +   
Sbjct: 524 PPPPQKSPPPPAPTP------PLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPP 577

Query: 553 SKTILPATPSSSKASRKS 570
                  TPS +K S   
Sbjct: 578 PPIPEEPTPSPTKDSSPE 595


>gnl|CDD|227578 COG5253, MSS4, Phosphatidylinositol-4-phosphate 5-kinase [Signal
           transduction mechanisms].
          Length = 612

 Score = 29.5 bits (66), Expect = 6.7
 Identities = 16/74 (21%), Positives = 31/74 (41%), Gaps = 2/74 (2%)

Query: 517 MTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAIT 576
           ++  PS       P S++  ++ PTS  +P   P  S      +   ++ +  +  P + 
Sbjct: 175 LSSQPSRKPTSENPKSESDNSKLPTSVNSP--LPDKSLLKRTLSNFWAERNSYNWKPLVY 232

Query: 577 TPCSSKSLRSGSDS 590
             C S+ + S SD 
Sbjct: 233 PSCPSEHIFSDSDV 246


>gnl|CDD|223023 PHA03249, PHA03249, DNA packaging tegument protein UL25;
           Provisional.
          Length = 653

 Score = 29.6 bits (66), Expect = 7.1
 Identities = 16/52 (30%), Positives = 29/52 (55%), Gaps = 2/52 (3%)

Query: 526 GQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPPAITT 577
           G+   +  +R  R  T+   P +   ++   +PATPSS K+ + S+PP+I +
Sbjct: 86  GRNKATKPSR--RNKTTQCRPTSLALATAATMPATPSSGKSPKVSSPPSIPS 135


>gnl|CDD|240626 cd05301, GDH, D-glycerate dehydrogenase/hydroxypyruvate reductase
           (GDH).  D-glycerate dehydrogenase (GDH, also known as
           hydroxypyruvate reductase, HPR) catalyzes the reversible
           reaction of (R)-glycerate + NAD+ to hydroxypyruvate +
           NADH + H+. In humans, HPR deficiency causes primary
           hyperoxaluria type 2, characterized by over-excretion of
           L-glycerate and oxalate in the urine, possibly due to an
           imbalance in competition with L-lactate dehydrogenase,
           another formate dehydrogenase (FDH)-like enzyme. GDH,
           like FDH and other members of the D-specific hydroxyacid
           dehydrogenase family that also includes L-alanine
           dehydrogenase and S-adenosylhomocysteine hydrolase,
           typically have a characteristic arrangement of 2 similar
           subdomains of the alpha/beta Rossmann-fold NAD+ binding
           form, despite often low sequence identity. The NAD+
           binding domain is inserted within the linear sequence of
           the mostly N-terminal catalytic domain, which has a
           similar domain structure to the internal NAD binding
           domain. Structurally, these domains are connected by
           extended alpha helices and create a cleft in which NAD
           is bound, primarily to the C-terminal portion of the 2nd
           (internal) domain. Some related proteins have similar
           structural subdomain but with a tandem arrangement of
           the catalytic and NAD-binding subdomains in the linear
           sequence. While many members of this family are dimeric,
           alanine DH is hexameric and phosphoglycerate DH is
           tetrameric.
          Length = 309

 Score = 29.3 bits (67), Expect = 7.7
 Identities = 12/34 (35%), Positives = 15/34 (44%), Gaps = 3/34 (8%)

Query: 31  KRLRQDEEALAAERKPLSELITR---VSLSSPGS 61
            R  + EE L A    L EL+     VSL  P +
Sbjct: 176 SRKPEAEEELGARYVSLDELLAESDFVSLHCPLT 209


>gnl|CDD|227725 COG5438, COG5438, Predicted multitransmembrane protein [Function
           unknown].
          Length = 385

 Score = 29.2 bits (66), Expect = 7.7
 Identities = 20/71 (28%), Positives = 29/71 (40%), Gaps = 11/71 (15%)

Query: 275 YLSTNANIVPKQQL---QLIGV-AALM-----IAAKVEEIYP--PKVSDYAYVTDGACSS 323
           YLS N  I          +IG   A+M     IA+ + E++   P ++       G    
Sbjct: 244 YLSKNVGINFSGLFLAGVIIGSLGAIMDVAISIASSINELHEQNPNITVKELFKSGMNIG 303

Query: 324 QDILSTEMNIL 334
           QDI+ T  N L
Sbjct: 304 QDIMGTMTNTL 314


>gnl|CDD|219833 pfam08418, Pol_alpha_B_N, DNA polymerase alpha subunit B
           N-terminal.  This is the eukaryotic DNA polymerase alpha
           subunit B N-terminal domain which is involved in complex
           formation. Also see pfam04058.
          Length = 239

 Score = 28.9 bits (65), Expect = 7.8
 Identities = 20/125 (16%), Positives = 43/125 (34%), Gaps = 9/125 (7%)

Query: 473 AYSYHKHSVSMELSDLAIAESAKLLKEEQQPTTPGNRTYSVPCD----MTPPPSSSKGQT 528
           ++S ++     +L+   + +  + L+ + +       +               S     T
Sbjct: 34  SFSTNRGLEDTDLTLENLRKFEEELQSQLEKRVRTPASIKTSKRLIEVPEAEESLLDSYT 93

Query: 529 TPSSKARQARTPTSSKAP--PTTPSSSKTI---LPATPSSSKASRKSAPPAITTPCSSKS 583
           TPS K    R  ++ + P    + S+S      L  +P+S   S   + P+  +   S  
Sbjct: 94  TPSDKGGMLRILSTPELPKRKRSFSASSLESPSLFFSPASFSPSAAPSTPSPNSAKFSSR 153

Query: 584 LRSGS 588
              G 
Sbjct: 154 SNPGE 158


>gnl|CDD|236886 PRK11249, katE, hydroperoxidase II; Provisional.
          Length = 752

 Score = 29.2 bits (66), Expect = 8.2
 Identities = 10/34 (29%), Positives = 13/34 (38%)

Query: 516 DMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTT 549
            + P   S +    P+    Q   P S KAP T 
Sbjct: 25  SLAPEDGSHRPAPEPTPPGAQPTAPGSLKAPDTR 58


>gnl|CDD|165528 PHA03270, PHA03270, envelope glycoprotein C; Provisional.
          Length = 466

 Score = 29.1 bits (65), Expect = 9.2
 Identities = 15/60 (25%), Positives = 29/60 (48%), Gaps = 2/60 (3%)

Query: 517 MTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSS--KTILPATPSSSKASRKSAPPA 574
           +     + +G  + +S+A  + +P S++ P TTP+ +  K       S  K+    +PPA
Sbjct: 19  LCAGAGAPRGAVSNASEAPTSGSPGSAEGPRTTPTPTRGKGTPTGPASPPKSGPPKSPPA 78


>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
           dihydrolipoamide succinyltransferase.  This model
           represents an Actinobacterial clade of E2 enzyme, a
           component of the 2-oxoglutarate dehydrogenase complex
           involved in the TCA cycle. These proteins have multiple
           domains including the catalytic domain (pfam00198), one
           or two biotin domains (pfam00364) and an E3-component
           binding domain (pfam02817).
          Length = 579

 Score = 29.2 bits (65), Expect = 9.3
 Identities = 15/62 (24%), Positives = 21/62 (33%)

Query: 514 PCDMTPPPSSSKGQTTPSSKARQARTPTSSKAPPTTPSSSKTILPATPSSSKASRKSAPP 573
             +  P PS +  +  P   AR         AP   P+ +     A P SS  S     P
Sbjct: 211 AEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAPAAAAPVSSGDSGPYVTP 270

Query: 574 AI 575
            +
Sbjct: 271 LV 272


>gnl|CDD|236555 PRK09537, pylS, pyrolysyl-tRNA synthetase; Reviewed.
          Length = 417

 Score = 29.0 bits (65), Expect = 9.7
 Identities = 13/56 (23%), Positives = 21/56 (37%), Gaps = 3/56 (5%)

Query: 523 SSKGQTTPSSKARQARTPTSSK--APPTTPSSSKTILPATPSSSKASRKSAPPAIT 576
           S+  +   +      R P   +   P    SS    +P+ P  S    K+  PA+T
Sbjct: 99  SAPTKKKKAMPKSVVRAPKPLENPVPAQAESSGSKPVPSIP-VSTPEVKAPAPALT 153


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.132    0.395 

Gapped
Lambda     K      H
   0.267   0.0783    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 35,437,736
Number of extensions: 3390375
Number of successful extensions: 4260
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3834
Number of HSP's successfully gapped: 254
Length of query: 712
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 608
Effective length of database: 6,324,786
Effective search space: 3845469888
Effective search space used: 3845469888
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (27.9 bits)