RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6664
         (624 letters)



>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score =  384 bits (987), Expect = e-128
 Identities = 164/446 (36%), Positives = 240/446 (53%), Gaps = 41/446 (9%)

Query: 35  SELVCRVKYNNTLPDIPFDCKFIAYPF-ESTRFIEYKSTSLERNYKYEVLT--EHDLGVT 91
            + + +VKY N LPD PF  K I YPF +  RF     TSL R  K  VL   +HDLG+ 
Sbjct: 1   QDYIAKVKYTNNLPDPPFPPKLITYPFPDENRFSPQLITSLYRKTKVTVLIQLDHDLGMP 60

Query: 92  IDLIN---------KDTYAPNYNIQLDPADEKLLEEDILTPQDSKRSRHHARSVSWLRRT 142
           +DL+          K  Y  + N++LDP D  LL +         R++  ++ VS+LRRT
Sbjct: 61  LDLLKIPGLLQGDTKVYYGDDVNVKLDPEDRILLRDPGADK----RTKTDSKKVSFLRRT 116

Query: 143 EYISTEQTRFQPQVMDKVEAKVGFSIKKNFKEDNLYMDRDSQVKAIEKTFADSKLPIEKH 202
           EYISTE  RF  Q ++  E+KVG ++K+N +E++ Y+DR++Q++AIEKTF D K  I KH
Sbjct: 117 EYISTESARFAVQALELTESKVGRNVKQNLREEDEYLDREAQIEAIEKTFEDVKKEILKH 176

Query: 203 YSKPNVTPVEIMPVYPDFKYWKYPCAQVIFDSDPAPVGRPIPAQIEEMSQAMIRGVMDES 262
            SK NV  VE+ P+ PD         QV FD D AP G+   AQ+ + S   I G ++  
Sbjct: 177 PSKKNVKAVEVWPLLPDTASMDQQFFQVRFDGDAAPDGKEKHAQLTQASSTAIFGPVELE 236

Query: 263 GEQFVAYFLP-------TAETLDKRKLDQLSNLEYTDEQEYEYKMAREYNWNVKSKSCKG 315
            E+FV+Y+LP         +TL+KR  D     +Y +++EY++K  REY+  VKSK+ K 
Sbjct: 237 EEEFVSYYLPDEKDSEILEDTLEKRSDDL---HDYDEDEEYKFKRVREYDMKVKSKATKL 293

Query: 316 YEENYFLIVRD-DAVYYNELETRVRLSKRR-----QKVGAAPNNTRLVVTHRPLNATEFR 369
            E   F +  +   VYY  L +RV L +RR     + +    NN +L V  R  +  E +
Sbjct: 294 NELALFFVSDENGVVYYKPLRSRVELRRRRVNDVIRPLVREHNNDQLNVKLRNPSTKESK 353

Query: 370 IQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
           ++  R   L+P + EE +EDE+EEEE+  +         E +E EG + E  G    ++ 
Sbjct: 354 MRDKRRARLDPIDFEEVDEDEDEEEEQRSD---------EHEEEEGEDSEEEGSQSREDG 404

Query: 430 KRTSDAEHSSKNSSRASSRSRSRSKS 455
              S ++  S + S+A   S S S S
Sbjct: 405 SSESSSDVGSDSESKADKESASDSDS 430


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 67.9 bits (166), Expect = 1e-11
 Identities = 47/261 (18%), Positives = 81/261 (31%), Gaps = 16/261 (6%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
             P  +  E                    A                   +    +    S
Sbjct: 128 PSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPS 187

Query: 439 SKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDK 498
           S  +    S   + +  + P + S   +++S  +    RSA  D  A S  S+ S+ S  
Sbjct: 188 SPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGC 247

Query: 499 ---------------SAASNRSRSGSAASNK-SRSDRSRSGSAGSDRSAASNKSVRSDKS 542
                               R    S  +   SR   + S S+  +RS + + S      
Sbjct: 248 GWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGP 307

Query: 543 AASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRS 602
           A S+  A S  S    S+ S + S+S  +   + +   S ++S S SR    +   S R 
Sbjct: 308 APSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRK 367

Query: 603 RSASKSGSSAASSRSGSPAGS 623
           R       S+ ++ +G P   
Sbjct: 368 RPRPSRAPSSPAASAGRPTRR 388



 Score = 60.2 bits (146), Expect = 3e-09
 Identities = 46/201 (22%), Positives = 79/201 (39%), Gaps = 11/201 (5%)

Query: 433 SDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSAR 492
           + A  ++  S  ASSR  +   S          S  +     +  +A S R  +      
Sbjct: 155 AGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPIS 214

Query: 493 SDRSDKSAASNRS---RSGSAASNKSRSDRSRSGSAGSDR------SAASNKSVRSDKSA 543
           +  S  + A  RS    +G+++S+ S S+ S  G    +       +  +  +   + S 
Sbjct: 215 ASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASG 274

Query: 544 ASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSR 603
            +  S+R   +  S+S R RS S S  +  +  A +     S S S SR  S + S  S 
Sbjct: 275 WNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRA-SSSSSSSRESSSS-STSSS 332

Query: 604 SASKSGSSAASSRSGSPAGSD 624
           S S  G++ +   S S + S 
Sbjct: 333 SESSRGAAVSPGPSPSRSPSP 353



 Score = 59.8 bits (145), Expect = 3e-09
 Identities = 43/211 (20%), Positives = 72/211 (34%), Gaps = 16/211 (7%)

Query: 414 EGGEREGGGEGEN-DNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGS 472
           E      G E E                 +S  +  S     + S +       + S  S
Sbjct: 243 ESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSS 302

Query: 473 NKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAA 532
             S  +  S R++ S  S+R   S  +++S+ S  G+A S      RS S S     +  
Sbjct: 303 PGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADP 362

Query: 533 SNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSR 592
           S+      K    +R+  S  +      R R+R+A        A   R  + +G     R
Sbjct: 363 SSPR----KRPRPSRAPSSPAASAGRPTRRRARAA-------VAGRARRRDATGRFPAGR 411

Query: 593 SVSRARSNRSRSASKSGSSAASSRSGSPAGS 623
                R +   + + SG+  A     +P+G 
Sbjct: 412 P----RPSPLDAGAASGAFYARYPLLTPSGE 438



 Score = 43.6 bits (103), Expect = 3e-04
 Identities = 44/228 (19%), Positives = 64/228 (28%), Gaps = 13/228 (5%)

Query: 361 RPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG 420
            P  +                +       E E            T   E     G     
Sbjct: 223 APGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRP 282

Query: 421 GGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAK 480
           G    + + +  S +   S   S  +  S   S S S ++ S S S SS     S R A 
Sbjct: 283 GPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSE--SSRGAA 340

Query: 481 SDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSD 540
                   RS    R           S       SR+  S + SAG         + R  
Sbjct: 341 VSPGPSPSRSPSPSR---PPPPADPSSPRKRPRPSRAPSSPAASAGR-------PTRRRA 390

Query: 541 KSAASNRSARSDGSDKSASNRSRSR-SASSQASNNSAASNRSSNKSGS 587
           ++A + R+ R D + +  + R R     +  AS    A       SG 
Sbjct: 391 RAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSGE 438



 Score = 34.4 bits (79), Expect = 0.22
 Identities = 18/94 (19%), Positives = 36/94 (38%), Gaps = 1/94 (1%)

Query: 433 SDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASS-RGSNKSDRSAKSDRSAKSDRSA 491
           S +   S +     +   S  K   P++   S +AS+ R + +  R+A + R+ + D + 
Sbjct: 346 SPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATG 405

Query: 492 RSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSA 525
           R        +   + + S A        + SG  
Sbjct: 406 RFPAGRPRPSPLDAGAASGAFYARYPLLTPSGEP 439



 Score = 34.0 bits (78), Expect = 0.27
 Identities = 26/152 (17%), Positives = 45/152 (29%), Gaps = 5/152 (3%)

Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEH 437
              P+    +  E     E  E Q+G  +    +      R G            +    
Sbjct: 758 FSNPSLVPAKLAEALALLEPAEPQRGAGSSPPVRAEAAFRRPGRLR-----RSGPAADAA 812

Query: 438 SSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSD 497
           S   S R S        S+S       G+A+     +S  S+KS  +A   R+   +   
Sbjct: 813 SRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSESSKSKPAAAGGRARGKNGRR 872

Query: 498 KSAASNRSRSGSAASNKSRSDRSRSGSAGSDR 529
           +           AA+    +  +    A + R
Sbjct: 873 RPRPPEPRARPGAAAPPKAAAAAPPAGAPAPR 904



 Score = 31.7 bits (72), Expect = 1.5
 Identities = 27/144 (18%), Positives = 48/144 (33%), Gaps = 4/144 (2%)

Query: 485 AKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAA 544
             S R AR+  S          + S    K     +    A   R A S+  VR++ +  
Sbjct: 738 RYSPRRARARASAWDITDALFSNPSLVPAKLAEALALLEPAEPQRGAGSSPPVRAEAAFR 797

Query: 545 SNRSARSDGSDKSASNR----SRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSN 600
                R  G    A++R     +SRS +    + S+   R    +     +RS   ++S 
Sbjct: 798 RPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSESSKSK 857

Query: 601 RSRSASKSGSSAASSRSGSPAGSD 624
            + +  ++       R   P    
Sbjct: 858 PAAAGGRARGKNGRRRPRPPEPRA 881



 Score = 31.7 bits (72), Expect = 1.7
 Identities = 24/158 (15%), Positives = 52/158 (32%), Gaps = 4/158 (2%)

Query: 442 SSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAA 501
            +RAS+   + +   +P+    +  A +    +     +   S+   R+  + R      
Sbjct: 745 RARASAWDITDALFSNPSLVP-AKLAEALALLEPAEPQRGAGSSPPVRAEAAFRRPGRLR 803

Query: 502 SNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNR 561
            +   +  AAS  +   +SRS +      ++     R   +AA    ARS  S KS    
Sbjct: 804 RSGP-AADAASRTASKRKSRSHTPDGGSESSG--PARPPGAAARPPPARSSESSKSKPAA 860

Query: 562 SRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARS 599
           +  R+               +    +     + +   +
Sbjct: 861 AGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPA 898


>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
           Provisional.
          Length = 672

 Score = 66.1 bits (162), Expect = 3e-11
 Identities = 39/214 (18%), Positives = 66/214 (30%), Gaps = 7/214 (3%)

Query: 390 EEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRS 449
                  A   +      A RQ  +        + E     R + A  +   S+  ++++
Sbjct: 68  ATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQA 127

Query: 450 RSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGS 509
           R R +    A+R  +  A   G    ++ A   R+  ++R+   +R ++    +R    +
Sbjct: 128 RERRERGEAARRGAARKAGEGG----EQPATEARADAAERTEEEERDERRRRGDREDRQA 183

Query: 510 AASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASS 569
            A    R  R   G  G DR        R        R  R  G  +    R   R A  
Sbjct: 184 EAERGERGRREERGRDGDDRDRRDR---REQGDRREERGRRDGGDRRGRRRRRDRRDARG 240

Query: 570 QASNNSAASNRSSNKSGSRSRSRSVSRARSNRSR 603
             +          +  G   R     R R  R R
Sbjct: 241 DDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGR 274



 Score = 54.5 bits (132), Expect = 1e-07
 Identities = 24/195 (12%), Positives = 59/195 (30%), Gaps = 2/195 (1%)

Query: 409 ERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSAS 468
           E + G                +R + A  +++ + + ++ + +     +PA R+ + +A+
Sbjct: 57  EARGGGAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAA 116

Query: 469 SRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSD 528
              S      A+  R      +AR   + K+       +  A ++ +             
Sbjct: 117 EAASAPEAAQARERRER--GEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRR 174

Query: 529 RSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSR 588
           R    ++   +++     R  R    D       R +    +        +R   +    
Sbjct: 175 RGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRD 234

Query: 589 SRSRSVSRARSNRSR 603
            R       R +R  
Sbjct: 235 RRDARGDDNREDRGD 249



 Score = 53.4 bits (129), Expect = 2e-07
 Identities = 30/151 (19%), Positives = 41/151 (27%), Gaps = 3/151 (1%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
               +  E  +  E        + GE  E    E      E   E E D  +R  D E  
Sbjct: 122 PEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDR 181

Query: 439 SKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDK 498
              + R     R                   +G  + +R  + D   +  R  R DR D 
Sbjct: 182 QAEAERGERGRREE--RGRDGDDRDRRDRREQGDRREERG-RRDGGDRRGRRRRRDRRDA 238

Query: 499 SAASNRSRSGSAASNKSRSDRSRSGSAGSDR 529
               NR   G    +       R G    DR
Sbjct: 239 RGDDNREDRGDRDGDDGEGRGGRRGRRFRDR 269



 Score = 50.3 bits (121), Expect = 2e-06
 Identities = 36/187 (19%), Positives = 48/187 (25%), Gaps = 12/187 (6%)

Query: 349 AAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEA 408
           AA              A +   +    +    P          E     E  Q  E  E 
Sbjct: 74  AAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRER 133

Query: 409 ERQEGEGGEREGGGEGE-NDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSA 467
                 G  R+ G  GE    E R   AE + +       R   R   ++ A+R   G  
Sbjct: 134 GEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRR 193

Query: 468 SSRGSNKSDRSAKSDRSAK-----------SDRSARSDRSDKSAASNRSRSGSAASNKSR 516
             RG +  DR  +  R               DR  R  R D+  A               
Sbjct: 194 EERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGD 253

Query: 517 SDRSRSG 523
               R G
Sbjct: 254 DGEGRGG 260



 Score = 49.9 bits (120), Expect = 4e-06
 Identities = 21/165 (12%), Positives = 39/165 (23%), Gaps = 5/165 (3%)

Query: 364 NATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGE 423
                R +R          +  E  ++   E  A+  ++ E  E + +   G   +   E
Sbjct: 125 AQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAE 184

Query: 424 GENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDR 483
            E     R  +      +  R   R +   +               R   +  R  +  R
Sbjct: 185 AERGERGRREERGRDGDDRDRRDRREQGDRRE-----ERGRRDGGDRRGRRRRRDRRDAR 239

Query: 484 SAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSD 528
              +         D        R         R  R   G    +
Sbjct: 240 GDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGGNERE 284


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 52.7 bits (127), Expect = 6e-07
 Identities = 38/252 (15%), Positives = 84/252 (33%), Gaps = 29/252 (11%)

Query: 370  IQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
            + +  E   E    EE+E  +E+  +   + +  +  + + ++ E              +
Sbjct: 1134 LDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKE-------------KK 1180

Query: 430  KRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDR 489
            K+ S A+ S K S   +S+     + +    +  +  ++S GS++ D   +  +  KS  
Sbjct: 1181 KKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSV 1240

Query: 490  SARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSA 549
                 + + S+ S      S  +++  SD         +     +    S    +     
Sbjct: 1241 KRLKSKKNNSSKS------SEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDG 1294

Query: 550  RSDGSDKSASNRSRSR----------SASSQASNNSAASNRSSNKSGSRSRSRSVSRARS 599
             S+G  K +S   +                + S    A  + S     ++ +   SR   
Sbjct: 1295 ESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLR 1354

Query: 600  NRSRSASKSGSS 611
               +  S S S 
Sbjct: 1355 RPRKKKSDSSSE 1366



 Score = 38.9 bits (91), Expect = 0.009
 Identities = 28/219 (12%), Positives = 72/219 (32%), Gaps = 7/219 (3%)

Query: 339  RLSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAE 398
            +L K + K              +  +      +   +   +  ++ + ++      ++ +
Sbjct: 1168 KLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQED 1227

Query: 399  ETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSP 458
            + +Q    +    +    ++    +   DN++ +SD          A  R  +   S  P
Sbjct: 1228 DEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPP 1287

Query: 459  AKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSD 518
              +   G      SN   + +   +     R   S          +S   +A   KS++ 
Sbjct: 1288 PSKRPDGE-----SNGGSKPSSPTKKKVKKRLEGS--LAALKKKKKSEKKTARKKKSKTR 1340

Query: 519  RSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKS 557
              ++ ++ S R     +  +SD S+  +  +  D S+  
Sbjct: 1341 VKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDE 1379


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 49.9 bits (119), Expect = 3e-06
 Identities = 33/191 (17%), Positives = 71/191 (37%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
           E   EEE+E+++ +EE++ ++ +  E  +  + + E  E+    E E + EK+  +    
Sbjct: 101 ESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDR 160

Query: 439 SKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDK 498
            +   R   R++SR K     K             +   + ++ +    +     +R  +
Sbjct: 161 EEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKE 220

Query: 499 SAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSA 558
                   + ++   +  S +S   S  S  S        S  S  +  S++   +    
Sbjct: 221 EDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTETRPRT 280

Query: 559 SNRSRSRSASS 569
           S R  S   +S
Sbjct: 281 SLRPPSARPAS 291



 Score = 29.1 bits (65), Expect = 8.7
 Identities = 21/182 (11%), Positives = 54/182 (29%), Gaps = 7/182 (3%)

Query: 439 SKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDK 498
           SK SS  + +   +  SK PA +      +    N+S +  + ++    +   +     K
Sbjct: 69  SKLSSDEAVKRVEKGGSKGPAAK---TKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPK 125

Query: 499 SAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSA 558
               +R     A   +   ++ +      +      +  + ++  A +R  +        
Sbjct: 126 EEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPN 185

Query: 559 SNRSRSRSAS----SQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAAS 614
             +           ++ +          N+   +       R  +       +S  S+  
Sbjct: 186 KKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEI 245

Query: 615 SR 616
           SR
Sbjct: 246 SR 247


>gnl|CDD|218883 pfam06075, DUF936, Plant protein of unknown function (DUF936).
           This family consists of several hypothetical proteins
           from Arabidopsis thaliana and Oryza sativa. The function
           of this family is unknown.
          Length = 564

 Score = 48.3 bits (115), Expect = 8e-06
 Identities = 40/201 (19%), Positives = 66/201 (32%), Gaps = 10/201 (4%)

Query: 423 EGENDNEKRTSDAEHSSKNSSRAS--SRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAK 480
              +D   +    + +S    R S  S S+S S   SP        + S   N +DR+  
Sbjct: 122 AFFSDAVIQVIKRKKASSAPRRGSWDSSSKSASIDSSPTVIGPRPRSFSE-LNLTDRTPA 180

Query: 481 SDRSAKS----DRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKS 536
             RS++S       +       S+   RS  GS     S S R +     + R   S  S
Sbjct: 181 KVRSSRSELGAPSPSGGTSCPSSSGGRRSSIGSRRLRGSASLRKKVAVLSAPRKPGSRSS 240

Query: 537 VRSDKSAASNRSARS-DGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVS 595
                  A + SA+S   S             S +AS     +++SS    +  +     
Sbjct: 241 DCKSSPRARSSSAKSPFKSSIQRKATKALSKLSLRASPKD--TSKSSKSEVAPPKKSEAK 298

Query: 596 RARSNRSRSASKSGSSAASSR 616
              S++  +       +  S 
Sbjct: 299 VPSSSKKWTDGNVSWDSLPSS 319



 Score = 37.5 bits (87), Expect = 0.020
 Identities = 41/190 (21%), Positives = 65/190 (34%), Gaps = 24/190 (12%)

Query: 435 AEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSD 494
           A  S    S A  +   R K+ S  +R    S+S   S  S  +    R         +D
Sbjct: 117 AADSLAFFSDAVIQVIKRKKASSAPRRGSWDSSSKSASIDSSPTVIGPRPRSFSELNLTD 176

Query: 495 RSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGS 554
           R+     S+RS  G+ + +   S  S SG   S        S R   SA+  +      +
Sbjct: 177 RTPAKVRSSRSELGAPSPSGGTSCPSSSGGRRSSIG-----SRRLRGSASLRKKVAVLSA 231

Query: 555 DKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAAS 614
            +   +RS                  S  KS  R+RS S +++    S     + + +  
Sbjct: 232 PRKPGSRS------------------SDCKSSPRARS-SSAKSPFKSSIQRKATKALSKL 272

Query: 615 SRSGSPAGSD 624
           S   SP  + 
Sbjct: 273 SLRASPKDTS 282


>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
           splicing factor.  These splicing factors consist of an
           N-terminal arginine-rich low complexity domain followed
           by three tandem RNA recognition motifs (pfam00076). The
           well-characterized members of this family are auxilliary
           components of the U2 small nuclear ribonuclearprotein
           splicing factor (U2AF). These proteins are closely
           related to the CC1-like subfamily of splicing factors
           (TIGR01622). Members of this subfamily are found in
           plants, metazoa and fungi.
          Length = 509

 Score = 44.5 bits (105), Expect = 2e-04
 Identities = 31/110 (28%), Positives = 44/110 (40%), Gaps = 9/110 (8%)

Query: 423 EGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKS---DRSA 479
           E + + EK        S    R  SR RSR + +    R RS    SR  ++     RS 
Sbjct: 4   EPDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSP 63

Query: 480 KSDRSAKSDRS-ARSDRSDKSAASN-----RSRSGSAASNKSRSDRSRSG 523
           +S R +   RS  R  R  +S  S      R R  S ++   + D+ RS 
Sbjct: 64  RSLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRSL 113



 Score = 41.8 bits (98), Expect = 0.001
 Identities = 32/117 (27%), Positives = 42/117 (35%), Gaps = 6/117 (5%)

Query: 427 DNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAK 486
           D E      +   ++  R+S R R RS+ +S   R R     SR  +  + S   DR   
Sbjct: 2   DEEPDREREKSRGRDRDRSSERPRRRSRDRS---RFRDRHRRSRERSYREDSRPRDRRRY 58

Query: 487 SDRSARSDRSDKSAAS---NRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSD 540
             RS RS R      S    R RS S  S +    R R  S  +       K    D
Sbjct: 59  DSRSPRSLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRSLWD 115



 Score = 40.3 bits (94), Expect = 0.003
 Identities = 31/131 (23%), Positives = 39/131 (29%), Gaps = 19/131 (14%)

Query: 488 DRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNR 547
           D     +R        R R    +S + R  RSR  S   DR   S +    + S   +R
Sbjct: 2   DEEPDREREKS-----RGRDRDRSSERPR-RRSRDRSRFRDRHRRSRERSYREDSRPRDR 55

Query: 548 SARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASK 607
                 S +S    S  RS             R   +  SRS        R  R RS S 
Sbjct: 56  RRYDSRSPRSLRYSSVRRS-------------RDRPRRRSRSVRSIEQHRRRLRDRSPSN 102

Query: 608 SGSSAASSRSG 618
                   RS 
Sbjct: 103 QWRKDDKKRSL 113



 Score = 31.8 bits (72), Expect = 1.1
 Identities = 27/112 (24%), Positives = 33/112 (29%), Gaps = 5/112 (4%)

Query: 470 RGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDR 529
               +  R    DRS++  R    DRS       RSR  S         R R       R
Sbjct: 6   DREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERS----YREDSRPRDRRRYDSR 61

Query: 530 SAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRS 581
           S  S +     +S    R  RS        +R R R  S           RS
Sbjct: 62  SPRSLRYSSVRRSRDRPRR-RSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRS 112



 Score = 31.4 bits (71), Expect = 1.5
 Identities = 21/113 (18%), Positives = 35/113 (30%), Gaps = 4/113 (3%)

Query: 374 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTS 433
           R+   +   E+    D +   E      +  +   +R                D  +  S
Sbjct: 1   RDEEPDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDS 60

Query: 434 DAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAK 486
            +  S + SS   SR R R +S+S     RS     R       S +  +  K
Sbjct: 61  RSPRSLRYSSVRRSRDRPRRRSRS----VRSIEQHRRRLRDRSPSNQWRKDDK 109



 Score = 30.2 bits (68), Expect = 3.6
 Identities = 27/119 (22%), Positives = 37/119 (31%), Gaps = 13/119 (10%)

Query: 448 RSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRS 507
               R + KS   R R    SS    +  R     R        RS R D     +R R 
Sbjct: 3   EEPDREREKS---RGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYRED-----SRPR- 53

Query: 508 GSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRS 566
                 +    RS      S    + ++  R  +S  S    R    D+S SN+ R   
Sbjct: 54  ----DRRRYDSRSPRSLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDD 108


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 43.2 bits (102), Expect = 4e-04
 Identities = 27/76 (35%), Positives = 34/76 (44%), Gaps = 9/76 (11%)

Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEH 437
           LE  +   E +DEEEE+EE EE         E  E EG  +E   + E + +   S  E 
Sbjct: 379 LEIEDANTERDDEEEEDEEEEE---------EEDEDEGPSKEHSDDEEFEEDDVESKYED 429

Query: 438 SSKNSSRASSRSRSRS 453
           S  NSS A      RS
Sbjct: 430 SDGNSSLAVGYKNDRS 445


>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
           Rho/Rac/Cdc42-like GTPases [Signal transduction
           mechanisms].
          Length = 1175

 Score = 43.0 bits (101), Expect = 6e-04
 Identities = 39/181 (21%), Positives = 64/181 (35%), Gaps = 6/181 (3%)

Query: 437 HSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRS 496
           HS   +S  SS + +     SPA  S S + SS  S K           K         S
Sbjct: 94  HSPSATSSTSSLNSNDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQKRKNPLLPSSS 153

Query: 497 DKSAASNRSRSGSAASNKSRSD-RSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSD 555
                     + +  S+    + RSR         +  ++S++        + + SD S+
Sbjct: 154 THGTHPPIVFTDNNGSHAGAPNARSRKEI-----PSLGSQSMQLPSPHFRQKFSSSDTSN 208

Query: 556 KSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASS 615
             +    R  S  S  S  S   + ++      S S   S   SN + S+S S + + SS
Sbjct: 209 GFSYPSIRKNSRHSSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSSSNSEAMSTSS 268

Query: 616 R 616
           +
Sbjct: 269 K 269



 Score = 39.5 bits (92), Expect = 0.006
 Identities = 48/196 (24%), Positives = 69/196 (35%), Gaps = 9/196 (4%)

Query: 437 HSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRS 496
           H   +S  +S +   R  S  P   S S ++S+   N +D    S  S     +  S +S
Sbjct: 70  HQIFSSFSSSPKLFQRRNSAGPITHSPSATSSTSSLNSNDGDQFSPASDSLSFNPSSTQS 129

Query: 497 DKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDK 556
            K +            N      S  G+        +N S     +A S +   S GS  
Sbjct: 130 RKDSGPGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEIPSLGSQS 189

Query: 557 SA--SNRSRSRSASSQASNNSAASN------RSSNKSGSRSR-SRSVSRARSNRSRSASK 607
               S   R + +SS  SN  +  +       SSN   S    S +V   R + S  AS 
Sbjct: 190 MQLPSPHFRQKFSSSDTSNGFSYPSIRKNSRHSSNSMPSFPHSSTAVLLKRHSGSSGASL 249

Query: 608 SGSSAASSRSGSPAGS 623
             S+   S S S A S
Sbjct: 250 ISSNITPSSSNSEAMS 265


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 42.5 bits (100), Expect = 6e-04
 Identities = 26/196 (13%), Positives = 67/196 (34%), Gaps = 2/196 (1%)

Query: 419 EGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRS 478
               + +  N+  TS    S       ++ + ++  +        S ++ S         
Sbjct: 38  TTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIYK 97

Query: 479 AKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVR 538
                +     +      + S  +      +  S+ S  ++ R+    ++ S  ++ S  
Sbjct: 98  NLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDS-S 156

Query: 539 SDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRAR 598
                 +  S +    ++ A + + ++ ++S    NS       N+S S+  S   +  +
Sbjct: 157 IKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPT-QPNQSNSQPASDDTANQK 215

Query: 599 SNRSRSASKSGSSAAS 614
           S+   + S S S+  S
Sbjct: 216 SSSKDNQSMSDSALDS 231



 Score = 39.4 bits (92), Expect = 0.005
 Identities = 26/184 (14%), Positives = 58/184 (31%)

Query: 432 TSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSA 491
            +D       +   S  S+  +  ++    S   +  +  +N S++     + +  D S 
Sbjct: 26  YADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSST 85

Query: 492 RSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARS 551
               +            +     +++    + S  +      N +           S +S
Sbjct: 86  SDSNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSEKS 145

Query: 552 DGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSS 611
                  S+ S      +Q+S    A N+ +  S +   S S  +  S +    ++S S 
Sbjct: 146 TNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQ 205

Query: 612 AASS 615
            AS 
Sbjct: 206 PASD 209



 Score = 29.4 bits (66), Expect = 7.7
 Identities = 26/179 (14%), Positives = 70/179 (39%), Gaps = 14/179 (7%)

Query: 446 SSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRS 505
           S+   +   SK       S   SS+ ++K+D +  S++     + +  D S   + +   
Sbjct: 34  STAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIID 93

Query: 506 RSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSR 565
                        ++      +      N S+ +      N ++     ++  +      
Sbjct: 94  ------FIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRN------ 141

Query: 566 SASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPAGSD 624
             S +++N+S  ++ SS K+ + ++S    +A + ++ S++ +  S ++ +  SP  + 
Sbjct: 142 --SEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQ 198


>gnl|CDD|216205 pfam00937, Corona_nucleoca, Coronavirus nucleocapsid protein. 
          Length = 346

 Score = 42.0 bits (99), Expect = 6e-04
 Identities = 21/78 (26%), Positives = 33/78 (42%), Gaps = 7/78 (8%)

Query: 441 NSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSA 500
             SR+SSRS SRS        SR  S  S  +N  +R++ S     +   A   +     
Sbjct: 154 GRSRSSSRSSSRS-------NSRGPSRGSSRNNSRNRNSSSPDDLVAAVLAALAKLGFGK 206

Query: 501 ASNRSRSGSAASNKSRSD 518
             + S+  S  + KS ++
Sbjct: 207 QKSSSKKPSRVTKKSAAE 224



 Score = 35.5 bits (82), Expect = 0.077
 Identities = 17/44 (38%), Positives = 22/44 (50%)

Query: 570 QASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAA 613
                S +S+RSS++S SR  SR  SR  S    S+S     AA
Sbjct: 151 GFRGRSRSSSRSSSRSNSRGPSRGSSRNNSRNRNSSSPDDLVAA 194



 Score = 33.9 bits (78), Expect = 0.23
 Identities = 18/64 (28%), Positives = 30/64 (46%)

Query: 438 SSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSD 497
           SS++SSR++SR  SR  S++ ++   S S     +      AK     +   S +  R  
Sbjct: 159 SSRSSSRSNSRGPSRGSSRNNSRNRNSSSPDDLVAAVLAALAKLGFGKQKSSSKKPSRVT 218

Query: 498 KSAA 501
           K +A
Sbjct: 219 KKSA 222



 Score = 31.6 bits (72), Expect = 1.2
 Identities = 16/40 (40%), Positives = 20/40 (50%), Gaps = 1/40 (2%)

Query: 586 GSRSRSRSVSRARS-NRSRSASKSGSSAASSRSGSPAGSD 624
           G R RSRS SR+ S + SR  S+  S   S    S +  D
Sbjct: 151 GFRGRSRSSSRSSSRSNSRGPSRGSSRNNSRNRNSSSPDD 190



 Score = 31.2 bits (71), Expect = 1.6
 Identities = 24/109 (22%), Positives = 45/109 (41%), Gaps = 10/109 (9%)

Query: 502 SNRSRSGSAASNKSRS---DRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSA 558
             RSRS S +S++S S    R  S +   +R+++S   + +   AA        G  K  
Sbjct: 153 RGRSRSSSRSSSRSNSRGPSRGSSRNNSRNRNSSSPDDLVAAVLAA----LAKLGFGKQ- 207

Query: 559 SNRSRSRSASSQASNNSAASNRSSNKSGSRS--RSRSVSRARSNRSRSA 605
            + S+  S  ++ S   AA  + +     R+  +  +V++    R    
Sbjct: 208 KSSSKKPSRVTKKSAAEAAKKQLNKPRWKRTPNKGENVTQCFGPRGPGQ 256


>gnl|CDD|215814 pfam00242, DNA_pol_viral_N, DNA polymerase (viral) N-terminal
           domain. 
          Length = 379

 Score = 42.1 bits (99), Expect = 7e-04
 Identities = 33/143 (23%), Positives = 57/143 (39%), Gaps = 2/143 (1%)

Query: 462 SRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSR 521
           ++S    SR  N+  R+  ++ S KSD S       +S    RSR G  A N+ +    +
Sbjct: 206 AQSSGILSRSENRRTRNLANNTSRKSDTSRSVGPVRQSQ-IQRSRLGLQA-NQGKLAHGQ 263

Query: 522 SGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRS 581
            G +GS R    + + R      S+    ++ +  S+S   +S    +  S+ S +   S
Sbjct: 264 QGRSGSIRGRKHSTTRRPFGVEPSSSGVTTNRASSSSSCFHQSAVRETAYSSLSTSERHS 323

Query: 582 SNKSGSRSRSRSVSRARSNRSRS 604
           S+      RS       S  +  
Sbjct: 324 SSGHAVELRSIPGGSVSSQNAGP 346



 Score = 34.0 bits (78), Expect = 0.22
 Identities = 24/134 (17%), Positives = 44/134 (32%), Gaps = 8/134 (5%)

Query: 495 RSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGS 554
             D+   +  S   S + N+   + + + S  SD S +     +S    +      + G 
Sbjct: 199 HGDEPFGAQSSGILSRSENRRTRNLANNTSRKSDTSRSVGPVRQSQIQRSRLGLQANQGK 258

Query: 555 DKSA----SNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGS 610
                   S   R R  S+         + S   +   S S S      ++S     + S
Sbjct: 259 LAHGQQGRSGSIRGRKHSTTRRPFGVEPSSSGVTTNRASSSSSC----FHQSAVRETAYS 314

Query: 611 SAASSRSGSPAGSD 624
           S ++S   S +G  
Sbjct: 315 SLSTSERHSSSGHA 328



 Score = 30.2 bits (68), Expect = 3.4
 Identities = 20/121 (16%), Positives = 38/121 (31%), Gaps = 5/121 (4%)

Query: 424 GENDNEKRTSDAEHSSKNSSRASSRSR-----SRSKSKSPAKRSRSGSASSRGSNKSDRS 478
            ++D  +       S    SR   ++         + +S + R R  S + R       S
Sbjct: 229 RKSDTSRSVGPVRQSQIQRSRLGLQANQGKLAHGQQGRSGSIRGRKHSTTRRPFGVEPSS 288

Query: 479 AKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVR 538
           +    +  S  S+   +S     +  S S S   + S          G   S+ +   + 
Sbjct: 289 SGVTTNRASSSSSCFHQSAVRETAYSSLSTSERHSSSGHAVELRSIPGGSVSSQNAGPLL 348

Query: 539 S 539
           S
Sbjct: 349 S 349


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 42.2 bits (99), Expect = 8e-04
 Identities = 49/186 (26%), Positives = 78/186 (41%), Gaps = 13/186 (6%)

Query: 436 EHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDR 495
            H  K          +   S S  K+  S +AS + S+K  R  K+D  A    S +   
Sbjct: 712 GHPYKKGVPPK---PAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRL 768

Query: 496 SDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSD 555
            +KS  S+ S S S++ + S S++         R ++ NK      S +S  S+ S   +
Sbjct: 769 EEKS--SSCSPSSSSSHHHSSSNK-------ESRKSSRNKEEEMLPSPSSPLSSSSPKPE 819

Query: 556 KSASNRSRSR-SASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAAS 614
             +  R R +   SS +   SA+S +SS+KS S S+ R      S+ S+    S     +
Sbjct: 820 HPSRKRPRRQEDTSSSSGPFSASSTKSSSKSSSTSKHRKTEGKGSSTSKEHKGSSGDTPN 879

Query: 615 SRSGSP 620
             S  P
Sbjct: 880 KASSFP 885



 Score = 40.7 bits (95), Expect = 0.002
 Identities = 39/198 (19%), Positives = 71/198 (35%), Gaps = 10/198 (5%)

Query: 373 LRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRT 432
           L  R    P ++       E++  +   +Q   T +E+   +G  +    + + + +K  
Sbjct: 706 LLSRIPGHPYKKGVPPKPAEKDSLSAPKKQTSKTASEKSSSKGKRKH---KNDEEADKIE 762

Query: 433 SDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSR-------GSNKSDRSAKSDRSA 485
           S  +   + SS  S  S S     S  K SR  S +          S  S  S K +  +
Sbjct: 763 SKKQRLEEKSSSCSPSSSSSHHHSSSNKESRKSSRNKEEEMLPSPSSPLSSSSPKPEHPS 822

Query: 486 KSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAAS 545
           +     + D S  S   + S + S++ + S S   ++   GS  S     S     + AS
Sbjct: 823 RKRPRRQEDTSSSSGPFSASSTKSSSKSSSTSKHRKTEGKGSSTSKEHKGSSGDTPNKAS 882

Query: 546 NRSARSDGSDKSASNRSR 563
           +       +  S   R +
Sbjct: 883 SFPVPPLSNGSSKPRRPK 900



 Score = 35.7 bits (82), Expect = 0.089
 Identities = 37/188 (19%), Positives = 68/188 (36%), Gaps = 13/188 (6%)

Query: 424 GENDNEKRTSDAEHSSKNSSRASSRSRSRSK-----SKSPAKRSRSGSASSRGSNKSDRS 478
               +       + S   S ++SS+ + + K      K  +K+ R    SS  S  S  S
Sbjct: 723 PAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSSS 782

Query: 479 AKSDRSAKSDRSARSDRSDK-------SAASNRSRSGSAASNKSRSDRSRSGSAGSDRSA 531
                S K  R +  ++ ++         +S+  +    +  + R     S S+G   SA
Sbjct: 783 HHHSSSNKESRKSSRNKEEEMLPSPSSPLSSSSPKPEHPSRKRPRRQEDTSSSSGP-FSA 841

Query: 532 ASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRS 591
           +S KS     S + +R     GS  S  ++  S    ++AS+       + +    R + 
Sbjct: 842 SSTKSSSKSSSTSKHRKTEGKGSSTSKEHKGSSGDTPNKASSFPVPPLSNGSSKPRRPKL 901

Query: 592 RSVSRARS 599
               R  S
Sbjct: 902 VFDDRVHS 909



 Score = 31.0 bits (70), Expect = 2.4
 Identities = 42/202 (20%), Positives = 75/202 (37%), Gaps = 3/202 (1%)

Query: 421 GGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAK 480
           G    + ++      E   K+SS+   R R+  K     +  +   A S       R+  
Sbjct: 479 GKVKSSGSQYHPESKEPPPKSSSKEKRRPRTAQKGPESGRGKQKSPAQSEAPP-QRRTVG 537

Query: 481 SDRSAKSDR-SARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRS 539
             +  K ++ SA  +R+     S        +S ++  DR ++ + GS + +   +   S
Sbjct: 538 KKQPKKPEKASAGDERTGLRPESEPGTLPYGSSVQTPPDRPKAATKGSRKPSPRKEPKSS 597

Query: 540 DKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSR-SVSRAR 598
              AA  R  +S       S       +SS  S    +   SS   G+   S+ S +  R
Sbjct: 598 VPPAAEKRKYKSPSKIVPKSREFIETDSSSSDSPEDESLPPSSQSPGNTESSKESCASLR 657

Query: 599 SNRSRSASKSGSSAASSRSGSP 620
           +   RS+  S +  +  R  SP
Sbjct: 658 TPVCRSSVGSQNDLSKDRLLSP 679



 Score = 30.7 bits (69), Expect = 3.3
 Identities = 45/209 (21%), Positives = 82/209 (39%), Gaps = 23/209 (11%)

Query: 433 SDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSAR 492
           S +E S  +  +A+ +  SR+   S    +   +ASS GS+ S  S+ S+ S+ SD  + 
Sbjct: 363 SSSEDS--DEEQATEKPPSRNTPPSAPSSNPEPAASSSGSSSS--SSGSESSSGSDSESE 418

Query: 493 SDRSDKSAASNRSRSGS-----AASNKSRSDR------------SRSGSAGSDRSAASNK 535
           S  SD S  +   R+ S      ++NK + D             + S S+         K
Sbjct: 419 SSSSD-SEENEPPRTASPEPEPPSTNKWQLDNWLNKVNPHKVSPAESVSSNPPIKQPMEK 477

Query: 536 SVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVS 595
             +   S +       +   KS+S   R    + +   +     +S  +S +  + R+V 
Sbjct: 478 EGKVKSSGSQYHPESKEPPPKSSSKEKRRPRTAQKGPESGRGKQKSPAQSEAPPQRRTVG 537

Query: 596 RARSNRSRSASKSGSSAASSRSGSPAGSD 624
           + +  +   AS +G      R  S  G+ 
Sbjct: 538 KKQPKKPEKAS-AGDERTGLRPESEPGTL 565


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 41.8 bits (98), Expect = 0.001
 Identities = 66/300 (22%), Positives = 100/300 (33%), Gaps = 20/300 (6%)

Query: 333 ELETRVRLSKRRQKV---GAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEED 389
           E E R R  + RQ      + P+         P  A++       E   E   EEEEEE+
Sbjct: 400 EEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQ--ESEEEESVEEEEEEEEEEEE 457

Query: 390 EEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRS 449
           EE+E EE E   + E  E E   G   E EG  EG+ D E+   DAE  +   +  S  S
Sbjct: 458 EEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMS 517

Query: 450 RSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGS 509
             +        R  S    S         +    S   +        +   +S+    G 
Sbjct: 518 EGQQ------PRGSSVQPESPQEEPLQPESMDAESVGEESDEELLAEESPLSSHTELEGV 571

Query: 510 AASNKSRSDRSRSGSAGSDRSAASNKS-VRSDKSAASNRSARSDGSDKSASNRSRSRSAS 568
           A   +++   SR        ++  N S   +  +   N S  +   ++  S R R R   
Sbjct: 572 ATPVETKISSSRKLPPPPVSTSLENDSATVTSTTRNGNVSPHTPQDEQPPSGRKRKRKEE 631

Query: 569 SQASNNSAASNRSSNKSGSRSR--------SRSVSRARSNRSRSASKSGSSAASSRSGSP 620
            ++        R  N S             S S     S R  + S    +++    G P
Sbjct: 632 VESEPLGNQYLRHHNGSEKDGLPAPMDPVTSCSPVADSSTRVDTPSHELVTSSPQTPGDP 691


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 41.1 bits (96), Expect = 0.002
 Identities = 37/170 (21%), Positives = 68/170 (40%), Gaps = 5/170 (2%)

Query: 364 NATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGE 423
           N  E R  +L       P  E++E+ EE EEE+ EE  +G  ++  ++  +   ++ G +
Sbjct: 295 NDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEE--EGGLSKKGKKLKKLKGKKNGLD 352

Query: 424 GENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDR 483
            ++ +    SD    S      S    +  K K P K     S  S   N        + 
Sbjct: 353 KDDSDSGDDSD---DSDIDGEDSVSLVTAKKQKEPKKEEPVDSNPSSPGNSGPARPSPES 409

Query: 484 SAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAAS 533
             K  R A ++ S   A+    +  +  + KS S +S   +    +S+++
Sbjct: 410 KDKGKRKAANEVSKSPASVPAKKLKTENAPKSSSGKSTPQTFSGSKSSSN 459



 Score = 33.4 bits (76), Expect = 0.39
 Identities = 34/199 (17%), Positives = 72/199 (36%), Gaps = 12/199 (6%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEG---EGGEREGGGEGENDNEKRTSDAEHSS 439
           + ++ +DE  EE+   ++        ER++    E   +    + E+  E      E   
Sbjct: 273 DSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEG 332

Query: 440 KNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKS 499
             S +     + + K     K        S  S+     + S  +AK  +  + +    S
Sbjct: 333 GLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQKEPKKEEPVDS 392

Query: 500 AASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSAS 559
             S+   SG A  +    D+ +  +A       S             +  +++ + KS+S
Sbjct: 393 NPSSPGNSGPARPSPESKDKGKRKAANEV----SKSPASVPA-----KKLKTENAPKSSS 443

Query: 560 NRSRSRSASSQASNNSAAS 578
            +S  ++ S   S+++AA 
Sbjct: 444 GKSTPQTFSGSKSSSNAAD 462



 Score = 33.0 bits (75), Expect = 0.47
 Identities = 36/226 (15%), Positives = 65/226 (28%), Gaps = 10/226 (4%)

Query: 402 QGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRAS----SRSRSRSKSKS 457
            G     +  E   GE+ GGG G++   K     +    + S          +S+ K K 
Sbjct: 188 NGPAAFGDEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKK 247

Query: 458 PAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSD--RSDKSAASNRSRSGSAASNKS 515
            AK  +      +G    D  A    S   D   R +   SD SA+ N         +  
Sbjct: 248 LAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPE 307

Query: 516 RSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNS 575
              +           +   K+      +   +  +     K+  ++  S S      ++ 
Sbjct: 308 IPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDI 367

Query: 576 AASNRSS----NKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRS 617
              +  S     K     +   V    S+   S     S  +  + 
Sbjct: 368 DGEDSVSLVTAKKQKEPKKEEPVDSNPSSPGNSGPARPSPESKDKG 413


>gnl|CDD|221931 pfam13136, DUF3984, Protein of unknown function (DUF3984).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in eukaryotes. Proteins in
           this family are typically between 393 and 442 amino
           acids in length.
          Length = 301

 Score = 40.1 bits (94), Expect = 0.002
 Identities = 54/193 (27%), Positives = 73/193 (37%), Gaps = 28/193 (14%)

Query: 447 SRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAK--SDRSARSDRSDKSAASNR 504
           S SRS S+S     R      SSR S  SD   KS  SA     +S RS R  KS  ++ 
Sbjct: 57  SHSRSPSRS-----RLHKRKKSSRRSPMSDTLLKSKSSAHLLHHQSTRSHRRSKSGTTSP 111

Query: 505 SRSGSAASNKSRSD----RSRSGSAGSDR---------SAASNKSVRSDKSAASNRSARS 551
            +  S+A  +        R+ +  A S R            S+ S+ S+      R A  
Sbjct: 112 RKPSSSAHRRRNDSEWLLRAGAALASSTREEKGQSWLVKRESSTSLVSEDEEDFEREAAR 171

Query: 552 DGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSS 611
           +    S     R RS  S  S        + ++ GSRS SR  SRA  + +       + 
Sbjct: 172 EREHSSRRASRRGRSGYSTPS-------AALSRRGSRSASRRGSRADLSMT-PLEARRAD 223

Query: 612 AASSRSGSPAGSD 624
           A  SR     G D
Sbjct: 224 AEDSRDTVLLGPD 236


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 40.1 bits (94), Expect = 0.002
 Identities = 31/152 (20%), Positives = 51/152 (33%), Gaps = 2/152 (1%)

Query: 369 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDN 428
               L   F     EEEEEE+E+E   ++         +  R + E  E +        +
Sbjct: 169 VNSMLEALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPS 228

Query: 429 EKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDR--SAK 486
            K  S +     +  + S+  RS + S S AK++  G    R +  S    K     S  
Sbjct: 229 PKSGSSSPAKPTSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKKAAPESTS 288

Query: 487 SDRSARSDRSDKSAASNRSRSGSAASNKSRSD 518
              S+    S+    +  S   S        +
Sbjct: 289 QSPSSAEPTSESPQTAGNSSLSSLGDYSDSDE 320



 Score = 29.3 bits (66), Expect = 6.7
 Identities = 31/186 (16%), Positives = 60/186 (32%), Gaps = 33/186 (17%)

Query: 374 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTS 433
               LE     E++E+EEEEEE+                    +    G    ++ +R  
Sbjct: 169 VNSMLEALFRREKKEEEEEEEEDEALI----------------KSLSFGPETEEDRRRAD 212

Query: 434 DAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARS 493
           D +                 +  +    S    +SS     S     + + +++  S+++
Sbjct: 213 DEDSEDD-------------EEDNDNTPSPKSGSSSPAKPTSILKKSAAKRSEAPSSSKA 259

Query: 494 DRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDG 553
            ++ +     R    S    K    ++   S     S+A   S     +  S+ S+  D 
Sbjct: 260 KKNSRGIPKPRDALSSLVVRK----KAAPESTSQSPSSAEPTSESPQTAGNSSLSSLGDY 315

Query: 554 SDKSAS 559
           SD   S
Sbjct: 316 SDSDES 321


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 40.8 bits (95), Expect = 0.003
 Identities = 19/57 (33%), Positives = 35/57 (61%), Gaps = 3/57 (5%)

Query: 374 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEA---ERQEGEGGEREGGGEGEND 427
           R+ F+   ++E+E+ED+++EE++ EE ++ E  +    E +E EGGE     + E D
Sbjct: 148 RDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVD 204



 Score = 29.6 bits (66), Expect = 5.9
 Identities = 18/64 (28%), Positives = 32/64 (50%), Gaps = 7/64 (10%)

Query: 370 IQRLRERFLE----PPNEEEEEEDEEEEEEEAEETQQGETTEAERQE---GEGGEREGGG 422
           I+R R R L      P +    +D++E+E+E ++ ++ +  E E +E   G   E E   
Sbjct: 132 IKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191

Query: 423 EGEN 426
            GE+
Sbjct: 192 GGED 195


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 40.2 bits (95), Expect = 0.003
 Identities = 30/138 (21%), Positives = 54/138 (39%), Gaps = 11/138 (7%)

Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSS 443
           EEEE  +EE + EAEE  + E  EA   E E        E         + AE +++ ++
Sbjct: 39  EEEEARKEEAKREAEEEAKAEAEEAAAAEAE--------EEAKAEAAAAAPAEEAAEAAA 90

Query: 444 RASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASN 503
            A + +R    ++  A R    +A    + K+ +  K  +  K     ++ R  K     
Sbjct: 91  AAEAAAR---PAEDEAARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGG 147

Query: 504 RSRSGSAASNKSRSDRSR 521
           + R       + R  + +
Sbjct: 148 KGRRRRRGRRRRRKKKKK 165


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 40.4 bits (94), Expect = 0.003
 Identities = 32/118 (27%), Positives = 49/118 (41%), Gaps = 4/118 (3%)

Query: 474 KSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAAS 533
           ++  S        +D S   D S K    +R RS SAA+     D        ++ + A 
Sbjct: 71  ENKASKDDGNINLTDTSENGDASKKGHGKSRVRSASAAAILEEDDSKDDMEFKANPNEAG 130

Query: 534 NKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRS 591
                         S+ SDG  K+++ +S S+SAS    +NS  S+ S+  SG  S S
Sbjct: 131 KPGKPKGNQGEGLASS-SDGKSKASA-KSGSKSASKHGESNS--SDESATDSGKASAS 184



 Score = 40.1 bits (93), Expect = 0.004
 Identities = 25/99 (25%), Positives = 44/99 (44%), Gaps = 2/99 (2%)

Query: 527 SDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSG 586
           SD S   NK+ + D +     ++ +  + K    +SR RSAS+ A      S        
Sbjct: 65  SDESFLENKASKDDGNINLTDTSENGDASKKGHGKSRVRSASAAAILEEDDSKDDMEFKA 124

Query: 587 SRSRSRSVSRARSNRSRS--ASKSGSSAASSRSGSPAGS 623
           + + +    + + N+     +S  G S AS++SGS + S
Sbjct: 125 NPNEAGKPGKPKGNQGEGLASSSDGKSKASAKSGSKSAS 163



 Score = 33.1 bits (75), Expect = 0.51
 Identities = 23/130 (17%), Positives = 50/130 (38%), Gaps = 3/130 (2%)

Query: 496 SDKSAASNRSRSGSAASNKS-RSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGS 554
           SD+S   N++       N +  S+   +   G  +S   + S  +      ++      +
Sbjct: 65  SDESFLENKASKDDGNINLTDTSENGDASKKGHGKSRVRSASAAAILEEDDSKDDMEFKA 124

Query: 555 DKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAAS 614
           + + + +      +      S++  +S   + S S+S S     ++   SA+ SG   AS
Sbjct: 125 NPNEAGKPGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSSDESATDSGK--AS 182

Query: 615 SRSGSPAGSD 624
           +      G+D
Sbjct: 183 ASVAGIVGAD 192


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 39.8 bits (93), Expect = 0.003
 Identities = 47/230 (20%), Positives = 119/230 (51%), Gaps = 14/230 (6%)

Query: 386 EEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRA 445
           EE+ +++E+++AEE QQ +  E ER +    ER    E +   E+    A    K +  A
Sbjct: 78  EEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEA 137

Query: 446 SSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRS 505
           ++++ + +K+K+ A+  R+ +A+ + + ++ + A+++ + K    A ++   K+ A   +
Sbjct: 138 AAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKK----AAAEAKKKAEAEAAA 193

Query: 506 RSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSR 565
           ++ + A  K+ ++  +  +A + + AA+     + K+AA  ++A    ++K+A+ ++  +
Sbjct: 194 KAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAA----AEKAAAAKAAEK 249

Query: 566 SASSQASN------NSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSG 609
           +A+++A+           S +++ K+G  ++      A +   +    SG
Sbjct: 250 AAAAKAAAEVDDLFGGLDSGKNAPKTGGGAKGNGAQGAGAGNGKKGGASG 299



 Score = 36.3 bits (84), Expect = 0.048
 Identities = 35/213 (16%), Positives = 89/213 (41%), Gaps = 10/213 (4%)

Query: 371 QRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEK 430
           +RL++   E    +E+++  EE  ++A   +Q +  EA  +     + +   E      K
Sbjct: 101 ERLKQLEKERLAAQEQKKQAEEAAKQAAL-KQKQAEEAAAKAAAAAKAKAEAE-----AK 154

Query: 431 RTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRS 490
           R + A   +   ++  + + +  K+ + AK+     A +  + K+   AK    A++ + 
Sbjct: 155 RAAAAAKKAAAEAKKKAEAEAAKKAAAEAKK----KAEAEAAAKAAAEAKKKAEAEAKKK 210

Query: 491 ARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSAR 550
           A ++   K+AA  ++ +  AA+    +    + +  ++++AA+  +   D       S +
Sbjct: 211 AAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVDDLFGGLDSGK 270

Query: 551 SDGSDKSASNRSRSRSASSQASNNSAASNRSSN 583
           +       +  + ++ A +       AS    +
Sbjct: 271 NAPKTGGGAKGNGAQGAGAGNGKKGGASGADID 303


>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566).  Family of
           related proteins that is plant specific.
          Length = 313

 Score = 39.5 bits (92), Expect = 0.003
 Identities = 31/121 (25%), Positives = 50/121 (41%), Gaps = 1/121 (0%)

Query: 504 RSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSR 563
           R+ S S+ S    +   RS S     S+  + S  S +    N  A    S  + +  S 
Sbjct: 1   RAASVSSGSTSGDASSPRSSSRRRLSSSFLSTSASS-RPRRLNAPASPPSSSPARNTSSS 59

Query: 564 SRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPAGS 623
           S    S+   +S +  R S++  S SR    + A  N S + + +  S++ SRS     S
Sbjct: 60  SSFGLSKQRPSSLSRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGSSSPSRSRRTTSS 119

Query: 624 D 624
           D
Sbjct: 120 D 120



 Score = 36.1 bits (83), Expect = 0.039
 Identities = 31/150 (20%), Positives = 62/150 (41%), Gaps = 14/150 (9%)

Query: 463 RSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRS 522
           R+ S SS  ++             S R   S     ++AS+R R  +A ++   S  +R+
Sbjct: 1   RAASVSSGSTSGDA-----SSPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARN 55

Query: 523 GSAGSDRSAASNKSVRSDKSAASNRSAR-SDGSDKSASNRSRSRSASSQASNNSAASNRS 581
            S+ S    +  +     +   S+R    S GS  +A++ + S + +S + ++S + +R 
Sbjct: 56  TSSSSSFGLSKQRPSSLSRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGSSSPSRSRR 115

Query: 582 SNKSGSRSRSRS--------VSRARSNRSR 603
           +  S   S +          V R +   S+
Sbjct: 116 TTSSDLSSGNGPSVLSFMADVKRGKKGPSK 145



 Score = 36.1 bits (83), Expect = 0.041
 Identities = 31/109 (28%), Positives = 45/109 (41%), Gaps = 2/109 (1%)

Query: 430 KRTSDAEHSSKNSSRASSRSRSRSKSKSP--AKRSRSGSASSRGSNKSDRSAKSDRSAKS 487
             +     SS  S+ ASSR R  +   SP  +  +R+ S+SS       R +   R   S
Sbjct: 19  SSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNTSSSSSFGLSKQRPSSLSRGRLS 78

Query: 488 DRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKS 536
            R     R   SAA++ + S + AS    S  SRS    S   ++ N  
Sbjct: 79  SRFVSPSRGSPSAAASLNGSLATASTSGSSSPSRSRRTTSSDLSSGNGP 127



 Score = 35.7 bits (82), Expect = 0.066
 Identities = 28/144 (19%), Positives = 52/144 (36%), Gaps = 19/144 (13%)

Query: 431 RTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRS 490
           R +     S +   +S RS SR +  S    + + S   R +  +   + S     S  S
Sbjct: 1   RAASVSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNTSSSS 60

Query: 491 ARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSAR 550
           +      + ++ +R R             S    + S  S ++  S+    + AS   + 
Sbjct: 61  SFGLSKQRPSSLSRGRL------------SSRFVSPSRGSPSAAASLNGSLATASTSGSS 108

Query: 551 SDGSDKSASNRSRSRSASSQASNN 574
           S        +RSR  ++S  +S N
Sbjct: 109 S-------PSRSRRTTSSDLSSGN 125


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 40.0 bits (93), Expect = 0.005
 Identities = 43/193 (22%), Positives = 62/193 (32%), Gaps = 10/193 (5%)

Query: 364 NATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGE 423
           N +E  I   R+   E   E E +E + + E EAEE +    TEAE  E E GE E G E
Sbjct: 679 NESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDE-GEIETGEE 737

Query: 424 GENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDR 483
           GE   ++   +AE              +    K       + +      ++ +  A  D 
Sbjct: 738 GEEVEDEGEGEAE--------GKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDG 789

Query: 484 SAKSDRSA-RSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKS 542
             K D  A      +    +         S     D       G     A N+       
Sbjct: 790 EMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDE 849

Query: 543 AASNRSARSDGSD 555
              +    SDG D
Sbjct: 850 KGVDGGGGSDGGD 862



 Score = 39.6 bits (92), Expect = 0.005
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
           + EEEEE+EEEEEEE EE ++ E  E E +E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 38.8 bits (90), Expect = 0.010
 Identities = 17/31 (54%), Positives = 21/31 (67%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAE 409
           +   EEEEEE+EEEEEEE EE ++ E    E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 37.7 bits (87), Expect = 0.023
 Identities = 19/34 (55%), Positives = 21/34 (61%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
           E   EEEEEE+EEEEEEE EE +  E    E  E
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPE 900



 Score = 36.9 bits (85), Expect = 0.042
 Identities = 18/34 (52%), Positives = 22/34 (64%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
           E   EEEEEE+EEEEEEE EE    E  E  +++
Sbjct: 872 EEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905



 Score = 36.5 bits (84), Expect = 0.048
 Identities = 23/59 (38%), Positives = 30/59 (50%), Gaps = 2/59 (3%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE-GGEREGGGEGENDNEKRTSDAE 436
           E P E E E  EE   E AE+  + ET      EGE   ER+G  EGE + E + +D +
Sbjct: 650 ERPTEAEGENGEESGGE-AEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHK 707



 Score = 36.1 bits (83), Expect = 0.070
 Identities = 23/69 (33%), Positives = 29/69 (42%), Gaps = 7/69 (10%)

Query: 374 RERFLEPPNEEE---EEEDEEEEEEEAEETQQGETTEAER---QEGEGGEREGGGEGEND 427
           R    E  N EE   E E E E E + E   +GE   AER   QEGEG       + + +
Sbjct: 651 RPTEAEGENGEESGGEAEQEGETETKGENESEGE-IPAERKGEQEGEGEIEAKEADHKGE 709

Query: 428 NEKRTSDAE 436
            E    + E
Sbjct: 710 TEAEEVEHE 718



 Score = 35.7 bits (82), Expect = 0.078
 Identities = 16/31 (51%), Positives = 21/31 (67%)

Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
           + EEE+EEEEEEE EE ++ E  E E +  E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 34.6 bits (79), Expect = 0.20
 Identities = 31/195 (15%), Positives = 49/195 (25%), Gaps = 5/195 (2%)

Query: 381 PNEEEEE---EDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEH 437
             E E     E E  EE   E  Q+GET      E E GE     +GE + E      E 
Sbjct: 646 GEEGERPTEAEGENGEESGGEAEQEGETETKGENESE-GEIPAERKGEQEGEGEIEAKEA 704

Query: 438 SSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSD 497
             K  + A              +          G    D   + +   K +     DR +
Sbjct: 705 DHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEG-EGEAEGKHEVETEGDRKE 763

Query: 498 KSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKS 557
                     G    ++        G    D  A        +  A            ++
Sbjct: 764 TEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQA 823

Query: 558 ASNRSRSRSASSQAS 572
                +  +   + +
Sbjct: 824 DDTEVKDETGEQELN 838



 Score = 32.7 bits (74), Expect = 0.69
 Identities = 16/59 (27%), Positives = 26/59 (44%), Gaps = 1/59 (1%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQ-EGEGGEREGGGEGENDNEKRTSDAE 436
                ++ E  +E  E+E     QGE  + E+  +G GG   G  E E + E+   + E
Sbjct: 819 SETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEE 877



 Score = 32.3 bits (73), Expect = 1.0
 Identities = 15/31 (48%), Positives = 20/31 (64%)

Query: 387 EEDEEEEEEEAEETQQGETTEAERQEGEGGE 417
           + +EEEEEEE EE ++ E  E E +E E  E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 30.7 bits (69), Expect = 3.1
 Identities = 15/31 (48%), Positives = 18/31 (58%)

Query: 389 DEEEEEEEAEETQQGETTEAERQEGEGGERE 419
           D EEEEEE EE ++ E  E E +E E    E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 29.6 bits (66), Expect = 7.1
 Identities = 14/37 (37%), Positives = 19/37 (51%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
               +  + EEEEE EE ++ E  E E +E E  E E
Sbjct: 855 GGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891



 Score = 29.2 bits (65), Expect = 7.7
 Identities = 18/98 (18%), Positives = 37/98 (37%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
           E   E +E+EDE E +   +   +G+     + E EG    G  +      +  +D    
Sbjct: 769 ETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEV 828

Query: 439 SKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSD 476
              +      + ++ ++K   K    G  S  G ++ +
Sbjct: 829 KDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEE 866



 Score = 29.2 bits (65), Expect = 8.1
 Identities = 13/37 (35%), Positives = 18/37 (48%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
           +     D  + EEE EE ++ E  E E +E E  E E
Sbjct: 853 DGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889



 Score = 29.2 bits (65), Expect = 9.0
 Identities = 21/94 (22%), Positives = 34/94 (36%), Gaps = 7/94 (7%)

Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEH 437
           L   ++ +  E E   E   EE ++    E E  E  GGE E  GE E       +  E+
Sbjct: 627 LGDLSKGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETE-------TKGEN 679

Query: 438 SSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRG 471
            S+    A  +     + +  AK +     +   
Sbjct: 680 ESEGEIPAERKGEQEGEGEIEAKEADHKGETEAE 713


>gnl|CDD|219916 pfam08580, KAR9, Yeast cortical protein KAR9.  The KAR9 protein in
           Saccharomyces cerevisiae is a cytoskeletal protein
           required for karyogamy, correct positioning of the
           mitotic spindle and for orientation of cytoplasmic
           microtubules. KAR9 localises at the shmoo tip in mating
           cells and at the tip of the growing bud in anaphase.
          Length = 626

 Score = 39.5 bits (92), Expect = 0.005
 Identities = 29/181 (16%), Positives = 48/181 (26%)

Query: 441 NSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSA 500
            SS  SS          P     S    + GS      A   R      +  S   +K +
Sbjct: 397 PSSSPSSSVIMTPPDSGPGSNVSSRRVGTPGSKSDRVGAVLLRRMNIKPTLASIPDEKPS 456

Query: 501 ASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASN 560
             +       + N S   R        + S     +   +K   +  S       +S   
Sbjct: 457 NISVFEDSETSPNSSTLLRDPPPKKCGEESGHLPNNPFFNKLKLTLSSIPPLSPRQSIIT 516

Query: 561 RSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSP 620
                  +S+ S+ S      S    S     ++   +     S ++S S     R G  
Sbjct: 517 LPTPSRPASRISSLSLRLGSYSGSIVSPPPYPTLVSRKGAAGLSFNRSVSDIEGERIGRY 576

Query: 621 A 621
            
Sbjct: 577 N 577



 Score = 32.1 bits (73), Expect = 0.92
 Identities = 36/179 (20%), Positives = 51/179 (28%), Gaps = 4/179 (2%)

Query: 428 NEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRS--A 485
           N K T  +    K S+ +       S + S   R          S     +   ++    
Sbjct: 442 NIKPTLASIPDEKPSNISVFEDSETSPNSSTLLRDPPPKKCGEESGHLPNNPFFNKLKLT 501

Query: 486 KSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAAS 545
            S     S R         SR  S  S+ S    S SGS  S     +  S +     + 
Sbjct: 502 LSSIPPLSPRQSIITLPTPSRPASRISSLSLRLGSYSGSIVSPPPYPTLVSRKGAAGLSF 561

Query: 546 NRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRS 604
           NRS      ++    R         A    A S  SS +S S      V     +  R 
Sbjct: 562 NRSVSDIEGER--IGRYNLLPTRIPALPFKAESTTSSRRSSSLPSPTGVIGFPGSVPRF 618


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 39.1 bits (91), Expect = 0.005
 Identities = 23/139 (16%), Positives = 62/139 (44%), Gaps = 3/139 (2%)

Query: 428 NEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKS 487
           N K+ +    +   + +AS++++S +  K   K++   + S++ + K+  +AK+ + AK 
Sbjct: 198 NAKKAAKTPAAKSGAKKASAKAKS-AAKKVSKKKAAKTAVSAKKAAKT--AAKAAKKAKK 254

Query: 488 DRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNR 547
                  ++ K+      ++  AA+  ++     +      +  A  K+    K+ A+ +
Sbjct: 255 TAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAK 314

Query: 548 SARSDGSDKSASNRSRSRS 566
           + +     K A   ++ R+
Sbjct: 315 APKRGAKGKKAKKVTKKRA 333



 Score = 32.9 bits (75), Expect = 0.47
 Identities = 28/162 (17%), Positives = 59/162 (36%), Gaps = 24/162 (14%)

Query: 456 KSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKS 515
            + AK++    A+  G+ K+   AKS     S + A      K+A S +  + +AA    
Sbjct: 196 AANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKA-----AKTAVSAKKAAKTAA---- 246

Query: 516 RSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNS 575
                 +  A      A  K+ ++ K AA           K A+  +   +  +  +   
Sbjct: 247 ----KAAKKAKKTAKKALKKAAKAVKKAA-----------KKAAKAAAKAAKGAAKATKG 291

Query: 576 AASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRS 617
            A  +      + + S++ + A++ +  +  K        R+
Sbjct: 292 KAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKKRA 333



 Score = 31.8 bits (72), Expect = 0.93
 Identities = 22/147 (14%), Positives = 51/147 (34%), Gaps = 1/147 (0%)

Query: 405 TTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRS 464
           T E  R     G        +    K  +    +   S+      +  +K+   AK++  
Sbjct: 184 TREIGRVLAAVGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAK 243

Query: 465 GSASSRGSNKSDRSAKSDRSAKS-DRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSG 523
            +A +    K        ++AK+  ++A+      + A+  +   +    K++    +  
Sbjct: 244 TAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKA 303

Query: 524 SAGSDRSAASNKSVRSDKSAASNRSAR 550
           +AGS   A +    R  K   + +  +
Sbjct: 304 AAGSKAKATAKAPKRGAKGKKAKKVTK 330



 Score = 30.6 bits (69), Expect = 2.4
 Identities = 23/147 (15%), Positives = 59/147 (40%), Gaps = 14/147 (9%)

Query: 431 RTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRS 490
             ++A+ ++K  +  S   ++ +K+KS AK+            K+ ++A S + A    +
Sbjct: 195 GAANAKKAAKTPAAKSGAKKASAKAKSAAKKV--------SKKKAAKTAVSAKKAAKTAA 246

Query: 491 ARSDRSDKSAASNRSRSGSAA------SNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAA 544
             + ++ K+A     ++  A       + K+ +  ++  +  +   A + K      +A 
Sbjct: 247 KAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAG 306

Query: 545 SNRSARSDGSDKSASNRSRSRSASSQA 571
           S   A +    + A  +   +    +A
Sbjct: 307 SKAKATAKAPKRGAKGKKAKKVTKKRA 333



 Score = 29.1 bits (65), Expect = 7.7
 Identities = 22/135 (16%), Positives = 50/135 (37%), Gaps = 5/135 (3%)

Query: 451 SRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAAS-NRSRSGS 509
            ++     AK     +++   S     S K  ++AK+  SA+   + K+AA   +    +
Sbjct: 200 KKAAKTPAAKSGAKKASAKAKSAAKKVSKK--KAAKTAVSAKK--AAKTAAKAAKKAKKT 255

Query: 510 AASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASS 569
           A     ++ ++   +A     AA+  +  + K+      A+     K+A+      +A +
Sbjct: 256 AKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKA 315

Query: 570 QASNNSAASNRSSNK 584
                     +   K
Sbjct: 316 PKRGAKGKKAKKVTK 330


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 38.8 bits (90), Expect = 0.006
 Identities = 46/228 (20%), Positives = 103/228 (45%), Gaps = 5/228 (2%)

Query: 386 EEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRA 445
           E++ +++EE+ AEE +  +  E ER +    ER    E +   E+    A+   K     
Sbjct: 78  EQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQ 137

Query: 446 SSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRS 505
           + ++ +  K K+ A ++++ + +++    ++   K++ +AK+   A++     +A     
Sbjct: 138 ARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAE 197

Query: 506 RSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSR 565
               AA+ K++++      A     AA+ +   ++K  A+ ++     +DK+A+    + 
Sbjct: 198 AEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAK----ADKAAAAAKAAE 253

Query: 566 SASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAA 613
             ++ A+ +      SS K  +         A S  S +A  SG+S A
Sbjct: 254 RKAAAAALDDIFGGLSSGK-NAPKTGGGAKNASSQGSGAAKNSGASGA 300



 Score = 36.5 bits (84), Expect = 0.040
 Identities = 35/216 (16%), Positives = 89/216 (41%), Gaps = 7/216 (3%)

Query: 369 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDN 428
           + +++ E        E+E   + E+E    + QQ +  EAE+Q     +++     +   
Sbjct: 84  KEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAA 143

Query: 429 EKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSD 488
           E++       +K ++ A+    +    K   + +++   +   +  +    K++  AK+ 
Sbjct: 144 EQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKA- 202

Query: 489 RSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAAS--- 545
            + ++    ++ A    ++ +AA  K+ +++ ++ +      AA+       K+AA+   
Sbjct: 203 AAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAALD 262

Query: 546 ---NRSARSDGSDKSASNRSRSRSASSQASNNSAAS 578
                 +    + K+      + S  S A+ NS AS
Sbjct: 263 DIFGGLSSGKNAPKTGGGAKNASSQGSGAAKNSGAS 298


>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
           Provisional.
          Length = 1164

 Score = 38.8 bits (90), Expect = 0.009
 Identities = 33/161 (20%), Positives = 62/161 (38%), Gaps = 9/161 (5%)

Query: 427 DNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAK 486
           D  +R+     + KN++   ++ +++ K K+  K       + R S KS    K   S  
Sbjct: 202 DKMRRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKS----KVQNSHF 257

Query: 487 SDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASN 546
             R  RS   +         SG  +S      + +S     D    SN   ++      N
Sbjct: 258 DVRILRSYTKENKKDEKNVVSGIRSSV---LLKRKSQCLRKDSYVYSNHQKKAKTGDPKN 314

Query: 547 RSARSDGSDKSASNRSRSRS--ASSQASNNSAASNRSSNKS 585
              R++GS  S ++ + S +   S++ SN + +S      +
Sbjct: 315 IIHRNNGSSNSNNDDTSSSNHLGSNRISNRNPSSPYKKQTT 355



 Score = 37.3 bits (86), Expect = 0.027
 Identities = 31/162 (19%), Positives = 64/162 (39%), Gaps = 5/162 (3%)

Query: 430 KRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDR 489
           K+T+    +  +  R S R+ S  K+ +   +++  +     + K DR    D   ++ R
Sbjct: 191 KQTNVTHTTCYDKMRRSPRNTSTIKN-NTNDKNKEKNKEKDKNIKKDR----DGDKQTKR 245

Query: 490 SARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSA 549
           ++   +   S    R        NK       SG   S      ++ +R D    SN   
Sbjct: 246 NSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSGIRSSVLLKRKSQCLRKDSYVYSNHQK 305

Query: 550 RSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRS 591
           ++   D        + S++S   + S++++  SN+  +R+ S
Sbjct: 306 KAKTGDPKNIIHRNNGSSNSNNDDTSSSNHLGSNRISNRNPS 347



 Score = 34.2 bits (78), Expect = 0.25
 Identities = 38/183 (20%), Positives = 63/183 (34%), Gaps = 16/183 (8%)

Query: 423 EGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSD 482
              N  EK   D    S +SS  SS   + S   SP       S        S  S  S 
Sbjct: 112 IKANKKEKHNLD----SSSSSSISSSLTNISFFSSPTSIYSCLS-------NSLSSKHSP 160

Query: 483 RSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGS--AGSDRSAASNKSVRSD 540
           +  K ++S   + S  ++  N+  S      ++    +        S R+ ++ K+  +D
Sbjct: 161 KVIKENQSTHVNISSDNSPRNKEISNKQLKKQTNVTHTTCYDKMRRSPRNTSTIKNNTND 220

Query: 541 KSAASNRSARSD---GSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRA 597
           K+   N+    +     D     +  S  +  Q S+      RS  K   +     VS  
Sbjct: 221 KNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSGI 280

Query: 598 RSN 600
           RS+
Sbjct: 281 RSS 283


>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
           Cobaltochelatase is responsible for the insertion of
           cobalt into the corrin ring of coenzyme B12 during its
           biosynthesis. Two versions have been well described.
           CbiK/CbiX is a monomeric, anaerobic version which acts
           early in the biosynthesis (pfam06180). CobNST is a
           trimeric, ATP-dependent, aerobic version which acts late
           in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
           number of genomes (actinobacteria, cyanobacteria,
           betaproteobacteria and pseudomonads) which apparently
           biosynthesize B12, encode a cobN gene but are
           demonstrably lacking cobS and cobT. These genomes do,
           however contain a homolog (modelled here) of the
           magnesium chelatase subunits BchI/BchD family. Aside
           from the cyanobacteria (which have a separate magnesium
           chelatase trimer), these species do not make chlorins,
           so do not have any use for a magnesium chelatase.
           Furthermore, in nearly all cases the members of this
           family are proximal to either CobN itself or other genes
           involved in cobalt transport or B12 biosynthesis.
          Length = 633

 Score = 38.5 bits (90), Expect = 0.009
 Identities = 42/200 (21%), Positives = 78/200 (39%), Gaps = 20/200 (10%)

Query: 355 RLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
            LV+ HR      F   + ++   E   EE+ EE   + E+  +E +        R    
Sbjct: 302 ELVLPHR-RRRKPFEQPQGKD---EKDLEEKPEEPGPDPEKP-DEGEDDAEQSGPRGHPT 356

Query: 415 GGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNK 474
            G  +        + +  +D + SS + +    R R  +  ++ A R  SG  S   S+ 
Sbjct: 357 PGNDDEKEP----DPQEEADGQGSSTDPAGDIFRIRVLAPPQARA-RGASGRRSRTRSDS 411

Query: 475 SDRSAKSDRSAKSDRSARSDRSDKSAASN-RSRSGSAASNKSR-SDRSRSGSAGS----- 527
             R  ++ R+         D + ++AA + R+R G+ A       ++ R+G AG+     
Sbjct: 412 RGRYVRARRNRGPPDDLAVDATLRAAAPHQRARPGAVAVEPEDLREKIRAGRAGNLVIFV 471

Query: 528 ---DRSAASNKSVRSDKSAA 544
                S A+   + + K A 
Sbjct: 472 VDASGSMAARGRMAAAKGAV 491


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
            Provisional.
          Length = 2722

 Score = 38.7 bits (90), Expect = 0.010
 Identities = 24/123 (19%), Positives = 48/123 (39%), Gaps = 2/123 (1%)

Query: 323  IVRDDAVYYNELETRVRLSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPN 382
            IV +     NELE+ V  SK  + +    N  + +   +     + +I+  R+      +
Sbjct: 2529 IVDNSTYIINELESHV--SKLNELLSYIDNEIKELENEKLKLLEKAKIEESRKERERIES 2586

Query: 383  EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNS 442
            E +E+  +EE+    ++ +  +  E +    E   RE  G  + +    T   E +    
Sbjct: 2587 ETQEDNTDEEQINRQQQERLQKEEEQKAYSQERLNREVSGTDDTNKNHNTGHDESNYGRY 2646

Query: 443  SRA 445
            S  
Sbjct: 2647 SNK 2649


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 38.1 bits (88), Expect = 0.016
 Identities = 28/140 (20%), Positives = 53/140 (37%), Gaps = 1/140 (0%)

Query: 381 PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSK 440
           P ++E EE E E+ +E++E ++G     E +EGE G++ G  +    ++  T   +    
Sbjct: 524 PGDKEGEEGEHEDSKESDEPKEGGKP-GETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFP 582

Query: 441 NSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSA 500
              +         K K P    R     S    +     KS +  +S +S +     +  
Sbjct: 583 KDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRP 642

Query: 501 ASNRSRSGSAASNKSRSDRS 520
           +S     G       +  +S
Sbjct: 643 SSPERPEGPKIIKSPKPPKS 662



 Score = 32.7 bits (74), Expect = 0.66
 Identities = 36/170 (21%), Positives = 50/170 (29%), Gaps = 12/170 (7%)

Query: 381 PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSK 440
              EEE+ D+ +E  E  E           +EGE GE E   E +   E      E    
Sbjct: 497 APIEEEDSDKHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGG-KPGETKEG 555

Query: 441 NSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSA 500
              +    ++    SK P    +          K  +  +  +  K  RSA+     KS 
Sbjct: 556 EVGKKPGPAKEHKPSKIPTLSKK---PEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSP 612

Query: 501 ASNRS--------RSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKS 542
                        R  S  S K      R  S          KS +  KS
Sbjct: 613 KLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKS 662


>gnl|CDD|222790 PHA00430, PHA00430, tail fiber protein.
          Length = 568

 Score = 37.2 bits (86), Expect = 0.024
 Identities = 25/134 (18%), Positives = 57/134 (42%), Gaps = 3/134 (2%)

Query: 428 NEKRTSDAEHSSKNSSRASSRSRSR-SKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAK 486
              +      S+ N+   ++RSR+   ++++ A+R  + S +S  + K  RS     +++
Sbjct: 150 PLGQIKTWNQSAWNARNEANRSRNEADRARNQAERFNNESGASATNTKQWRSEADGSNSE 209

Query: 487 SDRSARSDRSDKS--AASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAA 544
           ++R      S  S   A+      S+    +  D +   +A +  + AS  +  +  +AA
Sbjct: 210 ANRFKGYADSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAAHASEVNAANSATAA 269

Query: 545 SNRSARSDGSDKSA 558
           +  + R+      A
Sbjct: 270 ATSANRAKQQADRA 283



 Score = 36.4 bits (84), Expect = 0.043
 Identities = 26/137 (18%), Positives = 60/137 (43%), Gaps = 3/137 (2%)

Query: 473 NKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAA 532
           N+S  +A+++ +   + + R+    ++   N     SA + K     +   ++ ++R   
Sbjct: 158 NQSAWNARNEANRSRNEADRARN--QAERFNNESGASATNTKQWRSEADGSNSEANRFKG 215

Query: 533 SNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSR 592
              S+ S   AA  ++  S     +A + +   +AS+ A++ S  +  +S  + + S +R
Sbjct: 216 YADSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAAHASEVNAANSATAAATSANR 275

Query: 593 SVSRARSNRSRSASKSG 609
           +  +A       A K G
Sbjct: 276 AKQQA-DRAKTEADKLG 291



 Score = 34.9 bits (80), Expect = 0.12
 Identities = 22/112 (19%), Positives = 47/112 (41%), Gaps = 4/112 (3%)

Query: 431 RTSDAEHSSKNSSRASSRSRSRSK-SKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDR 489
           +     + S  S+  + + RS +  S S A R +  + S   S ++ +      S +++ 
Sbjct: 181 QAERFNNESGASATNTKQWRSEADGSNSEANRFKGYADSMTSSVEAAKGQAESSSKEANT 240

Query: 490 SARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDK 541
           +   D + K+AAS  +   S  +  + +  + + SA   +  A      +DK
Sbjct: 241 AG--DYATKAAASASAAHASEVNAANSATAA-ATSANRAKQQADRAKTEADK 289



 Score = 34.9 bits (80), Expect = 0.14
 Identities = 28/135 (20%), Positives = 57/135 (42%), Gaps = 11/135 (8%)

Query: 485 AKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAA 544
           A  D  A      K+   +   + + A N+SR++  R+ +     +  S  S     +  
Sbjct: 142 AVDDGDAVPLGQIKTWNQSAWNARNEA-NRSRNEADRARNQAERFNNESGASA----TNT 196

Query: 545 SNRSARSDGSDKSASNRSRSRSASSQASNNSAAS--NRSS---NKSGSRSRSRSVSRARS 599
               + +DGS  S +NR +  + S  +S  +A      SS   N +G  +   + S + +
Sbjct: 197 KQWRSEADGS-NSEANRFKGYADSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAA 255

Query: 600 NRSRSASKSGSSAAS 614
           + S   + + ++AA+
Sbjct: 256 HASEVNAANSATAAA 270



 Score = 33.7 bits (77), Expect = 0.32
 Identities = 15/93 (16%), Positives = 29/93 (31%)

Query: 525 AGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNK 584
           A  D  A     +++   +A N    ++ S   A          +  S  SA + +    
Sbjct: 142 AVDDGDAVPLGQIKTWNQSAWNARNEANRSRNEADRARNQAERFNNESGASATNTKQWRS 201

Query: 585 SGSRSRSRSVSRARSNRSRSASKSGSSAASSRS 617
               S S +        S ++S   +   +  S
Sbjct: 202 EADGSNSEANRFKGYADSMTSSVEAAKGQAESS 234



 Score = 31.8 bits (72), Expect = 1.2
 Identities = 28/108 (25%), Positives = 53/108 (49%), Gaps = 10/108 (9%)

Query: 519 RSRSGSAGSDRSAASNKSVRSDKSAASNRSARS-DGSDKSASNRSRSRSASSQASNNSAA 577
           ++ + SA + R+ A+    R++   A N++ R  + S  SA+N  + RS +   SN+ A 
Sbjct: 155 KTWNQSAWNARNEAN--RSRNEADRARNQAERFNNESGASATNTKQWRSEAD-GSNSEA- 210

Query: 578 SNRSSNKSGSRSRSRSVSRARSNRSR----SASKSGSSAASSRSGSPA 621
            NR    + S + S   ++ ++  S     +A    + AA+S S + A
Sbjct: 211 -NRFKGYADSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAAHA 257


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 36.9 bits (85), Expect = 0.024
 Identities = 19/52 (36%), Positives = 22/52 (42%), Gaps = 1/52 (1%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGE-TTEAERQEGEGGEREGGGEGENDNE 429
           E    EE +  EEEE+EE      GE   E   +E E  E    GE   D E
Sbjct: 178 EEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYE 229



 Score = 32.3 bits (73), Expect = 0.68
 Identities = 15/51 (29%), Positives = 24/51 (47%)

Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEK 430
           PP + + + +EEE  EE++  ++ E  E        G RE   E E + E 
Sbjct: 168 PPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEG 218


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 37.2 bits (86), Expect = 0.025
 Identities = 33/161 (20%), Positives = 63/161 (39%), Gaps = 5/161 (3%)

Query: 462 SRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSR 521
           S  G  S++       S  +   +KS  S   +        +   S +   +    +   
Sbjct: 220 SSKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIGINNHH 279

Query: 522 SGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRS 581
           S  A S           S+KS  S+ S   +GS ++ SN S + + S  + ++ +A + +
Sbjct: 280 SKHADSKLQTIEVIENHSNKSRPSSSST--NGSKETTSNSSSAAAGSIGSKSSKSAKHSN 337

Query: 582 SNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPAG 622
            NKS S  +S S +   +    S+S S + +   R+   + 
Sbjct: 338 RNKSNSSPKSHSSA---NGSVPSSSVSDNESKQKRASKSSS 375



 Score = 36.1 bits (83), Expect = 0.070
 Identities = 55/262 (20%), Positives = 95/262 (36%), Gaps = 33/262 (12%)

Query: 367 EFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGG---------- 416
           EF +Q L++    P  E       ++E+ EA       T E    +  GG          
Sbjct: 191 EFYMQLLQQAL--PKEENHTLSVTDKEKSEASSKGLTSTKELVPVQNSGGNHSLSKSSNS 248

Query: 417 ---EREGGGEGE-NDNEKRTSDAEHSSKNS---------------SRASSRSRSRSKSKS 457
              E E   +G+ + +            N                   S++SR  S S +
Sbjct: 249 QTPELEYSEKGKDHHHSHNHQHHSIGINNHHSKHADSKLQTIEVIENHSNKSRPSSSSTN 308

Query: 458 PAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRS 517
            +K + S S+S+   +   +S+KS + +  ++S  S +S  SA  +   S S + N+S+ 
Sbjct: 309 GSKETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSV-PSSSVSDNESKQ 367

Query: 518 DRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAA 577
            R+   S+G+ R +  + S  S      N    +  S  SA  R        QA    A 
Sbjct: 368 KRASKSSSGA-RDSKKDASGMSANGTVENCIPENKISTPSAIERLEQDIKKLQAELQQAR 426

Query: 578 SNRSSNKSGSRSRSRSVSRARS 599
            N S  ++     +      +S
Sbjct: 427 QNESELRNQISLLTSLERSLKS 448


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 36.9 bits (86), Expect = 0.028
 Identities = 20/43 (46%), Positives = 24/43 (55%)

Query: 372 RLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
           R R R   PP+  E EE+EEEEEE  EE ++ E  E    E E
Sbjct: 291 RRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 33.9 bits (78), Expect = 0.29
 Identities = 19/43 (44%), Positives = 24/43 (55%)

Query: 369 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQ 411
           R +R R     PP  EEEEE+EEE  EE EE ++ E    E +
Sbjct: 291 RRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 33.1 bits (76), Expect = 0.50
 Identities = 20/46 (43%), Positives = 24/46 (52%)

Query: 369 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
           R+   R R   PP    E E+EEEEEEE  E ++ E  E ER   E
Sbjct: 287 RLPLRRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEE 332


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 35.6 bits (82), Expect = 0.030
 Identities = 20/48 (41%), Positives = 30/48 (62%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEK 430
           +E++E+DEEEEEE+ EE ++ E  E E +  E  E E   E +N + K
Sbjct: 45  DEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLK 92



 Score = 30.2 bits (68), Expect = 2.0
 Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 3/72 (4%)

Query: 386 EEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRA 445
           +E+ E++EEEE E+ ++ E  E    E E  E E   E E ++E+   D +   K +   
Sbjct: 45  DEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDE---EEEEEDEEDNVDLKDIEKKNIND 101

Query: 446 SSRSRSRSKSKS 457
              S     +++
Sbjct: 102 IFNSTQDDNAQN 113



 Score = 28.7 bits (64), Expect = 6.8
 Identities = 19/63 (30%), Positives = 35/63 (55%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
           E   ++EEEE+E+EEE E  E  + E    E +E E  + E   + ++  +K  +D  +S
Sbjct: 46  EKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNS 105

Query: 439 SKN 441
           +++
Sbjct: 106 TQD 108


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 37.0 bits (85), Expect = 0.036
 Identities = 29/206 (14%), Positives = 81/206 (39%)

Query: 386  EEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRA 445
            EE  + E+ ++AE  ++ E  + + +E +  E E   E     E+         + + +A
Sbjct: 1215 EEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKA 1274

Query: 446  SSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRS 505
                ++    K+  K+    +  +    K+D + K    AK    A+    +    ++ +
Sbjct: 1275 EEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAA 1334

Query: 506  RSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSR 565
            +  +  + K+         A +D + A+ +   + +        ++D + K A  + ++ 
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKAD 1394

Query: 566  SASSQASNNSAASNRSSNKSGSRSRS 591
             A  +A  +   ++     + ++ ++
Sbjct: 1395 EAKKKAEEDKKKADELKKAAAAKKKA 1420


>gnl|CDD|237537 PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional.
          Length = 440

 Score = 36.4 bits (85), Expect = 0.043
 Identities = 29/124 (23%), Positives = 55/124 (44%), Gaps = 20/124 (16%)

Query: 500 AASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSA- 558
           AA   + +   A+  +R   + +G A S  SA +     +  S A+  +A   G  ++  
Sbjct: 295 AAGGGAAAAGGAAAAARGGAAAAGGASSAYSAGA-----AGGSGAAGVAAGLGGVARAGA 349

Query: 559 -SNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRS 617
            +  S  R A+S+A    A S +SS ++G+RS                + + ++A ++ +
Sbjct: 350 SAAASPLRRAASRA----AESMKSSFRAGARSTGGGA---------GGAAAAAAAGAAAA 396

Query: 618 GSPA 621
           G PA
Sbjct: 397 GPPA 400



 Score = 30.6 bits (70), Expect = 2.4
 Identities = 21/105 (20%), Positives = 42/105 (40%), Gaps = 12/105 (11%)

Query: 519 RSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAAS 578
            +  G+A + R  A+        +A    SA S G    A+  S +   ++     + A 
Sbjct: 301 AAAGGAAAAARGGAA--------AAGGASSAYSAG----AAGGSGAAGVAAGLGGVARAG 348

Query: 579 NRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPAGS 623
             ++     R+ SR+    +S+    A  +G  A  + + + AG+
Sbjct: 349 ASAAASPLRRAASRAAESMKSSFRAGARSTGGGAGGAAAAAAAGA 393


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 34.2 bits (79), Expect = 0.044
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGE 404
               EE++EE+EEEEE+E  E +   
Sbjct: 74  AAAAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 30.7 bits (70), Expect = 0.54
 Identities = 10/25 (40%), Positives = 16/25 (64%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQG 403
                EE++E+EEEEEE+ E  ++ 
Sbjct: 73  AAAAAEEKKEEEEEEEEKEESEEEA 97



 Score = 30.7 bits (70), Expect = 0.72
 Identities = 12/25 (48%), Positives = 16/25 (64%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQG 403
               ++EEEE+EEE+EE  EE   G
Sbjct: 76  AAEEKKEEEEEEEEKEESEEEAAAG 100



 Score = 29.5 bits (67), Expect = 1.7
 Identities = 13/37 (35%), Positives = 17/37 (45%), Gaps = 7/37 (18%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEG 415
                  EE+ EEEEEEE +E       E+E +   G
Sbjct: 71  AAAAAAAEEKKEEEEEEEEKE-------ESEEEAAAG 100



 Score = 28.8 bits (65), Expect = 3.1
 Identities = 9/29 (31%), Positives = 14/29 (48%)

Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEA 408
                     EE++EEE EE ++ E+ E 
Sbjct: 68  AAAAAAAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 28.0 bits (63), Expect = 4.9
 Identities = 11/37 (29%), Positives = 16/37 (43%)

Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGG 416
                      EE++EE EE ++ E +E E   G G 
Sbjct: 67  AAAAAAAAAAAEEKKEEEEEEEEKEESEEEAAAGLGA 103


>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
          Length = 249

 Score = 35.8 bits (83), Expect = 0.045
 Identities = 14/44 (31%), Positives = 19/44 (43%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEN 426
           + EE E++EE   E    +      AE +EGE        E EN
Sbjct: 206 DPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNEWEN 249



 Score = 33.5 bits (77), Expect = 0.24
 Identities = 14/44 (31%), Positives = 19/44 (43%)

Query: 385 EEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDN 428
           + EE EE+EE  AE     +   A  +  EG E       E +N
Sbjct: 206 DPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNEWEN 249


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 36.4 bits (84), Expect = 0.048
 Identities = 16/53 (30%), Positives = 25/53 (47%)

Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSD 434
           NE+ E   EE E  +  E  + E T+ E +EGE    E   + E+D     ++
Sbjct: 244 NEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESDEDTE 296



 Score = 32.5 bits (74), Expect = 0.81
 Identities = 17/61 (27%), Positives = 28/61 (45%), Gaps = 5/61 (8%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
           E   ++  EED +EE+ + ++    E +EA R+E EG +     E E D  + T      
Sbjct: 220 EETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSD-----ESEEDEAEATDGEGEE 274

Query: 439 S 439
            
Sbjct: 275 G 275


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 36.3 bits (84), Expect = 0.048
 Identities = 22/55 (40%), Positives = 26/55 (47%), Gaps = 6/55 (10%)

Query: 365 ATEFRIQRLRE--RFLE----PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEG 413
           ATE     + E   FL+    P  E   EE+EEEEEEE EE  + E    E   G
Sbjct: 375 ATEEDATTIDELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPMEEPVPG 429



 Score = 33.6 bits (77), Expect = 0.31
 Identities = 15/34 (44%), Positives = 18/34 (52%)

Query: 375 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEA 408
           ER+ E   EEEEEE+EE  E EA   +     E 
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGFEV 432


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 36.2 bits (84), Expect = 0.056
 Identities = 19/40 (47%), Positives = 25/40 (62%), Gaps = 4/40 (10%)

Query: 374 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEG 413
           R  ++ P  ++EEEE E+EEEEE EE    E  E E +EG
Sbjct: 338 RCTWVNPEQKDEEEEQEDEEEEEEEE----EPEEPEPEEG 373



 Score = 32.0 bits (73), Expect = 1.0
 Identities = 18/42 (42%), Positives = 23/42 (54%)

Query: 385 EEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEN 426
            E E+EEEEEE  E  ++ E  E E +E E   +E  G G N
Sbjct: 168 REGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGTGVN 209



 Score = 30.0 bits (68), Expect = 4.0
 Identities = 20/40 (50%), Positives = 23/40 (57%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGG 422
            E EEE+EEEE  EA+E  +GE  E E  E    E EG G
Sbjct: 168 REGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGTG 207


>gnl|CDD|165564 PHA03309, PHA03309, transcriptional regulator ICP4; Provisional.
          Length = 2033

 Score = 36.4 bits (83), Expect = 0.056
 Identities = 57/203 (28%), Positives = 84/203 (41%), Gaps = 29/203 (14%)

Query: 438  SSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSD 497
            S+  SS +SS S S S S   ++ SRS + S   S    R A  DRS    RS R    D
Sbjct: 1812 SAGRSSSSSSSSSSSSSSSPSSRPSRSATPSLSPSPSPPRRAPVDRS----RSGRRRERD 1867

Query: 498  KSAASNRSRSGSAASNKSRSDRSRSGSAGSDR------SAASNKSVRSDKSAAS--NRSA 549
            + +A+       A   +SR+D S  G+A  D            + + +  SA +  +RS 
Sbjct: 1868 RPSANPFRW---APRQRSRADHSPDGTAPGDAPLNLEDGPGRGRPIWTPSSATTLPSRSG 1924

Query: 550  RSDGSDKSASNRS------------RSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRA 597
              D  D++ +  S             SR+  S+ S     SN    KS    +SR   R 
Sbjct: 1925 PEDSVDETETEDSAPPARLAPSPLETSRAEDSEDSEYPEYSNPRLGKSPPALKSREARRP 1984

Query: 598  RSNRSR--SASKSGSSAASSRSG 618
             S + R  S+ K+G +  S+ S 
Sbjct: 1985 SSKQPRRPSSGKNGHTDVSAASA 2007



 Score = 31.0 bits (69), Expect = 2.7
 Identities = 35/121 (28%), Positives = 55/121 (45%), Gaps = 6/121 (4%)

Query: 404  ETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSR 463
            E + + R+    G R G  + ++D      D E   ++  R   R +SR  S    +RS 
Sbjct: 1746 EISPSPRRRDAEGRRFGCRQDDDDG----YDYEGGRESPERVLGRRQSRRDSVPVRRRSG 1801

Query: 464  SGSASSRGSNKSDRSAKSDRSAKSDRSAR-SDRSDKSAASNRSRSGSAASNKSRSDRSRS 522
            + +   R    + RS+ S  S+ S  S+  S R  +SA  + S S S    ++  DRSRS
Sbjct: 1802 AANCGGRWMISAGRSSSSSSSSSSSSSSSPSSRPSRSATPSLSPSPSP-PRRAPVDRSRS 1860

Query: 523  G 523
            G
Sbjct: 1861 G 1861



 Score = 30.6 bits (68), Expect = 3.2
 Identities = 24/74 (32%), Positives = 36/74 (48%)

Query: 538  RSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRA 597
            RS  +    R   S G   S+S+ S S S+SS +S  S ++  S + S S  R   V R+
Sbjct: 1799 RSGAANCGGRWMISAGRSSSSSSSSSSSSSSSPSSRPSRSATPSLSPSPSPPRRAPVDRS 1858

Query: 598  RSNRSRSASKSGSS 611
            RS R R   +  ++
Sbjct: 1859 RSGRRRERDRPSAN 1872



 Score = 29.8 bits (66), Expect = 5.9
 Identities = 30/96 (31%), Positives = 42/96 (43%), Gaps = 2/96 (2%)

Query: 529  RSAASNKSVRSDKSAASNRSARSDGSDKSASNRSR--SRSASSQASNNSAASNRSSNKSG 586
            RS A+N   R   SA  + S+ S  S  S+S+ S   SRSA+   S + +   R+     
Sbjct: 1799 RSGAANCGGRWMISAGRSSSSSSSSSSSSSSSPSSRPSRSATPSLSPSPSPPRRAPVDRS 1858

Query: 587  SRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPAG 622
               R R   R  +N  R A +  S A  S  G+  G
Sbjct: 1859 RSGRRRERDRPSANPFRWAPRQRSRADHSPDGTAPG 1894


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 35.9 bits (83), Expect = 0.061
 Identities = 18/41 (43%), Positives = 27/41 (65%)

Query: 374 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
           +E   E P+EEEEEE EE++EEE + T + E  + E ++ E
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEE 69



 Score = 29.4 bits (66), Expect = 6.8
 Identities = 15/38 (39%), Positives = 22/38 (57%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG 420
           E E+E  +EEEEEE EE ++ E    +++E    E E 
Sbjct: 30  EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEK 67


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 36.3 bits (84), Expect = 0.064
 Identities = 40/194 (20%), Positives = 72/194 (37%), Gaps = 11/194 (5%)

Query: 432 TSDAEHSSKNSSRASSRSRSRSKSKSPAK---RSRSGSASSRGSNKSDRSAKSDRSAKSD 488
           +SD+     +S+ ++  S   S+ +S  +   ++ + S        + RS +       D
Sbjct: 567 SSDSVTGGADSTMSAQDSMMASRMRSAVESYAKAHNISNEQATQELASRSTRGSAGGYGD 626

Query: 489 RSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKS--VRSDKSAASN 546
             A              + G       R+    S S G   S++S  S   R D  A + 
Sbjct: 627 AHAEWG-VGPKILGKGGKLGLGVKGGGRAGIDWSDSDGHSASSSSRSSHDARHDIDAQAT 685

Query: 547 RSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSAS 606
           +  R + SD   S +    S S   ++N+A S      +   S  +S  +  +N +RS  
Sbjct: 686 KDFR-EASDYFTSRKV---SESGSHTDNNADSRVDQLSAALNSAKQSYDQYTTNMTRSHE 741

Query: 607 KSG-SSAASSRSGS 619
            +  +S   S SG 
Sbjct: 742 YAEMASRTESMSGQ 755


>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1).  This family
           consists of several mammalian dentin matrix protein 1
           (DMP1) sequences. The dentin matrix acidic
           phosphoprotein 1 (DMP1) gene has been mapped to human
           chromosome 4q21. DMP1 is a bone and teeth specific
           protein initially identified from mineralised dentin.
           DMP1 is primarily localised in the nuclear compartment
           of undifferentiated osteoblasts. In the nucleus, DMP1
           acts as a transcriptional component for activation of
           osteoblast-specific genes like osteocalcin. During the
           early phase of osteoblast maturation, Ca(2+) surges into
           the nucleus from the cytoplasm, triggering the
           phosphorylation of DMP1 by a nuclear isoform of casein
           kinase II. This phosphorylated DMP1 is then exported out
           into the extracellular matrix, where it regulates
           nucleation of hydroxyapatite. DMP1 is a unique molecule
           that initiates osteoblast differentiation by
           transcription in the nucleus and orchestrates
           mineralised matrix formation extracellularly, at later
           stages of osteoblast maturation. The DMP1 gene has been
           found to be ectopically expressed in lung cancer
           although the reason for this is unknown.
          Length = 514

 Score = 35.8 bits (82), Expect = 0.066
 Identities = 57/254 (22%), Positives = 109/254 (42%), Gaps = 13/254 (5%)

Query: 368 FRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER--EGGGEGE 425
            R +R   R      + +E + E+EE+   +++ + ++ E   ++     R  E  G GE
Sbjct: 223 TRSERGNSRMSSAGLKSKESKGEDEEQASTQDSGESQSVEYPSRKFFRKSRISEEDGRGE 282

Query: 426 NDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRS----------GSASSRGSNKS 475
            D+         S++N+  + SR  SRS+S+  ++ ++S           S SS+ ++  
Sbjct: 283 LDDSNTMEVKSDSTENAGLSQSREHSRSESQEDSEENQSQEDSQEVQDPSSESSQEADLP 342

Query: 476 DRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDR-SRSGSAGSDRSAASN 534
            +   S+   +    +R D  D + + +  +  S +S +   D  S S S  ++  A S 
Sbjct: 343 SQENSSESQEEVVSESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSSSESQSTEEQADSE 402

Query: 535 KSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSV 594
            +     S  S  S   + S      +S S S  S++  + +  +  S +  S S+  S 
Sbjct: 403 SNESLSSSEESPESTEDENSSSQEGLQSHSASTESRSQESQSEQDSRSEEDDSDSQDSSR 462

Query: 595 SRARSNRSRSASKS 608
           S+  SN + SAS S
Sbjct: 463 SKEDSNSTESASSS 476



 Score = 33.1 bits (75), Expect = 0.50
 Identities = 40/211 (18%), Positives = 84/211 (39%), Gaps = 3/211 (1%)

Query: 386 EEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRA 445
           EE+   E ++      + ++TE               + +++  +   D++     SS +
Sbjct: 276 EEDGRGELDDSNTMEVKSDSTENAGLSQSREHSRSESQEDSEENQSQEDSQEVQDPSSES 335

Query: 446 SSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSAR--SDRSDKSAASN 503
           S  +   S+  S ++      + SRG N  + ++ S+    S+ S     D    S + +
Sbjct: 336 SQEADLPSQENS-SESQEEVVSESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSSSESQS 394

Query: 504 RSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSR 563
                 + SN+S S    S  +  D +++S + ++S  ++  +RS  S     S S    
Sbjct: 395 TEEQADSESNESLSSSEESPESTEDENSSSQEGLQSHSASTESRSQESQSEQDSRSEEDD 454

Query: 564 SRSASSQASNNSAASNRSSNKSGSRSRSRSV 594
           S S  S  S   + S  S++ S    + ++ 
Sbjct: 455 SDSQDSSRSKEDSNSTESASSSEEDGQPKNT 485



 Score = 30.0 bits (67), Expect = 4.5
 Identities = 46/208 (22%), Positives = 79/208 (37%), Gaps = 3/208 (1%)

Query: 337 RVRLSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEE 396
           + R+S+   +     +NT  V +    NA   + +       +  +EE + +++ +E ++
Sbjct: 271 KSRISEEDGRGELDDSNTMEVKSDSTENAGLSQSREHSRSESQEDSEENQSQEDSQEVQD 330

Query: 397 AEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSK 456
                  E     ++     + E   E   DN   T+   HS       SS   S     
Sbjct: 331 PSSESSQEADLPSQENSSESQEEVVSESRGDNPDNTT--SHSEDQEDSESSEEDSLDTPS 388

Query: 457 SPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRS-GSAASNKS 515
           S   +S    A S  +     S +S  S + + S+  +     +AS  SRS  S +   S
Sbjct: 389 SSESQSTEEQADSESNESLSSSEESPESTEDENSSSQEGLQSHSASTESRSQESQSEQDS 448

Query: 516 RSDRSRSGSAGSDRSAASNKSVRSDKSA 543
           RS+   S S  S RS   + S  S  S+
Sbjct: 449 RSEEDDSDSQDSSRSKEDSNSTESASSS 476


>gnl|CDD|226365 COG3846, TrbL, Type IV secretory pathway, TrbL components
           [Intracellular trafficking and secretion].
          Length = 452

 Score = 35.9 bits (83), Expect = 0.068
 Identities = 30/137 (21%), Positives = 48/137 (35%), Gaps = 24/137 (17%)

Query: 435 AEHSSKNSSRASSRSRSRSKSKSPAKRSRSG-------SASSRGSNKSDRSAKSDRSAKS 487
           A  +S  S+ AS+ +  R  S S A  + +G        A +        +  +  S  S
Sbjct: 328 ALGTSMASAAASAFASGRKGSGSGAFGTAAGVGDVKSPGAKAAMRTLGRAAGDTGVSVAS 387

Query: 488 DR-----SARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKS 542
                  SA    + KSA +  +   +A     R    ++  AG   S A          
Sbjct: 388 GVGQAPKSAGGSAAGKSAVAKATGVQAAPGWARRMASGQNVFAG--ASTA---------- 435

Query: 543 AASNRSARSDGSDKSAS 559
           AA+ R     G+D + S
Sbjct: 436 AAAVRGGDGAGNDANPS 452



 Score = 34.3 bits (79), Expect = 0.19
 Identities = 24/108 (22%), Positives = 36/108 (33%), Gaps = 10/108 (9%)

Query: 522 SGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRS 581
           +G+ G+   A+   SV +  +     S  S  +   AS R  S S +   +         
Sbjct: 311 AGARGATAGASLASSVTALGT-----SMASAAASAFASGRKGSGSGAFGTAAGVGDVKSP 365

Query: 582 SNKSGSRSRSR-----SVSRARSNRSRSASKSGSSAASSRSGSPAGSD 624
             K+  R+  R      VS A        S  GS+A  S      G  
Sbjct: 366 GAKAAMRTLGRAAGDTGVSVASGVGQAPKSAGGSAAGKSAVAKATGVQ 413


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 35.7 bits (82), Expect = 0.069
 Identities = 12/69 (17%), Positives = 20/69 (28%), Gaps = 1/69 (1%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
             PN++     +   E+  EE +   + E E       + E                E  
Sbjct: 157 PSPNQQPSSFLQPSHEDSPEEPEPPTS-EPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQ 215

Query: 439 SKNSSRASS 447
           S    +A S
Sbjct: 216 SPTPQQAPS 224



 Score = 31.1 bits (70), Expect = 1.9
 Identities = 14/79 (17%), Positives = 30/79 (37%), Gaps = 3/79 (3%)

Query: 375 ERFLEPPNEEEEEEDE---EEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKR 431
             FL+P +E+  EE E    E E ++    Q ET  +        +  G  +     +  
Sbjct: 164 SSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAP 223

Query: 432 TSDAEHSSKNSSRASSRSR 450
           + + + + ++    +   R
Sbjct: 224 SPNTQQAVEHEDEPTEPER 242



 Score = 30.3 bits (68), Expect = 3.0
 Identities = 3/51 (5%), Positives = 6/51 (11%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
             P                   +            +          E+  E
Sbjct: 126 SSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPE 176



 Score = 29.6 bits (66), Expect = 5.4
 Identities = 8/54 (14%), Positives = 15/54 (27%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRT 432
             P+E    E       +   +    + E   +E E    E   +     +  T
Sbjct: 144 PGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSET 197



 Score = 29.6 bits (66), Expect = 5.7
 Identities = 22/108 (20%), Positives = 30/108 (27%), Gaps = 4/108 (3%)

Query: 339 RLSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAE 398
           R    R++  AA           P   T     R          E + E  EE    E E
Sbjct: 30  REQAGRRRGTAARA----AKPAPPAPTTSGPQVRAVAEQGHRQTESDTETAEESRHGEKE 85

Query: 399 ETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRAS 446
           E  QG  + +  +          G  E      + +    S   S AS
Sbjct: 86  ERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPAS 133



 Score = 29.6 bits (66), Expect = 6.1
 Identities = 22/135 (16%), Positives = 41/135 (30%), Gaps = 9/135 (6%)

Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSS 443
           E+     E + E AEE++ GE    + + G+GG       G       +     S     
Sbjct: 63  EQGHRQTESDTETAEESRHGE----KEERGQGGP-----SGSGSESVGSPTPSPSGSAEE 113

Query: 444 RASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASN 503
            AS  S   +   SP   +      S  S+          S     + +     + +  +
Sbjct: 114 LASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHED 173

Query: 504 RSRSGSAASNKSRSD 518
                   +++   D
Sbjct: 174 SPEEPEPPTSEPEPD 188


>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
          Length = 1052

 Score = 36.1 bits (83), Expect = 0.069
 Identities = 23/151 (15%), Positives = 46/151 (30%), Gaps = 13/151 (8%)

Query: 369  RIQRLRER---------FLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
            R + LRE+          L          D  E + E + TQ G   EA+       E  
Sbjct: 885  RAEELREKRGKLEAHRAMLSGSEANSARRDTMEIQNEQKPTQDGPAAEAQPA----QENT 940

Query: 420  GGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSA 479
                 E      +  A+ ++   +    +     +        ++ +  +    K  + A
Sbjct: 941  VVESAEKAVAAVSEAAQQAATAVASGIEKVAEAVRKTVRRSVKKAAATRAAMKKKVAKKA 1000

Query: 480  KSDRSAKSDRSARSDRSDKSAASNRSRSGSA 510
             + ++A    +A+   + K  A        A
Sbjct: 1001 PAKKAAAKKAAAKKAAAKKKVAKKAPAKKVA 1031


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 35.8 bits (83), Expect = 0.082
 Identities = 35/236 (14%), Positives = 74/236 (31%), Gaps = 25/236 (10%)

Query: 371 QRLRERFL-EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
           + LR +   +  +EE+E+ED + EEE+ ++ +  +  E      +   +   GE + +N 
Sbjct: 312 EELRRKIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENS 371

Query: 430 -----KRTSDAEHSSKNSSRA-------------SSRSRS-RSKSKSPAKRSRSGSASSR 470
                K    AE   K  + A              S        SK    R + G  +  
Sbjct: 372 GLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGE 431

Query: 471 GSNKSDRSAK----SDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAG 526
              +S +  K      +  K          ++ A   +  +     ++            
Sbjct: 432 KEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELD 491

Query: 527 SDRSA-ASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRS 581
            +     +  SV        ++   S   DK+A+  S++     +      + +  
Sbjct: 492 EENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLD 547



 Score = 33.9 bits (78), Expect = 0.33
 Identities = 38/223 (17%), Positives = 71/223 (31%), Gaps = 11/223 (4%)

Query: 342 KRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEE---EDEEEEEEEAE 398
             R+K+G           +  L + +F +QR   R  E  + E EE   E E EEE + E
Sbjct: 353 MLRKKLGKLKEGE-DDEENSGLLSMKF-MQRAEARKKEENDAEIEELRRELEGEEESDEE 410

Query: 399 ETQQGETTEAERQE--GEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSK 456
           E ++       R++   E GE+E   +      K     +  S           ++ +  
Sbjct: 411 ENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKV 470

Query: 457 SPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSR 516
           +     RS  A      + +   + +   K+  S       + +    S     A+NK  
Sbjct: 471 ANKLLKRSEKAQKE--EEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKIS 528

Query: 517 SDRSRSGSA--GSDRSAASNKSVRSDKSAASNRSARSDGSDKS 557
               +              +  +  + S   +     D  D+ 
Sbjct: 529 KAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDEE 571


>gnl|CDD|152814 pfam12379, DUF3655, Protein of unknown function (DUF3655).  This
           domain family is found in viruses, and is approximately
           70 amino acids in length. The family is found in
           association with pfam08716, pfam01661, pfam05409,
           pfam06471, pfam08717, pfam06478, pfam09401, pfam06460,
           pfam08715, pfam08710.
          Length = 70

 Score = 32.3 bits (73), Expect = 0.084
 Identities = 16/34 (47%), Positives = 22/34 (64%)

Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEG 413
           PP+EEE+ E+ E EEEE EET + E    +  +G
Sbjct: 7   PPDEEEDCEEYECEEEEIEETCEHEYGTEDDYKG 40


>gnl|CDD|222011 pfam13257, DUF4048, Domain of unknown function (DUF4048).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in eukaryotes, and is typically
           between 228 and 257 amino acids in length.
          Length = 242

 Score = 34.8 bits (80), Expect = 0.088
 Identities = 33/111 (29%), Positives = 52/111 (46%), Gaps = 8/111 (7%)

Query: 421 GGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAK 480
           G EG N  E RT     S ++ SR++SRSR R +  S +    S S++S+G+     S K
Sbjct: 110 GEEGINATESRTVPPPRSRRSGSRSTSRSRLRLQGGSLSSSRSSRSSTSKGA----TSGK 165

Query: 481 SDRSAKSDRSARS----DRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGS 527
             +SA  D S  S    D   + + S +  S + A N ++     + S+  
Sbjct: 166 DSKSADIDVSFWSEFGIDTPGQKSKSPQKASSTPAGNTNQGQSQNAQSSNL 216



 Score = 29.0 bits (65), Expect = 6.4
 Identities = 23/98 (23%), Positives = 38/98 (38%), Gaps = 8/98 (8%)

Query: 498 KSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSD----- 552
           ++    RSR   + S      R + GS  S RS+ S+ S +   S   ++SA  D     
Sbjct: 120 RTVPPPRSRRSGSRSTSRSRLRLQGGSLSSSRSSRSSTS-KGATSGKDSKSADIDVSFWS 178

Query: 553 --GSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSR 588
             G D         + ASS  + N+      + +S + 
Sbjct: 179 EFGIDTPGQKSKSPQKASSTPAGNTNQGQSQNAQSSNL 216



 Score = 29.0 bits (65), Expect = 7.2
 Identities = 21/91 (23%), Positives = 35/91 (38%)

Query: 492 RSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARS 551
              R   S +++RSR      + S S  SRS ++    S   +KS   D S  S     +
Sbjct: 125 PRSRRSGSRSTSRSRLRLQGGSLSSSRSSRSSTSKGATSGKDSKSADIDVSFWSEFGIDT 184

Query: 552 DGSDKSASNRSRSRSASSQASNNSAASNRSS 582
            G    +  ++ S  A +     S  +  S+
Sbjct: 185 PGQKSKSPQKASSTPAGNTNQGQSQNAQSSN 215


>gnl|CDD|237620 PRK14136, recX, recombination regulator RecX; Provisional.
          Length = 309

 Score = 35.0 bits (80), Expect = 0.089
 Identities = 42/198 (21%), Positives = 77/198 (38%), Gaps = 9/198 (4%)

Query: 430 KRTSDAEHSSKNSSRASSRSRSRSKSKSPAKR-SRSGSASSRGSNK-SDRSAKSDRSAKS 487
           +R   A+    +    ++R+     S+   +  S  G  + R + + SD +  S   A  
Sbjct: 3   RRRQGADPQEADHPARAARAGRPHASRETDRTVSGEGRPAGRTATRASDDALVSFEIAAP 62

Query: 488 DRSARSDRS-DKSAASNRSRSGSAASNKSRS-DRSRSGSAGSDRSAASNKSVRSDKSAAS 545
           D     D S D    + R  SG    +      R+    A +  SAA       D    +
Sbjct: 63  DEPFDDDESFDAHDRARRRVSGVGVRDAGAPGGRAADARAANLSSAAKRAEAAGDVYTRT 122

Query: 546 NRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSV---SRARSNRS 602
           ++  R   + ++A       S S+ + ++ AA +R+S++     + R++   SR   +R+
Sbjct: 123 SQHPRR--TRRAAGPFHSDSSPSASSEDDGAARSRASSRPARSLKGRALGYLSRREYSRA 180

Query: 603 RSASKSGSSAASSRSGSP 620
             A K    A  S S  P
Sbjct: 181 ELARKLAPYADESDSVEP 198



 Score = 32.3 bits (73), Expect = 0.64
 Identities = 26/150 (17%), Positives = 55/150 (36%)

Query: 474 KSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAAS 533
           +  + A    +    R+AR+ R   S  ++R+ SG        + R+   +  S   AA 
Sbjct: 3   RRRQGADPQEADHPARAARAGRPHASRETDRTVSGEGRPAGRTATRASDDALVSFEIAAP 62

Query: 534 NKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRS 593
           ++    D+S  ++  AR   S     +       ++ A   + +S     ++     +R+
Sbjct: 63  DEPFDDDESFDAHDRARRRVSGVGVRDAGAPGGRAADARAANLSSAAKRAEAAGDVYTRT 122

Query: 594 VSRARSNRSRSASKSGSSAASSRSGSPAGS 623
               R  R  +      S+ S+ S     +
Sbjct: 123 SQHPRRTRRAAGPFHSDSSPSASSEDDGAA 152


>gnl|CDD|213932 TIGR04319, SerAla_Lrha_rpt, surface protein repeat Ser-Ala-175.
           This serine and alanine-rich surface protein repeat,
           about 175 amino acids long, occurs up to nine times in
           surface proteins of some Lactobacillus strains,
           particularly in Lactobacillus rhamnosus. Members
           proteins have the N-terminal variant signal sequence
           described by TIGR03715 and C-terminal LPXTG signals for
           surface attachment by sortase.
          Length = 175

 Score = 34.1 bits (78), Expect = 0.12
 Identities = 44/167 (26%), Positives = 80/167 (47%), Gaps = 5/167 (2%)

Query: 453 SKSKSPAKRSRSGSASSRGSNKSDRS---AKSDRSAKSDRSARSDRSDKSAASNRSRSGS 509
           S   S A    S +AS    N+S  S     S  ++ +   A S  +D SAAS+ +   S
Sbjct: 5   SSVASSANAVASSAASRFPDNQSLASLAKTASSANSVTSSYAASASADASAASSLATKVS 64

Query: 510 AASNKSRSDRSR--SGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSA 567
           +A+  + S  S+  S  A  +  AAS+ + ++ K+A++  S     +  ++   S +  A
Sbjct: 65  SANKAASSAASQANSALAAGNLDAASSYANQASKAASNASSLADKANSAASKALSEALQA 124

Query: 568 SSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAAS 614
           SS A+  S+A++ ++  +GS + +       S  S +AS +   A+S
Sbjct: 125 SSAAAIASSAASSAATLAGSLASANDAKSDASAASDAASSASVVASS 171


>gnl|CDD|219627 pfam07897, DUF1675, Protein of unknown function (DUF1675).  The
           members of this family are sequences derived from
           hypothetical plant proteins of unknown function. One
           member of this family is annotated as a putative
           RNA-binding protein, but no evidence was found to
           support this.
          Length = 283

 Score = 34.6 bits (79), Expect = 0.12
 Identities = 33/185 (17%), Positives = 62/185 (33%), Gaps = 9/185 (4%)

Query: 409 ERQEGEGGEREGGGEGENDNE--KRTSD--AEHSSKNSSRASSRSRSRSKSKSPAKRSRS 464
           +R  G G +        +D     RT+    E   +   R   +S  R ++K   KRS  
Sbjct: 36  DRSNGGGLDGGDVRTIVSDMFGLMRTTSLPVETEEEWRKRKEMQSLRRLEAKR--KRSEK 93

Query: 465 GSA--SSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRS 522
                S+     S  +A    S +   S++   +    A++ S  G ++S  S  D    
Sbjct: 94  EYNGVSNGDDMDSINAANGGGSGRDKESSKGVETGSPIATSSSSGGPSSSGLSELDVG-V 152

Query: 523 GSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSS 582
               S ++      V +  +   + S  +D  D   +  + +          S  +N   
Sbjct: 153 IGGQSGQTKQVRSPVNNILTGTESTSHGTDPVDAETNKSNTTSPTKRSRRPPSVDTNSGG 212

Query: 583 NKSGS 587
             S +
Sbjct: 213 EASLN 217



 Score = 29.5 bits (66), Expect = 5.0
 Identities = 21/125 (16%), Positives = 45/125 (36%), Gaps = 2/125 (1%)

Query: 391 EEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSR 450
           EEE  + +E Q     EA+R+  E  +   G    +D +   +     S     +S    
Sbjct: 69  EEEWRKRKEMQSLRRLEAKRKRSE--KEYNGVSNGDDMDSINAANGGGSGRDKESSKGVE 126

Query: 451 SRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSA 510
           + S   + +      S+     +      +S ++ +      +  +   + S+ +    A
Sbjct: 127 TGSPIATSSSSGGPSSSGLSELDVGVIGGQSGQTKQVRSPVNNILTGTESTSHGTDPVDA 186

Query: 511 ASNKS 515
            +NKS
Sbjct: 187 ETNKS 191


>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
           terminal.  This domain family is found in eukaryotes,
           and is typically between 177 and 207 amino acids in
           length. The family is found in association with
           pfam00778, pfam02377, pfam00610, pfam00595. The segment
           polarity gene dishevelled (dsh) is required for pattern
           formation of the embryonic segments. It is involved in
           the determination of body organisation through the
           Wingless pathway (analogous to the Wnt-1 pathway).
          Length = 202

 Score = 34.2 bits (78), Expect = 0.14
 Identities = 29/101 (28%), Positives = 38/101 (37%), Gaps = 10/101 (9%)

Query: 522 SGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRS 581
           SGSAGS  S  S       +S+ SNRS       ++A  R   R +    S +   S   
Sbjct: 63  SGSAGSQHSEGS-------RSSGSNRS--DGERSRAADGREGGRKSGGSGSESEHTSRSG 113

Query: 582 SNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPAG 622
           S +SG R      S    +   S   S S  +S  S    G
Sbjct: 114 SRRSGGRRAPSERSGPPPSE-GSVRSSLSHPSSHSSYGAPG 153



 Score = 30.7 bits (69), Expect = 1.5
 Identities = 28/93 (30%), Positives = 39/93 (41%), Gaps = 10/93 (10%)

Query: 464 SGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSG 523
           +GS  S GS       +S  S +SD         +        SGS + + SRS   RSG
Sbjct: 66  AGSQHSEGS-------RSSGSNRSDGERSRAADGREGGRKSGGSGSESEHTSRSGSRRSG 118

Query: 524 S--AGSDRSAASNKSVRSDKSAASNRSARSDGS 554
              A S+RS     S  S +S+ S+ S+ S   
Sbjct: 119 GRRAPSERSGPPP-SEGSVRSSLSHPSSHSSYG 150



 Score = 30.7 bits (69), Expect = 1.8
 Identities = 20/83 (24%), Positives = 30/83 (36%), Gaps = 7/83 (8%)

Query: 435 AEHSSKNSSRASSRSRSRSKS-----KSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDR 489
           ++HS  + S  S+RS           +   K   SGS S   S    R +   R+     
Sbjct: 68  SQHSEGSRSSGSNRSDGERSRAADGREGGRKSGGSGSESEHTSRSGSRRSGGRRAPSERS 127

Query: 490 SARSDRSDKSAASNRSRSGSAAS 512
                 S+ S  S+ S   S +S
Sbjct: 128 GPPP--SEGSVRSSLSHPSSHSS 148



 Score = 29.9 bits (67), Expect = 2.8
 Identities = 30/92 (32%), Positives = 43/92 (46%), Gaps = 9/92 (9%)

Query: 499 SAASNRSR-SGSAASNKSRSDRSRS--GSAGSDRSAASNKSVRSDKSAASNRSARSDGSD 555
           SA S  S  S S+ SN+S  +RSR+  G  G  +S  S     S+    S   +R  G  
Sbjct: 65  SAGSQHSEGSRSSGSNRSDGERSRAADGREGGRKSGGSG----SESEHTSRSGSRRSGGR 120

Query: 556 KSASNRSRSRSASSQASNNSAASNRSSNKSGS 587
           ++ S   RS    S+ S  S+ S+ SS+ S  
Sbjct: 121 RAPS--ERSGPPPSEGSVRSSLSHPSSHSSYG 150



 Score = 29.5 bits (66), Expect = 3.5
 Identities = 21/81 (25%), Positives = 32/81 (39%), Gaps = 1/81 (1%)

Query: 443 SRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAAS 502
           SR+S  +RS  +    A     G  S    ++S+ +++S       R A S+RS     S
Sbjct: 74  SRSSGSNRSDGERSRAADGREGGRKSGGSGSESEHTSRSGSRRSGGRRAPSERS-GPPPS 132

Query: 503 NRSRSGSAASNKSRSDRSRSG 523
             S   S +   S S     G
Sbjct: 133 EGSVRSSLSHPSSHSSYGAPG 153



 Score = 29.2 bits (65), Expect = 5.2
 Identities = 21/90 (23%), Positives = 35/90 (38%), Gaps = 1/90 (1%)

Query: 437 HSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRS 496
            S    S+ S  SRS   ++S  +RSR+      G       ++S+ +++S  S RS   
Sbjct: 62  GSGSAGSQHSEGSRSSGSNRSDGERSRAADGREGGRKSGGSGSESEHTSRSG-SRRSGGR 120

Query: 497 DKSAASNRSRSGSAASNKSRSDRSRSGSAG 526
              +  +       +   S S  S   S G
Sbjct: 121 RAPSERSGPPPSEGSVRSSLSHPSSHSSYG 150


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 34.8 bits (80), Expect = 0.14
 Identities = 29/169 (17%), Positives = 61/169 (36%), Gaps = 9/169 (5%)

Query: 445 ASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNR 504
           AS+     + + +        +A++  +  +   A     A    + R D +    A   
Sbjct: 393 ASAVPAVTAVTGAAGAALAPKAAAAAAA--TRAEAPPAAPAPPATADRGDDAADGDAPVP 450

Query: 505 SRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRS 564
           +++ + AS  SR D  R     +D  +AS  +  +   AA   + R+     +       
Sbjct: 451 AKANARASADSRCD-ERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPD 509

Query: 565 RSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAA 613
             A + AS   A +  +     +R  + + +        +A   G++AA
Sbjct: 510 ARAPAAASREDAPAAAAPPAPEARPPTPAAAAP------AARAGGAAAA 552



 Score = 34.1 bits (78), Expect = 0.25
 Identities = 44/213 (20%), Positives = 77/213 (36%), Gaps = 23/213 (10%)

Query: 421 GGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAK 480
               +  ++    DA   +K ++RAS+ SR   +   P   S S SA           A 
Sbjct: 433 PATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAP----------AS 482

Query: 481 SDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDR-----SRSGSAGSDRSAASNK 535
                 +   A    +  +A         A +  SR D        +  A     AA+  
Sbjct: 483 DAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAP 542

Query: 536 SVRSDKSAASNRSARSDG----SDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRS 591
           + R+  +AA+    R+ G    SD+ A   + ++ A++ A+    A+ R + +  +    
Sbjct: 543 AARAGGAAAALDVLRNAGMRVSSDRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRAR 602

Query: 592 RSVSRARSNRSRSASKSGSSAASSRSGSPAGSD 624
            +   A  N +  A +    AA SR   P   D
Sbjct: 603 AATGDAPPNGAARAEQ----AAESRGAPPPWED 631


>gnl|CDD|223046 PHA03328, PHA03328, nuclear egress lamina protein UL31;
           Provisional.
          Length = 316

 Score = 34.3 bits (79), Expect = 0.15
 Identities = 18/45 (40%), Positives = 25/45 (55%)

Query: 563 RSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASK 607
           R  S+S + S  +A  +R   + GSR RSR  SR RS+R  S  +
Sbjct: 7   RRSSSSLRRSRRAARRSRRDGRVGSRGRSRYRSRRRSSRRSSTRR 51



 Score = 32.0 bits (73), Expect = 0.76
 Identities = 13/63 (20%), Positives = 17/63 (26%)

Query: 433 SDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSAR 492
            D      +SS   SR  +R   +     SR  S        S RS+            R
Sbjct: 2   DDLALRRSSSSLRRSRRAARRSRRDGRVGSRGRSRYRSRRRSSRRSSTRRAELADTERDR 61

Query: 493 SDR 495
              
Sbjct: 62  YRA 64



 Score = 29.7 bits (67), Expect = 4.6
 Identities = 16/52 (30%), Positives = 23/52 (44%)

Query: 565 RSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSR 616
           R +SS    +  A+ RS       SR RS  R+R   SR +S   +  A + 
Sbjct: 7   RRSSSSLRRSRRAARRSRRDGRVGSRGRSRYRSRRRSSRRSSTRRAELADTE 58



 Score = 29.7 bits (67), Expect = 5.1
 Identities = 16/55 (29%), Positives = 24/55 (43%)

Query: 538 RSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSR 592
            S     S R+AR    D    +R RSR  S + S+  +++ R+      R R R
Sbjct: 9   SSSSLRRSRRAARRSRRDGRVGSRGRSRYRSRRRSSRRSSTRRAELADTERDRYR 63


>gnl|CDD|223025 PHA03253, PHA03253, UL35; Provisional.
          Length = 609

 Score = 34.9 bits (80), Expect = 0.15
 Identities = 38/140 (27%), Positives = 54/140 (38%), Gaps = 15/140 (10%)

Query: 426 NDNEKRTSDAEHSSKNSS--RASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDR 483
           N NE R+S     S ++S    S     RS   S  ++     ++SR S+ S     S+R
Sbjct: 460 NVNEGRSSSRASPSHSTSTIPYSPPQSGRSTPTSILRQRTPIRSNSRSSSVSFSQGDSNR 519

Query: 484 SAKSDRSARSDRS---------DKSAASNRSRS-GSAASNKSRSDRSRSGSAGSDRSA-- 531
           S  SD +  SD S         D+    +   S  S +SN S S  SR+      R    
Sbjct: 520 SHYSDETNISDYSYPMADLDLEDEEPMEDHPHSPQSISSNNSMSRTSRALQNSQRRRPPT 579

Query: 532 -ASNKSVRSDKSAASNRSAR 550
              + S R   +A   R  R
Sbjct: 580 MFPSSSERQQNNARLRRRTR 599



 Score = 31.5 bits (71), Expect = 1.8
 Identities = 20/61 (32%), Positives = 32/61 (52%), Gaps = 3/61 (4%)

Query: 560 NRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARS---NRSRSASKSGSSAASSR 616
           N +  RS+S  + ++S ++   S     RS   S+ R R+   + SRS+S S S   S+R
Sbjct: 460 NVNEGRSSSRASPSHSTSTIPYSPPQSGRSTPTSILRQRTPIRSNSRSSSVSFSQGDSNR 519

Query: 617 S 617
           S
Sbjct: 520 S 520


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 35.0 bits (81), Expect = 0.15
 Identities = 16/67 (23%), Positives = 35/67 (52%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNS 442
           EEEE+  ++E+EE+ ++  + E  + +  + E  E +   + E+D E+   + E   K S
Sbjct: 324 EEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKS 383

Query: 443 SRASSRS 449
           + ++   
Sbjct: 384 AESTRSE 390



 Score = 30.7 bits (70), Expect = 2.7
 Identities = 16/73 (21%), Positives = 34/73 (46%), Gaps = 1/73 (1%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEG-GEREGGGEGENDNEKRTSDAEHSSKN 441
            EE+EE+EE+  ++ +E    +  E E ++ +   E E   + ++D+E    + E   + 
Sbjct: 319 GEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEK 378

Query: 442 SSRASSRSRSRSK 454
             + S+ S     
Sbjct: 379 KKKKSAESTRSEL 391



 Score = 30.0 bits (68), Expect = 5.3
 Identities = 13/42 (30%), Positives = 27/42 (64%)

Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
           ++  +EEE+EEDE+ ++E+ EE ++ E  + +++  E    E
Sbjct: 349 VDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390



 Score = 28.8 bits (65), Expect = 9.6
 Identities = 23/95 (24%), Positives = 41/95 (43%), Gaps = 23/95 (24%)

Query: 369 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEE--------------TQQGETTEAERQEGE 414
           R++R+R        EEE++E+EE+ +E A++                QGE  E E ++G 
Sbjct: 278 RLRRMRG-------EEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGV 330

Query: 415 GGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRS 449
               E   + ++D E+   D + S +        S
Sbjct: 331 DD--EDEEDDDDDLEEEEEDVDLSDEEEDEEDEDS 363


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 34.5 bits (80), Expect = 0.16
 Identities = 14/21 (66%), Positives = 15/21 (71%)

Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
               EEEEEE+EEEEEEE  E
Sbjct: 298 AAAAEEEEEEEEEEEEEEPSE 318



 Score = 32.9 bits (76), Expect = 0.41
 Identities = 14/20 (70%), Positives = 16/20 (80%)

Query: 380 PPNEEEEEEDEEEEEEEAEE 399
              EEEEEE+EEEEEE +EE
Sbjct: 300 AAEEEEEEEEEEEEEEPSEE 319



 Score = 32.9 bits (76), Expect = 0.43
 Identities = 13/19 (68%), Positives = 14/19 (73%)

Query: 379 EPPNEEEEEEDEEEEEEEA 397
           E   EEEEEE+EE  EEEA
Sbjct: 303 EEEEEEEEEEEEEPSEEEA 321



 Score = 32.1 bits (74), Expect = 0.75
 Identities = 13/21 (61%), Positives = 14/21 (66%)

Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
               EEEEEE+EEEEE   EE
Sbjct: 300 AAEEEEEEEEEEEEEEPSEEE 320



 Score = 31.0 bits (71), Expect = 1.7
 Identities = 12/20 (60%), Positives = 13/20 (65%)

Query: 379 EPPNEEEEEEDEEEEEEEAE 398
           E   EEEEEE+E  EEE A 
Sbjct: 304 EEEEEEEEEEEEPSEEEAAA 323



 Score = 29.8 bits (68), Expect = 3.7
 Identities = 12/20 (60%), Positives = 13/20 (65%)

Query: 380 PPNEEEEEEDEEEEEEEAEE 399
                 EEE+EEEEEEE EE
Sbjct: 296 AQAAAAEEEEEEEEEEEEEE 315



 Score = 29.8 bits (68), Expect = 4.6
 Identities = 12/21 (57%), Positives = 14/21 (66%)

Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
           +    EEEEE+EEEEEEE   
Sbjct: 297 QAAAAEEEEEEEEEEEEEEPS 317



 Score = 29.4 bits (67), Expect = 6.1
 Identities = 12/21 (57%), Positives = 13/21 (61%)

Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
                 EEEE+EEEEEEE E 
Sbjct: 296 AQAAAAEEEEEEEEEEEEEEP 316


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 34.8 bits (80), Expect = 0.16
 Identities = 21/77 (27%), Positives = 33/77 (42%), Gaps = 6/77 (7%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKN- 441
           EEE+   +  EEE+ EE +    +     + +    E   E E D+E    +AE  +K  
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDE----EAESPAKAE 56

Query: 442 -SSRASSRSRSRSKSKS 457
            S R  +R +   K K 
Sbjct: 57  ISKREKARLKELKKQKK 73



 Score = 33.6 bits (77), Expect = 0.41
 Identities = 16/55 (29%), Positives = 21/55 (38%)

Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAE 436
            +  EEEDEEE E  A         +    E E  + E   E E+  +   S  E
Sbjct: 7   TQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKRE 61



 Score = 32.5 bits (74), Expect = 0.76
 Identities = 13/53 (24%), Positives = 21/53 (39%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKR 431
           E  N +  EE++EEE E    +   ++ + E    +  E E   E      K 
Sbjct: 3   EQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKA 55



 Score = 31.7 bits (72), Expect = 1.4
 Identities = 16/76 (21%), Positives = 32/76 (42%), Gaps = 3/76 (3%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
           E     +  E+E+EEE EA     G  ++ +    E    E   E +++  +  + AE S
Sbjct: 2   EEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAED---EDEDEEDDEEAESPAKAEIS 58

Query: 439 SKNSSRASSRSRSRSK 454
            +  +R     + + +
Sbjct: 59  KREKARLKELKKQKKQ 74


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 34.6 bits (80), Expect = 0.18
 Identities = 14/57 (24%), Positives = 21/57 (36%), Gaps = 3/57 (5%)

Query: 354 TRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAER 410
            RLV+  R   AT             PP     E+D++  E+E E+    +    E 
Sbjct: 247 ARLVLAPR---ATRLPAPPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEI 300



 Score = 30.0 bits (68), Expect = 4.8
 Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 1/29 (3%)

Query: 595 SRARSNRSRSASKSGSSAASSRSGSPAGS 623
             AR+ R+RSA  +G++   +R G P GS
Sbjct: 320 GGARA-RARSAGGAGAAQKGNRRGRPLGS 347


>gnl|CDD|219594 pfam07816, DUF1645, Protein of unknown function (DUF1645).  These
           sequences are derived from a number of hypothetical
           plant proteins. The region in question is approximately
           270 amino acids long. Some members of this family are
           annotated as yeast pheromone receptor proteins AR781 but
           no literature was found to support this.
          Length = 191

 Score = 33.3 bits (76), Expect = 0.20
 Identities = 40/167 (23%), Positives = 65/167 (38%), Gaps = 8/167 (4%)

Query: 451 SRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSA 510
           S + S SP  R R GS  S  +    R  ++ RS   +R      SD  +       G  
Sbjct: 21  SVASSPSPRDRGRVGSPRSESAFAPARLRRALRSLSPER--GGGSSDSESTDEGELEGVP 78

Query: 511 ASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQ 570
            S+   S    S S  S  + +S +   SD      RSA SDG D    + ++       
Sbjct: 79  PSSYCVSSSPASSSRKSSSTGSSKRWRLSD--LLLFRSA-SDGKDAFVFDAAKDPLLKYS 135

Query: 571 ASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRS 617
             ++ A+  +    + ++SR  S S+ +      AS      A++R+
Sbjct: 136 PLSSPASPVK---PASAKSRESSASKGKRRGKTVASAHELLYATNRA 179


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 32.7 bits (75), Expect = 0.22
 Identities = 9/24 (37%), Positives = 22/24 (91%)

Query: 382 NEEEEEEDEEEEEEEAEETQQGET 405
           +EE+EEE+++EE+++ +E+++ E+
Sbjct: 117 DEEDEEEEDDEEDDDEDESEEEES 140



 Score = 31.9 bits (73), Expect = 0.39
 Identities = 9/25 (36%), Positives = 20/25 (80%)

Query: 382 NEEEEEEDEEEEEEEAEETQQGETT 406
           ++EE+EE+E++EE++ E+  + E +
Sbjct: 116 DDEEDEEEEDDEEDDDEDESEEEES 140



 Score = 30.8 bits (70), Expect = 0.90
 Identities = 10/20 (50%), Positives = 16/20 (80%)

Query: 383 EEEEEEDEEEEEEEAEETQQ 402
           EE++EED++E+E E EE+  
Sbjct: 123 EEDDEEDDDEDESEEEESPV 142


>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family.  This model
           represents a subfamily of RNA splicing factors including
           the Pad-1 protein (N. crassa), CAPER (M. musculus) and
           CC1.3 (H.sapiens). These proteins are characterized by
           an N-terminal arginine-rich, low complexity domain
           followed by three (or in the case of 4 H. sapiens
           paralogs, two) RNA recognition domains (rrm: pfam00706).
           These splicing factors are closely related to the U2AF
           splicing factor family (TIGR01642). A homologous gene
           from Plasmodium falciparum was identified in the course
           of the analysis of that genome at TIGR and was included
           in the seed.
          Length = 457

 Score = 34.1 bits (78), Expect = 0.25
 Identities = 19/73 (26%), Positives = 28/73 (38%)

Query: 424 GENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDR 483
              D E+     +    +  R  SR RSRS+ +S  +R R      RG ++S    +  R
Sbjct: 1   RYRDRERGRLRNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNRYYR 60

Query: 484 SAKSDRSARSDRS 496
                   R DR 
Sbjct: 61  PRGDRSYRRDDRR 73



 Score = 34.1 bits (78), Expect = 0.25
 Identities = 31/131 (23%), Positives = 45/131 (34%), Gaps = 8/131 (6%)

Query: 480 KSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSR-SGSAGSDRSAASNKSVR 538
             DR     R+  + RSDK    +R RS S   ++ R DR    G  G  RS + N+  R
Sbjct: 2   YRDRERGRLRN-DTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNRYYR 60

Query: 539 SDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSN------KSGSRSRSR 592
                +  R  R  G +         R   +      A   R  +      K G     +
Sbjct: 61  PRGDRSYRRDDRRSGRNTKEPLTEAERDDRTVFVLQLALKARERDLYEFFSKVGKVRDVQ 120

Query: 593 SVSRARSNRSR 603
            +    S RS+
Sbjct: 121 CIKDRNSRRSK 131



 Score = 33.7 bits (77), Expect = 0.33
 Identities = 14/59 (23%), Positives = 18/59 (30%)

Query: 553 GSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSS 611
                   R R+ +  S      +     S     R R R   R R  RSRS S +   
Sbjct: 1   RYRDRERGRLRNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNRYY 59



 Score = 32.6 bits (74), Expect = 0.76
 Identities = 22/60 (36%), Positives = 23/60 (38%), Gaps = 3/60 (5%)

Query: 563 RSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPAG 622
           R R        N     R S+K   RSR RS SR RS R R           SRS SP  
Sbjct: 1   RYRDRERGRLRND---TRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNR 57



 Score = 32.2 bits (73), Expect = 0.86
 Identities = 20/77 (25%), Positives = 27/77 (35%), Gaps = 3/77 (3%)

Query: 444 RASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAK-SDRSAKSDRSARSDRSDKSAAS 502
           R   R R R ++ +  +  +    S R S   DRS +  DR     R  RS RS      
Sbjct: 1   RYRDRERGRLRNDTR-RSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRS-RSRSPNRY 58

Query: 503 NRSRSGSAASNKSRSDR 519
            R R   +     R   
Sbjct: 59  YRPRGDRSYRRDDRRSG 75



 Score = 30.2 bits (68), Expect = 3.3
 Identities = 18/60 (30%), Positives = 28/60 (46%), Gaps = 1/60 (1%)

Query: 561 RSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSP 620
           R R R    +  N++  S++   +S  RSRSR  SR R +R     + G S + S +   
Sbjct: 1   RYRDRE-RGRLRNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNRYY 59



 Score = 29.9 bits (67), Expect = 4.7
 Identities = 19/80 (23%), Positives = 29/80 (36%), Gaps = 5/80 (6%)

Query: 443 SRASSRSRSRSKS-KSPAKRSRSGSASS---RGSNKSDRSAKSDRSAKSDRSARSDRSDK 498
            R   R R R+ + +S   R RS   S    R   + DR     R  +S RS   +R  +
Sbjct: 2   YRDRERGRLRNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRS-RSRSPNRYYR 60

Query: 499 SAASNRSRSGSAASNKSRSD 518
                  R     S ++  +
Sbjct: 61  PRGDRSYRRDDRRSGRNTKE 80


>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
           (DUF2052).  This entry is of sequences of two conserved
           domains separated by a region of low complexity,
           spanning some 200 residues. The function is unknown.
          Length = 178

 Score = 32.8 bits (75), Expect = 0.29
 Identities = 22/48 (45%), Positives = 31/48 (64%)

Query: 367 EFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
           E R++ L ++  E   EEEEE+ EEE EEE EE Q+G   E E ++G+
Sbjct: 68  EARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGD 115



 Score = 28.2 bits (63), Expect = 8.7
 Identities = 16/37 (43%), Positives = 22/37 (59%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
            E++EE+E  EEEE +  ++ E  E E Q G  GE E
Sbjct: 74  LEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEE 110


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 31.7 bits (72), Expect = 0.30
 Identities = 12/21 (57%), Positives = 16/21 (76%)

Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
           E   E+EEE+ +EEE+EE EE
Sbjct: 76  EKAEEKEEEKKKEEEKEEEEE 96



 Score = 30.5 bits (69), Expect = 0.76
 Identities = 12/25 (48%), Positives = 17/25 (68%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQG 403
           +   +EEE++ EEE+EEE EE   G
Sbjct: 77  KAEEKEEEKKKEEEKEEEEEEALAG 101



 Score = 30.2 bits (68), Expect = 1.1
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEA 408
                EE+ +E+EEE++ EE ++ E  EA
Sbjct: 70  AAAAAEEKAEEKEEEKKKEEEKEEEEEEA 98



 Score = 29.4 bits (66), Expect = 2.0
 Identities = 12/26 (46%), Positives = 17/26 (65%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGE 404
           E   EE+EEE ++EEE+E EE +   
Sbjct: 75  EEKAEEKEEEKKKEEEKEEEEEEALA 100



 Score = 28.6 bits (64), Expect = 2.9
 Identities = 12/37 (32%), Positives = 17/37 (45%)

Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGG 416
                  EE  EE+EEE ++ ++ E  E E   G G 
Sbjct: 68  AAAAAAAEEKAEEKEEEKKKEEEKEEEEEEALAGLGA 104



 Score = 27.8 bits (62), Expect = 5.5
 Identities = 9/29 (31%), Positives = 18/29 (62%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTE 407
               E+ EE++EE+++EE +E ++ E   
Sbjct: 72  AAAEEKAEEKEEEKKKEEEKEEEEEEALA 100


>gnl|CDD|236394 PRK09169, PRK09169, hypothetical protein; Validated.
          Length = 2316

 Score = 33.9 bits (78), Expect = 0.31
 Identities = 18/92 (19%), Positives = 32/92 (34%), Gaps = 3/92 (3%)

Query: 480 KSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRS 539
           K  R A +    R  R  +   +   R+  A S  +   R R   AG+DR   S +    
Sbjct: 9   KRRRDAAAPADPRPRRRPRLGDAPAPRTARADSGATPRGRPR---AGADREPTSEQLRDY 65

Query: 540 DKSAASNRSARSDGSDKSASNRSRSRSASSQA 571
           ++      + + D   +    R       ++A
Sbjct: 66  ERWLDRAAAGQLDAQREQQCARLWFLVQQARA 97


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 33.8 bits (77), Expect = 0.33
 Identities = 17/66 (25%), Positives = 33/66 (50%), Gaps = 3/66 (4%)

Query: 382  NEEEEEED--EEEEEEEAEETQQGETTEAERQEGEGGEREG-GGEGENDNEKRTSDAEHS 438
            ++E+  ED  EE  +E  E T+ G  ++ E ++GE  E +      + D +   + AE  
Sbjct: 4058 DDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEAD 4117

Query: 439  SKNSSR 444
             +N+ +
Sbjct: 4118 EENTDK 4123



 Score = 33.4 bits (76), Expect = 0.50
 Identities = 25/130 (19%), Positives = 48/130 (36%), Gaps = 4/130 (3%)

Query: 375  ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSD 434
            +   E  NEE+     + +E EA E +       +       E     E   D ++  +D
Sbjct: 3835 DDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEEVND 3894

Query: 435  AEHSSKNS-SRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARS 493
                  NS +       +        ++S   SA+   +N+SD  +K D +   +   R 
Sbjct: 3895 IPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAA---NNESDLVSKEDDNKALEDKDRQ 3951

Query: 494  DRSDKSAASN 503
            ++ D+   S+
Sbjct: 3952 EKEDEEEMSD 3961


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 31.2 bits (71), Expect = 0.37
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGE 404
               EEEEEE+EEEEEEE  E +   
Sbjct: 73  AAAEEEEEEEEEEEEEEEESEEEAMA 98



 Score = 30.0 bits (68), Expect = 0.96
 Identities = 14/21 (66%), Positives = 15/21 (71%)

Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
               EEEEEE+EEEEEEE E 
Sbjct: 72  AAAAEEEEEEEEEEEEEEEES 92



 Score = 29.6 bits (67), Expect = 1.3
 Identities = 12/29 (41%), Positives = 15/29 (51%)

Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEA 408
                   E+EEEEEEE EE ++    EA
Sbjct: 68  AAAAAAAAEEEEEEEEEEEEEEEESEEEA 96



 Score = 29.6 bits (67), Expect = 1.4
 Identities = 15/25 (60%), Positives = 16/25 (64%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQG 403
               EEEEEE+EEEEEE  EE   G
Sbjct: 75  AEEEEEEEEEEEEEEEESEEEAMAG 99



 Score = 27.3 bits (61), Expect = 8.1
 Identities = 11/29 (37%), Positives = 15/29 (51%)

Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEA 408
                    +EEEEEEE EE ++ E+ E 
Sbjct: 67  AAAAAAAAAEEEEEEEEEEEEEEEESEEE 95


>gnl|CDD|149635 pfam08646, Rep_fac-A_C, Replication factor-A C terminal domain.
           This domain is found at the C terminal of replication
           factor A. Replication factor A (RPA) binds
           single-stranded DNA and is involved in replication,
           repair, and recombination of DNA.
          Length = 146

 Score = 32.2 bits (74), Expect = 0.37
 Identities = 23/88 (26%), Positives = 40/88 (45%), Gaps = 21/88 (23%)

Query: 258 VMDESGEQFVAYFLPTAETLDKRKLDQLSNLEYTDEQEYEY----KMAREYNWNVKSKSC 313
           + D +G+ +V  F   AE +     ++L  L+  D+ E+E      + REY + ++ K  
Sbjct: 60  ISDHTGQIWVTLFNEAAEEILGMSAEELMELKEEDDNEFEKIFQKALFREYVFRLRVK-- 117

Query: 314 KGYEENYFLIVRDDAVYYNELETRVRLS 341
              EE            YN+ E+RVR +
Sbjct: 118 ---EET-----------YND-ESRVRYT 130


>gnl|CDD|140324 PTZ00303, PTZ00303, phosphatidylinositol kinase; Provisional.
          Length = 1374

 Score = 33.5 bits (76), Expect = 0.39
 Identities = 38/168 (22%), Positives = 65/168 (38%), Gaps = 7/168 (4%)

Query: 460 KRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDR 519
           +R R   A+ RG+N S  +  S R       A +  +  S  +  + SGS A +   S  
Sbjct: 96  ERLRQAYAARRGANTSGGTVGSGRCGVHGDPAAAAATAGSCGAGGAVSGSGAIHSGVSGI 155

Query: 520 SRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSAS------NRSRSRSASSQASN 573
                  S   A      + +++     S  SD   + +S         +  SA + AS 
Sbjct: 156 PSMCCLNSCSPAGGPVPRKLEETLRETLSENSDFFFRVSSVAYHLDPNPQGASACTAASA 215

Query: 574 NSAASNRSSNKSGSRSRSRSVSRARSNRSRSASK-SGSSAASSRSGSP 620
           ++ +    +   G+RS  R    A +     AS  + + AA+S +G P
Sbjct: 216 SAGSGVMGNAGVGARSPRRGEGLASAAAGGVASLLTNTIAAASATGPP 263


>gnl|CDD|151621 pfam11179, DUF2967, Protein of unknown function (DUF2967).  This
           family of proteins with unknown function appears to be
           restricted to Drosophila.
          Length = 284

 Score = 33.0 bits (74), Expect = 0.39
 Identities = 35/143 (24%), Positives = 64/143 (44%), Gaps = 1/143 (0%)

Query: 480 KSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRS 539
           +S +SA+S   ARS  S K+  +  ++    AS K     +  G  G   S      V +
Sbjct: 20  QSPKSARSSEKARSHFSFKNYGNEDAKDAGRASGKGTGSGAGVGGGGGSGSGVGGNGVMA 79

Query: 540 DKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSR-SRSVSRAR 598
             +A     A+ + +  + + + R+R     A N +A +N +   + SR+  S SVS + 
Sbjct: 80  AGAADEAADAQEEDAIANRNKKLRARERDPMAGNANANANANGGSNSSRNHISPSVSPSH 139

Query: 599 SNRSRSASKSGSSAASSRSGSPA 621
           SNR+    +   +   + +G+ A
Sbjct: 140 SNRASPKREKKRTTPIASTGAIA 162


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 33.4 bits (77), Expect = 0.41
 Identities = 15/52 (28%), Positives = 25/52 (48%), Gaps = 1/52 (1%)

Query: 367 EFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 418
           E R      R LE  ++++EE DEE+EE    E  + +  + +  +    ER
Sbjct: 113 EPRYDDA-YRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRER 163



 Score = 30.3 bits (69), Expect = 3.1
 Identities = 9/52 (17%), Positives = 28/52 (53%)

Query: 368 FRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
               R  + + +   +++++E+ +EE+EE+ +++  E  + +  + +   RE
Sbjct: 111 QDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRE 162


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 33.3 bits (76), Expect = 0.44
 Identities = 21/82 (25%), Positives = 37/82 (45%), Gaps = 6/82 (7%)

Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKN 441
           + E++ +DEEEE+ ++E+  Q      +    +  + E G E  + N    +    S   
Sbjct: 186 SAEDDSDDEEEEDSDSEDYSQ-----YDGMLVDSSDEEEGEEAPSINYNEDTSESESD-E 239

Query: 442 SSRASSRSRSRSKSKSPAKRSR 463
           S    S SRS S S+  +  S+
Sbjct: 240 SDSEISESRSVSDSEESSPPSK 261


>gnl|CDD|178593 PLN03021, PLN03021, Low-temperature-induced protein; Provisional.
          Length = 619

 Score = 33.4 bits (75), Expect = 0.45
 Identities = 54/244 (22%), Positives = 88/244 (36%), Gaps = 25/244 (10%)

Query: 355 RLVVTHRPLNATEFRIQ-RLRERFLEPPNEEEEEEDEEEE---------EEEAEETQQGE 404
           R   TH PL  +EF  +   RE   EP N         E+         E++    Q+  
Sbjct: 162 REAPTHYPLGVSEFSDRGESREAHQEPLNTPVSLLSATEDVTRTFAPGGEDDYLGGQRKV 221

Query: 405 TTEAERQEGEGGEREGGGE----GENDNEKRTSDAEHSSKNSSR---ASSRSRSRSKSKS 457
             E  ++  E     GGG     G ++ + + +D  H    +     A S  R +   +S
Sbjct: 222 NVETPKRLEEDPAAPGGGSDYLSGVSNYQSKVTDPTHKGGEAGVPEIAESLGRMKVTDES 281

Query: 458 PAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRS 517
           P ++SR G    R  +   RS + D   +SD +  S     +     S++G      +R 
Sbjct: 282 PDQKSRQG----REEDFPTRSHEFDLKKESDINKNS----PARFGGESKAGMEEDFPTRG 333

Query: 518 DRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAA 577
           D       G D    ++     + S    R    +  D+S      S      AS  SA 
Sbjct: 334 DVKVESGLGRDLPTGTHDQFSPELSRPKERDDSEETKDESTHETKPSTYTEQLASATSAI 393

Query: 578 SNRS 581
           +N++
Sbjct: 394 TNKA 397


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 33.4 bits (76), Expect = 0.46
 Identities = 18/67 (26%), Positives = 31/67 (46%), Gaps = 4/67 (5%)

Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKN 441
           NE+EE+ + E E +E    Q+ E T+ E + GE    E   + + D+    SD +  +  
Sbjct: 223 NEQEEQGEGEGEGQEGSAPQESEATDRESESGE----EEMVQSDQDDLPDESDDDSETPG 278

Query: 442 SSRASSR 448
                +R
Sbjct: 279 EGARPAR 285



 Score = 31.1 bits (70), Expect = 2.3
 Identities = 22/82 (26%), Positives = 37/82 (45%), Gaps = 3/82 (3%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEN---DNEKRTSDAEHSS 439
           EE  ++ E E+EE+ ++ Q  E  + E+ EGEG  +EG    E+   D E  + + E   
Sbjct: 201 EEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQ 260

Query: 440 KNSSRASSRSRSRSKSKSPAKR 461
            +       S   S++     R
Sbjct: 261 SDQDDLPDESDDDSETPGEGAR 282



 Score = 30.7 bits (69), Expect = 3.1
 Identities = 12/54 (22%), Positives = 24/54 (44%)

Query: 389 DEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNS 442
           +  EE  +  E++  E  + ++      E +G GEGE        ++E + + S
Sbjct: 198 ELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRES 251



 Score = 29.5 bits (66), Expect = 6.1
 Identities = 20/65 (30%), Positives = 27/65 (41%), Gaps = 1/65 (1%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNS 442
           E  EE  ++ E E+ E+    + TE E QE +G     G EG    E   +D E  S   
Sbjct: 198 ELAEEMGDDTESEDEEDGDDDQPTENE-QEEQGEGEGEGQEGSAPQESEATDRESESGEE 256

Query: 443 SRASS 447
               S
Sbjct: 257 EMVQS 261


>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
           Tim54.  Mitochondrial function depends on the import of
           hundreds of different proteins synthesised in the
           cytosol. Protein import is a multi-step pathway which
           includes the binding of precursor proteins to surface
           receptors, translocation of the precursor across one or
           both mitochondrial membranes, and folding and assembly
           of the imported protein inside the mitochondrion. Most
           precursor proteins carry amino-terminal targeting
           signals, called pre-sequences, and are imported into
           mitochondria via import complexes located in both the
           outer and the inner membrane (IM). The IM complex, TIM,
           is made up of at least two proteins which mediate
           translocation of proteins into the matrix by removing
           their signal peptide and another pair of proteins, Tim54
           and Tim22, that insert the polytopic proteins, that
           carry internal targetting information, into the inner
           membrane.
          Length = 377

 Score = 33.2 bits (76), Expect = 0.46
 Identities = 17/54 (31%), Positives = 21/54 (38%), Gaps = 1/54 (1%)

Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEG-EGGEREGGGEGENDNEK 430
           L+PP   E   DE   E E E T   E+     +E  E    E     E +N K
Sbjct: 195 LDPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNK 248



 Score = 31.2 bits (71), Expect = 1.5
 Identities = 17/50 (34%), Positives = 28/50 (56%), Gaps = 2/50 (4%)

Query: 369 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQ--GETTEAERQEGEGG 416
           RI++LR +  E P EE   E+E  +E++  + +Q  G   +A+   G GG
Sbjct: 122 RIRKLRRKAGEEPEEELPLEEEATKEDDLADPRQVLGVYYKAKDPPGAGG 171


>gnl|CDD|236722 PRK10590, PRK10590, ATP-dependent RNA helicase RhlE; Provisional.
          Length = 456

 Score = 32.9 bits (75), Expect = 0.50
 Identities = 19/69 (27%), Positives = 29/69 (42%), Gaps = 5/69 (7%)

Query: 397 AEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSK 456
           AE  Q G      +Q G GG  +GGG G+   + R  +    S ++  A   SR    +K
Sbjct: 384 AEPIQNGR-----QQRGGGGRGQGGGRGQQQGQPRRGEGGAKSASAKPAEKPSRRLGDAK 438

Query: 457 SPAKRSRSG 465
              ++ R  
Sbjct: 439 PAGEQQRRR 447


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 33.1 bits (76), Expect = 0.51
 Identities = 14/29 (48%), Positives = 15/29 (51%)

Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQE 412
             EEE+EEEEEEE EE    E       E
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVMMMPAPE 431



 Score = 32.7 bits (75), Expect = 0.52
 Identities = 13/18 (72%), Positives = 14/18 (77%)

Query: 383 EEEEEEDEEEEEEEAEET 400
             EEEE+EEEEEEE EE 
Sbjct: 403 AAEEEEEEEEEEEEEEEP 420



 Score = 32.7 bits (75), Expect = 0.65
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEA 408
            EEEEE+EEEEEEE E   +     A
Sbjct: 404 AEEEEEEEEEEEEEEEPVAEVMMMPA 429



 Score = 31.6 bits (72), Expect = 1.3
 Identities = 15/36 (41%), Positives = 18/36 (50%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 418
           EEEEEE+EEEEEEE             +    GG +
Sbjct: 406 EEEEEEEEEEEEEEPVAEVMMMPAPEMQLPASGGIK 441



 Score = 30.8 bits (70), Expect = 2.4
 Identities = 15/28 (53%), Positives = 17/28 (60%)

Query: 375 ERFLEPPNEEEEEEDEEEEEEEAEETQQ 402
           ER+     EEEEEE+EEEEE  AE    
Sbjct: 400 ERWAAEEEEEEEEEEEEEEEPVAEVMMM 427


>gnl|CDD|220710 pfam10351, Apt1, Golgi-body localisation protein domain.  This is
           the C-terminus of a family of proteins conserved from
           plants to humans. The plant members are localised to the
           Golgi proteins and appear to regulate membrane
           trafficking, as they are required for rapid vesicle
           accumulation at the tip of the pollen tube. The
           C-terminus probably contains the Golgi localisation
           signal and it is well-conserved.
          Length = 451

 Score = 33.0 bits (76), Expect = 0.52
 Identities = 20/79 (25%), Positives = 34/79 (43%), Gaps = 1/79 (1%)

Query: 417 EREGGGEGENDNEKRTSDAEHSSKNSSRASSRSR-SRSKSKSPAKRSRSGSASSRGSNKS 475
            R+     E+ +  +  +    S +SS  S RS  +   S S +    S S  S+ S+KS
Sbjct: 295 GRDSSLSEEDSDSSKREEDSDISSSSSSGSRRSSSTSRSSSSSSSLLSSSSILSKSSDKS 354

Query: 476 DRSAKSDRSAKSDRSARSD 494
                S + +KS++    D
Sbjct: 355 KDKRFSLKLSKSEKEESDD 373



 Score = 32.3 bits (74), Expect = 0.81
 Identities = 29/92 (31%), Positives = 39/92 (42%), Gaps = 6/92 (6%)

Query: 480 KSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRS 539
             D S   + S  S R + S  S+ S SGS  S+ +    S S S     S++S  S  S
Sbjct: 295 GRDSSLSEEDSDSSKREEDSDISSSSSSGSRRSSSTSRSSSSSSSL---LSSSSILSKSS 351

Query: 540 DKSAASNRSARSDGSDKSAS---NRSRSRSAS 568
           DKS     S +   S+K  S       SRS+ 
Sbjct: 352 DKSKDKRFSLKLSKSEKEESDDLEEMISRSSK 383


>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
           translational repressor [Translation, ribosomal
           structure and biogenesis].
          Length = 777

 Score = 33.2 bits (76), Expect = 0.54
 Identities = 40/222 (18%), Positives = 77/222 (34%), Gaps = 23/222 (10%)

Query: 425 ENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAK---- 480
            +      S       +S   S+ SR++S S   + +  + ++S   S+ +  S      
Sbjct: 89  SSSTSGSQSLLMELPSSSFNPSTSSRNKSNSALSSTQQGNANSSVTLSSSTASSMFNSNK 148

Query: 481 -----SDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNK 535
                 + S  +  +        + AS+ S+  +     S        S     +   + 
Sbjct: 149 LPLPNPNHSNSATTNQSGSSFINTPASSSSQPLTNLVVSSIKRFPYLTSLSPFFNYLIDP 208

Query: 536 SVRSDKSAASNRSARSDGSDKSASN----RSRSRSASSQASNNSAASNRSSNKSGSRSRS 591
           S  S  ++A    + +   + S +N       S    +Q+  N+   N SS+ +   S  
Sbjct: 209 SSDSATASADTSPSFNPPPNLSPNNLFSTSDLSPLPDTQSVENNIILNSSSSINELTSIY 268

Query: 592 RSVSRARSNR----------SRSASKSGSSAASSRSGSPAGS 623
            SV   R+ R          + S+S    SA + +  SP GS
Sbjct: 269 GSVPSIRNLRGLNSALVSFLNVSSSSLAFSALNGKEVSPTGS 310



 Score = 32.8 bits (75), Expect = 0.59
 Identities = 37/169 (21%), Positives = 63/169 (37%), Gaps = 8/169 (4%)

Query: 436 EHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDR 495
            H SK S  +S+ S+      S         + +  S     +   + S     ++    
Sbjct: 17  LHHSKKSPPSSTTSQELMNGNSTPNSFSPIPSKASSSATFTLNLPINNSVNHKITS---- 72

Query: 496 SDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSD 555
              S++S R  SGS +   S S            S++ N S  S   + S  S+   G+ 
Sbjct: 73  ---SSSSRRKPSGSWSVAISSSTSGSQSLLMELPSSSFNPSTSSRNKSNSALSSTQQGNA 129

Query: 556 KSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRS 604
            S+   S S ++S   SN     N + + S + ++S S S   +  S S
Sbjct: 130 NSSVTLSSSTASSMFNSNKLPLPNPNHSNSATTNQSGS-SFINTPASSS 177


>gnl|CDD|235562 PRK05687, fliH, flagellar assembly protein H; Validated.
          Length = 246

 Score = 32.2 bits (74), Expect = 0.55
 Identities = 12/47 (25%), Positives = 14/47 (29%)

Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEG 424
             PP       +  E EE  EE     T E      +    EG   G
Sbjct: 30  PPPPEPAAPPPEPPEPEEVPEEEAAPLTEEELEAIRQQAHEEGFAAG 76



 Score = 29.9 bits (68), Expect = 3.5
 Identities = 17/51 (33%), Positives = 21/51 (41%), Gaps = 1/51 (1%)

Query: 375 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEG-EGGEREGGGEG 424
           E    PP   E EE  EEE     E +     +   +EG   G+ EG  EG
Sbjct: 34  EPAAPPPEPPEPEEVPEEEAAPLTEEELEAIRQQAHEEGFAAGKAEGRAEG 84


>gnl|CDD|235307 PRK04537, PRK04537, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 572

 Score = 33.0 bits (75), Expect = 0.56
 Identities = 27/164 (16%), Positives = 49/164 (29%), Gaps = 14/164 (8%)

Query: 458 PAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRS 517
           P +    G  +   +   D      R A+  R+A   R     +     S S +      
Sbjct: 396 PPRVPVEGEEADDEAG--DSVGTIFREAREQRAAEEQRRGGGRSGPGGGSRSGSVGGG-- 451

Query: 518 DRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAA 577
              R   AG+D      +  R +  A +  +A    +   A       +A+      +A 
Sbjct: 452 --GRRDGAGADGKPRPRRKPRVEGEADA--AAAGAETPVVA------AAAAQAPGVVAAD 501

Query: 578 SNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPA 621
             R+  K   R   R V  A    +   + +     +    +P 
Sbjct: 502 GERAPRKRRRRRNGRPVEGAEPVSTPVPAPAAPRKPTQVVATPV 545



 Score = 31.5 bits (71), Expect = 1.7
 Identities = 23/131 (17%), Positives = 48/131 (36%), Gaps = 3/131 (2%)

Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSS 443
           E EE D+E  +      ++     A  ++  GG R G G G              +    
Sbjct: 402 EGEEADDEAGDSVGTIFREAREQRAAEEQRRGGGRSGPGGGSRSGSVGGGGRRDGAGADG 461

Query: 444 RASSRSRSRSKSKSPAKRSRSGS---ASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSA 500
           +   R + R + ++ A  + + +   A++        +A  +R+ +  R  R+ R  + A
Sbjct: 462 KPRPRRKPRVEGEADAAAAGAETPVVAAAAAQAPGVVAADGERAPRKRRRRRNGRPVEGA 521

Query: 501 ASNRSRSGSAA 511
               +   + A
Sbjct: 522 EPVSTPVPAPA 532


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 32.6 bits (74), Expect = 0.62
 Identities = 10/35 (28%), Positives = 17/35 (48%)

Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 418
            +EE ++E E +E+   +  E    E Q  +  ER
Sbjct: 348 TDEESEDEVEIDESVIEEVAEMELLEVQVDDLAER 382


>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3.  The SLD3 DNA
           replication regulator is required for loading and
           maintenance of Cdc45 on chromatin during DNA
           replication.
          Length = 437

 Score = 32.5 bits (74), Expect = 0.64
 Identities = 29/187 (15%), Positives = 57/187 (30%), Gaps = 15/187 (8%)

Query: 441 NSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSA 500
             S   S  + +      + +   G+   R ++ +  S KS R        R    ++ +
Sbjct: 232 KPSMKISPLKKKKTGTLKSSKPEPGTPLKRQTSPASSSQKSRRR----SLQRVLTDERKS 287

Query: 501 ASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASN 560
           +S R+ S      +SR++ S       + S     S+ S  S+   ++ R        S+
Sbjct: 288 SSRRTPSL----LRSRTNSSLIEFLKRESSENLLPSLSSRTSSDLLKNKRLQKRQVDLSD 343

Query: 561 RSRSRSASSQASNNSAASNRS-------SNKSGSRSRSRSVSRARSNRSRSASKSGSSAA 613
            SR      +         +          KS     S+ +      RS    +     A
Sbjct: 344 SSRQHEEKLKKKQMLNEQKKELKRAISALKKSNRELSSKDIVETAEKRSSQFGQGVQVTA 403

Query: 614 SSRSGSP 620
           +      
Sbjct: 404 TPAGNRK 410



 Score = 32.5 bits (74), Expect = 0.74
 Identities = 31/172 (18%), Positives = 54/172 (31%), Gaps = 9/172 (5%)

Query: 430 KRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDR 489
           KR +    SS+ S R S +     + KS ++R+ S   S   S+  +   +         
Sbjct: 260 KRQTSPASSSQKSRRRSLQRVLTDERKSSSRRTPSLLRSRTNSSLIEFLKRESSENLLPS 319

Query: 490 SARSDRSD---KSAASNRSRSGSA-ASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAAS 545
            +    SD         R    S  +       + +       +      S        S
Sbjct: 320 LSSRTSSDLLKNKRLQKRQVDLSDSSRQHEEKLKKKQMLNEQKKELKRAISALK----KS 375

Query: 546 NRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRA 597
           NR   S    ++A  RS       Q +  + A NR  +   + + S S   +
Sbjct: 376 NRELSSKDIVETAEKRSSQFGQGVQVT-ATPAGNRKKDAGLTEATSSSFPSS 426


>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509).  This
           family consists of several uncharacterized viral
           proteins from the Marek's disease-like viruses. Members
           of this family are typically around 400 residues in
           length. The function of this family is unknown.
          Length = 377

 Score = 32.3 bits (73), Expect = 0.70
 Identities = 25/89 (28%), Positives = 33/89 (37%), Gaps = 1/89 (1%)

Query: 532 ASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRS 591
            +      D SA S  SA            S S     +  N S + +RS ++SGSR   
Sbjct: 290 DNRAGTPDDGSAISEDSAGGSHHTMRRPPHSTSGERRGRRRNRSESRSRSRSRSGSRRYR 349

Query: 592 RSVSRARSNRSRSASKSGSSAASSRSGSP 620
           R   R    R RS S+  +   SS   S 
Sbjct: 350 RRRGRGVPGR-RSESRQDTVLVSSSEASD 377



 Score = 29.2 bits (65), Expect = 6.7
 Identities = 19/64 (29%), Positives = 27/64 (42%)

Query: 559 SNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSG 618
            NR+ +    S  S +SA  +  + +    S S      R NRS S S+S S + S R  
Sbjct: 290 DNRAGTPDDGSAISEDSAGGSHHTMRRPPHSTSGERRGRRRNRSESRSRSRSRSGSRRYR 349

Query: 619 SPAG 622
              G
Sbjct: 350 RRRG 353



 Score = 28.8 bits (64), Expect = 9.0
 Identities = 20/63 (31%), Positives = 26/63 (41%), Gaps = 5/63 (7%)

Query: 437 HSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRS 496
           HS+    R   R+RS S+S     RSRS S S R   +  R     RS     +     S
Sbjct: 319 HSTSGERRGRRRNRSESRS-----RSRSRSGSRRYRRRRGRGVPGRRSESRQDTVLVSSS 373

Query: 497 DKS 499
           + S
Sbjct: 374 EAS 376


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 31.5 bits (72), Expect = 0.71
 Identities = 13/37 (35%), Positives = 21/37 (56%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
            E EE DEEE+ +E  E +  +    +R+E E  ++E
Sbjct: 109 VEVEELDEEEQIDELLEKELAKLKREKRRENERKQKE 145



 Score = 30.7 bits (70), Expect = 1.1
 Identities = 14/40 (35%), Positives = 23/40 (57%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 418
           E   EEE E +E +EEE+ +E  + E  + +R++    ER
Sbjct: 102 EEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENER 141



 Score = 30.7 bits (70), Expect = 1.1
 Identities = 17/45 (37%), Positives = 24/45 (53%)

Query: 368 FRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
           +R +  +   L+   +EEEEE+E E EE  EE Q  E  E E  +
Sbjct: 86  WRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAK 130


>gnl|CDD|221620 pfam12527, DUF3727, Protein of unknown function (DUF3727).  This
           domain family is found in bacteria and eukaryotes, and
           is approximately 100 amino acids in length.
          Length = 100

 Score = 30.3 bits (69), Expect = 0.74
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGE 404
           L  P E+E  E EEE++++ +E ++ E
Sbjct: 18  LPEPEEDEILELEEEDDDDEDEEEEYE 44



 Score = 30.3 bits (69), Expect = 0.78
 Identities = 13/37 (35%), Positives = 22/37 (59%), Gaps = 4/37 (10%)

Query: 380 PPNEEEE----EEDEEEEEEEAEETQQGETTEAERQE 412
           P  EE+E    EE+++++E+E EE +   +   E QE
Sbjct: 19  PEPEEDEILELEEEDDDDEDEEEEYELLASFYHEDQE 55


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 30.4 bits (69), Expect = 0.75
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
           E   E +E E+EE+EEE  EE
Sbjct: 78  EAAAEADEAEEEEKEEEAEEE 98



 Score = 29.7 bits (67), Expect = 1.4
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
                E +E +EEE+EEEAEE
Sbjct: 77  AEAAAEADEAEEEEKEEEAEE 97



 Score = 28.9 bits (65), Expect = 2.5
 Identities = 11/21 (52%), Positives = 15/21 (71%)

Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
              +E EEEE EEE EEE+++
Sbjct: 81  AEADEAEEEEKEEEAEEESDD 101


>gnl|CDD|220402 pfam09787, Golgin_A5, Golgin subfamily A member 5.  Members of this
           family of proteins are involved in maintaining Golgi
           structure. They stimulate the formation of Golgi stacks
           and ribbons, and are involved in intra-Golgi retrograde
           transport. Two main interactions have been
           characterized: one with RAB1A that has been activated by
           GTP-binding and another with isoform CASP of CUTL1.
          Length = 509

 Score = 32.5 bits (74), Expect = 0.75
 Identities = 20/82 (24%), Positives = 34/82 (41%), Gaps = 1/82 (1%)

Query: 541 KSAASNRSARSD-GSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARS 599
           K+A   R   ++   D+     S + S+   + + S     SSNK+ SRS   +  +  S
Sbjct: 11  KAAVLQRRKATEEDDDEDLLEVSSTTSSPVGSISWSVRETASSNKARSRSEKWNPDQPGS 70

Query: 600 NRSRSASKSGSSAASSRSGSPA 621
             S  +SK   ++ S  S    
Sbjct: 71  RVSSPSSKKDGTSRSLSSQVDD 92


>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
          Length = 580

 Score = 32.0 bits (73), Expect = 0.94
 Identities = 38/212 (17%), Positives = 60/212 (28%), Gaps = 27/212 (12%)

Query: 380 PPNEEEEEEDEEEEEEEAEE---------TQQGETTEAERQEGEG--GEREGGGEGENDN 428
           P     +  DE  E    E          T  G T+EA  ++ +     RE G  G    
Sbjct: 9   PGRWNYDSSDESPEGSRDENFDAERDDFLTPLGSTSEATSEDDDDLYPPRETGSGGGVAT 68

Query: 429 --------EKRTSDAEHSSKNSSRASSRSRSRSKSKSP-AKRSRSGSASSRGSNKSDRSA 479
                     R  +      +S  AS           P   R  S       S       
Sbjct: 69  STIYTVPRPPRGPEQTLDKPDSLPASRELPPGPTPVPPGGFRGASSPRLGADSTSPRFLY 128

Query: 480 KSDRSAKSDRSARSDRSDKSAAS--NRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSV 537
           + +          S+ S +  +     SR   + S K ++         S          
Sbjct: 129 QVNFPVILAPIGESNSSSEELSEEEEHSRPPPSESLKVKNGGKVYPKGFSKHK----THK 184

Query: 538 RSDKSAASNRSAR-SDGSDKSASNRSRSRSAS 568
           RS+ S  + ++AR   GS     ++ +  S S
Sbjct: 185 RSEFSGLTKKAARKRKGSFVFKPSQLKELSGS 216


>gnl|CDD|224346 COG1429, CobN, Cobalamin biosynthesis protein CobN and related
            Mg-chelatases [Coenzyme metabolism].
          Length = 1388

 Score = 32.4 bits (74), Expect = 0.96
 Identities = 21/89 (23%), Positives = 38/89 (42%), Gaps = 1/89 (1%)

Query: 535  KSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSV 594
            + V +   AA+  +A +  S    +  S   +A S AS+ S+A+   S+ +GS + S   
Sbjct: 1277 ELVEAATYAATRYAAFAPASATPGAPESVGTTAVSTASSASSATVTGSD-AGSGADSTGP 1335

Query: 595  SRARSNRSRSASKSGSSAASSRSGSPAGS 623
            S   +     A +       + SGS +  
Sbjct: 1336 SLGAAGSVTGAGEGYEMTKEAVSGSESTG 1364



 Score = 30.8 bits (70), Expect = 2.5
 Identities = 25/134 (18%), Positives = 38/134 (28%), Gaps = 12/134 (8%)

Query: 467  ASSRGSNKSDRSAKSDRSAKSDRSARSDRS-----DKSAASNRSRSGSAASNKSRSDRSR 521
            A     N       +  +   D       S     +   A+  + +  AA   + +    
Sbjct: 1242 AELVAENGVACCHHTCGNPALDEWVLGLVSVPGVPELVEAATYAATRYAAFAPASATPGA 1301

Query: 522  SGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRS 581
              S G   +A S     +  SA+S     SD    + S      +A S            
Sbjct: 1302 PESVG--TTAVS-----TASSASSATVTGSDAGSGADSTGPSLGAAGSVTGAGEGYEMTK 1354

Query: 582  SNKSGSRSRSRSVS 595
               SGS S   S S
Sbjct: 1355 EAVSGSESTGMSFS 1368



 Score = 30.1 bits (68), Expect = 4.6
 Identities = 28/202 (13%), Positives = 49/202 (24%), Gaps = 47/202 (23%)

Query: 371  QRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEK 430
            Q +  R LE       + D+E     AE   +           E G         N    
Sbjct: 1212 QSITARLLEAARRGLWDADDETLRLLAEVYAE--------LVAENGVACCHHTCGNPALD 1263

Query: 431  RTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRS 490
                        S         + + +  + +    AS+                     
Sbjct: 1264 EWVL-----GLVSVPGVPELVEAATYAATRYAAFAPASATPGA----------------P 1302

Query: 491  ARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSAR 550
                 +  S AS+ S +    S+           +G+D +           S  +  S  
Sbjct: 1303 ESVGTTAVSTASSASSATVTGSD---------AGSGADSTGP---------SLGAAGSVT 1344

Query: 551  SDGSDKSASNRSRSRSASSQAS 572
              G     +  + S S S+  S
Sbjct: 1345 GAGEGYEMTKEAVSGSESTGMS 1366



 Score = 29.7 bits (67), Expect = 6.6
 Identities = 14/83 (16%), Positives = 28/83 (33%), Gaps = 12/83 (14%)

Query: 543  AASNRSARSDGSDKSASNRSRS-RSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNR 601
             A +       S         +   A+++ +  + AS        S   +          
Sbjct: 1260 PALDEWVLGLVSVPGVPELVEAATYAATRYAAFAPASATPGA-PESVGTT---------- 1308

Query: 602  SRSASKSGSSAASSRSGSPAGSD 624
            + S + S SSA  + S + +G+D
Sbjct: 1309 AVSTASSASSATVTGSDAGSGAD 1331


>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA.  Members
           of this protein share two defining regions. One is a
           histidine/serine-rich cluster, typically
           H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
           the context of a pair of radical SAM proteins, HxsB and
           HxsC, and a fourth protein HxsD. The system is predicted
           to perform peptide modifications, likely in the
           His-Xaa-Ser region, to produce some uncharacterized
           natural product.
          Length = 186

 Score = 31.4 bits (71), Expect = 0.99
 Identities = 20/79 (25%), Positives = 28/79 (35%), Gaps = 7/79 (8%)

Query: 527 SDRSAASNKSVRSDKSAASNRSA--RSDGSDKSASNRSRSRSASSQASNNSAASNRSSNK 584
           S  S  S+ S  S  S A    +    D S  S    S S S S  +S  S       + 
Sbjct: 55  SHSSHRSHSSHSSHYSGAGGSYSVPSGDTSTYSYPVPSPSYSPSPGSSIQS-----LPST 109

Query: 585 SGSRSRSRSVSRARSNRSR 603
           +G R +S + +     R  
Sbjct: 110 TGVRPQSSAENANSEKRKL 128


>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 31.1 bits (71), Expect = 1.0
 Identities = 17/34 (50%), Positives = 19/34 (55%)

Query: 381 PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
            NE+ EEE EE E EEA E +  E T  E  E E
Sbjct: 4   KNEQVEEEVEETEVEEAVEDEVEEETVEEESEAE 37



 Score = 30.4 bits (69), Expect = 2.0
 Identities = 19/55 (34%), Positives = 27/55 (49%)

Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAE 436
           +E EEE  EEE E E  + +Q +  E E +  E  ER    + + +N KR    E
Sbjct: 23  DEVEEETVEEESEAELLDEEQAKIAELEAKLDELEERYLRLQADFENYKRRIQKE 77



 Score = 30.0 bits (68), Expect = 2.4
 Identities = 12/42 (28%), Positives = 19/42 (45%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG 420
           E   E E EE  E+E EE    ++ E    + ++ +  E E 
Sbjct: 10  EEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEA 51



 Score = 30.0 bits (68), Expect = 2.9
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
           E   + EEE +E E EE  E+  + ET E E + 
Sbjct: 3   EKNEQVEEEVEETEVEEAVEDEVEEETVEEESEA 36



 Score = 29.6 bits (67), Expect = 3.7
 Identities = 11/35 (31%), Positives = 17/35 (48%)

Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
                E EE  ++E EEE  EE  + E  + E+ +
Sbjct: 11  EVEETEVEEAVEDEVEEETVEEESEAELLDEEQAK 45


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 31.7 bits (72), Expect = 1.1
 Identities = 16/57 (28%), Positives = 21/57 (36%), Gaps = 3/57 (5%)

Query: 365 ATEFRIQRLRERFLEPPNEEEEEED---EEEEEEEAEETQQGETTEAERQEGEGGER 418
             E  +QR  E  LE    E  EED    E      +E +  +  E  + E E   R
Sbjct: 393 EEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEEAR 449



 Score = 31.7 bits (72), Expect = 1.2
 Identities = 16/73 (21%), Positives = 29/73 (39%)

Query: 364 NATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGE 423
           ++ E+  +    +  E  ++ EEEE  + E EE  E +Q E  E + +  E         
Sbjct: 372 DSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEF 431

Query: 424 GENDNEKRTSDAE 436
            + D  +     E
Sbjct: 432 EDEDELEEAQPEE 444



 Score = 30.5 bits (69), Expect = 3.2
 Identities = 16/57 (28%), Positives = 21/57 (36%)

Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAE 436
               EEEE  + E EE  E  Q     E  R        E   E E++ E+   + E
Sbjct: 389 ISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEE 445



 Score = 29.0 bits (65), Expect = 9.3
 Identities = 17/63 (26%), Positives = 23/63 (36%), Gaps = 1/63 (1%)

Query: 365 ATEFRIQRLRERFLEP-PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGE 423
             E   QR  E  L+    E  E E  E  EE+A   +     E E ++ +  E     E
Sbjct: 385 VAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEE 444

Query: 424 GEN 426
            E 
Sbjct: 445 EEE 447


>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6.  The VP6 protein a
           minor protein in the core of the virion is probably the
           viral helicase.
          Length = 322

 Score = 31.6 bits (71), Expect = 1.1
 Identities = 22/109 (20%), Positives = 47/109 (43%), Gaps = 3/109 (2%)

Query: 367 EFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEN 426
           E + ++++   ++     E +E E ++E+  E+       E   Q+ +GG+ E   E E+
Sbjct: 14  ELKQRQIQINLVDWFESGESKEKEPKDEDGQEQRISDGEGE---QKQKGGKEESDKETED 70

Query: 427 DNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKS 475
            +  R +  E  S +S++ S     +       K    G  + +GS  +
Sbjct: 71  ASVDRRTHTEVGSGSSAKGSGERADKHADGGDGKTRGGGGDADKGSGAT 119


>gnl|CDD|177618 PHA03381, PHA03381, tegument protein VP22; Provisional.
          Length = 290

 Score = 31.5 bits (71), Expect = 1.1
 Identities = 33/142 (23%), Positives = 64/142 (45%), Gaps = 10/142 (7%)

Query: 449 SRSRSKSKSPAKRSRSGSASSRGSNKSDR------SAKSDRSAKSDRSARSDRSDKSAAS 502
           S +R   + PA R+R G+  +RG ++++R       A++D    +  S+  +R      S
Sbjct: 31  SPARVSFEEPADRARRGAGQARGRSQAERRFHHYDEARADYPYYTGSSSEDERPADPRPS 90

Query: 503 NRSRSGSAASNK--SRSDRSRSGSAGSDRSAASN--KSVRSDKSAASNRSARSDGSDKSA 558
            R  +   AS    +R  R  +GS G  R A S   +   + K A++ R  +S  +D +A
Sbjct: 91  RRPHAQPEASGPGPARGARGPAGSRGRGRRAESPSPRDPPNPKGASAPRGRKSACADSAA 150

Query: 559 SNRSRSRSASSQASNNSAASNR 580
              + + +A  +    +  + +
Sbjct: 151 LLDAPAPAAPKRQKTPAGLARK 172


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 31.8 bits (72), Expect = 1.2
 Identities = 10/45 (22%), Positives = 16/45 (35%)

Query: 375 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
           +      + E E E + E+ EE E+    E  E      +   R 
Sbjct: 663 DECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRA 707



 Score = 29.8 bits (67), Expect = 4.8
 Identities = 14/52 (26%), Positives = 26/52 (50%)

Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
            +   E EEE++++ EE + +E +     ++E +    GE     E E+D E
Sbjct: 641 HQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAE 692


>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755).  This
           family is predominated by ORFs from Circoviridae. The
           function of this family remains to be determined.
          Length = 122

 Score = 30.1 bits (68), Expect = 1.2
 Identities = 16/54 (29%), Positives = 23/54 (42%)

Query: 429 EKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSD 482
            K    +  + K   + S    SRS  K  AK+ R  S+SS  S+ S  S+   
Sbjct: 68  RKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSSSSSSSES 121


>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
           family). 
          Length = 619

 Score = 31.8 bits (72), Expect = 1.2
 Identities = 28/175 (16%), Positives = 59/175 (33%), Gaps = 3/175 (1%)

Query: 452 RSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAA 511
           + K ++ A++  +   S+        +  S  S  S R    + +  + +  R+ + + +
Sbjct: 438 QEKQRAKARKKLANKTSTVMEPPYGSTESSVPSTPSTRRNDRNITSNTPSLKRTPNLTKS 497

Query: 512 SNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQA 571
           S    +      +  + + +   +     K  A++RS       +S +N + S   SS  
Sbjct: 498 SLSQEASLISKSTGNTHKHSTPRRLTTLPKLPAASRS-SKGNLIRSGANGNASSDLSSPG 556

Query: 572 SNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRS--GSPAGSD 624
           S NS +   S             S  +   S  ++         RS    P  S 
Sbjct: 557 SINSKSPEHSVPLVRVFDIHLRASTTKGRHSTPSTNEKKKRLLKRSPLSPPKESV 611



 Score = 29.4 bits (66), Expect = 6.6
 Identities = 37/135 (27%), Positives = 57/135 (42%), Gaps = 11/135 (8%)

Query: 495 RSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSV-RSDKSAASNRSARSDG 553
           R +K  A  R +    A+  S       GS  S  S  S  S  R+D++  SN  +    
Sbjct: 437 RQEKQRAKARKK---LANKTSTVMEPPYGSTES--SVPSTPSTRRNDRNITSNTPSLKRT 491

Query: 554 SDKSASNRSRSRSASSQASNN---SAASNRSSN--KSGSRSRSRSVSRARSNRSRSASKS 608
            + + S+ S+  S  S+++ N    +   R +   K  + SRS   +  RS  + +AS  
Sbjct: 492 PNLTKSSLSQEASLISKSTGNTHKHSTPRRLTTLPKLPAASRSSKGNLIRSGANGNASSD 551

Query: 609 GSSAASSRSGSPAGS 623
            SS  S  S SP  S
Sbjct: 552 LSSPGSINSKSPEHS 566


>gnl|CDD|215598 PLN03138, PLN03138, Protein TOC75; Provisional.
          Length = 796

 Score = 31.7 bits (72), Expect = 1.3
 Identities = 15/61 (24%), Positives = 27/61 (44%)

Query: 563 RSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPAG 622
            S S SS     S+ S+RS   +    R+ S+  + S  + S++ S S++  +       
Sbjct: 12  ASTSLSSSRPQLSSFSSRSPQSATRSPRASSIKCSASASASSSATSSSASLVANGAVALL 71

Query: 623 S 623
           S
Sbjct: 72  S 72


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
           subunit 8 N-terminus.  The largest of the mammalian
           translation initiation factors, eIF3, consists of at
           least eight subunits ranging in mass from 35 to 170 kDa.
           eIF3 binds to the 40 S ribosome in an early step of
           translation initiation and promotes the binding of
           methionyl-tRNAi and mRNA.
          Length = 593

 Score = 31.7 bits (72), Expect = 1.3
 Identities = 17/64 (26%), Positives = 28/64 (43%), Gaps = 6/64 (9%)

Query: 376 RFLEPPNEEEEEEDEEEEEEEAEETQQGET------TEAERQEGEGGEREGGGEGENDNE 429
           R+ E P  E+EEE+E+E++++     + E       TE      E G      + E D +
Sbjct: 130 RYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDED 189

Query: 430 KRTS 433
              S
Sbjct: 190 ADLS 193


>gnl|CDD|219726 pfam08143, CBFNT, CBFNT (NUC161) domain.  This N terminal domain is
           found in proteins of CARG-binding factor A-like
           proteins.
          Length = 65

 Score = 28.6 bits (64), Expect = 1.3
 Identities = 13/47 (27%), Positives = 18/47 (38%)

Query: 375 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGG 421
           E+F+E     E  E+  +  E A     G       + G GG  EG 
Sbjct: 6   EQFMETKFANENGEEGADGREGAPIEGMGGGPAGAPRPGGGGGAEGD 52


>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal.  Members
           of this family are found in the N terminal region of
           cytochrome B561, as well as in various other putative
           uncharacterized proteins.
          Length = 559

 Score = 31.7 bits (72), Expect = 1.4
 Identities = 30/152 (19%), Positives = 54/152 (35%), Gaps = 6/152 (3%)

Query: 460 KRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDR 519
           K   S       + KS  ++K+     +        S  S + +RS S S   +      
Sbjct: 104 KAKDSQFTVVSQAKKSPPASKTSTPMNTSEPLVPGHSSFSDSPSRSASPSRKFS------ 157

Query: 520 SRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASN 579
             S    S +   SNK      S  S   + S G   S+ NRS  RS+     ++    +
Sbjct: 158 PSSTIQQSPQLTPSNKPASPSSSYQSPSYSSSLGPVNSSGNRSNLRSSPWALRSSGDKKD 217

Query: 580 RSSNKSGSRSRSRSVSRARSNRSRSASKSGSS 611
            ++++    +    V   +   + SA K+ + 
Sbjct: 218 ITTDEKYLETFLAEVDEEQHMITSSAGKNATP 249


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 31.3 bits (71), Expect = 1.5
 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 1/52 (1%)

Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDA 435
           E+E++D+ EEEE        E ++A  +E E GE E   E   D+   + DA
Sbjct: 232 EDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEME-AAEASADDTPDSDDA 282



 Score = 30.6 bits (69), Expect = 2.4
 Identities = 15/80 (18%), Positives = 31/80 (38%), Gaps = 1/80 (1%)

Query: 369 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEN-D 427
              R+    L   +  EE  DE E  +  +   + +  E E  +    E  G  +  + D
Sbjct: 194 AFARVVRDMLSSMDMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSED 253

Query: 428 NEKRTSDAEHSSKNSSRASS 447
           ++  + + E     ++ AS+
Sbjct: 254 SDASSEEMESGEMEAAEASA 273


>gnl|CDD|221825 pfam12877, DUF3827, Domain of unknown function (DUF3827).  This
           family contains the human KIAA1549 protein which has
           been found to be fused fused to BRAF gene in many cases
           of pilocytic astrocytomas. The fusion is due mainly to a
           tandem duplication of 2 Mb at 7q34. Although nothing is
           known about the function of KIAA1549 protein, the BRAF
           protein is a well characterized oncoprotein. It is a
           serine/threonine protein kinase which is implicated in
           MAP/ERK signalling, a critical pathway for the
           regulation of cell division, differentiation and
           secretion.
          Length = 684

 Score = 31.4 bits (71), Expect = 1.6
 Identities = 34/140 (24%), Positives = 51/140 (36%), Gaps = 9/140 (6%)

Query: 427 DNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSR--GSNKSDRSAKSDRS 484
           D E  T  ++ S   SS    R   +S S      S   S  S      KS R + +   
Sbjct: 359 DAEVPTPKSKSSQDGSSNKKRRRGRKSPSDGD---SEGSSVISNRSSREKSGRPSTTPSV 415

Query: 485 AKSDRSARSDRSDKSAASNRSRSG-SAASNKSRSDR-SRSGSAGSDRSAASNKSVRSDKS 542
               +  + +   K A  + +    S+AS     DR SR  S   DRS+   + +     
Sbjct: 416 TAQQKPTKEEGRKKPAPPSGTDEQLSSASIFEHVDRLSRPSSDPYDRSSGKIQLIAMQPM 475

Query: 543 AASNRSARSDGS--DKSASN 560
            A     R + S  D++A N
Sbjct: 476 PAPPVPPRFEPSRDDRAAEN 495



 Score = 30.7 bits (69), Expect = 2.7
 Identities = 25/107 (23%), Positives = 41/107 (38%), Gaps = 4/107 (3%)

Query: 478 SAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSV 537
           + KS  S     + +  R  KS +   S   S  SN+S  ++S   S     +A      
Sbjct: 364 TPKSKSSQDGSSNKKRRRGRKSPSDGDSEGSSVISNRSSREKSGRPSTTPSVTAQQ---- 419

Query: 538 RSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNK 584
           +  K     + A   G+D+  S+ S        +  +S   +RSS K
Sbjct: 420 KPTKEEGRKKPAPPSGTDEQLSSASIFEHVDRLSRPSSDPYDRSSGK 466


>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
          Length = 222

 Score = 30.8 bits (69), Expect = 1.7
 Identities = 16/75 (21%), Positives = 38/75 (50%)

Query: 370 IQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
           + RL+   ++  ++ +++E++E++E+  E   Q E  E   Q+ +   +      + +N 
Sbjct: 138 LNRLKPHMMKTKSKNQKQENQEKQEKLEESEIQSEKQEQPSQQPQSQPQPQQQPQQPNNN 197

Query: 430 KRTSDAEHSSKNSSR 444
           KR +  E  S + +R
Sbjct: 198 KRKAMEEGLSPDRTR 212


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
           procyclic acidic repetitive protein (PARP) like
           sequences. The procyclic acidic repetitive protein
           (parp) genes of Trypanosoma brucei encode a small family
           of abundant surface proteins whose expression is
           restricted to the procyclic form of the parasite. They
           are found at two unlinked loci, parpA and parpB;
           transcription of both loci is developmentally regulated.
          Length = 145

 Score = 30.3 bits (67), Expect = 1.7
 Identities = 24/71 (33%), Positives = 32/71 (45%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
           +P +E EEEE+ E EEE  EE +  E  E E +  E GE E   E E + E         
Sbjct: 59  DPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPEP 118

Query: 439 SKNSSRASSRS 449
                 A+ +S
Sbjct: 119 EPEPGAATLKS 129


>gnl|CDD|215556 PLN03064, PLN03064, alpha,alpha-trehalose-phosphate synthase
           (UDP-forming); Provisional.
          Length = 934

 Score = 31.3 bits (71), Expect = 1.7
 Identities = 32/102 (31%), Positives = 40/102 (39%), Gaps = 11/102 (10%)

Query: 499 SAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSA 558
           S A  RSRS      KS  DR  SG   S RS + N   +  +S  S        S KS 
Sbjct: 818 SPAIARSRSPDGL--KSSGDRRPSGKLPSSRSNSKNSQGKKQRSLLS--------SAKSG 867

Query: 559 SNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSN 600
            N + S  +  + S      +    K G    S +V R RSN
Sbjct: 868 VNHAASHGSDRRPSPEKIGWSVLDLK-GENYFSCAVGRKRSN 908



 Score = 31.3 bits (71), Expect = 1.7
 Identities = 20/64 (31%), Positives = 26/64 (40%), Gaps = 1/64 (1%)

Query: 443 SRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRS-AKSDRSARSDRSDKSAA 501
           S A +RSRS    KS   R  SG   S  SN  +   K  RS   S +S  +  +   + 
Sbjct: 818 SPAIARSRSPDGLKSSGDRRPSGKLPSSRSNSKNSQGKKQRSLLSSAKSGVNHAASHGSD 877

Query: 502 SNRS 505
              S
Sbjct: 878 RRPS 881


>gnl|CDD|220776 pfam10483, Hap2_elong, Histone acetylation protein 2.  Hap2 is one
           of three histone acetyltransferases proteins that, in
           yeasts, are found associated with elongating forms of
           RNA polymerase II (Elongator). The Haps can be isolated
           in two forms, as a six-subunit complex with Elongator
           and as a complex of the three proteins on their own. The
           role of the Hap complex in transcription is still
           speculative, being possibly to keep the HAT activity of
           free Elongator in check, allowing histone acetylation
           only in the presence of a transcribing polymerase, or
           the interaction with Haps might render Elongator
           susceptible to modifications thereby altering its
           activity.
          Length = 280

 Score = 30.8 bits (70), Expect = 1.8
 Identities = 29/116 (25%), Positives = 43/116 (37%), Gaps = 30/116 (25%)

Query: 333 ELETRVR-LSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQR---LRERFLEPPNE----- 383
           + E   R LS+    +    N   +V+        E R +    L E F+  P       
Sbjct: 165 DKEALDRSLSEPVFPLNEGLNGVGIVLE------LENRRKSGRSLTETFILSPATHEYES 218

Query: 384 -EEEEEDEEEEEEEAEETQQGETT------EAERQEGEG--------GEREGGGEG 424
            +E+ E + E  EE  E+ +  TT      E +R   E          E+ GGGEG
Sbjct: 219 LKEKPEGDAEAAEEDPESLEDLTTFNLGLSEKQRLAREQVELPFFDAQEKLGGGEG 274


>gnl|CDD|222914 PHA02666, PHA02666, hypothetical protein; Provisional.
          Length = 287

 Score = 31.1 bits (69), Expect = 1.9
 Identities = 23/106 (21%), Positives = 45/106 (42%), Gaps = 4/106 (3%)

Query: 489 RSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRS 548
           ++ R D   +S   +R R+ S  S +      +  SA    + AS+ +  ++ + + +  
Sbjct: 17  KTMRGDGHHESCFHHRRRANSMESRRKSRPSRQHRSAERTPTTASSLTHENNTAPSRHGK 76

Query: 549 ARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSV 594
             S      AS+RS     S+ +S+N  A       +  RS+  +V
Sbjct: 77  QHS----CKASSRSSHNRGSTSSSHNHHAHRGPHQSAHRRSKHDAV 118


>gnl|CDD|227928 COG5641, GAT1, GATA Zn-finger-containing transcription factor
           [Transcription].
          Length = 498

 Score = 31.0 bits (70), Expect = 1.9
 Identities = 29/196 (14%), Positives = 71/196 (36%), Gaps = 2/196 (1%)

Query: 424 GENDNEKRTSDAEHSSKNSSRASSRS--RSRSKSKSPAKRSRSGSASSRGSNKSDRSAKS 481
             ++N     +  ++    + ++S+       + +     S +GS S R +N      + 
Sbjct: 218 SSHNNNDSNGENANTESIGNSSASKLTKSWEERPQGRQLLSDAGSLSPRSNNPKSPLLEG 277

Query: 482 DRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDK 541
              + S +   + +    +   RS   ++ +       S   S   + S ++ K   S +
Sbjct: 278 LMGSTSLQPVSTPKLVLPSDKKRSTLTTSTATPLWRRTSDKSSFSCNASGSALKPPGSKR 337

Query: 542 SAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNR 601
                    S  S+ +  N S + +    +   ++ S  +  K  +++RS   ++ +  R
Sbjct: 338 PLLPKPDPNSKRSNATCMNCSSTPTNKILSPPTTSNSPGAQVKLPNQTRSTGATKKKITR 397

Query: 602 SRSASKSGSSAASSRS 617
            R  S    + +SS  
Sbjct: 398 RRMNSGKIPALSSSMK 413


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 28.7 bits (65), Expect = 2.0
 Identities = 10/19 (52%), Positives = 14/19 (73%)

Query: 379 EPPNEEEEEEDEEEEEEEA 397
               EEE++E+EEEEEE+ 
Sbjct: 63  AAAAEEEKKEEEEEEEEDD 81



 Score = 28.0 bits (63), Expect = 3.6
 Identities = 10/17 (58%), Positives = 13/17 (76%)

Query: 383 EEEEEEDEEEEEEEAEE 399
             EEE+ EEEEEEE ++
Sbjct: 65  AAEEEKKEEEEEEEEDD 81



 Score = 28.0 bits (63), Expect = 4.2
 Identities = 8/20 (40%), Positives = 11/20 (55%)

Query: 380 PPNEEEEEEDEEEEEEEAEE 399
                   E+E++EEEE EE
Sbjct: 59  AAAAAAAAEEEKKEEEEEEE 78



 Score = 27.2 bits (61), Expect = 6.5
 Identities = 10/18 (55%), Positives = 13/18 (72%)

Query: 379 EPPNEEEEEEDEEEEEEE 396
                EEE+++EEEEEEE
Sbjct: 62  AAAAAEEEKKEEEEEEEE 79


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 31.2 bits (71), Expect = 2.1
 Identities = 27/88 (30%), Positives = 38/88 (43%), Gaps = 8/88 (9%)

Query: 333 ELETRV-RLSKRRQKVGAAPNNTRLVVTHRP-LNATEFRIQRLRERFLEPPNEEEEEEDE 390
           E E R+ RL +RR+ +        L+   R  +     R + LRER  E   E EE+ + 
Sbjct: 506 EAEDRIERLEERREDL------EELIAERRETIEEKRERAEELRERAAELEAEAEEKREA 559

Query: 391 EEEEEEAEETQQGETTEAERQEGEGGER 418
             E EE  E  + E  E   +  E  ER
Sbjct: 560 AAEAEEEAEEAREEVAELNSKLAELKER 587


>gnl|CDD|236779 PRK10864, PRK10864, putative methyltransferase; Provisional.
          Length = 346

 Score = 30.9 bits (70), Expect = 2.2
 Identities = 11/49 (22%), Positives = 16/49 (32%)

Query: 551 SDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARS 599
            D SDK   N    +     +  + A   R   +    S+SR      S
Sbjct: 20  DDDSDKRTHNPRTGKGGGRPSGKSRADGGRRPARDDRNSQSRDRKWEDS 68


>gnl|CDD|218576 pfam05395, DARPP-32, Protein phosphatase inhibitor 1/DARPP-32.
           This family consists of several mammalian protein
           phosphatase inhibitor 1 (IPP-1) and dopamine- and
           cAMP-regulated neuronal phosphoprotein (DARPP-32)
           proteins. Protein phosphatase inhibitor-1 is involved in
           signal transduction and is an endogenous inhibitor of
           protein phosphatase-1. It has been demonstrated that
           DARPP-32, if phosphorylated, can inhibit
           protein-phosphatase-1. DARPP-32 has a key role in many
           neurotransmitter pathways throughout the brain and has
           been shown to be involved in controlling receptors, ion
           channels and other physiological factors including the
           brain's response to drugs of abuse, such as cocaine,
           opiates and nicotine. DARPP-32 is reciprocally regulated
           by the two neurotransmitters that are most often
           implicated in schizophrenia - dopamine and glutamate.
           Dopamine activates DARPP-32 through the D1 receptor
           pathway and disables DARPP-32 through the D2 receptor.
           Glutamate, acting through the N-methyl-d-aspartate
           receptor, renders DARPP-32 inactive. A mutant form of
           DARPP-32 has been linked with gastric cancers.
          Length = 170

 Score = 29.9 bits (67), Expect = 2.2
 Identities = 19/88 (21%), Positives = 27/88 (30%), Gaps = 2/88 (2%)

Query: 370 IQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
           +QR+ E  L+     EE +  EEE+E     + G  T            E G     D +
Sbjct: 83  LQRIVESHLQSQGSLEENQASEEEDELDSPDELGYPTGETESTEAQDSEEPGSCPTQDAQ 142

Query: 430 KR--TSDAEHSSKNSSRASSRSRSRSKS 455
                    H +   S       S   S
Sbjct: 143 SEVGLEGPWHRTAELSGPEQEGDSEEPS 170


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 31.0 bits (70), Expect = 2.4
 Identities = 43/269 (15%), Positives = 87/269 (32%), Gaps = 31/269 (11%)

Query: 368  FRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAE--RQEGEGG--------- 416
             ++ R   +   P    ++  + E  EE    +       AE  + +G  G         
Sbjct: 1195 KKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKPKGRAGAKKKAPAAA 1254

Query: 417  -EREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKS 475
             E+E   E  +  ++  +    S+   S     +     ++  A R +  ++ S  S+  
Sbjct: 1255 KEKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETVKAVPARRAAARKKPLASVSVISDSD 1314

Query: 476  DR--------SAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGS 527
            D         S       K  R   +     +     ++    A+ +S           +
Sbjct: 1315 DDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEMLKPA 1374

Query: 528  DRSAAS-NKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSG 586
            +    S  K VR  +++  N+        KS S   R+ +     S+ + + + SS K  
Sbjct: 1375 EAIGISPEKKVRKMRASPFNK--------KSGSVLGRAATNKETESSENVSGSSSSEKDE 1426

Query: 587  --SRSRSRSVSRARSNRSRSASKSGSSAA 613
                ++ R     R   +   S S S +A
Sbjct: 1427 IDVSAKPRPQRANRKQTTYVLSDSESESA 1455


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 30.7 bits (69), Expect = 2.4
 Identities = 31/174 (17%), Positives = 72/174 (41%), Gaps = 11/174 (6%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNS 442
           EE  EE++ + + +A    + +     +Q+ EG   E   E E    K  + A   +K +
Sbjct: 83  EEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGT--EEVTEEEKAAAKAKAAAAAKAKAA 140

Query: 443 SRASSRSRSRSKSKSP--------AKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSD 494
           + A  +     +            AK   + +A ++ +  + + A        + +    
Sbjct: 141 ALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEK 200

Query: 495 RSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRS 548
              K+ A+  +++ +AA  K ++ +  +G +G + + A   +    K+AA+ R+
Sbjct: 201 AKAKAKAAAAAKAKAAALAKQKASQG-NGDSGDEDAKAKAIAAAKAKAAAAARA 253



 Score = 29.9 bits (67), Expect = 4.5
 Identities = 39/219 (17%), Positives = 92/219 (42%), Gaps = 11/219 (5%)

Query: 371 QRLRERFLEPPNEEEEEEDEEE--EEEEAEETQQGETTEAERQEGEGGEREGGGEGENDN 428
           +  R+R L   +  E  + EEE  E+E+A       T E  ++      +          
Sbjct: 20  EEARKR-LVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAALAKQK 78

Query: 429 EKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSD 488
            + T +     K  ++A + + +++K+ + AK+ R G+       K+   AK+  +AK+ 
Sbjct: 79  REGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKA- 137

Query: 489 RSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRS 548
             A +    K   +         ++K ++ ++++ +A   ++AA  K     + AA    
Sbjct: 138 -KAAALAKQKREGTEEVTEEEEETDKEKA-KAKAAAAAKAKAAALAK-----QKAAEAGE 190

Query: 549 ARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGS 587
              + +++  +      +A+++A   + A  ++S  +G 
Sbjct: 191 GTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGNGD 229


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 30.7 bits (69), Expect = 2.5
 Identities = 9/78 (11%), Positives = 20/78 (25%)

Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSS 443
           +     + + E  +   ++     +E  +    E         D  K    A     +  
Sbjct: 382 QAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAGPGDDP 441

Query: 444 RASSRSRSRSKSKSPAKR 461
                 R +    S  +R
Sbjct: 442 AEPDGIRRQDDFSSRRRR 459



 Score = 29.6 bits (66), Expect = 5.6
 Identities = 19/177 (10%), Positives = 46/177 (25%), Gaps = 10/177 (5%)

Query: 388 EDEEEEEEEAEETQQGE-TTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRAS 446
            DE  E E AE       +  +       GE     +     +   ++        S   
Sbjct: 292 VDEALEREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQ 351

Query: 447 SRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSR 506
              R    + +  + S +     +  + + ++  + +      SA  +     A+     
Sbjct: 352 VADRDGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDE 411

Query: 507 SGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSR 563
           +      K+      +      +    ++   +               D  +S R R
Sbjct: 412 TEPEVPEKA------APIPDPAKP---DELAVAGPGDDPAEPDGIRRQDDFSSRRRR 459



 Score = 29.6 bits (66), Expect = 6.1
 Identities = 8/74 (10%), Positives = 17/74 (22%), Gaps = 7/74 (9%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNS 442
           E      EE     +E   + E    E+        +        +E   +         
Sbjct: 392 EAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAK-------PDELAVAGPGDDPAEP 444

Query: 443 SRASSRSRSRSKSK 456
                +    S+ +
Sbjct: 445 DGIRRQDDFSSRRR 458


>gnl|CDD|240430 PTZ00473, PTZ00473, Plasmodium Vir superfamily; Provisional.
          Length = 420

 Score = 30.6 bits (69), Expect = 2.5
 Identities = 21/104 (20%), Positives = 33/104 (31%), Gaps = 6/104 (5%)

Query: 516 RSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNS 575
                  G   ++     N       S+ S R    D S           S+   +  +S
Sbjct: 309 NMGHDSRGPYNANYGGQFNSRSGRTGSSESIRGFTYDSSTTYGG------SSYGTSQTDS 362

Query: 576 AASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGS 619
            ++  S +   S +   S S   S    S++  GSS  SS S  
Sbjct: 363 TSTYGSRSTFDSSTGGGSQSGGGSTYGGSSTFDGSSRGSSDSFG 406



 Score = 29.8 bits (67), Expect = 4.6
 Identities = 18/82 (21%), Positives = 32/82 (39%), Gaps = 2/82 (2%)

Query: 545 SNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRS--RSRSVSRARSNRS 602
            N  +   GS +S    +   S +   S+   +   S++  GSRS   S +   ++S   
Sbjct: 326 FNSRSGRTGSSESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGGGSQSGGG 385

Query: 603 RSASKSGSSAASSRSGSPAGSD 624
            +   S +   SSR  S +   
Sbjct: 386 STYGGSSTFDGSSRGSSDSFGV 407


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 30.1 bits (68), Expect = 2.5
 Identities = 14/45 (31%), Positives = 21/45 (46%)

Query: 375 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
           E      NE +EE+  +EE+ E EE ++ E  E   +  E    E
Sbjct: 14  EEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEE 58


>gnl|CDD|215448 PLN02834, PLN02834, 3-dehydroquinate synthase.
          Length = 433

 Score = 30.5 bits (69), Expect = 2.6
 Identities = 9/62 (14%), Positives = 23/62 (37%)

Query: 556 KSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASS 615
           KS+S  +   +  +  S + + +    N S   S+   ++     +   +  + S    +
Sbjct: 2   KSSSADNSESNTPTVLSRSPSDAFFDQNSSIESSKEGDLTEVIHEKCPVSGANKSEVTKT 61

Query: 616 RS 617
            S
Sbjct: 62  AS 63


>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807).  This is
           a family of conserved fungal proteins of unknown
           function.
          Length = 169

 Score = 29.7 bits (67), Expect = 2.9
 Identities = 17/76 (22%), Positives = 28/76 (36%), Gaps = 3/76 (3%)

Query: 373 LRERFLEPPNEEEEEEDEEEEEEEAEET--QQGETTEAERQEGEGGEREGGGEGENDNEK 430
           LRER L+   E EEE + +   +          +  EAE+Q      +    +    + K
Sbjct: 66  LRERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERK-SAK 124

Query: 431 RTSDAEHSSKNSSRAS 446
                     + S+AS
Sbjct: 125 DPRGGTQDVVDKSQAS 140


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 30.4 bits (69), Expect = 3.0
 Identities = 18/79 (22%), Positives = 33/79 (41%), Gaps = 6/79 (7%)

Query: 383 EEEEEEDEEEEEEEAEETQQG-----ETTEAERQEGEGGEREGGGEGENDNEKRTSDAEH 437
           +E + E + +E E  EE +        +   E  EG   E +  GE E+D+E    +   
Sbjct: 205 KEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESA-WEGFE 263

Query: 438 SSKNSSRASSRSRSRSKSK 456
           S         R + ++K++
Sbjct: 264 SEYEPINKPVRPKRKTKAQ 282


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 30.4 bits (69), Expect = 3.0
 Identities = 19/107 (17%), Positives = 38/107 (35%)

Query: 363 LNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGG 422
           L      +  L +   +   +E+  E EEE+ E     ++ +  +AE    E    E   
Sbjct: 1   LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEA 60

Query: 423 EGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASS 469
           +     E    + E     +  A  R+  R    + A+  ++ S  +
Sbjct: 61  KPAASGEGGGGEEEEEEAKAEAAEFRAYLRGGDDALAEERKALSTGT 107


>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 193

 Score = 29.6 bits (67), Expect = 3.1
 Identities = 16/33 (48%), Positives = 19/33 (57%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQ 411
              +EEEE E+EE EEE   E +Q E  E E Q
Sbjct: 19  AEKSEEEEAEEEEPEEENELEEEQQEIAELEAQ 51


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 29.8 bits (68), Expect = 3.2
 Identities = 11/33 (33%), Positives = 21/33 (63%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQ 411
           E   E+++++DE+E+E++ EE   G   E  R+
Sbjct: 50  EEDLEDDDDDDEDEDEDDEEEADLGPDPEEARE 82



 Score = 29.1 bits (66), Expect = 5.0
 Identities = 13/63 (20%), Positives = 21/63 (33%), Gaps = 7/63 (11%)

Query: 367 EFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEN 426
           E R+  L   F++P            E E  EE       + E  + +  + +   E E 
Sbjct: 19  EGRLSDLISGFIDPNAAAAAATAAAIESELDEE-------DLEDDDDDDEDEDEDDEEEA 71

Query: 427 DNE 429
           D  
Sbjct: 72  DLG 74



 Score = 28.3 bits (64), Expect = 9.1
 Identities = 8/36 (22%), Positives = 17/36 (47%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
                E E ++E+ E+++ ++  + E  E E   G 
Sbjct: 40  TAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGP 75


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 29.5 bits (66), Expect = 3.2
 Identities = 26/115 (22%), Positives = 46/115 (40%), Gaps = 10/115 (8%)

Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEH 437
           + PP  ++EEED + + EE EE ++      +  + E  E E   E    + K  S +  
Sbjct: 12  VRPPPTKDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEETAASSKAPSSSSK 71

Query: 438 SSKN----------SSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSD 482
           SS            + R S R     +  +P   ++    ++R   KS R  ++ 
Sbjct: 72  SSSQETISIPPTPPARRPSRRWDQTGRFPNPTTGAKPTLRAARREYKSWRELRNR 126


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 30.2 bits (68), Expect = 3.6
 Identities = 8/29 (27%), Positives = 14/29 (48%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTE 407
             P E EEE DE ++ +  +  +  +  E
Sbjct: 280 PEPPEPEEEPDEPDQTDPDDGEETDQIPE 308


>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region.  The precise
           function of this eukaryotic protein family is unknown.
           The yeast orthologues have been implicated in cell cycle
           progression and biogenesis of 60S ribosomal subunits.
           The Schistosoma mansoni Mak16 has been shown to target
           protein transport to the nucleolus.
          Length = 97

 Score = 28.3 bits (63), Expect = 3.6
 Identities = 16/33 (48%), Positives = 20/33 (60%)

Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
            EE EE DEEEEEEE EE +      ++ +E E
Sbjct: 57  AEESEENDEEEEEEEEEEDEGEIEYVSDDEELE 89



 Score = 27.5 bits (61), Expect = 6.4
 Identities = 16/36 (44%), Positives = 19/36 (52%)

Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 417
             EE EE++EEEEEE EE  +GE       E    E
Sbjct: 56  EAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEE 91


>gnl|CDD|240237 PTZ00038, PTZ00038, ferredoxin; Provisional.
          Length = 191

 Score = 29.4 bits (66), Expect = 3.7
 Identities = 14/64 (21%), Positives = 26/64 (40%), Gaps = 4/64 (6%)

Query: 557 SASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSR 616
           SA       S   ++S +  +S+ S N    R+   S+  ++ N SR +S   +  +S  
Sbjct: 28  SADLAFSICSGRRRSSASFISSSASPN----RASCYSLGNSKGNTSRPSSNGVTPGSSPV 83

Query: 617 SGSP 620
               
Sbjct: 84  RSLL 87


>gnl|CDD|221661 pfam12609, DUF3774, Wound-induced protein.  This family of proteins
           is found in eukaryotes. Proteins in this family are
           typically between 81 and 97 amino acids in length. The
           proteins in the family are often annotated as
           wound-induced proteins however there is little
           accompanying literature to confirm this.
          Length = 78

 Score = 27.8 bits (62), Expect = 3.7
 Identities = 12/40 (30%), Positives = 18/40 (45%)

Query: 504 RSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSA 543
           RS   +A +N  RS      S+ S  SA++  +    K A
Sbjct: 24  RSLHQAAKANARRSASQAGKSSSSSSSASAAAAEVKAKQA 63


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 30.0 bits (68), Expect = 3.8
 Identities = 18/52 (34%), Positives = 24/52 (46%), Gaps = 9/52 (17%)

Query: 385 EEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAE 436
            +EED EE  E+AEE         E +E +  E E   E + D E+   D E
Sbjct: 274 GDEEDLEELLEKAEE---------EEEEDDYSESEDEDEEDEDEEEEEDDDE 316



 Score = 29.2 bits (66), Expect = 5.8
 Identities = 11/35 (31%), Positives = 23/35 (65%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 417
           E+ EEE+EE++  E+E+  + +  E E ++ + G+
Sbjct: 284 EKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGD 318



 Score = 29.2 bits (66), Expect = 6.8
 Identities = 16/41 (39%), Positives = 22/41 (53%)

Query: 374 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
            E  LE   EEEEE+D  E E+E EE +  E  E + +  +
Sbjct: 279 LEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
           fusion protein; Provisional.
          Length = 131

 Score = 28.8 bits (64), Expect = 3.9
 Identities = 11/51 (21%), Positives = 24/51 (47%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
           E  + +E+ ++  E +   E  ++ ET EA  +  +  + E     E ++E
Sbjct: 77  EAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDE 127


>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
           synthase complex, beta subunit.  Nomenclature follows
           the description for Methanosarcina thermophila. The
           CO-methylating acetyl-CoA synthase is considered the
           defining enzyme of the Wood-Ljungdahl pathway, used for
           acetate catabolism by sulfate reducing bacteria but for
           acetate biosynthesis by acetogenic bacteria such as
           oorella thermoacetica (f. Clostridium thermoaceticum)
           [Energy metabolism, Chemoautotrophy].
          Length = 458

 Score = 30.2 bits (68), Expect = 4.0
 Identities = 12/25 (48%), Positives = 14/25 (56%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTE 407
           E +EEE EEEEE    E  + E  E
Sbjct: 402 EVDEEEIEEEEEAMQPEEMEMEGFE 426


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 30.1 bits (68), Expect = 4.0
 Identities = 24/99 (24%), Positives = 42/99 (42%), Gaps = 8/99 (8%)

Query: 369 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDN 428
            +++L  +      E++E+E EE E+   E  Q+GE   A  Q  E G +         N
Sbjct: 294 ELKKLENKL-----EKQEDELEELEKAAEELRQKGELLYANLQLIEEGLKSVRLADFYGN 348

Query: 429 EKRTSDAEHS---SKNSSRASSRSRSRSKSKSPAKRSRS 464
           E+   + + S   S+N+ R   + +    +K    R  S
Sbjct: 349 EEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLS 387


>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in bacteria and eukaryotes, and
           is typically between 384 and 430 amino acids in length.
          Length = 414

 Score = 29.8 bits (67), Expect = 4.1
 Identities = 39/147 (26%), Positives = 61/147 (41%), Gaps = 16/147 (10%)

Query: 463 RSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRS 522
           RS S S R S +         S  S+R+     SD + +   S SG  AS  +R   SR 
Sbjct: 20  RSDSVSKRWSAQLPSGLSRGNSFLSNRN-----SDAAPSGTDSLSGRPASRLNREPSSRP 74

Query: 523 GSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSA-SNRSRSRSASSQASNNSAASNRS 581
           GS+ S+ +     +   ++ A    +   +G  + +  +  RSRSAS    +NS   +R 
Sbjct: 75  GSSHSEATIVRQ-AKEGERPATPPEARPDEGFVRPSLPSHPRSRSASV---SNSKDGDRP 130

Query: 582 SNKSGSRSRSRSVSRARSNRSRSASKS 608
           S          S S+    R  S +K+
Sbjct: 131 S------DLPPSPSKTMDPRRWSPTKA 151


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
           biogenesis [Translation, ribosomal structure and
           biogenesis].
          Length = 1077

 Score = 30.1 bits (67), Expect = 4.1
 Identities = 31/172 (18%), Positives = 62/172 (36%), Gaps = 3/172 (1%)

Query: 374 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTS 433
           R+   +P  +   EE   E+E   +++    + E E  +  G +     E E+DNE+   
Sbjct: 407 RKTRRQPTGKAIAEETSREDELSFDDSDVSTSDENEDVDFTGKKGAINNEDESDNEEVAF 466

Query: 434 DAEHS---SKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRS 490
           D++     S+ + R      S+       KR R+        + S      +   +S +S
Sbjct: 467 DSDSQFDESEGNLRWKEGLASKLAYSQSGKRGRNIQKIFYDESLSPEECIEEYKGESAKS 526

Query: 491 ARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKS 542
           + SD   +    +       A+    S+  +   +  +       S+   KS
Sbjct: 527 SESDLVVQDEPEDFFDVSKVANESISSNHEKLMESEFEELKKKWSSLAQLKS 578


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 30.0 bits (67), Expect = 4.2
 Identities = 41/243 (16%), Positives = 76/243 (31%), Gaps = 31/243 (12%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNS 442
            + EE+D++ EEE+ + T   +      ++ + G       G   +    +  +  +   
Sbjct: 114 TDSEEDDDDSEEEDNKSTSSKD--GKGSKKTQPGVST--SSGSTTSGTDLNTKQSQTGLG 169

Query: 443 SRASSRSRSRSKSKSPAKRSR-----------SGSASSRGSNKSDRSAKSDRSAKSDRSA 491
           +  S   +  + S+S                  G A +       R+  S          
Sbjct: 170 ASGSHAQQDPAVSQSGVVGVPGLGVPGVGVPGGGGAGALPGVGVGRAGVSPGVGVGGLGG 229

Query: 492 RSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARS 551
                    ASN SR G    ++ R    R    G          VR   S +S  + R 
Sbjct: 230 VPG--VGILASNTSREGQTQDDQERDGDGRVIEPG-----VGLPGVRVGDSTSSPSTTRP 282

Query: 552 DGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSR-------SRSVSRARSNRSRS 604
            GS  + +    S   S+     S++ N  +  + S S         R+++     R   
Sbjct: 283 SGS--TTTTTPASSGPSAPGGPGSSSRNAVTRSTDSISGPIPSPGAPRAITGQMGEREMF 340

Query: 605 ASK 607
           A +
Sbjct: 341 AVQ 343


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
           secreted from the skin of amphibians, including the
           opiate-like dermorphins and deltorphins, and the
           antimicrobial dermoseptins and temporins. The alignment
           for this family includes the signal peptide.
          Length = 46

 Score = 26.9 bits (60), Expect = 4.2
 Identities = 10/20 (50%), Positives = 14/20 (70%)

Query: 383 EEEEEEDEEEEEEEAEETQQ 402
           EE+ E++EE E+EE  E Q 
Sbjct: 23  EEKREDEEENEDEEEGEEQS 42



 Score = 26.9 bits (60), Expect = 4.2
 Identities = 13/22 (59%), Positives = 16/22 (72%)

Query: 383 EEEEEEDEEEEEEEAEETQQGE 404
           EEE+ EDEEE E+E E  +Q E
Sbjct: 22  EEEKREDEEENEDEEEGEEQSE 43



 Score = 26.5 bits (59), Expect = 4.6
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 382 NEEEEEEDEEEEEEEAEETQQ 402
            E+ E+E+E E+EEE EE  +
Sbjct: 23  EEKREDEEENEDEEEGEEQSE 43


>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
           Provisional.
          Length = 291

 Score = 29.8 bits (66), Expect = 4.3
 Identities = 43/217 (19%), Positives = 86/217 (39%), Gaps = 15/217 (6%)

Query: 380 PPNEEEEEEDEEEEEEEAEETQQG-ETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
           P   EE++ED +++ EE ++   G E  E E+  G+ G+      G    EK+   +E  
Sbjct: 71  PLPTEEDDEDVDDDSEEGDDDDGGAEDEEEEKVRGQSGQEGTVALGSGSTEKKLIGSE-- 128

Query: 439 SKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDK 498
            K +  + S + S S S S             G  ++D++  +  +  +  +  +     
Sbjct: 129 -KQTELSISSAESISPSGSRELNVNLTQTEVEGKKETDKNTPAVENPLTTGNGENTLPAG 187

Query: 499 SAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSA 558
               N S         SR         G   ++   K+V   ++AA+ +S    GS+ + 
Sbjct: 188 IVEGNPSPPPPQDGIHSREQD------GEGTTSEGQKNVPLPETAATPQSHHDKGSEGTG 241

Query: 559 SNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVS 595
            +     + ++  + N+  +  + N  GS + S + S
Sbjct: 242 ED-----TKATTVTANTTDTTNTQNSDGSTAVSHTTS 273


>gnl|CDD|218908 pfam06136, DUF966, Domain of unknown function (DUF966).  Family of
           plant proteins with unknown function.
          Length = 308

 Score = 29.7 bits (67), Expect = 4.4
 Identities = 23/94 (24%), Positives = 36/94 (38%), Gaps = 8/94 (8%)

Query: 533 SNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNN------SAASNRSSNKSG 586
           S+ S    + A+S      + SD    NR  ++S SS              ++ S+    
Sbjct: 91  SSSSKGDPEEASS--RKLQEESDTPPVNRRANQSWSSSDLAEYKVYKAEEPADASTQTDD 148

Query: 587 SRSRSRSVSRARSNRSRSASKSGSSAASSRSGSP 620
            RSR  S + +        S   SS++ S S SP
Sbjct: 149 RRSRDSSEAESTELSREEISPPSSSSSPSSSSSP 182


>gnl|CDD|177912 PLN02272, PLN02272, glyceraldehyde-3-phosphate dehydrogenase.
          Length = 421

 Score = 29.8 bits (67), Expect = 4.4
 Identities = 18/60 (30%), Positives = 28/60 (46%), Gaps = 5/60 (8%)

Query: 492 RSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARS 551
           RS  +  +AA+  S   S++S     D S+  S G   S + + S     S+  + SARS
Sbjct: 8   RSAATAPAAAARGSDFSSSSS-----DPSKVSSVGFSSSLSFSGSSSGASSSLQSCSARS 62


>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
          Length = 869

 Score = 30.0 bits (68), Expect = 4.5
 Identities = 17/40 (42%), Positives = 20/40 (50%), Gaps = 1/40 (2%)

Query: 378 LEP-PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGG 416
           LEP P EE EEE  EE EE   E  + E    +   G+ G
Sbjct: 828 LEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDTGQLG 867


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 29.7 bits (66), Expect = 4.6
 Identities = 17/85 (20%), Positives = 36/85 (42%), Gaps = 4/85 (4%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETT----EAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
            E +E  E ++E++ EE +  E      +++R +  G E+E   E E +       A+  
Sbjct: 284 IENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADEL 343

Query: 439 SKNSSRASSRSRSRSKSKSPAKRSR 463
           ++       + R    + +P  + R
Sbjct: 344 NELEKGIEEKRRQMESATNPILQKR 368


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 29.7 bits (67), Expect = 4.7
 Identities = 12/38 (31%), Positives = 21/38 (55%)

Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 417
             +EE E++  +E+EE+    ++ E    E +E E GE
Sbjct: 35  TEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts
           harbouring premature signals for translation termination
           are recognised and rapidly degraded by eukaryotic cells
           through a pathway known as nonsense-mediated mRNA decay.
           In Saccharomyces cerevisiae, three trans-acting factors
           (Upf1 to Upf3) are required for nonsense-mediated mRNA
           decay.
          Length = 171

 Score = 28.9 bits (65), Expect = 4.7
 Identities = 15/56 (26%), Positives = 26/56 (46%)

Query: 381 PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAE 436
              E E +D EE+EE  EE +  E+++ E  +    E++   + E +    T   E
Sbjct: 1   SGSESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEE 56


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 29.3 bits (66), Expect = 4.9
 Identities = 28/165 (16%), Positives = 49/165 (29%), Gaps = 10/165 (6%)

Query: 433 SDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSG--SASSRGSNKSDRSAKSDRSAKSDRS 490
            + E  SK  +++    R+  +SK   K   S           K  +  K+  SA S  S
Sbjct: 95  RELEKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPES 154

Query: 491 ARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAAS----N 546
             S  S  +++   +     A         ++    S  S+   KS +S  +        
Sbjct: 155 --SPSSTPNSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKM 212

Query: 547 R--SARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRS 589
           R      D  D +  + S   +    A              G  +
Sbjct: 213 RKWDLDGDEDDDAVLDYSAPDANDENADAPEDVEEVDQESWGRGT 257


>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
           (DUF2098).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function.
          Length = 91

 Score = 27.7 bits (62), Expect = 5.0
 Identities = 15/50 (30%), Positives = 25/50 (50%), Gaps = 1/50 (2%)

Query: 374 RERFLEPPNEEEEEEDEEEE-EEEAEETQQGETTEAERQEGEGGEREGGG 422
           R  +LE  ++ +E+++E EE +EE  E  + E    E  +       GGG
Sbjct: 42  RTDYLEVTDKVKEKKEEREEDKEELIERIKKEEETFEDVDLGSAGCGGGG 91


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 29.8 bits (68), Expect = 5.0
 Identities = 15/29 (51%), Positives = 19/29 (65%)

Query: 371 QRLRERFLEPPNEEEEEEDEEEEEEEAEE 399
           Q++RE FL+    EEEEE+EEEE     E
Sbjct: 517 QKIREEFLQKHKNEEEEEEEEEELPLIPE 545


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 29.0 bits (65), Expect = 5.0
 Identities = 13/66 (19%), Positives = 32/66 (48%)

Query: 377 FLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAE 436
           F E   +E++E  +E EE+EA + ++ +  + E++  E   ++   +   +  K+ +   
Sbjct: 91  FPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKT 150

Query: 437 HSSKNS 442
            + K  
Sbjct: 151 TTKKEE 156


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 30.0 bits (67), Expect = 5.2
 Identities = 15/57 (26%), Positives = 23/57 (40%)

Query: 342 KRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAE 398
           ++     A   +  L  +  PL      I    +  +E  N EE E +EE E+ E E
Sbjct: 339 EKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDE 395


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 29.3 bits (65), Expect = 5.3
 Identities = 28/116 (24%), Positives = 44/116 (37%)

Query: 353 NTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
           N  L     P  A     +  +E   +   EEEEEE+EEE E E  E     T+    + 
Sbjct: 110 NIGLAALQLPKKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEV 169

Query: 413 GEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSAS 468
             G    GG  GE   E+  ++AE      +  ++ S +     +   +   G+  
Sbjct: 170 DHGNGSSGGDNGEEGEEESVTEAEAEGTTVAGPTTTSPNGGFQPTTPPQEVYGTTD 225


>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
          Length = 200

 Score = 29.1 bits (65), Expect = 5.4
 Identities = 17/49 (34%), Positives = 20/49 (40%), Gaps = 3/49 (6%)

Query: 381 PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
           P  + E E  E E+   E+T      E E    EGG   G G G  D E
Sbjct: 116 PEAKGEGEGHEPEDPPPEDTPPPPGGEGEV---EGGPSPGPGPGPLDQE 161


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 29.2 bits (65), Expect = 5.7
 Identities = 27/112 (24%), Positives = 47/112 (41%), Gaps = 5/112 (4%)

Query: 348 GAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTE 407
           G  P N R  + ++     E R    +ER++E   EEEE + E E   +  E ++ +  +
Sbjct: 175 GDTPLNVREHLWNKAATEREKRQDE-KERYVE---EEEESDTELEAVTDDSEKEKTKKKD 230

Query: 408 AERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRS-KSKSP 458
            E+  G     E     E ++ +  SD +    N  +   R    + KS+ P
Sbjct: 231 LEKWLGSDQSMETSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKP 282



 Score = 29.2 bits (65), Expect = 6.9
 Identities = 19/90 (21%), Positives = 39/90 (43%), Gaps = 6/90 (6%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
           E   +E+E   EEEEE + E     + +E E+ + +  E+  G      +++    +E  
Sbjct: 194 EKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLG------SDQSMETSESE 247

Query: 439 SKNSSRASSRSRSRSKSKSPAKRSRSGSAS 468
            + SS + S       +K   ++ ++  A 
Sbjct: 248 EEESSESESDEDEDEDNKGKIRKRKTDDAK 277


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 29.7 bits (66), Expect = 5.8
 Identities = 54/237 (22%), Positives = 82/237 (34%), Gaps = 8/237 (3%)

Query: 373 LRERFLEPPNEEEEEEDEEEEEEEAE-ETQQGETTEAERQEGEGGEREGGGEGENDNEKR 431
           L ++ +E    EE E  E E  E  E E  + E  E ER E +  ER+     E +   R
Sbjct: 455 LEKKRIERLEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDR 514

Query: 432 TS--DAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDR 489
                 E + +NS          S    P      G+    G   SD   ++    +S  
Sbjct: 515 LERDRLEKARRNSYFLKGMENGLSAGGGPGDGPGVGAGVGAGVGTSD--GRNHSGVRSGI 572

Query: 490 SARSDRSDKSAASNRSRSGSAASNKSR-SDRSRSGSAGSDRSAASNKSVRSDKSAASNRS 548
                 S +    +  RSG   S +    D  RSG   S R    + S+R   +    RS
Sbjct: 573 HCSIQSSARGGVHDSVRSGVHDSVRGGVHDSMRSGVHDSLRGGVHD-SIRGG-AHDGVRS 630

Query: 549 ARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSA 605
              DG  KS+  +      +     N    + + ++    S   S  +  +  SR A
Sbjct: 631 GAHDGVRKSSGVKVEGMGNARAPPPNQGTHDGNFSRGKDSSDHVSSYKPYAYDSRKA 687



 Score = 29.3 bits (65), Expect = 7.0
 Identities = 47/219 (21%), Positives = 72/219 (32%), Gaps = 4/219 (1%)

Query: 369 RIQRL-RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEND 427
           RI+RL RE       E  E  + E  E E  E ++ E    ER   +  ERE     E D
Sbjct: 459 RIERLEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDRLERD 518

Query: 428 NEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKS 487
             ++     +  K      S                +G  +S G N S   +    S +S
Sbjct: 519 RLEKARRNSYFLKGMENGLSAGGGPGDGPGVGAGVGAGVGTSDGRNHSGVRSGIHCSIQS 578

Query: 488 DRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASN- 546
                   S +S   +  R G   S +S    S  G         ++  VRS   A    
Sbjct: 579 SARGGVHDSVRSGVHDSVRGGVHDSMRSGVHDSLRGGVHDSIRGGAHDGVRS--GAHDGV 636

Query: 547 RSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKS 585
           R +     +   + R+   +  +   N S   + S + S
Sbjct: 637 RKSSGVKVEGMGNARAPPPNQGTHDGNFSRGKDSSDHVS 675


>gnl|CDD|240184 cd05384, SCP_PRY1_like, SCP_PRY1_like: SCP-like extracellular
           protein domain, PRY1-like sub-family restricted to
           fungi. The wider family of SCP containing proteins
           includes plant pathogenesis-related protein 1 (PR-1),
           CRISPs, mammalian cysteine-rich secretory proteins, and
           allergen 5 from vespid venom. It has been proposed that
           SCP domains may function as endopeptidases. PRY1 is a
           yeast protein that is up-regulated in core ESCRT
           mutants. This PRY1-like group also contains fruiting
           body proteins SC7/14 from Schizophyllum commune.
          Length = 129

 Score = 28.4 bits (64), Expect = 5.8
 Identities = 7/22 (31%), Positives = 12/22 (54%)

Query: 117 EEDILTPQDSKRSRHHARSVSW 138
              IL   +SKR+ H  + ++W
Sbjct: 3   ASSILDAHNSKRALHGVQPLTW 24


>gnl|CDD|177755 PLN00152, PLN00152, DNA-directed RNA polymerase; Provisional.
          Length = 130

 Score = 28.3 bits (63), Expect = 5.8
 Identities = 12/40 (30%), Positives = 18/40 (45%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 418
           EP   E EE  EE+ E    +  +G   E E +E +   +
Sbjct: 12  EPSEPEIEEGAEEDSESNNNDEVKGSIGENEDKEEQEPVQ 51


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 29.2 bits (66), Expect = 5.9
 Identities = 13/51 (25%), Positives = 22/51 (43%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTS 433
           +E E  D E+EEE+ E  ++ +    E    E  E     E     +++ S
Sbjct: 125 KEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKAS 175


>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain.  This is a
           short domain found in bacterial type II/III secretory
           system proteins. The architecture of these proteins
           suggest that this family may be functionally analogous
           to pfam03958.
          Length = 95

 Score = 27.7 bits (62), Expect = 5.9
 Identities = 20/67 (29%), Positives = 32/67 (47%), Gaps = 13/67 (19%)

Query: 527 SDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSG 586
             RS +SN SV S             GS  S+ + S S S++S    +S++S+   + SG
Sbjct: 11  MKRSGSSNTSVTS-------------GSVSSSGSNSSSSSSNSSNGGSSSSSSSGDSSSG 57

Query: 587 SRSRSRS 593
           +R  + S
Sbjct: 58  TRITTES 64


>gnl|CDD|224304 COG1386, scpB, Chromosome segregation and condensation protein B
           [DNA replication, recombination and repair].
          Length = 184

 Score = 28.8 bits (65), Expect = 6.1
 Identities = 14/60 (23%), Positives = 21/60 (35%), Gaps = 8/60 (13%)

Query: 338 VRLSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEA 397
           +R   RR   G          T + L+   F +  L E    P  EE ++     EE+  
Sbjct: 133 IREVGRRDTPGRP---YLYGTTEKFLD--YFGLDSLDE---LPDLEELKDAGLLSEEDLL 184


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 29.0 bits (65), Expect = 6.2
 Identities = 13/26 (50%), Positives = 19/26 (73%)

Query: 374 RERFLEPPNEEEEEEDEEEEEEEAEE 399
           + + LE  + +EE+E +EEEEEE EE
Sbjct: 159 KLKELEAEDVDEEDEKDEEEEEEEEE 184


>gnl|CDD|224532 COG1617, COG1617, Uncharacterized conserved protein [Function
           unknown].
          Length = 158

 Score = 28.6 bits (64), Expect = 6.2
 Identities = 18/73 (24%), Positives = 27/73 (36%), Gaps = 7/73 (9%)

Query: 334 LETRVRLSKRRQ------KVGAAPNNTRLV-VTHRPLNATEFRIQRLRERFLEPPNEEEE 386
           +E  +RL+  RQ        GA      +V V         F+   LR+  LE       
Sbjct: 71  IEILLRLAGTRQIKDAIKLGGAKKGENFIVAVLVDTPKMEGFKCSELRDFVLEEAEVLRI 130

Query: 387 EEDEEEEEEEAEE 399
            ++E EE  E + 
Sbjct: 131 TDEELEEAGEEDV 143


>gnl|CDD|214923 smart00937, PCRF, This domain is found in peptide chain release
           factors. 
          Length = 116

 Score = 27.8 bits (63), Expect = 6.3
 Identities = 12/31 (38%), Positives = 15/31 (48%)

Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
            EEE++E  E  EEE EE +     E E   
Sbjct: 4   EEEEDDEMRELAEEELEELEAELEEELEELL 34


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 28.0 bits (62), Expect = 6.3
 Identities = 23/55 (41%), Positives = 29/55 (52%)

Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAE 436
           N  +E +DE EEEEE +E  + E  E E +EGE  E   G  G+   E    DAE
Sbjct: 42  NGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAE 96



 Score = 27.2 bits (60), Expect = 9.6
 Identities = 16/51 (31%), Positives = 26/51 (50%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
           E  +E EEEE+ +E++EE E   + E  E E +      +    + E+D E
Sbjct: 46  EGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAE 96


>gnl|CDD|234700 PRK00249, flgH, flagellar basal body L-ring protein; Reviewed.
          Length = 222

 Score = 29.1 bits (66), Expect = 6.4
 Identities = 13/62 (20%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 534 NKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRS 593
            ++  + KS ++NRS R   ++    +     +     ++   +S+ S    GS SRS +
Sbjct: 75  QENTSASKSNSTNRS-RDSSTNFGLPSLFGYLNGLDANASLETSSDNSFKGKGSASRSNT 133

Query: 594 VS 595
           ++
Sbjct: 134 LT 135


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 28.9 bits (64), Expect = 6.5
 Identities = 14/35 (40%), Positives = 23/35 (65%)

Query: 370 IQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGE 404
           + +L+E   E   EE++ ++E+EEEE  EE +Q E
Sbjct: 203 LSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEE 237


>gnl|CDD|227824 COG5537, IRR1, Cohesin [Cell division and chromosome partitioning].
          Length = 740

 Score = 29.5 bits (66), Expect = 6.6
 Identities = 12/48 (25%), Positives = 19/48 (39%)

Query: 425 ENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGS 472
           E+D E +  D    +  + R S   +  SK  SP        +S + S
Sbjct: 10  EDDYEDQDDDDYVDTATAKRKSRSRKPMSKENSPDGVLAKDMSSDQKS 57


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 28.9 bits (64), Expect = 7.2
 Identities = 16/45 (35%), Positives = 25/45 (55%)

Query: 375 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
           ER  E  +E E++E++E E EE    +Q + +E E  E +  E E
Sbjct: 145 ERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETE 189


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
           represents the eukaryotic large ribosomal protein P1.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P1 is located in the L12 stalk, with proteins
           P2, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers) and bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plant species have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site located in the C-terminal
           region of P2. TCS inactivates the ribosome by
           depurinating a specific adenine in the sarcin-ricin loop
           of 28S rRNA.
          Length = 103

 Score = 27.7 bits (62), Expect = 7.6
 Identities = 9/21 (42%), Positives = 15/21 (71%)

Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
               E ++EE +EEEEEE+++
Sbjct: 76  AAAAEAKKEEKKEEEEEESDD 96


>gnl|CDD|131764 TIGR02717, AcCoA-syn-alpha, acetyl coenzyme A synthetase (ADP
           forming), alpha domain.  Although technically
           reversible, it is believed that this group of
           ADP-dependent acetyl-CoA synthetases (ACS) act in the
           direction of acetate and ATP production in the organisms
           in which it has been characterized. In most species this
           protein exists as a fused alpha-beta domain polypeptide.
           In Pyrococcus and related species, however the domains
           exist as separate polypeptides. This model represents
           the alpha (N-terminal) domain. In Pyrococcus and related
           species there appears to have been the development of a
           paralogous family such that four other proteins are
           close relatives. In reference , one of these (along with
           its beta-domain partner) was characterized as ACS-II
           showing specificity for phenylacetyl-CoA. This model has
           been constructed to exclude these non-ACS-I paralogs.
           This may result in new, authentic ACS-I sequences
           falling below the trusted cutoff.
          Length = 447

 Score = 29.2 bits (66), Expect = 7.8
 Identities = 11/20 (55%), Positives = 16/20 (80%)

Query: 605 ASKSGSSAASSRSGSPAGSD 624
            S++G+ AASS +G+ AGSD
Sbjct: 240 TSEAGAKAASSHTGALAGSD 259


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 28.6 bits (64), Expect = 8.0
 Identities = 15/48 (31%), Positives = 18/48 (37%)

Query: 365 ATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
             E R + + E+          EE    EEEEAEE    E    E  E
Sbjct: 139 VNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAEEAPAEESAE 186


>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist.  LIN-8
           is a nuclear protein, present at the sites of
           transcriptional repressor complexes, which interacts
           with LIN-35 Rb.Lin35 Rb is a product of the class B
           synMuv gene lin-35 which silences genes required for
           vulval specification through chromatin modification and
           remodelling. The biological role of the interaction has
           not yet been determined however predictions have been
           made. The interaction shows that class A synMuv genes
           control vulval induction through the transcriptional
           regulation of gene expression. LIN-8 normally functions
           as part of a protein complex however when the complex is
           absent, other family members can partially replace LIN-8
           activity.
          Length = 316

 Score = 28.9 bits (65), Expect = 8.0
 Identities = 19/100 (19%), Positives = 33/100 (33%), Gaps = 9/100 (9%)

Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAE-------RQEGEGGEREGGGEGENDNEKRTSD 434
           + EE +E +E E EE  +         E       R +             N  +   S 
Sbjct: 159 DMEENDEVDEVEVEEVPDDYGANLAVDEPEQSTMSRPQEVKQSVPAQQAPPNPQQPMPSA 218

Query: 435 AEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNK 474
           +  S+   S+++S SR  S         R  + +   + K
Sbjct: 219 SSESAT--SKSASTSRESSPQPQSPPPRRVPAPTVDSTAK 256


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 28.2 bits (63), Expect = 8.1
 Identities = 13/35 (37%), Positives = 22/35 (62%)

Query: 385 EEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
            ++E+EE EEE  +ET+Q +  E + +  E  +RE
Sbjct: 94  SDDEEEETEEESTDETEQEDPPETKTESKEKKKRE 128


>gnl|CDD|223023 PHA03249, PHA03249, DNA packaging tegument protein UL25;
           Provisional.
          Length = 653

 Score = 29.2 bits (65), Expect = 8.2
 Identities = 26/135 (19%), Positives = 52/135 (38%), Gaps = 3/135 (2%)

Query: 493 SDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSA---ASNKSVRSDKSAASNRSA 549
           S   D+     R R+ +   ++  +  S   S+  ++S+    S     S+  A   R A
Sbjct: 23  SSLEDEVFLDPRPRAPTEDLDRMEAGLSSYSSSSDNKSSFEVVSETDSGSEAEAERGRRA 82

Query: 550 RSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSG 609
              G +K+     R+++   + ++ + A+  +   + S  +S  VS   S  S S    G
Sbjct: 83  GMGGRNKATKPSRRNKTTQCRPTSLALATAATMPATPSSGKSPKVSSPPSIPSLSEEDEG 142

Query: 610 SSAASSRSGSPAGSD 624
           +   S    S    +
Sbjct: 143 AERNSGGDDSSHTDN 157


>gnl|CDD|224236 COG1317, FliH, Flagellar biosynthesis/type III secretory pathway
           protein [Cell motility and secretion / Intracellular
           trafficking and secretion].
          Length = 234

 Score = 28.5 bits (64), Expect = 8.3
 Identities = 17/52 (32%), Positives = 23/52 (44%)

Query: 375 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEN 426
           +  LE    E+  E +EEE E A +  Q    E  R+  E G + G  EG  
Sbjct: 34  KAVLEEEELEQALEAKEEELESAAQELQEGIEEGAREGYEEGFQLGYEEGFE 85


>gnl|CDD|235218 PRK04081, PRK04081, hypothetical protein; Provisional.
          Length = 207

 Score = 28.5 bits (64), Expect = 8.3
 Identities = 21/65 (32%), Positives = 33/65 (50%), Gaps = 6/65 (9%)

Query: 546 NRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSA 605
           N   +   + KS     RS+++ ++A++ S++S  SS KSG    S S S      S S+
Sbjct: 149 NYQNQRQRTYKSPQAYQRSQNSFNKAASASSSSGSSSKKSGFFGGSSSAS------SSSS 202

Query: 606 SKSGS 610
           S  GS
Sbjct: 203 SSFGS 207


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 29.0 bits (65), Expect = 8.3
 Identities = 18/57 (31%), Positives = 25/57 (43%), Gaps = 1/57 (1%)

Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEND-NEKRTSDAEH 437
           ++E   E    E+ E EE ++GE  E E  E E  E E  G+   +  E      EH
Sbjct: 66  DDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEELTEASEQLEEH 122


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 28.5 bits (64), Expect = 8.4
 Identities = 18/48 (37%), Positives = 26/48 (54%)

Query: 367 EFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
           E R +R +         +EEEE EEE E++ EE ++ E  E  R+E E
Sbjct: 26  EEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQE 73


>gnl|CDD|219038 pfam06459, RR_TM4-6, Ryanodine Receptor TM 4-6.  This region covers
           TM regions 4-6 of the ryanodine receptor 1 family.
          Length = 217

 Score = 28.5 bits (64), Expect = 8.5
 Identities = 16/46 (34%), Positives = 21/46 (45%), Gaps = 1/46 (2%)

Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQE-GEGGEREGGGEG 424
           P  E  EE + E    +  ET +GE  E      G   ++EGGG G
Sbjct: 15  PEEEGAEEVEPEGSGRDLPETGEGEEEEVGVDAFGLDMKKEGGGYG 60


>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
           UL32.  The large phosphorylated protein (UL32-like) of
           herpes viruses is the polypeptide most frequently
           reactive in immuno-blotting analyses with antisera when
           compared with other viral proteins.
          Length = 777

 Score = 29.1 bits (65), Expect = 8.8
 Identities = 35/254 (13%), Positives = 74/254 (29%), Gaps = 19/254 (7%)

Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEH 437
           LE  +   E  DE+++++     Q  +            + E  G+   ++   +  A  
Sbjct: 275 LEYDDPGLESTDEDDDDDGDSSLQTFKPLLDLTGSSLWSDDEESGD---EDGDGSGFAPE 331

Query: 438 SSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAK--SDRSAKSDRSARSDR 495
               +   S+ +         + +  S  A    S   +       +   +      ++ 
Sbjct: 332 PLIKTDSRSNDTLVDLGRGGGSLKLDSVDAPGTSSYLFEPGLSPSPNSGKEMPGILTTEN 391

Query: 496 SDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSD 555
            D   AS  S        +  + +  +    +        +V  ++S A +    +D  D
Sbjct: 392 LDLPLASTDSTEMDPEDKRGGAVKINNSGILAWGLKTPGLAVNDERSIAVSSDGITDVLD 451

Query: 556 -----KSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGS 610
                +  S+     S S  +    +A           SR     R     +  +S S S
Sbjct: 452 PPSPLRLHSSDKVIDSVSPPSKRRVSAPA---------SRLDDAKRPEVTATPESSGSDS 502

Query: 611 SAASSRSGSPAGSD 624
              +S       SD
Sbjct: 503 EGGASGREDETSSD 516


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 29.0 bits (65), Expect = 8.8
 Identities = 8/45 (17%), Positives = 19/45 (42%)

Query: 370 IQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
             +      +  +  EE E + +EE+  +E  + +  E   ++ E
Sbjct: 381 YTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKE 425


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 29.3 bits (65), Expect = 9.0
 Identities = 29/123 (23%), Positives = 51/123 (41%), Gaps = 9/123 (7%)

Query: 502 SNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNR 561
           S R+R    + +  RS R +  ++   R++ +N+  RS     S R++ S  S  S  ++
Sbjct: 4   SMRTRRSRGSMSTLRSGRKKQTASPDGRASPTNEDQRS-----SGRNSPSAASTSSNDSK 58

Query: 562 SRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPA 621
           + S    ++     A    S  KS  R R +  S       R  +K   +   SR  SP+
Sbjct: 59  AESTKKPNKKIKEEAT---SPLKSTKRQREKPASDTEEP-ERVTAKKSKTQELSRPNSPS 114

Query: 622 GSD 624
             +
Sbjct: 115 EGE 117


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 28.6 bits (64), Expect = 9.1
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query: 379 EPPNEEEEEEDEEEEEEEAEETQ 401
           E   +E+EEE+E EEEEE E+  
Sbjct: 262 EKELKEDEEEEETEEEEEEEDED 284


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
            [Transcription / DNA replication, recombination, and
            repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 29.2 bits (65), Expect = 9.1
 Identities = 23/80 (28%), Positives = 37/80 (46%), Gaps = 5/80 (6%)

Query: 377  FLEPPNEEEEEEDEEE--EEEEAEETQQGETTEAERQEGEGGEREGGGEGEND--NEKRT 432
            FL   +++E +E EEE  E E + + +  ET E E  + E  E     E END  +E+  
Sbjct: 922  FLMVGSDDESDESEEEVSEYEASSDDESDETDEDEESD-ESSEDLSEDESENDSSDEEDG 980

Query: 433  SDAEHSSKNSSRASSRSRSR 452
             D +     ++  S   + R
Sbjct: 981  EDWDELESKAAYDSRPGKRR 1000


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 28.9 bits (65), Expect = 9.4
 Identities = 23/89 (25%), Positives = 38/89 (42%), Gaps = 7/89 (7%)

Query: 355 RLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEE-EEAEETQQGETT------E 407
           RL   H  L A E R+   +E   E  N   ++    E++ +EA +    E +       
Sbjct: 264 RLQHLHGELVADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQELSAANAKLA 323

Query: 408 AERQEGEGGEREGGGEGENDNEKRTSDAE 436
           A+R E E  E + G   + D E+  +D +
Sbjct: 324 ADRSELELLEDQKGAFEDADIEQLQADLD 352


>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1.  Rec10 / Red1 is involved in meiotic
           recombination and chromosome segregation during
           homologous chromosome formation. This protein localises
           to the synaptonemal complex in S. cerevisiae and the
           analogous structures (linear elements) in S. pombe. This
           family is currently only found in fungi.
          Length = 706

 Score = 29.1 bits (65), Expect = 9.5
 Identities = 27/142 (19%), Positives = 55/142 (38%), Gaps = 3/142 (2%)

Query: 440 KNSSRASSRSRSRSKSKSP--AKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSD 497
            N++  + +S+   K+K+   A     G  SS  +NK     ++  S+   +S RS    
Sbjct: 489 ANNNSQNIKSKKVVKAKTNNKANLQDVGECSSPPNNKEKNDKQTSTSSSVLKSDRSSIEV 548

Query: 498 KSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAAS-NRSARSDGSDK 556
           ++  +N  +      N      S++ +      + S  +V S     S N +  S+ +  
Sbjct: 549 RNPNANVKKLEDTTYNAKFPTVSKNNAYTLVDISTSEDAVNSADDTRSVNITDISESTTI 608

Query: 557 SASNRSRSRSASSQASNNSAAS 578
           S  +     S +S+      +S
Sbjct: 609 SGLSSKFPSSFTSKLQEQIYSS 630


>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
          Length = 258

 Score = 28.6 bits (65), Expect = 9.6
 Identities = 13/30 (43%), Positives = 16/30 (53%)

Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQE 412
             E  E+EEEE EE EE ++ E  E    E
Sbjct: 229 LAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258


>gnl|CDD|218190 pfam04651, Pox_A12, Poxvirus A12 protein. 
          Length = 188

 Score = 28.2 bits (63), Expect = 9.8
 Identities = 17/55 (30%), Positives = 23/55 (41%)

Query: 525 AGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASN 579
             SD    + K V S  S+ S R + S    K  S  SRSRS + +    +  S 
Sbjct: 49  NPSDNEVKAGKRVTSASSSKSKRCSTSTSKTKPCSRSSRSRSGAPRRRGTAFGSM 103


>gnl|CDD|239922 cd04476, RPA1_DBD_C, RPA1_DBD_C: A subfamily of OB folds
           corresponding to the C-terminal OB fold, the
           ssDNA-binding domain (DBD)-C, of human RPA1 (also called
           RPA70). RPA1 is the large subunit of Replication protein
           A (RPA). RPA is a nuclear ssDNA-binding protein (SSB)
           which appears to be involved in all aspects of DNA
           metabolism including replication, recombination, and
           repair. RPA also mediates specific interactions of
           various nuclear proteins. In animals, plants, and fungi,
           RPA is a heterotrimer with subunits of 70KDa (RPA1),
           32kDa (RPA2), and 14 KDa (RPA3). In addition to DBD-C,
           RPA1 contains three other OB folds: DBD-A, DBD-B, and
           RPA1N. The major DNA binding activity of RPA is
           associated with RPA1 DBD-A and DBD-B. RPA1 DBD-C is
           involved in DNA binding and trimerization. It contains
           two structural insertions not found to date in other
           OB-folds: a zinc ribbon and a three-helix bundle. RPA1
           DBD-C also contains a Cys4-type zinc-binding motif,
           which plays a role in the ssDNA binding function of this
           domain. It appears that zinc itself may not be required
           for ssDNA binding.
          Length = 166

 Score = 28.0 bits (63), Expect = 9.9
 Identities = 16/73 (21%), Positives = 29/73 (39%), Gaps = 9/73 (12%)

Query: 258 VMDESGEQFVAYFLPTAETLDKRKLDQLSNLEYTDEQEYEYKMA----REYNWNVKSKSC 313
           V D +GE ++  F   AE +  +  ++L  L+  D   +   +     + + + V  K  
Sbjct: 74  VADHTGEAWLTLFDEVAEQIFGKSAEELLELKEEDPDAFPDAIQDLVGKTFLFRVSVK-- 131

Query: 314 KGYEENYFLIVRD 326
              EE Y    R 
Sbjct: 132 ---EETYNDEGRI 141


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 28.4 bits (63), Expect = 9.9
 Identities = 13/84 (15%), Positives = 34/84 (40%), Gaps = 2/84 (2%)

Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSS 443
           ++EE+   +E +  E+         +RQE     +      ++ + + T+  ++  K   
Sbjct: 120 KQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQY--KQDM 177

Query: 444 RASSRSRSRSKSKSPAKRSRSGSA 467
           +  +  +    SK P +++    A
Sbjct: 178 KEKASEQENEDSKEPVEKAERTKA 201


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.305    0.121    0.326 

Gapped
Lambda     K      H
   0.267   0.0857    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 29,379,777
Number of extensions: 2803350
Number of successful extensions: 10457
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6918
Number of HSP's successfully gapped: 1064
Length of query: 624
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 521
Effective length of database: 6,369,140
Effective search space: 3318321940
Effective search space used: 3318321940
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (22.0 bits)
S2: 62 (27.4 bits)