RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7773
         (1281 letters)



>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
            chain; Provisional.
          Length = 1033

 Score =  492 bits (1267), Expect = e-154
 Identities = 254/596 (42%), Positives = 363/596 (60%), Gaps = 36/596 (6%)

Query: 488  SEDEEMKEKKEEPSKAVGEKKTEDDEYKDAGEEATYYSIAHTYHEKVTEQASIMVNGKLK 547
              D+    KK +       K TE++E ++  +E             + + + I   GK++
Sbjct: 114  KGDQSASAKKAKGRGRHASKLTEEEEDEEYLKEEEDGLGGSGGTRLLVQPSCI--KGKMR 171

Query: 548  EYQIKGLEWMVSLYNNNLNGILADEMGLGKTVQTIALLTYLMEKKKNLGPYLVIVPLSTL 607
            +YQ+ GL W++ LY N +NGILADEMGLGKT+QTI+LL YL E +   GP++V+ P STL
Sbjct: 172  DYQLAGLNWLIRLYENGINGILADEMGLGKTLQTISLLGYLHEYRGITGPHMVVAPKSTL 231

Query: 608  SNWVLEFEKWAPSANVVAYKGSPLARRALQGR-MKNSKFNVLLTTYEFIIKDKSVLSKIQ 666
             NW+ E  ++ P    V + G+P  R   +   +   KF+V +T++E  IK+K+ L +  
Sbjct: 232  GNWMNEIRRFCPVLRAVKFHGNPEERAHQREELLVAGKFDVCVTSFEMAIKEKTALKRFS 291

Query: 667  WKCMIIDEGHRMKNHHCKLTQTLNTFYNSSHRVLLTGTPLQNKLPELWALLNFLLPSIFK 726
            W+ +IIDE HR+KN +  L++T+  F ++++R+L+TGTPLQN L ELWALLNFLLP IF 
Sbjct: 292  WRYIIIDEAHRIKNENSLLSKTMRLF-STNYRLLITGTPLQNNLHELWALLNFLLPEIFS 350

Query: 727  SCSTFEQWFNAPFATTGEKVELNEEETILIIRRLHKVLRPFLLRRLKKEVESQLPEKVEY 786
            S  TF++WF      +GE       +   ++++LHKVLRPFLLRRLK +VE  LP K E 
Sbjct: 351  SAETFDEWF----QISGE------NDQQEVVQQLHKVLRPFLLRRLKSDVEKGLPPKKET 400

Query: 787  IIKCDMSGLQKLLYQHLHNRGILLTDGSEKGKTGEGGARA-LKNTIVQLRKLCNHPFMFQ 845
            I+K  MS +QK  Y+ L  + + + +         GG R  L N  +QLRK CNHP++FQ
Sbjct: 401  ILKVGMSQMQKQYYKALLQKDLDVVNA--------GGERKRLLNIAMQLRKCCNHPYLFQ 452

Query: 846  NIEEKYCQHIGQNNTVVSGPDIYRTSGKFELLDRILPKFKETKHRVLMFCQMTQLMTILE 905
              E              +G  +   SGK  LLD++LPK KE   RVL+F QMT+L+ ILE
Sbjct: 453  GAEP--------GPPYTTGEHLVENSGKMVLLDKLLPKLKERDSRVLIFSQMTRLLDILE 504

Query: 906  DYFIWRDFSYLRLDGTTKSEDRGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADTVII 965
            DY ++R + Y R+DG T  EDR   +  FN+P S+ F+FLLSTRAGGLG+NL  AD VI+
Sbjct: 505  DYLMYRGYQYCRIDGNTGGEDRDASIDAFNKPGSEKFVFLLSTRAGGLGINLATADIVIL 564

Query: 966  FDSDWNPHQDLQAQDRAHRIGQKNEVRVLRLITVNSVEERILAAARYKLNMDEKVIQAG- 1024
            +DSDWNP  DLQAQDRAHRIGQK EV+V R  T  ++EE+++  A  KL +D  VIQ G 
Sbjct: 565  YDSDWNPQVDLQAQDRAHRIGQKKEVQVFRFCTEYTIEEKVIERAYKKLALDALVIQQGR 624

Query: 1025 MFDQKSTGSERHEFLQTILHQDDEE-EEENEVPDDETVNQLIARTEDELT-LFQKM 1078
            + +QK+   +  E LQ + +  +     ++    DE ++++IA+ E+    L  KM
Sbjct: 625  LAEQKTVNKD--ELLQMVRYGAEMVFSSKDSTITDEDIDRIIAKGEEATAELDAKM 678



 Score =  315 bits (808), Expect = 3e-90
 Identities = 173/382 (45%), Positives = 228/382 (59%), Gaps = 34/382 (8%)

Query: 1   TPLQNKLPELWALLNFLLPSIFKSCSTFEQWFNAPFATTGEKVELNEEETILIIRRLHKV 60
           TPLQN L ELWALLNFLLP IF S  TF++WF      +GE       +   ++++LHKV
Sbjct: 328 TPLQNNLHELWALLNFLLPEIFSSAETFDEWF----QISGE------NDQQEVVQQLHKV 377

Query: 61  LRPFLLRRLKKEVESQLPEKVEYIIKCDMSGLQKLLYQHLHNRGILLTDGSEKGKTGEGG 120
           LRPFLLRRLK +VE  LP K E I+K  MS +QK  Y+ L  + + + +         GG
Sbjct: 378 LRPFLLRRLKSDVEKGLPPKKETILKVGMSQMQKQYYKALLQKDLDVVNA--------GG 429

Query: 121 ARA-LKNTIVQLRKLCNHPFMFQNIEEKYCQHIGQNNTVVSGPDIYRTSGKFELLDRILP 179
            R  L N  +QLRK CNHP++FQ  E              +G  +   SGK  LLD++LP
Sbjct: 430 ERKRLLNIAMQLRKCCNHPYLFQGAEP--------GPPYTTGEHLVENSGKMVLLDKLLP 481

Query: 180 KFKETKHRVLMFCQMTQLMTILEDYFIWRDFSYLRLDGTTKSEDRGELLRKFNEPESKYF 239
           K KE   RVL+F QMT+L+ ILEDY ++R + Y R+DG T  EDR   +  FN+P S+ F
Sbjct: 482 KLKERDSRVLIFSQMTRLLDILEDYLMYRGYQYCRIDGNTGGEDRDASIDAFNKPGSEKF 541

Query: 240 IFLLSTRAGGLGLNLQAADTVIIFDSDWNPHQDLQAQDRAHRIGQKNEVRVLRLITVNSV 299
           +FLLSTRAGGLG+NL  AD VI++DSDWNP  DLQAQDRAHRIGQK EV+V R  T  ++
Sbjct: 542 VFLLSTRAGGLGINLATADIVILYDSDWNPQVDLQAQDRAHRIGQKKEVQVFRFCTEYTI 601

Query: 300 EERILAAARYKLNMDEKEYHRNNTARTQKLNK----AIINYHINAEKEQKKEQERIEKER 355
           EE+++  A  KL +D     +   A  + +NK     ++ Y   AE     +   I  E 
Sbjct: 602 EEKVIERAYKKLALDALVIQQGRLAEQKTVNKDELLQMVRY--GAEMVFSSKDSTITDED 659

Query: 356 MRRLMAEDEEGYRKLIDQKKDK 377
           + R++A+ EE   +L D K  K
Sbjct: 660 IDRIIAKGEEATAEL-DAKMKK 680


>gnl|CDD|215770 pfam00176, SNF2_N, SNF2 family N-terminal domain.  This domain is
           found in proteins involved in a variety of processes
           including transcription regulation (e.g., SNF2, STH1,
           brahma, MOT1), DNA repair (e.g. ERCC6, RAD16, RAD5), DNA
           recombination (e.g. RAD54), and chromatin unwinding
           (e.g. ISWI) as well as a variety of other proteins with
           little functional information (e.g. lodestar, ETL1).
          Length = 301

 Score =  377 bits (971), Expect = e-121
 Identities = 149/302 (49%), Positives = 200/302 (66%), Gaps = 7/302 (2%)

Query: 549 YQIKGLEWMVSLYNNNLNGILADEMGLGKTVQTIALL-TYLMEKKKNLGPYLVIVPLSTL 607
           YQ++G+ W++SL +N L GILADEMGLGKT+QTIALL TYL E K   GP LV+ PLSTL
Sbjct: 1   YQLEGVNWLISLESNGLGGILADEMGLGKTLQTIALLATYLKEGKDRRGPTLVVCPLSTL 60

Query: 608 SNWVLEFEKWAPSANVVAYKGSPLARRALQGRM--KNSKFNVLLTTYEFIIKDK---SVL 662
            NW+ EFEKWAP+  VV Y G    R  L+  M  +   ++V++TTYE + KDK   S+L
Sbjct: 61  HNWLNEFEKWAPALRVVVYHGDGRERSKLRQSMAKRLDTYDVVITTYEVLRKDKKLLSLL 120

Query: 663 SKIQWKCMIIDEGHRMKNHHCKLTQTLNTFYNSSHRVLLTGTPLQNKLPELWALLNFLLP 722
           +K++W  +++DE HR+KN   KL + L     + +R+LLTGTP+QN L ELWALLNFL P
Sbjct: 121 NKVEWDRVVLDEAHRLKNSKSKLYKALKKL-KTRNRLLLTGTPIQNNLEELWALLNFLRP 179

Query: 723 SIFKSCSTFEQWFNAPFATTGEKVELNEEETILIIRRLHKVLRPFLLRRLKKEVESQLPE 782
             F S   FE+WFN P A T +    N E+    I RLHK+L+PFLLRR K +VE  LP 
Sbjct: 180 GPFGSFKVFEEWFNIPIANTADNKNKNLEKGKEGINRLHKLLKPFLLRRTKDDVEKSLPP 239

Query: 783 KVEYIIKCDMSGLQKLLYQHLHNRGILLTDGSEKGKTGEGGARALKNTIVQLRKLCNHPF 842
           K E+++ C++S  Q+ LY+ L  +  L    + +G     G  +L N I+QLRK+CNHP+
Sbjct: 240 KTEHVLYCNLSDEQRKLYKKLLTKRRLALSFAVEGGEKNVGIASLLNLIMQLRKICNHPY 299

Query: 843 MF 844
           +F
Sbjct: 300 LF 301



 Score =  178 bits (454), Expect = 7e-50
 Identities = 67/141 (47%), Positives = 89/141 (63%)

Query: 1   TPLQNKLPELWALLNFLLPSIFKSCSTFEQWFNAPFATTGEKVELNEEETILIIRRLHKV 60
           TP+QN L ELWALLNFL P  F S   FE+WFN P A T +    N E+    I RLHK+
Sbjct: 161 TPIQNNLEELWALLNFLRPGPFGSFKVFEEWFNIPIANTADNKNKNLEKGKEGINRLHKL 220

Query: 61  LRPFLLRRLKKEVESQLPEKVEYIIKCDMSGLQKLLYQHLHNRGILLTDGSEKGKTGEGG 120
           L+PFLLRR K +VE  LP K E+++ C++S  Q+ LY+ L  +  L    + +G     G
Sbjct: 221 LKPFLLRRTKDDVEKSLPPKTEHVLYCNLSDEQRKLYKKLLTKRRLALSFAVEGGEKNVG 280

Query: 121 ARALKNTIVQLRKLCNHPFMF 141
             +L N I+QLRK+CNHP++F
Sbjct: 281 IASLLNLIMQLRKICNHPYLF 301


>gnl|CDD|223627 COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family
            [Transcription / DNA replication, recombination, and
            repair].
          Length = 866

 Score =  326 bits (836), Expect = 2e-95
 Identities = 244/776 (31%), Positives = 370/776 (47%), Gaps = 61/776 (7%)

Query: 296  VNSVEERILAAARYKLNMDEKEYHRNNTARTQKLNKAIINYHINAEKEQKKEQERIEKER 355
            +N +  R+L A    L  D K        +   L   I    I   K  +  Q  +E   
Sbjct: 97   LNELALRVLIADEVGL-GDLKTIEAGAILKELLLRGEIKRVLILVPKTLR-AQWVVELLE 154

Query: 356  MR--RLMAEDEEGYRKLIDQ----KKDKRLAFLLSQTDEYISNLTQMVKEHKEEQQKKMR 409
                RL   D+EG R L+ Q            +L   D    + ++  +   E +  +  
Sbjct: 155  KFNIRLAVLDKEGLRYLLKQYDAYNPFSTEDLVLISLDLAKRSDSKRREALLEAEWGERD 214

Query: 410  EEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPPAKQIKEW 469
                 +  ++        +  L+  E   L+ L   +     D K+    A P       
Sbjct: 215  LLVIDEAHNLGSSEGTRKLAPLETLEYELLKQLAEKIPSKLLDLKVLLLSATPE------ 268

Query: 470  LASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDAGEEATYYSIAHT 529
                   ++  ED        + ++            ++T     +D  +E         
Sbjct: 269  -------QLKEEDLFARLRLLDPLRLADLSQILEKFVRETLKLSARDLKDELKELLAELR 321

Query: 530  YHEKVTEQASIMVNGK-LKEYQIKGLEWMVS-LYNNNLNGILADEMGLGKTVQTIALLTY 587
              E +      +     L+ YQ++G+ W+   L +N L GILAD+MGLGKTVQTIALL  
Sbjct: 322  LSEDLLNAPEPVDLSAELRPYQLEGVNWLSELLRSNLLGGILADDMGLGKTVQTIALLLS 381

Query: 588  LMEKKK-NLGPYLVIVPLSTLSNWVLEFEKWAPSAN-VVAYKGSP----LARRALQGRMK 641
            L+E  K  LGP L++VP S LSNW  EFEK+AP    V+ Y G        R AL+  +K
Sbjct: 382  LLESIKVYLGPALIVVPASLLSNWKREFEKFAPDLRLVLVYHGEKSELDKKREALRDLLK 441

Query: 642  NSK---FNVLLTTYEFIIK---DKSVLSKIQWKCMIIDEGHRMKNHHCKLTQTLNTFYNS 695
                  F+V++TTYE + +   D   L KI+W  +++DE HR+KN      + L     +
Sbjct: 442  LHLVIIFDVVITTYELLRRFLVDHGGLKKIEWDRVVLDEAHRIKNDQSSEGKALQFL-KA 500

Query: 696  SHRVLLTGTPLQNKLPELWALLN-FLLPSIFKSC-STFEQWFNAPFATTGEKVELNEEET 753
             +R+ LTGTPL+N+L ELW+LL  FL P +  +  + F + F  P     E+     E  
Sbjct: 501  LNRLDLTGTPLENRLGELWSLLQEFLNPGLLGTSFAIFTRLFEKPIQA--EEDIGPLEAR 558

Query: 754  ILIIRRLHKVLRPFLLRRLKKEVE--SQLPEKVEYIIKCDMSGLQKLLYQHLHNRGI--- 808
             L I  L K+L PF+LRR K++VE   +LP K+E +++C++S  Q+ LY+ L        
Sbjct: 559  ELGIELLRKLLSPFILRRTKEDVEVLKELPPKIEKVLECELSEEQRELYEALLEGAEKNQ 618

Query: 809  -----LLTDGSEKGKTGEGGARALKNTIVQLRKLCNHPFMFQN-IEEKYCQHI-----GQ 857
                 L    S++ + G+     L   + +LR++CNHP +    +E  + + +      +
Sbjct: 619  QLLEDLEKADSDENRIGDSELNIL-ALLTRLRQICNHPALVDEGLEATFDRIVLLLREDK 677

Query: 858  NNTVVSGPDIYRTSGKFELLDRIL-PKFKETKH--RVLMFCQMTQLMTILEDYFIWRDFS 914
            +   +  P I  + GK + LD +L  K  E  H  +VL+F Q T ++ +LEDY       
Sbjct: 678  DFDYLKKPLIQLSKGKLQALDELLLDKLLEEGHYHKVLIFSQFTPVLDLLEDYLKALGIK 737

Query: 915  YLRLDGTTKSEDRGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADTVIIFDSDWNPHQ 974
            Y+RLDG+T ++ R EL+ +FN  E +  +FLLS +AGGLGLNL  ADTVI+FD  WNP  
Sbjct: 738  YVRLDGSTPAKRRQELIDRFNADE-EEKVFLLSLKAGGLGLNLTGADTVILFDPWWNPAV 796

Query: 975  DLQAQDRAHRIGQKNEVRVLRLITVNSVEERILAAARYKLNMDEKVIQAGMFDQKS 1030
            +LQA DRAHRIGQK  V+V RLIT  ++EE+IL     K  + + +I A    + S
Sbjct: 797  ELQAIDRAHRIGQKRPVKVYRLITRGTIEEKILELQEKKQELLDSLIDAEGEKELS 852



 Score =  199 bits (508), Expect = 1e-52
 Identities = 128/338 (37%), Positives = 190/338 (56%), Gaps = 25/338 (7%)

Query: 1   TPLQNKLPELWALLN-FLLPSIFKSC-STFEQWFNAPFATTGEKVELNEEETILIIRRLH 58
           TPL+N+L ELW+LL  FL P +  +  + F + F  P     E+     E   L I  L 
Sbjct: 509 TPLENRLGELWSLLQEFLNPGLLGTSFAIFTRLFEKPIQA--EEDIGPLEARELGIELLR 566

Query: 59  KVLRPFLLRRLKKEVE--SQLPEKVEYIIKCDMSGLQKLLYQHLHNRGI--------LLT 108
           K+L PF+LRR K++VE   +LP K+E +++C++S  Q+ LY+ L             L  
Sbjct: 567 KLLSPFILRRTKEDVEVLKELPPKIEKVLECELSEEQRELYEALLEGAEKNQQLLEDLEK 626

Query: 109 DGSEKGKTGEGGARALKNTIVQLRKLCNHPFMFQN-IEEKYCQHI-----GQNNTVVSGP 162
             S++ + G+     L   + +LR++CNHP +    +E  + + +      ++   +  P
Sbjct: 627 ADSDENRIGDSELNIL-ALLTRLRQICNHPALVDEGLEATFDRIVLLLREDKDFDYLKKP 685

Query: 163 DIYRTSGKFELLDRIL-PKFKETKH--RVLMFCQMTQLMTILEDYFIWRDFSYLRLDGTT 219
            I  + GK + LD +L  K  E  H  +VL+F Q T ++ +LEDY       Y+RLDG+T
Sbjct: 686 LIQLSKGKLQALDELLLDKLLEEGHYHKVLIFSQFTPVLDLLEDYLKALGIKYVRLDGST 745

Query: 220 KSEDRGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADTVIIFDSDWNPHQDLQAQDRA 279
            ++ R EL+ +FN  E +  +FLLS +AGGLGLNL  ADTVI+FD  WNP  +LQA DRA
Sbjct: 746 PAKRRQELIDRFNADE-EEKVFLLSLKAGGLGLNLTGADTVILFDPWWNPAVELQAIDRA 804

Query: 280 HRIGQKNEVRVLRLITVNSVEERILAAARYKLNMDEKE 317
           HRIGQK  V+V RLIT  ++EE+IL     K  + +  
Sbjct: 805 HRIGQKRPVKVYRLITRGTIEEKILELQEKKQELLDSL 842


>gnl|CDD|214692 smart00487, DEXDc, DEAD-like helicases superfamily. 
          Length = 201

 Score =  117 bits (294), Expect = 1e-29
 Identities = 50/199 (25%), Positives = 83/199 (41%), Gaps = 14/199 (7%)

Query: 545 KLKEYQIKGLEWMVSLYNNNLNGILADEMGLGKT-VQTIALLTYLMEKKKNLGPYLVIVP 603
            L+ YQ + +E +    +   + ILA   G GKT    +  L  L   K   G  LV+VP
Sbjct: 8   PLRPYQKEAIEAL---LSGLRDVILAAPTGSGKTLAALLPALEALKRGKG--GRVLVLVP 62

Query: 604 LSTL-SNWVLEFEKWAPSANV-VAYKGSPLARRALQGRMKNSKFNVLLTTYEFIIKD--K 659
              L   W  E +K  PS  + V       ++R    ++++ K ++L+TT   ++     
Sbjct: 63  TRELAEQWAEELKKLGPSLGLKVVGLYGGDSKREQLRKLESGKTDILVTTPGRLLDLLEN 122

Query: 660 SVLSKIQWKCMIIDEGHRMKN--HHCKLTQTLNTFYNSSHRVLLTGTPLQNKLPELWALL 717
             LS      +I+DE HR+ +     +L + L     +   +LL+ TP +     L   L
Sbjct: 123 DKLSLSNVDLVILDEAHRLLDGGFGDQLEKLLKLLPKNVQLLLLSATPPEEIENLLELFL 182

Query: 718 N--FLLPSIFKSCSTFEQW 734
           N    +   F      EQ+
Sbjct: 183 NDPVFIDVGFTPLEPIEQF 201


>gnl|CDD|238034 cd00079, HELICc, Helicase superfamily c-terminal domain; associated
           with DEXDc-, DEAD-, and DEAH-box proteins, yeast
           initiation factor 4A, Ski2p, and Hepatitis C virus NS3
           helicases; this domain is found in a wide variety of
           helicases and helicase related proteins; may not be an
           autonomously folding unit, but an integral part of the
           helicase; 4 helicase superfamilies at present according
           to the organization of their signature motifs; all
           helicases share the ability to unwind nucleic acid
           duplexes with a distinct directional polarity; they
           utilize the free energy from nucleoside triphosphate
           hydrolysis to fuel their translocation along DNA,
           unwinding the duplex in the process.
          Length = 131

 Score =  112 bits (282), Expect = 6e-29
 Identities = 36/128 (28%), Positives = 57/128 (44%), Gaps = 3/128 (2%)

Query: 164 IYRTSGKFELLDRILPKFKETKHRVLMFCQMTQLMTILEDYFIWRDFSYLRLDGTTKSED 223
           +     K E L  +L +  +   +VL+FC   +++  L +           L G    E+
Sbjct: 7   LPVEDEKLEALLELLKEHLKKGGKVLIFCPSKKMLDELAELLRKPGIKVAALHGDGSQEE 66

Query: 224 RGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADTVIIFDSDWNPHQDLQAQDRAHRIG 283
           R E+L+ F E E    + L++T     G++L     VI +D  W+P   LQ   RA R G
Sbjct: 67  REEVLKDFREGE---IVVLVATDVIARGIDLPNVSVVINYDLPWSPSSYLQRIGRAGRAG 123

Query: 284 QKNEVRVL 291
           QK    +L
Sbjct: 124 QKGTAILL 131



 Score =  112 bits (282), Expect = 6e-29
 Identities = 36/128 (28%), Positives = 57/128 (44%), Gaps = 3/128 (2%)

Query: 867 IYRTSGKFELLDRILPKFKETKHRVLMFCQMTQLMTILEDYFIWRDFSYLRLDGTTKSED 926
           +     K E L  +L +  +   +VL+FC   +++  L +           L G    E+
Sbjct: 7   LPVEDEKLEALLELLKEHLKKGGKVLIFCPSKKMLDELAELLRKPGIKVAALHGDGSQEE 66

Query: 927 RGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADTVIIFDSDWNPHQDLQAQDRAHRIG 986
           R E+L+ F E E    + L++T     G++L     VI +D  W+P   LQ   RA R G
Sbjct: 67  REEVLKDFREGE---IVVLVATDVIARGIDLPNVSVVINYDLPWSPSSYLQRIGRAGRAG 123

Query: 987 QKNEVRVL 994
           QK    +L
Sbjct: 124 QKGTAILL 131


>gnl|CDD|238005 cd00046, DEXDc, DEAD-like helicases superfamily. A diverse family
           of proteins involved in ATP-dependent RNA or DNA
           unwinding. This domain contains the ATP-binding region.
          Length = 144

 Score =  104 bits (262), Expect = 4e-26
 Identities = 36/145 (24%), Positives = 65/145 (44%), Gaps = 7/145 (4%)

Query: 566 NGILADEMGLGKTVQTIALLTYLMEKKKNLGPYLVIVPLSTLSNWVLEFEKWAPSANV-V 624
           + +LA   G GKT+  +  +  L++  K  G  LV+ P   L+N V E  K      + V
Sbjct: 2   DVLLAAPTGSGKTLAALLPILELLDSLKG-GQVLVLAPTRELANQVAERLKELFGEGIKV 60

Query: 625 AYKGSPLARRALQGRMKNSKFNVLLTTYEFIIKD--KSVLSKIQWKCMIIDEGHRMKNHH 682
            Y     + +  Q ++ + K ++++ T   ++ +  +  LS  +   +I+DE HR+ N  
Sbjct: 61  GYLIGGTSIKQ-QEKLLSGKTDIVVGTPGRLLDELERLKLSLKKLDLLILDEAHRLLNQG 119

Query: 683 CKLTQ--TLNTFYNSSHRVLLTGTP 705
             L     L         +LL+ TP
Sbjct: 120 FGLLGLKILLKLPKDRQVLLLSATP 144


>gnl|CDD|201125 pfam00271, Helicase_C, Helicase conserved C-terminal domain.  The
           Prosite family is restricted to DEAD/H helicases,
           whereas this domain family is found in a wide variety of
           helicases and helicase related proteins. It may be that
           this is not an autonomously folding unit, but an
           integral part of the helicase.
          Length = 78

 Score = 96.8 bits (242), Expect = 4e-24
 Identities = 26/81 (32%), Positives = 36/81 (44%), Gaps = 3/81 (3%)

Query: 203 DYFIWRDFSYLRLDGTTKSEDRGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADTVII 262
                      RL G    E+R E+L  F   +SK    L++T   G G++L   + VI 
Sbjct: 1   KLLRKPGIKVARLHGGLSQEEREEILEDFRNGKSK---VLVATDVAGRGIDLPDVNLVIN 57

Query: 263 FDSDWNPHQDLQAQDRAHRIG 283
           +D  WNP   +Q   RA R G
Sbjct: 58  YDLPWNPASYIQRIGRAGRAG 78



 Score = 96.8 bits (242), Expect = 4e-24
 Identities = 26/81 (32%), Positives = 36/81 (44%), Gaps = 3/81 (3%)

Query: 906 DYFIWRDFSYLRLDGTTKSEDRGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADTVII 965
                      RL G    E+R E+L  F   +SK    L++T   G G++L   + VI 
Sbjct: 1   KLLRKPGIKVARLHGGLSQEEREEILEDFRNGKSK---VLVATDVAGRGIDLPDVNLVIN 57

Query: 966 FDSDWNPHQDLQAQDRAHRIG 986
           +D  WNP   +Q   RA R G
Sbjct: 58  YDLPWNPASYIQRIGRAGRAG 78


>gnl|CDD|197757 smart00490, HELICc, helicase superfamily c-terminal domain. 
          Length = 82

 Score = 95.7 bits (239), Expect = 1e-23
 Identities = 29/84 (34%), Positives = 39/84 (46%), Gaps = 3/84 (3%)

Query: 200 ILEDYFIWRDFSYLRLDGTTKSEDRGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADT 259
            L +          RL G    E+R E+L KFN  + K    L++T     GL+L   D 
Sbjct: 2   ELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIK---VLVATDVAERGLDLPGVDL 58

Query: 260 VIIFDSDWNPHQDLQAQDRAHRIG 283
           VII+D  W+P   +Q   RA R G
Sbjct: 59  VIIYDLPWSPASYIQRIGRAGRAG 82



 Score = 95.7 bits (239), Expect = 1e-23
 Identities = 29/84 (34%), Positives = 39/84 (46%), Gaps = 3/84 (3%)

Query: 903 ILEDYFIWRDFSYLRLDGTTKSEDRGELLRKFNEPESKYFIFLLSTRAGGLGLNLQAADT 962
            L +          RL G    E+R E+L KFN  + K    L++T     GL+L   D 
Sbjct: 2   ELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIK---VLVATDVAERGLDLPGVDL 58

Query: 963 VIIFDSDWNPHQDLQAQDRAHRIG 986
           VII+D  W+P   +Q   RA R G
Sbjct: 59  VIIYDLPWSPASYIQRIGRAGRAG 82


>gnl|CDD|214727 smart00573, HSA, domain in helicases and associated with SANT
           domains. 
          Length = 73

 Score = 53.9 bits (130), Expect = 4e-09
 Identities = 21/48 (43%), Positives = 34/48 (70%)

Query: 314 DEKEYHRNNTARTQKLNKAIINYHINAEKEQKKEQERIEKERMRRLMA 361
           D KE H+   A  +K+ KA+++YH N EKE+++ +E+ EK R+R+L A
Sbjct: 26  DFKEEHKWKIAAAKKMAKAVMDYHQNKEKEEERREEKNEKRRLRKLAA 73


>gnl|CDD|203672 pfam07533, BRK, BRK domain.  The function of this domain is
           unknown. It is often found associated with helicases and
           transcription factors.
          Length = 45

 Score = 48.4 bits (116), Expect = 2e-07
 Identities = 15/43 (34%), Positives = 24/43 (55%)

Query: 437 SQLEDLRITVKETNSDNKLRGEDAPPAKQIKEWLASHPGWEVV 479
           S   + R+ V    +  +L G+DAP  K ++ WL  +PG+EV 
Sbjct: 1   SLDGEERVPVINRKTGKRLTGDDAPKLKDLERWLQENPGYEVD 43


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 54.7 bits (131), Expect = 2e-07
 Identities = 44/136 (32%), Positives = 55/136 (40%), Gaps = 16/136 (11%)

Query: 1149 KAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAP-----------EDEEEEKP 1197
            K IE   A D E +E       K   S K+  K  V E +            ED   +K 
Sbjct: 50   KKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKE 109

Query: 1198 VKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEE 1257
             KK  RR + + +    D E+E  +KK RK  K    D+  ED   +  VS      VEE
Sbjct: 110  SKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSD-----VEE 164

Query: 1258 EEEVEEAEGEEEEEDD 1273
             E V   E E EEE D
Sbjct: 165  SEFVTSLENESEEELD 180



 Score = 35.0 bits (80), Expect = 0.30
 Identities = 20/78 (25%), Positives = 32/78 (41%), Gaps = 11/78 (14%)

Query: 1203 RRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVE 1262
             RKK++ +   ++E DE+    K+K T++            K    K   ++ E  EE  
Sbjct: 48   SRKKIESALAVDEEPDENGAVSKKKPTRSV-----------KRATKKTVVEISEPLEEGS 96

Query: 1263 EAEGEEEEEDDKASKPRK 1280
            E    E+   DK SK   
Sbjct: 97   ELVVNEDAALDKESKKTP 114


>gnl|CDD|219455 pfam07529, HSA, HSA.  This domain is predicted to bind DNA and is
           often found associated with helicases.
          Length = 73

 Score = 48.1 bits (115), Expect = 5e-07
 Identities = 17/48 (35%), Positives = 30/48 (62%)

Query: 314 DEKEYHRNNTARTQKLNKAIINYHINAEKEQKKEQERIEKERMRRLMA 361
           D +E  +   A+ +KL +A+  YH   EKE+++ +ER  K+R++ L A
Sbjct: 26  DFREERKWKIAKAKKLARAVAQYHKYIEKEEQRRKEREAKQRLKALKA 73


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 54.0 bits (129), Expect = 5e-07
 Identities = 51/205 (24%), Positives = 97/205 (47%), Gaps = 11/205 (5%)

Query: 1081 ERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREK----EEALMGRGSRQRKQV 1136
            E RK E   +   +K+  +   K D E E    EEER  E+    EEA M   +R++  +
Sbjct: 1216 EARKAEDAKKAEAVKKAEE--AKKDAE-EAKKAEEERNNEEIRKFEEARMAHFARRQAAI 1272

Query: 1137 DYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEK 1196
               ++    E  KA E   AD+ ++ EE+ K    K+ + +  K    ++ A E +++  
Sbjct: 1273 KAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKAD 1332

Query: 1197 PVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVE 1256
              KK+    K   + E    E E+   +   + +     + K+++ +K   + +KK   E
Sbjct: 1333 AAKKKAEEAKK--AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK--AE 1388

Query: 1257 EEEEVEEAEGEEEEEDDKASKPRKS 1281
            E+++ +EA+ + EE+  KA + +K+
Sbjct: 1389 EKKKADEAKKKAEEDKKKADELKKA 1413



 Score = 52.8 bits (126), Expect = 1e-06
 Identities = 54/230 (23%), Positives = 105/230 (45%), Gaps = 17/230 (7%)

Query: 1066 ARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEV----ENSMYEEERE--- 1118
            A+  DE    +K +  ++  E K +  E K+  D   K  +      E    EE ++   
Sbjct: 1469 AKKADEAK--KKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE 1526

Query: 1119 -REKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGS-G 1176
             ++ EEA     +++ ++    D L   E LK  E+    +  ++ EE K +  ++    
Sbjct: 1527 AKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEA 1586

Query: 1177 KKGKKSYVEETAPEDEEEEKPVKKRRRR------KKVQISDEDEDEEDESPKKKKRKSTK 1230
            KK +++ +EE     EEE+K   +  ++      K  ++   +E+++     KKK    K
Sbjct: 1587 KKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEK 1646

Query: 1231 TPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
              A++  K ++E K   ++  KK  E++++ EEA+  EE+E   A   +K
Sbjct: 1647 KKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK 1696



 Score = 52.5 bits (125), Expect = 2e-06
 Identities = 51/216 (23%), Positives = 99/216 (45%), Gaps = 18/216 (8%)

Query: 1081 ERRKGETKPRLVEIKELPDWMVKDDDEV----ENSMYEEEREREKEEALMGRGSRQRKQV 1136
            E +K E K +  E+K+  +  +K  +E     E    EE++     +A   + + + +  
Sbjct: 1538 EAKKAEEKKKADELKKAEE--LKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIE 1595

Query: 1137 DYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDE---- 1192
            +      +++ +KA E   A++ + + EE+K   +++   ++ KK   EE    +E    
Sbjct: 1596 EVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKA 1655

Query: 1193 EEEKPVKKRR--------RRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEK 1244
            EEE  +K           ++K  +    +EDE+  +   KK       A++  K++ EEK
Sbjct: 1656 EEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEK 1715

Query: 1245 HVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
                + KK   E + + EEA+ E EE+  KA + +K
Sbjct: 1716 KKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKK 1751



 Score = 49.8 bits (118), Expect = 1e-05
 Identities = 49/207 (23%), Positives = 91/207 (43%), Gaps = 18/207 (8%)

Query: 1066 ARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEAL 1125
            A+  +E        AE  K   + R+ E+ +L +   K   E      + E  + K E L
Sbjct: 1569 AKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAE---EAKKAEEAKIKAEEL 1625

Query: 1126 MGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVE 1185
              +   ++K+V+       +E  KA E        EEE ++K   + + + +  KK+  E
Sbjct: 1626 K-KAEEEKKKVEQLKKKEAEEKKKAEELKK----AEEENKIKAAEEAKKAEEDKKKA--E 1678

Query: 1186 ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKH 1245
            E    +E+E        ++    +  E E+ +     KKK    K  A++  K ++E K 
Sbjct: 1679 EAKKAEEDE--------KKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKI 1730

Query: 1246 VVSKRKKKVVEEEEEVEEAEGEEEEED 1272
               + KK+  E++++ EEA+ +EEE+ 
Sbjct: 1731 KAEEAKKEAEEDKKKAEEAKKDEEEKK 1757



 Score = 47.4 bits (112), Expect = 5e-05
 Identities = 46/208 (22%), Positives = 107/208 (51%), Gaps = 4/208 (1%)

Query: 1076 QKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEERERE-KEEALMGRGSRQRK 1134
            +K DA ++K E K +  E K+  +   K  DE++ +   +++  E K++A   + + + K
Sbjct: 1378 KKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAK 1437

Query: 1135 QVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEE 1194
            +        D+   KA E   A++ +++ EE K   + +   ++ KK+  +E   + EE 
Sbjct: 1438 KKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKA--DEAKKKAEEA 1495

Query: 1195 EKPVKKRRRRKKVQISDEDEDEEDESPKKKK-RKSTKTPAKDKHKEDDEEKHVVSKRKKK 1253
            +K   + ++  + +   ++  + +E+ K  + +K+ +    D+ K+ +E+K     +K +
Sbjct: 1496 KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAE 1555

Query: 1254 VVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
             +++ EE ++AE  ++ E+DK    RK+
Sbjct: 1556 ELKKAEEKKKAEEAKKAEEDKNMALRKA 1583



 Score = 47.4 bits (112), Expect = 6e-05
 Identities = 50/218 (22%), Positives = 94/218 (43%), Gaps = 26/218 (11%)

Query: 1076 QKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQ 1135
            +K D  ++K E   +  E K+  +   K  D  +    + E  ++  EA          +
Sbjct: 1302 KKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKK---KAEEAKKAAEAAKAEAEAAADE 1358

Query: 1136 VDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEE 1195
             +            A E   A + ++EE + K    K+ + +K K    ++ A ED+++ 
Sbjct: 1359 AE-----------AAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKA 1407

Query: 1196 KPVKKRRRRKK----VQISDEDEDEEDESPKK----KKRKSTKTPAKDKHKEDDEEKHVV 1247
              +KK    KK     +   E++ + DE+ KK    KK    K  A++  K ++ +K   
Sbjct: 1408 DELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467

Query: 1248 SKRK----KKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
              +K    KK  EE ++ +EA+ + EE   KA + +K+
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKA 1505



 Score = 46.3 bits (109), Expect = 1e-04
 Identities = 51/241 (21%), Positives = 99/241 (41%), Gaps = 14/241 (5%)

Query: 1048 EEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDE 1107
             EE++N         +       E  +    + ++ K E   +  E K   + + K ++E
Sbjct: 1572 AEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEE 1631

Query: 1108 ---VENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEE 1164
               VE    +E  E++K E L       +K  +       +E  KA ED    +  ++ E
Sbjct: 1632 KKKVEQLKKKEAEEKKKAEEL-------KKAEEENKIKAAEEAKKAEEDKKKAEEAKKAE 1684

Query: 1165 EVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKK 1224
            E +    +    +  +    EE   ++ EE+K  ++ ++ ++      +E +++    KK
Sbjct: 1685 EDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKK 1744

Query: 1225 KRKSTKTPAKDK----HKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
            K +  K   ++K    H + +EEK     RK+K    EEE++E + +   E DK  K   
Sbjct: 1745 KAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIF 1804

Query: 1281 S 1281
             
Sbjct: 1805 D 1805



 Score = 45.1 bits (106), Expect = 3e-04
 Identities = 52/239 (21%), Positives = 98/239 (41%), Gaps = 14/239 (5%)

Query: 1048 EEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDE 1107
            ++ EE +  ++E  N+ I + E+             K E   +  E+K+  +   K  DE
Sbjct: 1237 KDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEE--KKKADE 1294

Query: 1108 VENSMYEEEREREKEEALMGRGSRQ-RKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEV 1166
             + +  +++ +  K++A   + + + +K+ +      D    KA E   A +  + E E 
Sbjct: 1295 AKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEA 1354

Query: 1167 KPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKR 1226
                 +    K        E A + +EE K      ++K  +    DE ++     KKK 
Sbjct: 1355 AADEAEAAEEKA-------EAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKA 1407

Query: 1227 KSTKTPAKDKHKEDDEEKHVVSKRK----KKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
               K  A  K K D+ +K    K+K    KK  EE ++ +EA+ + EE        +K+
Sbjct: 1408 DELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKA 1466



 Score = 42.1 bits (98), Expect = 0.002
 Identities = 50/213 (23%), Positives = 95/213 (44%), Gaps = 23/213 (10%)

Query: 1076 QKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQ 1135
            +K DA ++K E   +  E  +          E E +  E E   EK EA   +    +K+
Sbjct: 1329 KKADAAKKKAEEAKKAAEAAK---------AEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379

Query: 1136 VDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGK--KSYVEETAPEDEE 1193
             D      ++   K   D+     EE++++   + K   + KK    K   EE    DE 
Sbjct: 1380 ADAAKKKAEE---KKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEA 1436

Query: 1194 EEKPVKKRRR---RKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKR 1250
            ++K  + ++    +KK + + + E+ + ++ + KK    K  A++  K D+ +K     +
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAK 1496

Query: 1251 KK-----KVVEEEEEVEEA-EGEEEEEDDKASK 1277
            KK     K  E +++ +EA + EE ++ D+A K
Sbjct: 1497 KKADEAKKAAEAKKKADEAKKAEEAKKADEAKK 1529



 Score = 41.3 bits (96), Expect = 0.004
 Identities = 44/213 (20%), Positives = 88/213 (41%), Gaps = 12/213 (5%)

Query: 1081 ERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTD 1140
            E RK E   +  E ++  D    +         + E  R+ E+A     +R+ ++V   +
Sbjct: 1132 EARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAE 1191

Query: 1141 SLTDQEWLKAIED----------DGADDYEEEE--EEVKPVTKKRGSGKKGKKSYVEETA 1188
             L   E  +  E           + A   E+ +  E VK   + +   ++ KK+  E   
Sbjct: 1192 ELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNN 1251

Query: 1189 PEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVS 1248
             E  + E+       R++  I  E+  + DE  K +++K      K + K+  +E    +
Sbjct: 1252 EEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKA 1311

Query: 1249 KRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
            +  KK  E +++ EEA+ + +    KA + +K+
Sbjct: 1312 EEAKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344



 Score = 40.1 bits (93), Expect = 0.010
 Identities = 58/256 (22%), Positives = 102/256 (39%), Gaps = 42/256 (16%)

Query: 1049 EEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEV 1108
            EE +    + + V QL  +  +E     K  AE  K   +   ++  E      +D  + 
Sbjct: 1623 EELKKAEEEKKKVEQLKKKEAEE-----KKKAEELKKAEEENKIKAAEEAKKAEEDKKKA 1677

Query: 1109 ENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKP 1168
            E +   EE E++  EAL  + + + K+ +       +E  KA E        EEE ++K 
Sbjct: 1678 EEAKKAEEDEKKAAEAL-KKEAEEAKKAEELKKKEAEEKKKAEELKK----AEEENKIKA 1732

Query: 1169 VTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRR----------RRKKVQISDEDEDEED 1218
               K+ + +  KK+   E A +DEEE+K +   +          R++K  + +E+ DEED
Sbjct: 1733 EEAKKEAEEDKKKA---EEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEED 1789

Query: 1219 ESPKKKKRKSTK----------------TPAKDKHKE--DDEEKHVVSKRKKKVVEEEEE 1260
            E  + +  K  K                    +  KE  D   K V    K   +EE + 
Sbjct: 1790 EKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADS-KNMQLEEADA 1848

Query: 1261 VEEAEGEEEEEDDKAS 1276
             E+ +  +  E+ +  
Sbjct: 1849 FEKHKFNKNNENGEDG 1864



 Score = 39.4 bits (91), Expect = 0.015
 Identities = 51/224 (22%), Positives = 97/224 (43%), Gaps = 19/224 (8%)

Query: 1066 ARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEAL 1125
            A+  DE    +K   E +K E   +  E  +  D   K  +E + +   +E +++ EEA 
Sbjct: 1443 AKKADEA---KKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKA---DEAKKKAEEAK 1496

Query: 1126 MGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVE 1185
                  ++       +    E  KA E   AD+ ++ EE  K    K+   KK      +
Sbjct: 1497 KKADEAKKAAEAKKKA---DEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK 1553

Query: 1186 ETAPEDEEEEKPVKKRRR--------RKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKH 1237
                +  EE+K  ++ ++         +K + + + E+   E   K   +  K  A++  
Sbjct: 1554 AEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAK 1613

Query: 1238 KEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
            K   EE  + ++  KK  EE+++VE+ + +E EE  KA + +K+
Sbjct: 1614 KA--EEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKA 1655



 Score = 38.6 bits (89), Expect = 0.029
 Identities = 40/169 (23%), Positives = 75/169 (44%), Gaps = 20/169 (11%)

Query: 340  AEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKE 399
            AE+ +K E+ +I+ E +++   E ++  +    + ++K+ A  L + +E          +
Sbjct: 1609 AEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAK 1668

Query: 400  HKEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGED 459
              EE +KK  E KK ++   K         K + +E+ + E+L+          K   E+
Sbjct: 1669 KAEEDKKKAEEAKKAEEDEKKAA----EALKKEAEEAKKAEELK----------KKEAEE 1714

Query: 460  APPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKK 508
               A+++K+        E     +E+   E EE K+K EE  K   EKK
Sbjct: 1715 KKKAEELKK------AEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKK 1757



 Score = 38.6 bits (89), Expect = 0.031
 Identities = 41/177 (23%), Positives = 75/177 (42%), Gaps = 6/177 (3%)

Query: 343  EQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEHKE 402
            E KK +E  + E  ++     +    K +  +K +          E +  L +  K+ K 
Sbjct: 1550 ELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKA 1609

Query: 403  EQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPP 462
            E+ KK  E K K ++  K +     +++L   E+ + +     +K+   +NK++  +   
Sbjct: 1610 EEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAE-ELKKAEEENKIKAAEEAK 1668

Query: 463  AKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDAGE 519
              +  +  A     E   +  EDE    E +K++ EE  KA   KK E +E K A E
Sbjct: 1669 KAEEDKKKA-----EEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEE 1720



 Score = 38.2 bits (88), Expect = 0.042
 Identities = 42/206 (20%), Positives = 92/206 (44%), Gaps = 8/206 (3%)

Query: 1076 QKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQ 1135
            +K D  ++K E   +  E K+      ++  + E +  + E  ++ +EA   + + + K+
Sbjct: 1431 KKADEAKKKAEEAKKADEAKK----KAEEAKKAEEAKKKAEEAKKADEA--KKKAEEAKK 1484

Query: 1136 VDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEE 1195
             D      ++   KA E   A + +++ +E K   + +    + KK+   + A E ++ E
Sbjct: 1485 ADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAK-KADEAKKAEEAKKADEAKKAE 1543

Query: 1196 KPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVV 1255
            +  K    +K  ++   +E ++ E  KK +        K +  +  EE  +    K    
Sbjct: 1544 EKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEE 1603

Query: 1256 EEEEEVEEAEGEEEEEDDKASKPRKS 1281
            E++ + EEA+ + EE   KA + +K+
Sbjct: 1604 EKKMKAEEAK-KAEEAKIKAEELKKA 1628



 Score = 36.3 bits (83), Expect = 0.13
 Identities = 38/221 (17%), Positives = 90/221 (40%), Gaps = 8/221 (3%)

Query: 1069 EDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGR 1128
            + +    +   A+    E   +  E K+      ++  + E +  + E  R+ EEA    
Sbjct: 1078 DFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAE 1137

Query: 1129 GSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSG--------KKGK 1180
             +R+ ++    +     E  +  ED    +   + E+ K     R +         +K +
Sbjct: 1138 DARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAE 1197

Query: 1181 KSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKED 1240
             +   E A + EEE K  + R+     +     + EE +   ++ +K+ +    ++ ++ 
Sbjct: 1198 DARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKF 1257

Query: 1241 DEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
            +E +     R++  ++ EE  +  E ++ EE  KA + +K+
Sbjct: 1258 EEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKA 1298



 Score = 35.5 bits (81), Expect = 0.27
 Identities = 44/230 (19%), Positives = 96/230 (41%), Gaps = 9/230 (3%)

Query: 275  AQDRAHRIGQKNEVRVLRLITVNSVEERILAAARYKLNMDEKEYHRNNTARTQKLNKAII 334
            A  +A    +  E R+  ++ +   E+++ A    +    E+   +    +  +  K  +
Sbjct: 1579 ALRKAEEAKKAEEARIEEVMKLYEEEKKMKAE---EAKKAEEAKIKAEELKKAEEEKKKV 1635

Query: 335  NYHINAEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLT 394
                  E E+KK+ E ++K      +   EE  +   D+KK +          +    L 
Sbjct: 1636 EQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695

Query: 395  QMVKEHKE-EQQKKMREEKKKKKKSIKQ-----KILDGNMDKLDLDESSQLEDLRITVKE 448
            +  +E K+ E+ KK   E+KKK + +K+     KI      K   ++  + E+ +   +E
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE 1755

Query: 449  TNSDNKLRGEDAPPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKE 498
                  L+ E+   A++I++   +    E+  ED +     D+++K+  +
Sbjct: 1756 KKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFD 1805



 Score = 34.0 bits (77), Expect = 0.69
 Identities = 42/196 (21%), Positives = 80/196 (40%), Gaps = 14/196 (7%)

Query: 1094 IKELPDWMVKDDDEVENSMYEEEREREKEEALMGR---GSRQRKQVDYTDSLTDQEWLKA 1150
            I+EL ++   DD   E  + +E+ +   E     +   G  +  +  Y D   D      
Sbjct: 1029 IEELTEYGNNDDVLKEKDIIDEDIDGNHEGKAEAKAHVGQDEGLKPSYKDFDFDA----- 1083

Query: 1151 IEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQIS 1210
             ED+ AD+  EE        KK  +GK  +    EE   + E+  K  + R+     +  
Sbjct: 1084 KEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAE 1143

Query: 1211 DEDEDEEDESPKKKKRKSTKTPAKDKHKEDD-----EEKHVVSKRKKKVVEEEEEVEEAE 1265
            +  + E+ +  +  ++      A++  K +D       +     RK + + + E+  +AE
Sbjct: 1144 EARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAE 1203

Query: 1266 -GEEEEEDDKASKPRK 1280
               + EE+ KA + RK
Sbjct: 1204 AARKAEEERKAEEARK 1219



 Score = 34.0 bits (77), Expect = 0.82
 Identities = 39/185 (21%), Positives = 70/185 (37%), Gaps = 17/185 (9%)

Query: 338  INAEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKK---DKRLAFLLSQTDEYISNLT 394
            I AE+ +K ++ +  +E+ +   A+  E  +K  + KK   + + A    +  E      
Sbjct: 1272 IKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKA 1331

Query: 395  QMVKEHKEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNK 454
               K+  EE +K    E  K +        +   +K +  E  + E      K+     K
Sbjct: 1332 DAAKKKAEEAKKA--AEAAKAEAEAAADEAEAAEEKAEAAEKKKEEA-----KKKADAAK 1384

Query: 455  LRGEDAPPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEY 514
             + E+   A + K+            +   DE  +    K+K +E  K   EKK  D+  
Sbjct: 1385 KKAEEKKKADEAKKKAEED-------KKKADELKKAAAAKKKADEAKKKAEEKKKADEAK 1437

Query: 515  KDAGE 519
            K A E
Sbjct: 1438 KKAEE 1442



 Score = 33.2 bits (75), Expect = 1.2
 Identities = 50/229 (21%), Positives = 92/229 (40%), Gaps = 11/229 (4%)

Query: 302  RILAAARYKLNMDEKEYHRNNTARTQKLNKAIINYHINAEKEQKKEQERIEKERMRRLMA 361
            +  AA + +     +E  +   A+  +  K       +AE+ +K E+ER  +E  +   A
Sbjct: 1201 KAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEA 1260

Query: 362  EDEEGYRKLIDQK-KDKRLAFLLSQTDEY--ISNLTQMVKEHKEEQQKKMREEKKKK--- 415
                  R+    K ++ R A  L + +E        +  ++ K ++ KK  EE KK    
Sbjct: 1261 RMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEA 1320

Query: 416  KKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPPAKQIKEWLASHPG 475
            KK  ++     +  K   +E+ +  +      E  +D     E+   A + K+  A    
Sbjct: 1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKA 1380

Query: 476  WEVVMEDSEDEGSED-----EEMKEKKEEPSKAVGEKKTEDDEYKDAGE 519
                 +  E + +++     EE K+K +E  KA   KK  D+  K A E
Sbjct: 1381 DAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEE 1429



 Score = 31.3 bits (70), Expect = 4.2
 Identities = 32/172 (18%), Positives = 74/172 (43%), Gaps = 8/172 (4%)

Query: 340  AEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKE 399
            A++ +KK +E  + +  ++  AE+ +   +   + ++ + A    +  E      +  K+
Sbjct: 1433 ADEAKKKAEEAKKADEAKK-KAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEA-KK 1490

Query: 400  HKEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGED 459
              EE +KK  E KK  +   K        +    DE+ + E+ +   +   ++ K + ++
Sbjct: 1491 KAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADE 1550

Query: 460  APPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTED 511
               A+++K+        E    +   +  ED+ M  +K E +K   E + E+
Sbjct: 1551 LKKAEELKK------AEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEE 1596



 Score = 30.9 bits (69), Expect = 6.7
 Identities = 42/191 (21%), Positives = 80/191 (41%), Gaps = 14/191 (7%)

Query: 340  AEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKE 399
            AE+ +KK +E  + +  ++   E ++        ++ K+ A    +  E      +   E
Sbjct: 1459 AEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEA----KKKADE 1514

Query: 400  HKEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGED 459
             K+ ++ K  +E KK +++ K        +K   DE  + E+L+   ++  ++   + E+
Sbjct: 1515 AKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEE 1574

Query: 460  --------APPAKQIKE--WLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKT 509
                    A  AK+ +E         +E   +   +E  + EE K K EE  KA  EKK 
Sbjct: 1575 DKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKK 1634

Query: 510  EDDEYKDAGEE 520
             +   K   EE
Sbjct: 1635 VEQLKKKEAEE 1645



 Score = 30.1 bits (67), Expect = 9.5
 Identities = 57/299 (19%), Positives = 118/299 (39%), Gaps = 21/299 (7%)

Query: 977  QAQDRAHRIGQKNEVRVLRLITVNSVEERILAAARYKLNMDEKVIQAGMFDQKSTGSERH 1036
            +A+ +A  + +  E +           E    A   K   +E  I+A    +K+   ++ 
Sbjct: 1617 EAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKK 1676

Query: 1037 EFLQTILHQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKE 1096
                    ++ ++ EE+E    E + +     +    L +K   E++K E   +  E  +
Sbjct: 1677 A-------EEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENK 1729

Query: 1097 LPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGA 1156
            +     K + E +    EE ++ E+E+       ++   +   +    +E  K  E    
Sbjct: 1730 IKAEEAKKEAEEDKKKAEEAKKDEEEK-------KKIAHLKKEEEKKAEEIRKEKEAVIE 1782

Query: 1157 DDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKR--RRRKKVQISDEDE 1214
            ++ +EE+E+ +    K+        + + E   E        K+      K+V  S   +
Sbjct: 1783 EELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQ 1842

Query: 1215 DEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
             EE ++ +K K        +D +KE D      +K K    ++EEE+EEA+  E+ + D
Sbjct: 1843 LEEADAFEKHKFNKNNENGEDGNKEAD-----FNKEKDLKEDDEEEIEEADEIEKIDKD 1896


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
            binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 53.5 bits (128), Expect = 7e-07
 Identities = 52/241 (21%), Positives = 102/241 (42%), Gaps = 20/241 (8%)

Query: 1045 QDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAER--RKGETKPRLVEIK-ELPDWM 1101
            + ++E E     ++E+  ++ A  + E     +++A+    KGET+   VE + E     
Sbjct: 666  EAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEG 725

Query: 1102 VKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEE 1161
             +D+ E+E     EE E E E    G+   + +          +   +  ED+   + + 
Sbjct: 726  TEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQA 785

Query: 1162 EEE-EVKPVTKKRGSGKKGKKSYVEETAPEDEEEEK-----PVKKRRRRKKVQISDEDED 1215
             E+ E+K      G  +   ++   E    + + E       VK     +++   ++ E 
Sbjct: 786  GEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEA 845

Query: 1216 EEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKA 1275
            ++DE        S    ++++ +E++EE+           EEEEE EE E EEEE ++  
Sbjct: 846  KQDEKGVDGGGGSDGGDSEEEEEEEEEEE-----------EEEEEEEEEEEEEEENEEPL 894

Query: 1276 S 1276
            S
Sbjct: 895  S 895



 Score = 42.3 bits (99), Expect = 0.002
 Identities = 39/165 (23%), Positives = 63/165 (38%), Gaps = 5/165 (3%)

Query: 1114 EEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKR 1173
            E ER  E E         + +Q   T++  + E    I  +   + E E E        +
Sbjct: 648  EGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHK 707

Query: 1174 GSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDE-----EDESPKKKKRKS 1228
            G  +  +  +  ET  E  E+E  ++     ++V+   E E E     E E  +K+    
Sbjct: 708  GETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHE 767

Query: 1229 TKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
             +T A+ K  ED+ E       + K  E  E   E EGE E  + 
Sbjct: 768  GETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEK 812



 Score = 37.3 bits (86), Expect = 0.073
 Identities = 28/114 (24%), Positives = 44/114 (38%), Gaps = 12/114 (10%)

Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
            E   EE +        G+ G++S   E   E E E K   +       +   E E E + 
Sbjct: 643  ERTGEEGER--PTEAEGENGEESG-GEAEQEGETETKGENESEGEIPAERKGEQEGEGEI 699

Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
              K+         A  K + + EE     + + +  E+E E+E  E  EE ED+
Sbjct: 700  EAKE---------ADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDE 744



 Score = 36.9 bits (85), Expect = 0.077
 Identities = 33/197 (16%), Positives = 67/197 (34%), Gaps = 14/197 (7%)

Query: 1080 AERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYT 1139
             ER   E +       E  +   +   E E     E +   + E  +    R+ +Q    
Sbjct: 642  GERTGEEGERPTEAEGENGE---ESGGEAEQEGETETKGENESEGEI-PAERKGEQEGEG 697

Query: 1140 DSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDE---EEEK 1196
            +    +   K   +    ++E E E     T+  G  + G++    E   E E   + E 
Sbjct: 698  EIEAKEADHKGETEAEEVEHEGETEAEG--TEDEGEIETGEEGEEVEDEGEGEAEGKHEV 755

Query: 1197 PVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVE 1256
              +  R+  + +   E E +EDE          +     + K D+  +  V    +    
Sbjct: 756  ETEGDRKETEHEGETEAEGKEDED-----EGEIQAGEDGEMKGDEGAEGKVEHEGETEAG 810

Query: 1257 EEEEVEEAEGEEEEEDD 1273
            E++E E     + ++ +
Sbjct: 811  EKDEHEGQSETQADDTE 827



 Score = 35.0 bits (80), Expect = 0.37
 Identities = 35/137 (25%), Positives = 56/137 (40%), Gaps = 12/137 (8%)

Query: 1146 EWLKAIEDDGADDYEEEEEEVKPVTKKRGSG--KKGKKSYVEETAPEDEEEEKPVKKRRR 1203
            +W K IE      + +E+   +PV K    G   KG  +  E T     EE +   +   
Sbjct: 603  KWNKQIEL-----WVKEQLSRRPVAKVMALGDLSKGDVAEAEHTGERTGEEGERPTEAEG 657

Query: 1204 RKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVE- 1262
                +   E E E +   K +     + PA+ K +++ E +    +   K   E EEVE 
Sbjct: 658  ENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEH 717

Query: 1263 ----EAEGEEEEEDDKA 1275
                EAEG E+E + + 
Sbjct: 718  EGETEAEGTEDEGEIET 734


>gnl|CDD|221124 pfam11496, HDA2-3, Class II histone deacetylase complex subunits 2
            and 3.  This family of class II histone deacetylase
            complex subunits HDA2 and HDA3 is found in fungi, The
            member from S. pombe is referred to as Ccq1. These
            proteins associate with HDA1 to generate the activity of
            the HDA1 histone deacetylase complex. HDA1 interacts with
            itself and with the HDA2-HDA3 subcomplex to form a
            probable tetramer and these interactions are necessary
            for catalytic activity. The HDA1 histone deacetylase
            complex is responsible for the deacetylation of lysine
            residues on the N-terminal part of the core histones
            (H2A, H2B, H3 and H4). Histone deacetylation gives a tag
            for epigenetic repression and plays an important role in
            transcriptional regulation, cell cycle progression and
            developmental events. HDA2 and HDA3 have a conserved
            coiled-coil domain towards their C-terminus.
          Length = 279

 Score = 51.6 bits (124), Expect = 1e-06
 Identities = 37/178 (20%), Positives = 73/178 (41%), Gaps = 17/178 (9%)

Query: 870  TSGKFELLDRILP--KFKETKHRVLMFCQMTQLMTILEDYFIWRDFSYLRLDGTTKSEDR 927
            TSGKF +L+ ++      E    VL+  +  + + ++E   + +  +Y RL G +  E+ 
Sbjct: 93   TSGKFLVLNDLINLLIRSERDLHVLIISRSVKTLDLVEALLLGKGLNYKRLSGESLYEEN 152

Query: 928  GELLRKFNEPESKYFIFLLSTRA---GGLGLNLQAA-DTVIIFDSDWNPH----QDLQAQ 979
             +      +     +I L ++         L      D +I FD   +      + L+ Q
Sbjct: 153  HK--VSDKKGSLSLWIHLTTSDGLTNTDSSLLSNYKFDLIISFDPSLDTSLPSIESLRTQ 210

Query: 980  DRAHRIGQKNEVRVLRLITVNSVEERILAAARYKLNMDEKVIQAGMFDQKSTGSERHE 1037
            +R       N   ++RL+ VNS+E   L   +   N  + ++QA +  +   G    +
Sbjct: 211  NRRG-----NLTPIIRLVVVNSIEHVELCFPKKYPNRLDYLVQASVVLRDIVGDLPPD 263



 Score = 48.5 bits (116), Expect = 9e-06
 Identities = 33/148 (22%), Positives = 62/148 (41%), Gaps = 17/148 (11%)

Query: 167 TSGKFELLDRILP--KFKETKHRVLMFCQMTQLMTILEDYFIWRDFSYLRLDGTTKSEDR 224
           TSGKF +L+ ++      E    VL+  +  + + ++E   + +  +Y RL G +  E+ 
Sbjct: 93  TSGKFLVLNDLINLLIRSERDLHVLIISRSVKTLDLVEALLLGKGLNYKRLSGESLYEEN 152

Query: 225 GELLRKFNEPESKYFIFLLSTRA---GGLGLNLQAA-DTVIIFDSDWNPH----QDLQAQ 276
            +      +     +I L ++         L      D +I FD   +      + L+ Q
Sbjct: 153 HK--VSDKKGSLSLWIHLTTSDGLTNTDSSLLSNYKFDLIISFDPSLDTSLPSIESLRTQ 210

Query: 277 DRAHRIGQKNEVRVLRLITVNSVEERIL 304
           +R       N   ++RL+ VNS+E   L
Sbjct: 211 NRRG-----NLTPIIRLVVVNSIEHVEL 233


>gnl|CDD|197800 smart00592, BRK, domain in transcription and CHROMO domain
           helicases. 
          Length = 45

 Score = 43.5 bits (103), Expect = 1e-05
 Identities = 15/41 (36%), Positives = 23/41 (56%)

Query: 439 LEDLRITVKETNSDNKLRGEDAPPAKQIKEWLASHPGWEVV 479
             + R+ V    +  KL G+DAP AK ++ WL  +P +EV 
Sbjct: 1   DGEERVPVINRETGKKLTGDDAPKAKDLERWLEENPEYEVA 41


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3 subunit. 
            This is a family of proteins which are subunits of the
            eukaryotic translation initiation factor 3 (eIF3). In
            yeast it is called Hcr1. The Saccharomyces cerevisiae
            protein eIF3j (HCR1) has been shown to be required for
            processing of 20S pre-rRNA and binds to 18S rRNA and eIF3
            subunits Rpg1p and Prt1p.
          Length = 242

 Score = 47.3 bits (113), Expect = 2e-05
 Identities = 33/123 (26%), Positives = 54/123 (43%), Gaps = 26/123 (21%)

Query: 1157 DDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDE 1216
             D+++EE E      K     K           +DE+E+  VK           D  ++E
Sbjct: 2    ADWDDEEFEPPAPPAKAVVKDKW----------DDEDEDDDVK-----------DSWDEE 40

Query: 1217 EDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKAS 1276
            EDE  +++K    K  AK K K+  + K  + +++K   E+EE+      E+  ED+ A 
Sbjct: 41   EDEEKEEEK---AKVAAKAKAKKALKAK--IEEKEKAKREKEEKGLRELEEDTPEDELAE 95

Query: 1277 KPR 1279
            K R
Sbjct: 96   KLR 98



 Score = 38.5 bits (90), Expect = 0.015
 Identities = 19/72 (26%), Positives = 37/72 (51%)

Query: 1152 EDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISD 1211
            +DD  D ++EEE+E K   K + + K   K  ++    E E+ ++  +++  R+  + + 
Sbjct: 30   DDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTP 89

Query: 1212 EDEDEEDESPKK 1223
            EDE  E    +K
Sbjct: 90   EDELAEKLRLRK 101


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
            subunit (TFIIF-alpha).  Transcription initiation factor
            IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
            II-associating protein 74 (RAP74) is the large subunit of
            transcription factor IIF (TFIIF), which is essential for
            accurate initiation and stimulates elongation by RNA
            polymerase II.
          Length = 528

 Score = 48.8 bits (116), Expect = 2e-05
 Identities = 42/181 (23%), Positives = 71/181 (39%), Gaps = 22/181 (12%)

Query: 1100 WMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDY 1159
             M+K       +  +E+ E E E+   GRG    K +   D   D E     + D +D  
Sbjct: 181  MMMKAAKNGPAAFGDEDEETEGEKGGGGRG----KDLKIKDLEGDDE----DDGDESDKG 232

Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
             E+ +E K   KK+   K  KK   ++      +++          +    D D++  +E
Sbjct: 233  GEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDA--------DEYDSDDGDDEGREE 284

Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPR 1279
                    S   P + + K   E         K  +E++E+ EE+E E+ EE+   SK  
Sbjct: 285  DYISDSSASGNDPEEREDKLSPEIP------AKPEIEQDEDSEESEEEKNEEEGGLSKKG 338

Query: 1280 K 1280
            K
Sbjct: 339  K 339



 Score = 45.0 bits (106), Expect = 2e-04
 Identities = 36/170 (21%), Positives = 67/170 (39%), Gaps = 10/170 (5%)

Query: 1076 QKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQ 1135
             +     + G  + + ++IK+L      D DE +     E+ + EK +    + ++ +K 
Sbjct: 197  DEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKG--GEDGDEEKSKKKKKKLAKNKK- 253

Query: 1136 VDYTDSLTDQEWLKAIEDDGADDYEEE--EEEVKPVTKKRGSGKKGKKSYVEETAPEDEE 1193
                  L D +  K   DD AD+Y+ +  ++E +       S   G      E     E 
Sbjct: 254  -----KLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEI 308

Query: 1194 EEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEE 1243
              KP  ++    +    +++E+E   S K KK K  K       K+D + 
Sbjct: 309  PAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDS 358



 Score = 34.2 bits (78), Expect = 0.57
 Identities = 24/104 (23%), Positives = 40/104 (38%), Gaps = 18/104 (17%)

Query: 1187 TAPEDEEEEKPVKK---------RRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKH 1237
            TA E EEE K  KK          +  K    +  DEDEE E  K    +      KD  
Sbjct: 160  TAEEAEEEMKRRKKTANGFQLMMMKAAKNGPAAFGDEDEETEGEKGGGGRGKDLKIKDLE 219

Query: 1238 KEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
             +D+++             ++   +  E + +++  K +K +K 
Sbjct: 220  GDDEDDGDE---------SDKGGEDGDEEKSKKKKKKLAKNKKK 254


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This protein,
            which interacts with both microtubules and TRAF3 (tumour
            necrosis factor receptor-associated factor 3), is
            conserved from worms to humans. The N-terminal region is
            the microtubule binding domain and is well-conserved; the
            C-terminal 100 residues, also well-conserved, constitute
            the coiled-coil region which binds to TRAF3. The central
            region of the protein is rich in lysine and glutamic acid
            and carries KKE motifs which may also be necessary for
            tubulin-binding, but this region is the least
            well-conserved.
          Length = 506

 Score = 48.0 bits (114), Expect = 3e-05
 Identities = 38/155 (24%), Positives = 73/155 (47%), Gaps = 7/155 (4%)

Query: 1134 KQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEE 1193
            K+V+   S       K  ++   +  +EEE+E + V +++   K+  K   ++  P++E 
Sbjct: 78   KRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEA 137

Query: 1194 EEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKST-------KTPAKDKHKEDDEEKHV 1246
            +EK   K + ++K +  +E  D E+E  +++ R  +       K P   K +  +EEK  
Sbjct: 138  KEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQR 197

Query: 1247 VSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
             + R+    + EE     E E+EE+D K  +   S
Sbjct: 198  QAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTS 232



 Score = 47.6 bits (113), Expect = 4e-05
 Identities = 18/144 (12%), Positives = 58/144 (40%)

Query: 1135 QVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEE 1194
            Q     + +     +A++       +    + KP  + +    K ++   E+   E +++
Sbjct: 61   QALAKCAESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKK 120

Query: 1195 EKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKV 1254
            ++  K+  + +K +   +++    E  K+K++K  +   +++ K+ +  +     +K   
Sbjct: 121  KEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPK 180

Query: 1255 VEEEEEVEEAEGEEEEEDDKASKP 1278
             +   + +E   EE++        
Sbjct: 181  KKPPNKKKEPPEEEKQRQAAREAV 204



 Score = 38.3 bits (89), Expect = 0.027
 Identities = 19/106 (17%), Positives = 44/106 (41%), Gaps = 7/106 (6%)

Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKP------VKKRRRRKKVQISDED 1213
            +EE +E +P  +K    +K  +   +    +  E  +         K++   K +   E+
Sbjct: 134  KEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEE 193

Query: 1214 EDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEE 1259
            E +   + +  K K  + P  ++ +E +E+     +     +EE+E
Sbjct: 194  EKQRQAAREAVKGK-PEEPDVNEEREKEEDDGKDRETTTSPMEEDE 238



 Score = 34.9 bits (80), Expect = 0.34
 Identities = 27/168 (16%), Positives = 64/168 (38%), Gaps = 4/168 (2%)

Query: 1114 EEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKR 1173
            EE+++++++     +  + +++    +    +E  K  E    +  + EEE+ +   + +
Sbjct: 115  EEKKKKKEKPKEEPKDRKPKEEAK--EKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAK 172

Query: 1174 GSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPA 1233
               KK  K        E  EEEK  +  R   K +  + D +EE E  K++     +   
Sbjct: 173  SRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEERE--KEEDDGKDRETT 230

Query: 1234 KDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
                +ED+  +     R+     ++ +   +    E  +       + 
Sbjct: 231  TSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTETRP 278


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit; Provisional.
          Length = 482

 Score = 45.7 bits (109), Expect = 1e-04
 Identities = 24/90 (26%), Positives = 49/90 (54%), Gaps = 3/90 (3%)

Query: 1156 ADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDED 1215
            A   E  EEE++ +T   GS K  KK        E + EE+  +K+++    +  +E+E+
Sbjct: 389  AAFLELTEEEIEFLT---GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEE 445

Query: 1216 EEDESPKKKKRKSTKTPAKDKHKEDDEEKH 1245
            EE E  +++K +  +   ++K +E++++K 
Sbjct: 446  EEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 39.1 bits (92), Expect = 0.013
 Identities = 24/80 (30%), Positives = 44/80 (55%), Gaps = 1/80 (1%)

Query: 1190 EDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSK 1249
             +EE E     ++  KK++   E  +++ E  KK+K+K      K + +E++EEK    +
Sbjct: 395  TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKE-EEEEEEKEKKEE 453

Query: 1250 RKKKVVEEEEEVEEAEGEEE 1269
             K++  EE EE +E E E++
Sbjct: 454  EKEEEEEEAEEEKEEEEEKK 473



 Score = 38.7 bits (91), Expect = 0.018
 Identities = 28/90 (31%), Positives = 46/90 (51%), Gaps = 9/90 (10%)

Query: 1191 DEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKR 1250
             EEE + +   ++  K  I    E  E +  ++KK K  K  A  K +E++EE+      
Sbjct: 395  TEEEIEFLTGSKKATKK-IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEE------ 447

Query: 1251 KKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
              K  +EEE+ EE E  EEE++++  K +K
Sbjct: 448  --KEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 36.8 bits (86), Expect = 0.080
 Identities = 20/71 (28%), Positives = 35/71 (49%), Gaps = 1/71 (1%)

Query: 1211 DEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEE 1270
             E+E E     KK  +K  K   +   K+ +EEK    K+     ++EEE EE + ++EE
Sbjct: 395  TEEEIEFLTGSKKATKKIKKI-VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEE 453

Query: 1271 EDDKASKPRKS 1281
            E ++  +  + 
Sbjct: 454  EKEEEEEEAEE 464



 Score = 36.1 bits (84), Expect = 0.12
 Identities = 28/138 (20%), Positives = 64/138 (46%), Gaps = 28/138 (20%)

Query: 288 VRVLRLITVNSVEERILAAARYKLNMDEKEYHRNNTARTQKLNKAIINYHINAEKEQKKE 347
           +  L +I  ++ E     AA  +L  +E E+   +   T+K+ K      I  + E+K+E
Sbjct: 371 LPFLSIIFKHNPELAARLAAFLELTEEEIEFLTGSKKATKKIKK------IVEKAEKKRE 424

Query: 348 QERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEHKEEQQKK 407
           +E+ EK+  ++  A  ++   +  +++K +                    +E +EE+++ 
Sbjct: 425 EEKKEKK--KKAFAGKKKEEEEEEEKEKKE--------------------EEKEEEEEEA 462

Query: 408 MREEKKKKKKSIKQKILD 425
             E++++++K  KQ  L 
Sbjct: 463 EEEKEEEEEKKKKQATLF 480



 Score = 33.7 bits (78), Expect = 0.76
 Identities = 26/76 (34%), Positives = 36/76 (47%), Gaps = 7/76 (9%)

Query: 1213 DEDEE-----DESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVE--EAE 1265
             E+E       +   KK +K  +   K + +E  E+K      KKK  EEEEE E  E E
Sbjct: 395  TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454

Query: 1266 GEEEEEDDKASKPRKS 1281
             EEEEE+ +  K  + 
Sbjct: 455  KEEEEEEAEEEKEEEE 470


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 44.9 bits (107), Expect = 3e-04
 Identities = 36/152 (23%), Positives = 64/152 (42%), Gaps = 34/152 (22%)

Query: 1049 EEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEI-KELPDWMVKDDDE 1107
              E  E+ ++ T ++LI   ++EL   +K      +      L E+ K+  + ++   + 
Sbjct: 201  GFERLELEEEGTPSELIREIKEELEEIEK-----ERESLLEELKELAKKYLEELLALYEY 255

Query: 1108 VENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVK 1167
            +E        E E+ EAL    S+  K  D T ++    W+        D  ++ +E + 
Sbjct: 256  LEI-------ELERAEAL----SKFLKT-DKTFAIEG--WVPE------DRVKKLKELID 295

Query: 1168 PVTKKRGSGKKGKKSYVEETAPEDEEEEKPVK 1199
              T        G  +YVE   P++EEEE P K
Sbjct: 296  KAT--------GGSAYVEFVEPDEEEEEVPTK 319


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 44.3 bits (104), Expect = 4e-04
 Identities = 38/248 (15%), Positives = 82/248 (33%), Gaps = 20/248 (8%)

Query: 1043 LHQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWM- 1101
            L Q  EE    +V     VN   +  ++E       +A   +   +      +   + + 
Sbjct: 20   LRQKQEEGSLGQVTTQVEVNSQNSVPDEESKTSTDDEAALLERLARREERRDERFSEALE 79

Query: 1102 --VKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDY 1159
               +      +    E   R +E++    G+      +     + +E  +  E +G    
Sbjct: 80   RQKEFKPTSTDQSLSEPSRRMQEDS----GAENETVEEEEKEESREEREEVEETEGVTKS 135

Query: 1160 EEEEEEVKPVTKKRGSGKK----GKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDED 1215
            E++ +       ++   +      +K         + E      K       +   E   
Sbjct: 136  EQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQ 195

Query: 1216 EEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEE----EEE 1271
             E     +K ++  +  A     E +E K    + ++KV+EEEE+  + E  +    EEE
Sbjct: 196  VEAGKEFEKLKQKQQEAAL----ELEELK-KKREERRKVLEEEEQRRKQEEADRKSREEE 250

Query: 1272 DDKASKPR 1279
            + +  K  
Sbjct: 251  EKRRLKEE 258



 Score = 33.9 bits (77), Expect = 0.68
 Identities = 52/227 (22%), Positives = 82/227 (36%), Gaps = 28/227 (12%)

Query: 1037 EFLQTILHQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKE 1096
            EF  T   Q   E       D    N+ +   E E +  ++ + E  +G TK        
Sbjct: 83   EFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDW- 141

Query: 1097 LPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGA 1156
                     D  E    E+E E E+EE        +      T  L   E   +      
Sbjct: 142  --------RDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEG 193

Query: 1157 DDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDE 1216
               E  +E  K   K+            +E A E EE +K   KR  R+KV   +E   +
Sbjct: 194  AQVEAGKEFEKLKQKQ------------QEAALELEELKK---KREERRKVLEEEEQRRK 238

Query: 1217 EDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEE 1263
            ++E+     RKS +   K + KE+ E +   +  K++ V E+   E+
Sbjct: 239  QEEA----DRKSREEEEKRRLKEEIERRRAEAAEKRQKVPEDGLSED 281



 Score = 33.1 bits (75), Expect = 1.1
 Identities = 42/193 (21%), Positives = 75/193 (38%), Gaps = 39/193 (20%)

Query: 1104 DDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEE 1163
             DDE E +   E R RE+EE L     RQ+++      +T Q  + +            +
Sbjct: 1    VDDEEEAA--RERRRREREEQL-----RQKQEEGSLGQVTTQVEVNS-------QNSVPD 46

Query: 1164 EEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVK---KRRRRKKVQISD--------- 1211
            EE K  T    +        +E  A  +E  ++      +R++  K   +D         
Sbjct: 47   EESKTSTDDEAAL-------LERLARREERRDERFSEALERQKEFKPTSTDQSLSEPSRR 99

Query: 1212 EDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEE---VEEAEGEE 1268
              ED   E+   ++ +  ++    + +E+ EE   V+K ++K    + E    EE E E 
Sbjct: 100  MQEDSGAENETVEEEEKEESR---EEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEP 156

Query: 1269 EEEDDKASKPRKS 1281
            EEE+       + 
Sbjct: 157  EEEEKPKRGSLEE 169


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
            the C subunit of DNA polymerase delta. It carries the
            essential residues for binding to the Pol1 subunit of
            polymerase alpha, from residues 293-332, which are
            characterized by the motif D--G--VT, referred to as the
            DPIM motif. The first 160 residues of the protein form
            the minimal domain for binding to the B subunit, Cdc1, of
            polymerase delta, the final 10 C-terminal residues,
            362-372, being the DNA sliding clamp, PCNA, binding
            motif.
          Length = 427

 Score = 44.1 bits (104), Expect = 4e-04
 Identities = 42/151 (27%), Positives = 67/151 (44%), Gaps = 30/151 (19%)

Query: 1149 KAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQ 1208
              +ED+ A+    +E+E +   K  G           E +  +EE E+  K++R+R K  
Sbjct: 253  VILEDESAEPTGLDEDEDEDEPKPSG-----------ERSDSEEETEEKEKEKRKRLKKM 301

Query: 1209 ISDEDEDEED----ESPK-KKKRKSTKTPAKDKHKEDDEEKHV----VSKRK-----KKV 1254
            + DEDEDEE     ESP  +++ +  + P   K +E+ EE  V      +R      KK 
Sbjct: 302  MEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRRRGRRRVMKKK 361

Query: 1255 VEEEEE-----VEEAEGEEEEEDDKASKPRK 1280
              ++EE      +  E E   ED+    P K
Sbjct: 362  TFKDEEGYLVTKKVYEWESFSEDEAEPPPTK 392



 Score = 40.2 bits (94), Expect = 0.006
 Identities = 40/190 (21%), Positives = 66/190 (34%), Gaps = 25/190 (13%)

Query: 1076 QKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQ 1135
            +K + +     T     E +E     V  +DE       +E E E E    G  S   ++
Sbjct: 228  EKKEKKEASESTVKEESE-EESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEE 286

Query: 1136 VDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEE 1195
             +  +    ++ LK + +D  +D +EE E V     +    ++ +   + +   E EE  
Sbjct: 287  TEEKEK-EKRKRLKKMMED--EDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVT 343

Query: 1196 KPVKKRRRR------KKVQISDED---------------EDEEDESPKKKKRKSTKTPAK 1234
                  RRR      KK    DE+               EDE +  P K K K +     
Sbjct: 344  VSPDGGRRRGRRRVMKKKTFKDEEGYLVTKKVYEWESFSEDEAEPPPTKPKPKVSTPAVP 403

Query: 1235 DKHKEDDEEK 1244
               K+    K
Sbjct: 404  AAAKKPKAPK 413



 Score = 39.8 bits (93), Expect = 0.007
 Identities = 31/112 (27%), Positives = 52/112 (46%), Gaps = 9/112 (8%)

Query: 1171 KKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTK 1230
            KK+   KK KK   E T  E+ EEE   +      +       +++EDE   K   + + 
Sbjct: 223  KKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSD 282

Query: 1231 TPAKDKHKEDDEEKHVVSKRKKKVVE-EEEEVEEAEGEE---EEEDDKASKP 1278
            +  + + KE ++ K     R KK++E E+E+ E     E   EEE+ +  +P
Sbjct: 283  SEEETEEKEKEKRK-----RLKKMMEDEDEDEEMEIVPESPVEEEESEEPEP 329



 Score = 37.9 bits (88), Expect = 0.033
 Identities = 18/116 (15%), Positives = 43/116 (37%), Gaps = 10/116 (8%)

Query: 1168 PVTKKRGSGKKGKKSYVEETAPED--EEEEKPVKKRRRRKKVQISDEDEDEEDESP---K 1222
               K+    K    S   +   ++  E++E      +   + +    D   EDES     
Sbjct: 205  ASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTG 264

Query: 1223 KKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
              + +    P     + D EE+    +++K      + +++   +E+E+++    P
Sbjct: 265  LDEDEDEDEPKPSGERSDSEEETEEKEKEK-----RKRLKKMMEDEDEDEEMEIVP 315


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
            found at the N terminus of SMC proteins. The SMC
            (structural maintenance of chromosomes) superfamily
            proteins have ATP-binding domains at the N- and
            C-termini, and two extended coiled-coil domains separated
            by a hinge in the middle. The eukaryotic SMC proteins
            form two kind of heterodimers: the SMC1/SMC3 and the
            SMC2/SMC4 types. These heterodimers constitute an
            essential part of higher order complexes, which are
            involved in chromatin and DNA dynamics. This family also
            includes the RecF and RecN proteins that are involved in
            DNA metabolism and recombination.
          Length = 1162

 Score = 44.6 bits (105), Expect = 5e-04
 Identities = 42/248 (16%), Positives = 95/248 (38%), Gaps = 2/248 (0%)

Query: 1033 SERHEFLQTILHQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLV 1092
             E     +     + E  +   +   E + +   R ++EL   +    +      K +  
Sbjct: 674  KELLAEQELQEKAESELAKNEILRRQEEIKKKEQRIKEELK--KLKLEKEELLADKVQEA 731

Query: 1093 EIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIE 1152
            + K   +  + +    E    EE+   +KEE    +     K+ +  +     E LK  E
Sbjct: 732  QDKINEELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEE 791

Query: 1153 DDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDE 1212
            +       +EEE      + +   +  ++  +     E  +EE+  +     K+ Q  ++
Sbjct: 792  EKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEK 851

Query: 1213 DEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEED 1272
              +EE E  +++  K          +E+ EE+ +  + + K  +E+EE +E E E ++++
Sbjct: 852  LAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKDN 911

Query: 1273 DKASKPRK 1280
                K  +
Sbjct: 912  LLEEKENE 919



 Score = 38.8 bits (90), Expect = 0.026
 Identities = 45/207 (21%), Positives = 82/207 (39%), Gaps = 6/207 (2%)

Query: 1065 IARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEA 1124
              + E    L ++ +           L   +       K   E      + E E E    
Sbjct: 170  RKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLY 229

Query: 1125 LMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYV 1184
            L      + +     + L D++           + E+EEE +  V K+    +K KK   
Sbjct: 230  LDYLKLNEERIDLLQELLRDEQEEIESS---KQELEKEEEILAQVLKENKEEEKEKKLQE 286

Query: 1185 EETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEK 1244
            EE     +EEE+   +  + ++ ++ DE++ +E E   KK  K  K   K+K + ++ EK
Sbjct: 287  EELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELK---KEKEEIEELEK 343

Query: 1245 HVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
             +     K+  EEEEE +  + +E+ E
Sbjct: 344  ELKELEIKREAEEEEEEQLEKLQEKLE 370



 Score = 34.2 bits (78), Expect = 0.60
 Identities = 38/222 (17%), Positives = 82/222 (36%), Gaps = 14/222 (6%)

Query: 312 NMDEKEYHRNNTARTQKLNKAIINYHINAEKEQKKEQERIEKERMRRLM----AEDEEGY 367
               K+  R      +  N A +   +   K Q+ + +   K+ +         E EE  
Sbjct: 167 REKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEN 226

Query: 368 RKLIDQKK--------DKRLAFLLSQTDEYISNLTQMVKEHKEEQQKKMREEKKKKKKSI 419
              +D  K         + L     +  E      +  +E   +  K+ +EE+K+KK  +
Sbjct: 227 LLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKK--L 284

Query: 420 KQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPPAKQIKEWLASHPGWEVV 479
           +++ L     + +  +S  L+  R  V +     +   E     K++K+        E  
Sbjct: 285 QEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKE 344

Query: 480 MEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDAGEEA 521
           +++ E +   +EE +E+ E+  + + + + E    K    E 
Sbjct: 345 LKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESER 386



 Score = 30.7 bits (69), Expect = 6.9
 Identities = 30/216 (13%), Positives = 81/216 (37%), Gaps = 9/216 (4%)

Query: 315 EKEYHRNNTARTQKLNKAIINYHINAEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQK 374
                       +++  +        E   +  +E  E+E+ ++L  E+ +   K  ++ 
Sbjct: 240 IDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEEL 299

Query: 375 KDKRLAFLLSQTDEYISNLTQMVKEHKEEQQKKMREEKKKK---KKSIKQKILDGNMDKL 431
           K + L   L +         +  ++  ++ +K++++EK++    +K +K+  +    ++ 
Sbjct: 300 KSELL--KLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEE 357

Query: 432 DLDESSQLEDLRITVKETNSDNKLRGEDAPPAKQIKEWLASHPGWEVVMEDSED----EG 487
           + ++  +L++    ++E     K    +   +    +        E   E        E 
Sbjct: 358 EEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQ 417

Query: 488 SEDEEMKEKKEEPSKAVGEKKTEDDEYKDAGEEATY 523
            ED   +EKKEE       +++ + +     EE   
Sbjct: 418 EEDLLKEEKKEELKIVEELEESLETKQGKLTEEKEE 453



 Score = 30.3 bits (68), Expect = 9.8
 Identities = 33/251 (13%), Positives = 88/251 (35%), Gaps = 23/251 (9%)

Query: 299 VEERILAAARYKLNMDEKEYHRNNTARTQKLNKAIINYHINAEKEQK-----KEQERIEK 353
             E +    R   + ++ +       + +K  K         EKE K     +E E  E+
Sbjct: 300 KSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEE 359

Query: 354 ERMRRLMAEDEEGYRKLIDQKKDKRLAF---------------LLSQTDEYISNLTQMVK 398
           E++ +L  + E+   +L+ +KK +                      +  + +  L++  +
Sbjct: 360 EQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQEE 419

Query: 399 EHKEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGE 458
           +  +E++K+  +  ++ ++S++ K      +K +L++ +          + + D     +
Sbjct: 420 DLLKEEKKEELKIVEELEESLETKQGKLTEEKEELEKQALKLLKDKLELKKSEDLLKETK 479

Query: 459 DAPPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDAG 518
                +Q++  L      E   ++S+   + +                  +      D G
Sbjct: 480 LVKLLEQLELLLLRQKLEEASQKESK---AREGLAVLLALIKDGVGLRIISAAGRGGDLG 536

Query: 519 EEATYYSIAHT 529
                Y +A +
Sbjct: 537 VAVANYKVAIS 547


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
            related to Mpp10 (M phase phosphoprotein 10). The U3
            small nucleolar ribonucleoprotein (snoRNP) is required
            for three cleavage events that generate the mature 18S
            rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
            depletion of Mpp10, a U3 snoRNP-specific protein, halts
            18S rRNA production and impairs cleavage at the three U3
            snoRNP-dependent sites.
          Length = 613

 Score = 43.4 bits (102), Expect = 7e-04
 Identities = 50/243 (20%), Positives = 94/243 (38%), Gaps = 20/243 (8%)

Query: 1057 DDETV-NQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMY-- 1113
            D E +  QL  + +  L    +   E    +    +  + +L D    +D    + M   
Sbjct: 55   DAEQIWQQLKLQNKPILDDLNQKYVEFLINKE--HIRVLAKLQDSESHEDGSDGSDMDSE 112

Query: 1114 ------EEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVK 1167
                  EEE E E  E  M     +    + ++S  +       EDD     EEEE   +
Sbjct: 113  DSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETEDDEEKKMEEEEAGEE 172

Query: 1168 PVTK-KRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKR 1226
              +  +    KK  KS V++   + +E  + ++     ++  + DED+ E+      +  
Sbjct: 173  KESVEQATREKKFDKSGVDDKFFKLDEMNEFLEATEAEEEAALGDEDDFEDYFQDDSEDG 232

Query: 1227 KSTKTPAKDKHKEDDEEKHV--------VSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
            K  +     + +EDDEE ++          K KKK   ++ E+E+ E ++E    +A   
Sbjct: 233  KDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSK 292

Query: 1279 RKS 1281
             + 
Sbjct: 293  PEE 295



 Score = 38.4 bits (89), Expect = 0.027
 Identities = 49/246 (19%), Positives = 93/246 (37%), Gaps = 14/246 (5%)

Query: 1044 HQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVK 1103
            H+D  +  + +  D     +     EDE    + +D E            +++L D   +
Sbjct: 100  HEDGSDGSDMDSEDSADDEE--EEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETE 157

Query: 1104 DDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTD----QEWLKAIEDD----G 1155
            DD+E    M EEE   EKE        ++  +    D         E+L+A E +     
Sbjct: 158  DDEE--KKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNEFLEATEAEEEAAL 215

Query: 1156 ADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDED 1215
             D+ + E+         +     G     E+    + E E     + + KK    D+ E 
Sbjct: 216  GDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAEL 275

Query: 1216 EEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKA 1275
            E+DE  K+  +K   +  +++ +EDDE++    + +++  E   +  + +    E  D  
Sbjct: 276  EDDEPDKEAVKKEADSKPEEEDEEDDEQED--DQDEEEPPEAAMDKVKLDEPVLEGVDLE 333

Query: 1276 SKPRKS 1281
            S    S
Sbjct: 334  SPKELS 339



 Score = 37.7 bits (87), Expect = 0.042
 Identities = 41/230 (17%), Positives = 87/230 (37%), Gaps = 17/230 (7%)

Query: 1046 DDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDD 1105
            ++ E    ++ DDET +    + E+E    +K   E+   E K      K   D      
Sbjct: 142  NESESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFD----KSGVDDKFFKL 197

Query: 1106 DEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEE 1165
            DE+   +   E E E              + D+ D   D       ++D     +EE++E
Sbjct: 198  DEMNEFLEATEAEEEA---------ALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDE 248

Query: 1166 VKPVTKKRGSGK----KGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESP 1221
               +  +         K K +  +    +DE +++ VKK    K  +  +ED+++ED+  
Sbjct: 249  EGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQD 308

Query: 1222 KKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
            +++  ++     K      +       K      + + ++++   + E+E
Sbjct: 309  EEEPPEAAMDKVKLDEPVLEGVDLESPKELSSFEKRQAKLKQQIEQLEKE 358


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 43.3 bits (102), Expect = 0.001
 Identities = 76/359 (21%), Positives = 128/359 (35%), Gaps = 72/359 (20%)

Query: 989  NEVRVLRLITVNSVEERILAAARYKLNMDEKVIQAGMFDQKSTGSERHEF-LQTILHQDD 1047
            N+VR +  +    V+  I+   R K ++ E++ Q G F      ++  E  +        
Sbjct: 1039 NKVRFILAV----VDGEIIVNNRKKADLVEELRQKG-FTPFPKKAKSVEAAVAGATDDAA 1093

Query: 1048 EEEEENEVPDDETV-----NQLIARTEDELTL--FQKMDAERRKGE--------TKPRLV 1092
            EEEEE +V     V     + L++     LTL   Q++ A+R K          T P+ +
Sbjct: 1094 EEEEEIDVSSSSGVRGSDYDYLLSMAIGTLTLEKVQELCADRDKLNIEVEDLKKTTPKSL 1153

Query: 1093 EIKELPDW-MVKDDDEVENSMYEEEREREKEEALMGRG------SRQRKQ----VDYTDS 1141
             +K+L       D  + E++  EE RE+ +  A  G        SRQ  +       T  
Sbjct: 1154 WLKDLDALEKELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKK 1213

Query: 1142 LTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKK-------------------- 1181
             ++ E  +      A + E   E VKP   + G+ KK                       
Sbjct: 1214 ASESETTEETYGSSAMETENVAEVVKP-KGRAGAKKKAPAAAKEKEEEDEILDLKDRLAA 1272

Query: 1182 ----------SYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKK-----KR 1226
                      + +EET           +K+       ISD D+D++D + +       K+
Sbjct: 1273 YNLDSAPAQSAKMEETVKAVPARRAAARKKPLASVSVISDSDDDDDDFAVEVSLAERLKK 1332

Query: 1227 KSTKTPAKDKHKE----DDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
            K  + PA    K        +K   +  +       E ++ AE      + K  K R S
Sbjct: 1333 KGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEMLKPAEAIGISPEKKVRKMRAS 1391



 Score = 40.2 bits (94), Expect = 0.009
 Identities = 32/133 (24%), Positives = 49/133 (36%), Gaps = 10/133 (7%)

Query: 1150 AIEDDGADDYEEEEEEV-KPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQ 1208
             I D   DD +   E       KK+G  K    +      P   ++  P   +  +K + 
Sbjct: 1309 VISDSDDDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLT 1368

Query: 1209 ISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEE--EEVEEAEG 1266
               +  +    SP+KK RK   +P   K          V  R     E E  E V  +  
Sbjct: 1369 EMLKPAEAIGISPEKKVRKMRASPFNKK-------SGSVLGRAATNKETESSENVSGSSS 1421

Query: 1267 EEEEEDDKASKPR 1279
             E++E D ++KPR
Sbjct: 1422 SEKDEIDVSAKPR 1434



 Score = 33.7 bits (77), Expect = 0.75
 Identities = 37/165 (22%), Positives = 65/165 (39%), Gaps = 22/165 (13%)

Query: 381  FLLSQTDEYISNLTQ-MVKEHKEEQQKKMREEKKKKKKSIKQ------KILDGNMDKLDL 433
            +LLS     I  LT   V+E   ++ K   E +  KK + K         L+  +DKLD 
Sbjct: 1114 YLLSMA---IGTLTLEKVQELCADRDKLNIEVEDLKKTTPKSLWLKDLDALEKELDKLDK 1170

Query: 434  DESSQLEDLRITVK-----ETNSDNKLRGE---DAPPAKQIKEWLASHPGWEVVMEDSED 485
            +++   E      +     E+ +  K+  +      P K  K+   S    E     + +
Sbjct: 1171 EDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAME 1230

Query: 486  EGSEDEEMKEKKEEPSK----AVGEKKTEDDEYKDAGEEATYYSI 526
              +  E +K K    +K    A  ++K E+DE  D  +    Y++
Sbjct: 1231 TENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLKDRLAAYNL 1275



 Score = 30.2 bits (68), Expect = 8.5
 Identities = 18/90 (20%), Positives = 33/90 (36%), Gaps = 15/90 (16%)

Query: 1146 EWLKAIEDDG-ADDYEEEEEEVKPVTKKRGS-----GKKGKKSYVEETAPEDEEEE---- 1195
            E LK  E  G + + +  +    P  KK GS         +    E  +     E+    
Sbjct: 1369 EMLKPAEAIGISPEKKVRKMRASPFNKKSGSVLGRAATNKETESSENVSGSSSSEKDEID 1428

Query: 1196 -----KPVKKRRRRKKVQISDEDEDEEDES 1220
                 +P +  R++    +SD + +  D+S
Sbjct: 1429 VSAKPRPQRANRKQTTYVLSDSESESADDS 1458


>gnl|CDD|215832 pfam00270, DEAD, DEAD/DEAH box helicase.  Members of this family
           include the DEAD and DEAH box helicases. Helicases are
           involved in unwinding nucleic acids. The DEAD box
           helicases are involved in various aspects of RNA
           metabolism, including nuclear transcription, pre mRNA
           splicing, ribosome biogenesis, nucleocytoplasmic
           transport, translation, RNA decay and organellar gene
           expression.
          Length = 169

 Score = 40.7 bits (96), Expect = 0.001
 Identities = 31/143 (21%), Positives = 56/143 (39%), Gaps = 15/143 (10%)

Query: 574 GLGKTVQTIALLTYLMEKKKNLGPY--LVIVPLSTLSNWVLE-FEKWAPSANV---VAYK 627
           G GKT     LL  L       G    LV+ P   L+  + E  +K      +   +   
Sbjct: 24  GSGKT--LAFLLPILQALLPKKGGPQALVLAPTRELAEQIYEELKKLFKILGLRVALLTG 81

Query: 628 GSPLARRALQGRMKNSKFNVLLTTYEFI---IKDKSVLSKIQWKCMIIDEGHRM--KNHH 682
           G+ L  +A +  +K  K ++L+ T   +   ++   +      K +++DE HR+      
Sbjct: 82  GTSLKEQARK--LKKGKADILVGTPGRLLDLLRRGKLKLLKNLKLLVLDEAHRLLDMGFG 139

Query: 683 CKLTQTLNTFYNSSHRVLLTGTP 705
             L + L+        +LL+ T 
Sbjct: 140 DDLEEILSRLPPDRQILLLSATL 162


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
            family of proteins are involved in 60S ribosomal
            biogenesis. They are specifically involved in the
            processing beyond the 27S stage of 25S rRNA maturation.
            This family contains sequences that bear similarity to
            the glioma tumour suppressor candidate region gene 2
            protein (p60). This protein has been found to interact
            with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 42.0 bits (99), Expect = 0.002
 Identities = 33/178 (18%), Positives = 68/178 (38%), Gaps = 24/178 (13%)

Query: 1103 KDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEE 1162
            +D  E+    YE+E + EK+   + R   ++ +    ++    E  + + ++  DD EEE
Sbjct: 193  EDHQELLQEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEE 252

Query: 1163 EEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPK 1222
             ++         S  +G +S  E        + K   +R + K+               +
Sbjct: 253  SDD--------ESAWEGFESEYEPINKPVRPKRKTKAQRNKEKR---------------R 289

Query: 1223 KKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
            K+  +  K   + K K     + +    K+   +E+    + E  +E  + K  K RK
Sbjct: 290  KELEREAKEEKQLKKKLAQLAR-LKEIAKEVAQKEKARARKKEQRKERGEKKKLKRRK 346


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
            Rpc31.  RNA polymerase III contains seventeen subunits in
            yeasts and in human cells. Twelve of these are akin to
            RNA polymerase I or II and the other five are RNA pol
            III-specific, and form the functionally distinct groups
            (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31, Rpc34
            and Rpc82 form a cluster of enzyme-specific subunits that
            contribute to transcription initiation in S.cerevisiae
            and H.sapiens. There is evidence that these subunits are
            anchored at or near the N-terminal Zn-fold of Rpc1,
            itself prolonged by a highly conserved but RNA polymerase
            III-specific domain.
          Length = 221

 Score = 40.9 bits (96), Expect = 0.002
 Identities = 23/85 (27%), Positives = 44/85 (51%), Gaps = 12/85 (14%)

Query: 1191 DEEEEKPVKKRRRRKKVQISDEDEDEEDESPK--KKKRKSTKTPAKDKHKEDDEEKHVVS 1248
            +++  K +   + ++KV +  E+E++ DE     +KK K  +    D+  E DEE+    
Sbjct: 123  NKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEE--- 179

Query: 1249 KRKKKVVEEEEEVEEAEGEEEEEDD 1273
                   EEEEE +E   +++++DD
Sbjct: 180  -------EEEEEEDEDFDDDDDDDD 197



 Score = 30.5 bits (69), Expect = 4.7
 Identities = 16/60 (26%), Positives = 32/60 (53%)

Query: 1215 DEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDK 1274
            +++        +   K     + +ED +EK  + ++K K +E E+  EE E +EEEE+++
Sbjct: 123  NKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEE 182


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part of
            a large ribonucleoprotein complex containing the U3
            snoRNA. Depletion of the Utp proteins impedes production
            of the 18S rRNA, indicating that they are part of the
            active pre-rRNA processing complex. This large RNP
            complex has been termed the small subunit (SSU)
            processome.
          Length = 728

 Score = 42.4 bits (100), Expect = 0.002
 Identities = 48/251 (19%), Positives = 99/251 (39%), Gaps = 23/251 (9%)

Query: 1046 DDEEEEENEVPDDETVNQLIAR-------------TEDELTLFQKMDA--ERRKGETKPR 1090
             +EE+E+++  DD+  N  + R                 L   + M     R+K E    
Sbjct: 334  SEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAE 393

Query: 1091 LVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDS-LTDQEWLK 1149
            + E++   +   + D+E      ++   R K     G  + +++            E+ +
Sbjct: 394  IEELRRELEGEEESDEEENEEPSKKNVGRRK----FGPENGEKEAESKKLKKENKNEFKE 449

Query: 1150 AIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQI 1209
              E D  ++  E+EEE K         K+ +K+  EE   E +EE   +K      K   
Sbjct: 450  KKESDEEEE-LEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKS-- 506

Query: 1210 SDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEE 1269
            + + + ++  S K  K  +  + A  K K+  +++  +      + EE+    + + EE+
Sbjct: 507  AKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEED 566

Query: 1270 EEDDKASKPRK 1280
            E+D++     K
Sbjct: 567  EDDEELPFLFK 577



 Score = 41.2 bits (97), Expect = 0.004
 Identities = 36/191 (18%), Positives = 73/191 (38%), Gaps = 30/191 (15%)

Query: 1103 KDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDG------- 1155
            +DDDE ++        R+K   L     ++ +  +    L   ++++  E          
Sbjct: 339  EDDDEDDDDGENPWMLRKKLGKL-----KEGEDDEENSGLLSMKFMQRAEARKKEENDAE 393

Query: 1156 ----ADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAP-EDEEEEKPVKKRRRRKKVQIS 1210
                  + E EEE  +   ++      G++ +  E    E E ++   + +   K+ + S
Sbjct: 394  IEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKES 453

Query: 1211 DEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEE 1270
            DE+E+ EDE   K ++ + K   +             S++ +K  EEEE  EE    +  
Sbjct: 454  DEEEELEDEEEAKVEKVANKLLKR-------------SEKAQKEEEEEELDEENPWLKTT 500

Query: 1271 EDDKASKPRKS 1281
                 S  ++ 
Sbjct: 501  SSVGKSAKKQD 511



 Score = 38.9 bits (91), Expect = 0.019
 Identities = 43/197 (21%), Positives = 83/197 (42%), Gaps = 25/197 (12%)

Query: 1101 MVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYE 1160
              K D+E   ++ E+ R   + E L       RK    + S  D++     E++  DD E
Sbjct: 293  RAKYDEEARKALEEQLR---QGEELR------RKIEGKSVSEEDEDEDSDSEEEDEDDDE 343

Query: 1161 EEEEEVKPVTKKRGSGKKGKKSYVEETAP-------------EDEEEEKPVKKRRRRKKV 1207
            ++++   P   ++  GK  +    EE +              + EE +  +++ RR  + 
Sbjct: 344  DDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEG 403

Query: 1208 QISDEDEDEEDESPKKKKRKS---TKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEA 1264
            +   ++E+ E+ S K   R+         + + K+  +E     K KK+  EEEE  +E 
Sbjct: 404  EEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEE 463

Query: 1265 EGEEEEEDDKASKPRKS 1281
            E + E+  +K  K  + 
Sbjct: 464  EAKVEKVANKLLKRSEK 480



 Score = 37.7 bits (88), Expect = 0.044
 Identities = 60/257 (23%), Positives = 93/257 (36%), Gaps = 59/257 (22%)

Query: 1047 DEEEEENEVPDDETVNQLIARTEDELTLFQKMDAE-----------RRK---------GE 1086
              E  + E  D E     I     EL   ++ D E           RRK          E
Sbjct: 381  RAEARKKEENDAE-----IEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAE 435

Query: 1087 TKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQE 1146
            +K    E K       + D+E E    EE +  +    L+ R  + +K+ +  +   +  
Sbjct: 436  SKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENP 495

Query: 1147 WLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKK 1206
            WLK     G           K  +KK+ S K  K          ++  +  VK ++++KK
Sbjct: 496  WLKTTSSVGKS-------AKKQDSKKKSSSKLDKA--------ANKISKAAVKVKKKKKK 540

Query: 1207 VQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVE---------- 1256
             +  D D+D  DE        S K    D+  EDDEE   + K+K  + E          
Sbjct: 541  EKSIDLDDDLIDEE------DSIKLDVDDEEDEDDEELPFLFKQKDLIKEAFAGDDVVAE 594

Query: 1257 ---EEEEVEEAEGEEEE 1270
               E++EV E E  +E 
Sbjct: 595  FEKEKKEVIEEEDPKEI 611


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 41.1 bits (96), Expect = 0.003
 Identities = 34/179 (18%), Positives = 50/179 (27%), Gaps = 15/179 (8%)

Query: 1114 EEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYE-EEEEEVKPVTKK 1172
              E  R  E+   G+G       +   S T         ++ A     E      P +  
Sbjct: 75   TAEESRHGEKEERGQGGPSGSGSESVGSPTPSP--SGSAEELASGLSPENTSGSSPESPA 132

Query: 1173 RGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKK-------- 1224
              S      S+     P   E   P   ++    +Q S ED  EE E P  +        
Sbjct: 133  SHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGP 192

Query: 1225 KRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEE----EVEEAEGEEEEEDDKASKPR 1279
             +  T T +       DE     S   ++          E E+   E E E       R
Sbjct: 193  PQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEPEREGPPFPGHR 251



 Score = 35.7 bits (82), Expect = 0.17
 Identities = 24/162 (14%), Positives = 42/162 (25%), Gaps = 10/162 (6%)

Query: 1127 GRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEV-------KPVTKKRGSGKKG 1179
               + +  +       T    ++A+ + G    E + E         K    + G    G
Sbjct: 37   RGTAARAAKPAPPAPTTSGPQVRAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSG 96

Query: 1180 KKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKE 1239
             +S    T       E+             S E        P          PA  +   
Sbjct: 97   SESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHN 156

Query: 1240 DDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
                +   S  +      E+  EE E    E +  +  P +S
Sbjct: 157  PSPNQQPSSFLQP---SHEDSPEEPEPPTSEPEPDSPGPPQS 195



 Score = 34.2 bits (78), Expect = 0.41
 Identities = 11/63 (17%), Positives = 22/63 (34%), Gaps = 4/63 (6%)

Query: 1215 DEEDESPKKKKRKS--TKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAE--GEEEE 1270
                E   +++  +     PA         +   V+++  +  E + E  E    GE+EE
Sbjct: 27   GGTREQAGRRRGTAARAAKPAPPAPTTSGPQVRAVAEQGHRQTESDTETAEESRHGEKEE 86

Query: 1271 EDD 1273
               
Sbjct: 87   RGQ 89



 Score = 32.6 bits (74), Expect = 1.4
 Identities = 12/83 (14%), Positives = 24/83 (28%), Gaps = 3/83 (3%)

Query: 1199 KKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEE 1258
            K+ R   +       +       +  +R+ T   A                + + V E+ 
Sbjct: 9    KRSRHTLRSSCRGHCKRHGGTREQAGRRRGTAARAAKPAPPAPTTS---GPQVRAVAEQG 65

Query: 1259 EEVEEAEGEEEEEDDKASKPRKS 1281
                E++ E  EE     K  + 
Sbjct: 66   HRQTESDTETAEESRHGEKEERG 88


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
            Fas-binding protein) is thought to play a role in
            apoptosis, but precise role played by Daxx remains to be
            determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 41.4 bits (97), Expect = 0.004
 Identities = 37/179 (20%), Positives = 68/179 (37%), Gaps = 16/179 (8%)

Query: 1101 MVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYE 1160
            M +DD E E     +ERER+   +     S+       + S+  QE  +    +  ++ E
Sbjct: 393  MKQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEE 452

Query: 1161 EEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDES 1220
            EEEEE +    +   G+  ++    E     EEE +                + D + E 
Sbjct: 453  EEEEEEEQ-ESEEEEGEDEEEEEEVEADNGSEEEMEG-------------SSEGDGDGEE 498

Query: 1221 PKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEE--EEEVEEAEGEEEEEDDKASK 1277
            P++   +     A      + ++    S + +   EE  + E  +AE   EE D++   
Sbjct: 499  PEEDAERRNSEMAGISRMSEGQQPRGSSVQPESPQEEPLQPESMDAESVGEESDEELLA 557



 Score = 32.2 bits (73), Expect = 2.4
 Identities = 19/70 (27%), Positives = 35/70 (50%), Gaps = 2/70 (2%)

Query: 1212 EDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
            +D+ EE+E  K+++R+   T ++             S       +E EE E  E EEEEE
Sbjct: 395  QDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMAS--QESEEEESVEEEEEEE 452

Query: 1272 DDKASKPRKS 1281
            +++  + ++S
Sbjct: 453  EEEEEEEQES 462


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
            This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 38.9 bits (91), Expect = 0.005
 Identities = 30/95 (31%), Positives = 46/95 (48%), Gaps = 5/95 (5%)

Query: 1189 PEDEEEEKPVKKRRRRKKVQISDEDED-EEDESPKKKKRKSTKTPAKDKHKEDDEEKHVV 1247
            P  + E    KK+++    +I    ++ EE +  K KK+KS K   KDK K+DD++    
Sbjct: 51   PIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDD-- 108

Query: 1248 SKRKKKVVEEEEEVEEAEGEEEEEDDKAS--KPRK 1280
               KK   E E+++E+      E     S  KPRK
Sbjct: 109  KSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRK 143



 Score = 33.1 bits (76), Expect = 0.57
 Identities = 14/79 (17%), Positives = 40/79 (50%), Gaps = 1/79 (1%)

Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
            +E  EE++ V K+    +K K    +    +D++++K   K+  + + +  +++ +++ E
Sbjct: 65   KELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK-DEKEAEDKLE 123

Query: 1220 SPKKKKRKSTKTPAKDKHK 1238
               K   ++  T ++ K +
Sbjct: 124  DLTKSYSETLSTLSELKPR 142


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 40.6 bits (95), Expect = 0.006
 Identities = 28/123 (22%), Positives = 46/123 (37%), Gaps = 16/123 (13%)

Query: 1155 GADDYEEEEEEVKPV---TKKRGSGKKGK----KSYVEETAPEDEEEEKPVKKRRRRKKV 1207
            G+ +      E  P    + K    KKGK    K    E+ P+DEE+     K+ ++K  
Sbjct: 398  GSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGR 457

Query: 1208 QISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGE 1267
              S +   +     KK            K +ED+          KK++E   ++EE   E
Sbjct: 458  DKSSKVPSDSKAGGKK---------ESVKSQEDNNNIPPEEWVMKKILEWVPDLEEDGTE 508

Query: 1268 EEE 1270
            +  
Sbjct: 509  DPG 511


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 40.0 bits (94), Expect = 0.009
 Identities = 18/130 (13%), Positives = 43/130 (33%), Gaps = 2/130 (1%)

Query: 1149 KAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQ 1208
              +     D  +  E ++     K  +     K+  ++   ++ +  K  +K+    K  
Sbjct: 57   IFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDD 116

Query: 1209 ISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEE 1268
              +  +D +  +              D    DD++            + ++E EE   +E
Sbjct: 117  DLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEE--KKE 174

Query: 1269 EEEDDKASKP 1278
             +E +K S  
Sbjct: 175  AKELEKLSDD 184



 Score = 38.1 bits (89), Expect = 0.036
 Identities = 27/153 (17%), Positives = 54/153 (35%), Gaps = 6/153 (3%)

Query: 1130 SRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAP 1189
              Q  QV    S   ++   A E D      + +   K    K  + KK K         
Sbjct: 49   PEQIDQVLIFLSGMVKDTDDATESDIPKK--KTKTAAKAAAAKAPAKKKLKDELDSSKKA 106

Query: 1190 EDEEEEKPVKKRRRRKKVQISDE----DEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKH 1245
            E +            K + + ++    D+D++D+               D   +DD++  
Sbjct: 107  EKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVD 166

Query: 1246 VVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
               + KK+  E E+  ++ +   +E+D +A + 
Sbjct: 167  DEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQ 199



 Score = 33.4 bits (77), Expect = 0.77
 Identities = 19/139 (13%), Positives = 46/139 (33%), Gaps = 7/139 (5%)

Query: 1143 TDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRR 1202
               + +         D ++  E   P  K + + K        +   +DE +     +  
Sbjct: 50   EQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAE-- 107

Query: 1203 RRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVE 1262
              KK  +  +D+    +      +        D    DD++            ++E++ +
Sbjct: 108  --KKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDD---DDDEDDDEDDDD 162

Query: 1263 EAEGEEEEEDDKASKPRKS 1281
            +   +E+EE  +A +  K 
Sbjct: 163  DDVDDEDEEKKEAKELEKL 181



 Score = 32.7 bits (75), Expect = 1.3
 Identities = 26/187 (13%), Positives = 69/187 (36%), Gaps = 16/187 (8%)

Query: 1045 QDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKD 1104
            ++  E ++      + V   ++    +     + D  ++K +T  +    K       K 
Sbjct: 39   KEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAK---KK 95

Query: 1105 DDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEE 1164
              +  +S  + E++   ++       +    ++  D   D +    ++DD  DD +++E+
Sbjct: 96   LKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDED 155

Query: 1165 EVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKK 1224
            + +                 ++   EDEE+++  +  +         +++D E     +K
Sbjct: 156  DDE-------------DDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARK 202

Query: 1225 KRKSTKT 1231
              K T T
Sbjct: 203  DAKLTAT 209



 Score = 30.3 bits (69), Expect = 8.1
 Identities = 37/198 (18%), Positives = 72/198 (36%), Gaps = 24/198 (12%)

Query: 341 EKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEH 400
           E+ +KK ++   K + +  + ++E   ++ ++ KK         Q D+ +  L+ MVK+ 
Sbjct: 14  EEAKKKLKKLAAKSKSKGFITKEEI--KEALESKKKTP-----EQIDQVLIFLSGMVKDT 66

Query: 401 KEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDA 460
            +  +      KKK K + K         K   DE    +           D+    +D 
Sbjct: 67  DDATESD--IPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDI 124

Query: 461 PPAKQIKEWLASHPGWEVVMEDSEDEG---------------SEDEEMKEKKEEPSKAVG 505
               Q  +        ++  +D +D+                 EDEE KE KE    +  
Sbjct: 125 DVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDD 184

Query: 506 EKKTEDDEYKDAGEEATY 523
           +    D++  +A  +A  
Sbjct: 185 DDFVWDEDDSEALRQARK 202


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
            describes Pseudomonas denitrificans CobT gene product,
            which is a cobalt chelatase subunit that functions in
            cobalamin biosynthesis. Cobalamin (vitamin B12) can be
            synthesized via several pathways, including an aerobic
            pathway (found in Pseudomonas denitrificans) and an
            anaerobic pathway (found in P. shermanii and Salmonella
            typhimurium). These pathways differ in the point of
            cobalt insertion during corrin ring formation. There are
            apparently a number of variations on these two pathways,
            where the major differences seem to be concerned with the
            process of ring contraction. Confusion regarding the
            functions of enzymes found in the aerobic vs. anaerobic
            pathways has arisen because nonhomologous genes in these
            different pathways were given the same gene symbols.
            Thus, cobT in the aerobic pathway (P. denitrificans) is
            not a homolog of cobT in the anaerobic pathway (S.
            typhimurium). It should be noted that E. coli synthesizes
            cobalamin only when it is supplied with the precursor
            cobinamide, which is a complex intermediate.
            Additionally, all E. coli cobalamin synthesis genes
            (cobU, cobS and cobT) were named after their Salmonella
            typhimurium homologs which function in the anaerobic
            cobalamin synthesis pathway. This model describes the
            aerobic cobalamin pathway Pseudomonas denitrificans CobT
            gene product, which is a cobalt chelatase subunit, with a
            MW ~70 kDa. The aerobic pathway cobalt chelatase is a
            heterotrimeric, ATP-dependent enzyme that catalyzes
            cobalt insertion during cobalamin biosynthesis. The other
            two subunits are the P. denitrificans CobS (TIGR01650)
            and CobN (pfam02514 CobN/Magnesium Chelatase) proteins.
            To avoid potential confusion with the nonhomologous
            Salmonella typhimurium/E.coli cobT gene product, the P.
            denitrificans gene symbol is not used in the name of this
            model [Biosynthesis of cofactors, prosthetic groups, and
            carriers, Heme, porphyrin, and cobalamin].
          Length = 600

 Score = 39.9 bits (93), Expect = 0.009
 Identities = 27/128 (21%), Positives = 53/128 (41%), Gaps = 4/128 (3%)

Query: 1140 DSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVK 1199
            + L   E  + + DD   + EE+ ++ +P   ++    +G+    E +AP++ E      
Sbjct: 192  EMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRES 251

Query: 1200 KRRRRKKVQISDEDEDEE--DESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEE 1257
            +    + VQ   +D  +E  D+S    +      P      E D +  V +    + V+ 
Sbjct: 252  ESGEEEMVQSDQDDLPDESDDDSETPGEGARPARPFTSTGGEPDYK--VFTTAFDETVDA 309

Query: 1258 EEEVEEAE 1265
            EE  +E E
Sbjct: 310  EELCDEEE 317


>gnl|CDD|234468 TIGR04095, dnd_restrict_1, DNA phosphorothioation system
           restriction enzyme.  The DNA phosphorothioate
           modification system dnd (DNA instability during
           electrophoresis) recently has been shown to provide a
           modification essential to a restriction system. This
           protein family was detected by Partial Phylogenetic
           Profiling as linked to dnd, and its members usually are
           clustered with the dndABCDE genes.
          Length = 451

 Score = 39.2 bits (92), Expect = 0.011
 Identities = 42/144 (29%), Positives = 62/144 (43%), Gaps = 20/144 (13%)

Query: 545 KLKEYQIKGL-EWMVSLYNNNLNGILADEMGLGKTVQTIALLTYLMEKKKNLGPYLVIVP 603
           +L++YQ + +  W    + NN  GIL    G GKT+  +A  + L EK   L   LV+ P
Sbjct: 8   ELRDYQKEAIRAW----FKNNGRGILKMATGTGKTLTALAAASKLYEKIGLL-VLLVVCP 62

Query: 604 LSTL-SNWVLEFEKWA--PSANVVAYKGSP-----LARRALQGRMKNSKFNVLLTTYEFI 655
              L   W  E EK+   P   ++ Y+        L+         N KF  ++TT    
Sbjct: 63  YQHLVDQWAREAEKFGLNP---ILCYESVSNWQSELSTGLYNLNSGNQKFLAIITTNATF 119

Query: 656 IKDK--SVLSKIQWKCMII-DEGH 676
           I     S L +   K ++I DE H
Sbjct: 120 IGKNFQSQLRRFPGKTLLIGDEAH 143


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
            Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina
            virus topoisomerase, Variola virus topoisomerase, Shope
            fibroma virus topoisomeras.
          Length = 391

 Score = 38.9 bits (91), Expect = 0.014
 Identities = 19/82 (23%), Positives = 40/82 (48%)

Query: 1192 EEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRK 1251
            +E+ K +K + +R K  I   +   + +   K K +        + KE  +EK    K+K
Sbjct: 283  QEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKK 342

Query: 1252 KKVVEEEEEVEEAEGEEEEEDD 1273
            K++   EE +E+ E +  ++++
Sbjct: 343  KQIERLEERIEKLEVQATDKEE 364


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 39.0 bits (91), Expect = 0.016
 Identities = 17/55 (30%), Positives = 32/55 (58%)

Query: 1207 VQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEV 1261
             ++ DE+E+EE E  K+++ K+T    +   +E+ EEK   +K+ K+   E E +
Sbjct: 33   KEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELL 87



 Score = 38.6 bits (90), Expect = 0.021
 Identities = 17/52 (32%), Positives = 28/52 (53%)

Query: 1180 KKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKT 1231
            K+   E    E+EEE++  K+   +   +  + DE+EE E  KKK +K  +T
Sbjct: 29   KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80



 Score = 30.9 bits (70), Expect = 4.8
 Identities = 17/53 (32%), Positives = 29/53 (54%), Gaps = 3/53 (5%)

Query: 1228 STKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
              K   K+   E++EE+    + KK+  E+  + EE   EEEE+++K  K +K
Sbjct: 27   VEKEVEKEVPDEEEEEE---KEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKK 76


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 38.9 bits (91), Expect = 0.020
 Identities = 34/204 (16%), Positives = 76/204 (37%), Gaps = 8/204 (3%)

Query: 1077 KMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSM-YEEEREREKEEALMGRGSRQRKQ 1135
               +++       + V+  E      K D++  NS   ++E + E++        ++ K 
Sbjct: 1186 ADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKS 1245

Query: 1136 VDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEE--TAPEDEE 1193
                 S + ++  +   DD + +  + +   K V+  + S     K    E     +   
Sbjct: 1246 KKNNSSKSSEDNDEFSSDDLSKE-GKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSS 1304

Query: 1194 EEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKK 1253
              K   K+R    +    + +  E ++ +KKK K+    A          +     RKKK
Sbjct: 1305 PTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRR----PRKKK 1360

Query: 1254 VVEEEEEVEEAEGEEEEEDDKASK 1277
                 E+ +++E ++ E++D    
Sbjct: 1361 SDSSSEDDDDSEVDDSEDEDDEDD 1384



 Score = 34.3 bits (79), Expect = 0.58
 Identities = 34/191 (17%), Positives = 66/191 (34%), Gaps = 16/191 (8%)

Query: 365  EGYRKLIDQ--KKDKRLAFLLSQT------------DEYISNLTQMVKEHKEEQQKKMRE 410
            E   KL  +  KK+K L  L + T            +E +    ++ ++   ++Q+   +
Sbjct: 1102 EKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSK 1161

Query: 411  EKKKKKKSIKQKILDGNMDKL-DLDESSQLEDLRITVKETNSDNKLRGEDAPPAKQIKEW 469
             K K  K  K K+      K     + S+   +    K  +SD K + +D P  K+    
Sbjct: 1162 TKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSS 1221

Query: 470  LASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDA-GEEATYYSIAH 528
             +     E      +    +  + K+     S    ++ + DD  K+   + A     A 
Sbjct: 1222 GSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAV 1281

Query: 529  TYHEKVTEQAS 539
             Y      +  
Sbjct: 1282 QYSPPPPSKRP 1292



 Score = 30.4 bits (69), Expect = 8.1
 Identities = 21/101 (20%), Positives = 47/101 (46%), Gaps = 5/101 (4%)

Query: 1186 ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKH 1245
            E A E++EE +  +  + ++    +     +  +   KKK K  K  + DK K+     +
Sbjct: 1138 EEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGN 1197

Query: 1246 -----VVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
                    KRK     + ++   +  ++E+++++ +KP+KS
Sbjct: 1198 SKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKS 1238


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 38.3 bits (89), Expect = 0.031
 Identities = 27/94 (28%), Positives = 47/94 (50%), Gaps = 6/94 (6%)

Query: 1161 EEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDES 1220
            E E E K +T++    KK K    EE A   E+E K +K  ++  K ++  +   +    
Sbjct: 5    ESEAEKKILTEEELERKKKK----EEKA--KEKELKKLKAAQKEAKAKLQAQQASDGTNV 58

Query: 1221 PKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKV 1254
            PKK ++KS K   +D++ ED  +      +KK++
Sbjct: 59   PKKSEKKSRKRDVEDENPEDFIDPDTPFGQKKRL 92


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family are
            designated YL1. These proteins have been shown to be
            DNA-binding and may be a transcription factor.
          Length = 238

 Score = 37.4 bits (87), Expect = 0.034
 Identities = 37/179 (20%), Positives = 66/179 (36%), Gaps = 15/179 (8%)

Query: 1103 KDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEE 1162
            +D+      ++EEE +   EE  +     + +     D   D E     E++G  + + E
Sbjct: 24   EDEFFWTYLLFEEEED--DEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKELQRE 81

Query: 1163 EEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPK 1222
            E   K    K  + K+  K   ++     +  +      R +KK +         D   +
Sbjct: 82   ERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKA--AAPRPKKKSERISWAPTLLDSPRR 139

Query: 1223 KKKRKST-----KTPAKDKHKEDDEEKHVVSKRKKKVVEE------EEEVEEAEGEEEE 1270
            K  R ST      T  + K +E   +K     RK+K  ++      EE + EA+  E  
Sbjct: 140  KSSRSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKELTQEERLAEAKETERI 198


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 38.4 bits (89), Expect = 0.034
 Identities = 44/222 (19%), Positives = 89/222 (40%), Gaps = 18/222 (8%)

Query: 1059 ETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDD--DEVENSMYEEE 1116
            E +   +   +D   L  + D   +    +    E++   + + KD    E ENS  EEE
Sbjct: 3825 EDITNTLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEE 3884

Query: 1117 REREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSG 1176
             +   EE             D ++SL ++ W +  E+D  +  ++  E+     +     
Sbjct: 3885 NQDLDEEV-------NDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVS 3937

Query: 1177 KKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDK 1236
            K+      +  A ED++ ++   +      V I DE + +  E+  +   ++      + 
Sbjct: 3938 KED-----DNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPED 3992

Query: 1237 HKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
             K D++E  V     K    E+ ++E A+  +EE D +  +P
Sbjct: 3993 LKLDEKEGDV----SKDSDLEDMDMEAADENKEEADAEKDEP 4030



 Score = 37.3 bits (86), Expect = 0.074
 Identities = 47/241 (19%), Positives = 101/241 (41%), Gaps = 36/241 (14%)

Query: 1045 QDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKD 1104
            ++DEEE  ++V  D+ +                    +      P   E  +LP+ +  D
Sbjct: 3953 KEDEEEMSDDVGIDDEIQ----------------PDIQENNSQPPPENEDLDLPEDLKLD 3996

Query: 1105 DDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEE 1164
            + E + S     ++ + E+  M      +++ D       Q+    +E++   D + +++
Sbjct: 3997 EKEGDVS-----KDSDLEDMDMEAADENKEEADAEKDEPMQDE-DPLEENNTLDEDIQQD 4050

Query: 1165 EVKPVTKKRGSGKKGKKSYVEETAPEDEEE-EKPVKKRRRRKKVQISDEDEDEEDESPKK 1223
            +   + +      K  +   EE   E+EE  E  VK     ++    +  ED+  ++  K
Sbjct: 4051 DFSDLAEDDE---KMNEDGFEENVQENEESTEDGVKSDEELEQ---GEVPEDQAIDNHPK 4104

Query: 1224 KKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVE---EAEGEEEEEDDKASKPRK 1280
               KST   A  +  E++ +K +V + ++  + EE+ V     A+GE E+  +  S P++
Sbjct: 4105 MDAKSTFASA--EADEENTDKGIVGENEE--LGEEDGVRGNGTADGEFEQVQEDTSTPKE 4160

Query: 1281 S 1281
            +
Sbjct: 4161 A 4161



 Score = 35.7 bits (82), Expect = 0.20
 Identities = 35/167 (20%), Positives = 70/167 (41%), Gaps = 16/167 (9%)

Query: 1114 EEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKR 1173
            E+    E E    G GS    + D T++L + + L+ + ++     + + +E +    + 
Sbjct: 3805 EKSNSGELESG-TGLGSGVGAE-DITNTLNEDDDLEELANEEDTANQSDLDESEARELES 3862

Query: 1174 GSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEED-------ESPKKKKR 1226
                  K S V E    D EEE       +    +++D  ED  +       + P ++  
Sbjct: 3863 DMNGVTKDSVVSENENSDSEEEN------QDLDEEVNDIPEDLSNSLNEKLWDEPNEEDL 3916

Query: 1227 KSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
              T+  + ++   ++E   V  +   K +E+++  +E E EEE  DD
Sbjct: 3917 LETEQKSNEQSAANNESDLVSKEDDNKALEDKDR-QEKEDEEEMSDD 3962



 Score = 34.2 bits (78), Expect = 0.67
 Identities = 37/226 (16%), Positives = 94/226 (41%), Gaps = 18/226 (7%)

Query: 1047 DEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDD 1106
              + + +E    E  + +   T+D  ++  + +    + E +    E+ ++P+      +
Sbjct: 3847 ANQSDLDESEARELESDMNGVTKD--SVVSENENSDSEEENQDLDEEVNDIPE---DLSN 3901

Query: 1107 EVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEV 1166
             +   +++E  E +  E    + + Q    + +D ++ ++  KA+ED      +E+EEE+
Sbjct: 3902 SLNEKLWDEPNEEDLLETEQ-KSNEQSAANNESDLVSKEDDNKALED-KDRQEKEDEEEM 3959

Query: 1167 KPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKR 1226
                      +   +    +  PE+E+ + P +  +  +K     +D D ED        
Sbjct: 3960 SDDVGIDDEIQPDIQENNSQPPPENEDLDLP-EDLKLDEKEGDVSKDSDLED-------- 4010

Query: 1227 KSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEED 1272
                  A D++KE+ + +     + +  +EE   ++E   +++  D
Sbjct: 4011 --MDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSD 4054



 Score = 30.7 bits (69), Expect = 6.8
 Identities = 45/248 (18%), Positives = 91/248 (36%), Gaps = 36/248 (14%)

Query: 1045 QDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAE--------RRKGETKPRLVEIKE 1096
            Q++EE  E+ V  DE + Q     +  +    KMDA+          +   K  + E +E
Sbjct: 4072 QENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEADEENTDKGIVGENEE 4131

Query: 1097 LPDW-MVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDG 1155
            L +   V+ +   +    + + +    +  M    RQ + +   D L  +EW +A     
Sbjct: 4132 LGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSEADRQYQSLG--DHL--REWQQANRIHE 4187

Query: 1156 ADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDED 1215
             +D  E + +              +  +V+    EDEEE+        + +++  D DE 
Sbjct: 4188 WEDLTESQSQAF---------DDSEFMHVK----EDEEEDLQALGNAEKDQIKSIDRDES 4234

Query: 1216 EEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKK----------VVEEEEEVEEAE 1265
                          +  A +   +  ++   +S  K+            + + E+V E  
Sbjct: 4235 ANQNPDSMNSTNIAEDEADEVGDKQLQDGQDISDIKQTGEDTLPTEFGSINQSEKVFELS 4294

Query: 1266 GEEEEEDD 1273
             +E+ ED+
Sbjct: 4295 EDEDIEDE 4302


>gnl|CDD|221586 pfam12457, TIP_N, Tuftelin interacting protein N terminal.  This
            domain family is found in eukaryotes, and is typically
            between 99 and 114 amino acids in length. The family is
            found in association with pfam08697, pfam01585. There are
            two completely conserved residues (G and F) that may be
            functionally important. TIP is involved in enamel
            assembly by interacting with one of the major proteins
            responsible for biomineralisation of enamel - tuftelin.
          Length = 106

 Score = 35.0 bits (81), Expect = 0.035
 Identities = 26/102 (25%), Positives = 49/102 (48%), Gaps = 10/102 (9%)

Query: 1185 EETAPEDEEEEKPVKKRRRR-KKVQI----SDEDEDEEDESPKKKKRKSTKTPAKDKHKE 1239
            E T  + + E  P ++RRR+ K+  I    +D+D+D+E E  + + R+ +K   KD  K 
Sbjct: 10   EITDYDLDNEFNPNRRRRRQTKEQAIYGIFADDDDDDEGERRRGRGRRRSKKK-KDYSK- 67

Query: 1240 DDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
                   VS   K+  +E EE ++ +  E++++         
Sbjct: 68   ---PVAFVSGGIKQAAKEPEEDDKEDESEDDDESDEDDDEDD 106


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
            regulatory subunit R2) is the regulatory subunit of the
            histone H2A phosphatase complex. It has been shown to
            confer resistance to the anticancer drug cisplatin in
            yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 37.5 bits (87), Expect = 0.038
 Identities = 31/138 (22%), Positives = 55/138 (39%), Gaps = 13/138 (9%)

Query: 1140 DSLTDQEWLKAIEDDGADDYEEEEEEV----KPVTKKRGSGKKGKKSYVEETAPEDEEEE 1195
             SL+  EWL          + E  + V    +P  +       G  S         +++ 
Sbjct: 156  VSLSKIEWLNEDFPSELSPFIERIDSVNGPGEPEPEDDPKDSLGNGSSTNGLPDSSQDKN 215

Query: 1196 KPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVV 1255
            K +++    K+   +   +D+  +    K +KS      D+  +  EEK        ++ 
Sbjct: 216  KSLEEYYE-KESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEK--------ELK 266

Query: 1256 EEEEEVEEAEGEEEEEDD 1273
            E+EEE E  E EEEE++D
Sbjct: 267  EDEEEEETEEEEEEEDED 284


>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
            This family includes NS2 proteins from other members of
            the Orbivirus genus. NS2 is a non-specific
            single-stranded RNA-binding protein that forms large
            homomultimers and accumulates in viral inclusion bodies
            of infected cells. Three RNA binding regions have been
            identified in Bluetongue virus serotype 17 at residues
            2-11, 153-166 and 274-286. NS2 multimers also possess
            nucleotidyl phosphatase activity. The precise function of
            NS2 is not known, but it may be involved in the transport
            and condensation of viral mRNAs.
          Length = 363

 Score = 37.6 bits (87), Expect = 0.038
 Identities = 33/137 (24%), Positives = 55/137 (40%), Gaps = 7/137 (5%)

Query: 1093 EIKELPDWMVKDDD--EVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKA 1150
            E+ ELP     D D  E+   +  EERE+EKEE  M    +  + +   DS  D+  L  
Sbjct: 140  EMPELPGVTRSDYDVRELRQKI-REEREKEKEEQPMKPAFKPERWMGGPDSDEDENPLDE 198

Query: 1151 IEDD---GADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKV 1207
               D        +++EE    V ++     +     +EE   +  +E++  +   R    
Sbjct: 199  EAPDMTPETSKQDQKEERRAAVERRLAELVEMINWNLEERRRDLRKEQELEENVERDSD- 257

Query: 1208 QISDEDEDEEDESPKKK 1224
               +  ED ED   K +
Sbjct: 258  DEDEHGEDSEDGETKPE 274



 Score = 34.5 bits (79), Expect = 0.37
 Identities = 30/118 (25%), Positives = 46/118 (38%), Gaps = 1/118 (0%)

Query: 1074 LFQKMDAERRKG-ETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQ 1132
            L QK+  ER K  E +P     K        D DE EN + EE  +   E +   +   +
Sbjct: 157  LRQKIREEREKEKEEQPMKPAFKPERWMGGPDSDEDENPLDEEAPDMTPETSKQDQKEER 216

Query: 1133 RKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPE 1190
            R  V+   +   +     +E+   D  +E+E E           + G+ S   ET PE
Sbjct: 217  RAAVERRLAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGEDSEDGETKPE 274



 Score = 32.5 bits (74), Expect = 1.3
 Identities = 32/175 (18%), Positives = 74/175 (42%), Gaps = 22/175 (12%)

Query: 403 EQQKKMREEKKKKKKSIKQKILDG-----NMDKLDLDESSQLEDLRITVKETNSDNKLRG 457
           E ++K+REE++K+K+    K              D DE+   E+      ET+  ++   
Sbjct: 156 ELRQKIREEREKEKEEQPMKPAFKPERWMGGPDSDEDENPLDEEAPDMTPETSKQDQKEE 215

Query: 458 EDAPPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDA 517
             A   +++ E        E++  + E+      ++++++E       +   ED+  +D+
Sbjct: 216 RRAAVERRLAE------LVEMINWNLEER---RRDLRKEQELEENVERDSDDEDEHGEDS 266

Query: 518 GEEATYYS--IAHTYHEKVTEQASIMVNGKLKEYQIKGLEWMVSLYNNNLNGILA 570
            +  T     I   Y E+++E        K+K+ ++  L  M+   +   +G++ 
Sbjct: 267 EDGETKPESYITSEYIERISEIR------KMKDERLSSLASMMPQASGEFDGMIF 315


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 38.1 bits (89), Expect = 0.042
 Identities = 31/184 (16%), Positives = 67/184 (36%), Gaps = 47/184 (25%)

Query: 1104 DDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEE 1163
            +  + E +     ++ +++     R  RQR++                     D+  + +
Sbjct: 655  ESQQAEVTEKARTQDEQQQAP---RRERQRRR--------------------NDEKRQAQ 691

Query: 1164 EEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEK--PVKKRRRRK----KVQISDEDEDEE 1217
            +E K +               E++  E E+EE+   V+ RR+++    KV+I      EE
Sbjct: 692  QEAKALNV------------EEQSVQETEQEERVQQVQPRRKQRQLNQKVRIEQS-VAEE 738

Query: 1218 DESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASK 1277
              +P  ++  + +   ++      E   V      +   E++E   AE  +         
Sbjct: 739  AVAPVVEETVAAEPVVQEVPAPRTELVKVPLPVVAQTAPEQDEENNAENRDNN-----GM 793

Query: 1278 PRKS 1281
            PR+S
Sbjct: 794  PRRS 797



 Score = 36.2 bits (84), Expect = 0.15
 Identities = 16/79 (20%), Positives = 39/79 (49%), Gaps = 1/79 (1%)

Query: 1192 EEEEKPVKKRRRRKKVQISDEDEDEEDESPKKK-KRKSTKTPAKDKHKEDDEEKHVVSKR 1250
            E  E+  + RR+ ++      +  + + + K + + +  + P +++ +  ++EK    + 
Sbjct: 634  ENREENRRNRRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQE 693

Query: 1251 KKKVVEEEEEVEEAEGEEE 1269
             K +  EE+ V+E E EE 
Sbjct: 694  AKALNVEEQSVQETEQEER 712



 Score = 33.1 bits (76), Expect = 1.2
 Identities = 40/154 (25%), Positives = 59/154 (38%), Gaps = 20/154 (12%)

Query: 1045 QDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETK-PRLVEIKELPDWMVK 1103
            Q  E  E  +    E      ART+DE     + + +RR+ + K     E K L      
Sbjct: 649  QTAETRESQQAEVTEK-----ARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALN----- 698

Query: 1104 DDDEVENSMYEEEREREKEEALMGRGSRQRKQ-VDYTDSLTDQEWLKAIEDDGADDYEEE 1162
                 E S+ E E+E   ++    R  RQ  Q V    S+ ++     +E+  A      
Sbjct: 699  ---VEEQSVQETEQEERVQQVQPRRKQRQLNQKVRIEQSVAEEAVAPVVEETVAA----- 750

Query: 1163 EEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEK 1196
            E  V+ V   R    K     V +TAPE +EE  
Sbjct: 751  EPVVQEVPAPRTELVKVPLPVVAQTAPEQDEENN 784


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
            proteins whose interaction is required for the maturation
            of the 18S rRNA and for 40S ribosome production.
          Length = 809

 Score = 37.7 bits (88), Expect = 0.042
 Identities = 38/165 (23%), Positives = 65/165 (39%), Gaps = 49/165 (29%)

Query: 1114 EEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKR 1173
            E ER ++ E        R R+     +   ++E  K   DD  D++E ++++        
Sbjct: 267  EAERLKKLEAE------RLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNF------ 314

Query: 1174 GSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPA 1233
            G G+  +    EE   +DE+EE               D+D +EE+E              
Sbjct: 315  GLGQGEEDEEEEEDGVDDEDEE-------------DDDDDLEEEEED------------- 348

Query: 1234 KDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
             D   E+++E            +E+ + E+ E EEEEE +K  K 
Sbjct: 349  VDLSDEEEDE-----------EDEDSDDEDDEEEEEEEKEKKKKK 382



 Score = 31.9 bits (73), Expect = 2.6
 Identities = 37/184 (20%), Positives = 64/184 (34%), Gaps = 53/184 (28%)

Query: 1141 SLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVK- 1199
            SL++ +    + DD   D ++  +           G +      +E   E++ E K  K 
Sbjct: 122  SLSEIDKDDDVRDDDDFDDDDLGDLASDDRAAHFGGGED-----DEEDEEEQPERKKSKK 176

Query: 1200 ----------KRRRRKKVQISDEDEDEEDE------------SPKKKKRKSTKTPAKDKH 1237
                      K  + ++ +  +EDED  +E               K   K   TP ++K 
Sbjct: 177  EVMKEVIAKSKFYKAERQKAKEEDEDLREELDDDFKDLMSLLRTVKPPPKPPMTP-EEKD 235

Query: 1238 KEDD--------EEKHVVSKRKKKVVEEEEEVEEAE--------------GEEEEEDDKA 1275
             E D        + +   + R K   EEE   EEAE              GEEE+++++ 
Sbjct: 236  DEYDQRVRELTFDRRAQPTDRTK--TEEELAKEEAERLKKLEAERLRRMRGEEEDDEEEE 293

Query: 1276 SKPR 1279
                
Sbjct: 294  DSKE 297


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is a
            family of proteins conserved in fungi. The function is
            not known.
          Length = 436

 Score = 37.5 bits (87), Expect = 0.043
 Identities = 25/115 (21%), Positives = 42/115 (36%), Gaps = 4/115 (3%)

Query: 1167 KPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKR 1226
            K  TKK   G   K S  ++    +  +E    ++ +  K       E     +P+  K+
Sbjct: 12   KAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKK----PKEASRPGTPRNPKK 67

Query: 1227 KSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
             S  T +     E+   K   S    +    + +    E EEEEE ++   P  S
Sbjct: 68   SSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEEEEPEETPDPIAS 122



 Score = 33.6 bits (77), Expect = 0.74
 Identities = 21/84 (25%), Positives = 36/84 (42%), Gaps = 15/84 (17%)

Query: 1213 DEDEEDESPKKKKRKSTKTPAKDKHKED-----DE-EKHVVSKRKKKVVE---------E 1257
            D   ++++P KK +K   +    +  ED     DE E+   +K  KK  E          
Sbjct: 6    DSLPDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNP 65

Query: 1258 EEEVEEAEGEEEEEDDKASKPRKS 1281
            ++  +  E      ++K +KPRKS
Sbjct: 66   KKSSKPTESSAASSEEKPAKPRKS 89


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 36.3 bits (84), Expect = 0.047
 Identities = 21/77 (27%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 1204 RKKVQISDEDEDEEDES--PKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEV 1261
            R  V+ + E  D+ ++   P+  +  S KT  K K     ++    +K+KKK  EE  E+
Sbjct: 21   RAHVKSTTESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPEL 80

Query: 1262 EEAEGEEEEEDDKASKP 1278
               E  + EE+++  K 
Sbjct: 81   AAEELSDSEENEENDKK 97



 Score = 32.1 bits (73), Expect = 1.1
 Identities = 21/93 (22%), Positives = 43/93 (46%), Gaps = 9/93 (9%)

Query: 1152 EDDGADDYEEEEEEV---KPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQ 1208
             D   DD   E+ E+   K   KK+ + +K  K   ++T  + +E+E+  +        +
Sbjct: 31   VDKVNDDIFPEDVEIPSKKTSKKKKTTPRK--KKTTKKTKKKKKEKEEVPEL----AAEE 84

Query: 1209 ISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDD 1241
            +SD +E+EE++     +    +  A + + ED 
Sbjct: 85   LSDSEENEENDKKVDYELPKVQNTAAEVNHEDV 117



 Score = 32.1 bits (73), Expect = 1.3
 Identities = 17/59 (28%), Positives = 26/59 (44%), Gaps = 3/59 (5%)

Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
            + KKKK    K     K K+  +EK  V    +   EE  + EE E  +++ D +  K 
Sbjct: 50   TSKKKKTTPRKKKTTKKTKKKKKEKEEV---PELAAEELSDSEENEENDKKVDYELPKV 105


>gnl|CDD|183731 PRK12766, PRK12766, 50S ribosomal protein L32e; Provisional.
          Length = 232

 Score = 36.7 bits (85), Expect = 0.049
 Identities = 27/97 (27%), Positives = 44/97 (45%), Gaps = 15/97 (15%)

Query: 1133 RKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDE 1192
            R + D       +E    +ED+G ++ EEE+ +V+   + RG          E+T    +
Sbjct: 51   RIKADVGGLEVSEETEAEVEDEGGEEEEEEDADVETELRPRG--------LTEKTPELSD 102

Query: 1193 EEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKST 1229
            EE + + +RRR  K Q + +D         KKKR  T
Sbjct: 103  EEARLLTQRRREGKPQFNRQDY-------HKKKRTPT 132


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 37.4 bits (87), Expect = 0.052
 Identities = 19/109 (17%), Positives = 46/109 (42%), Gaps = 17/109 (15%)

Query: 361 AEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEHKEEQQKKMREEKKKKKKSIK 420
            + ++  ++   +KK+K+        D+  + L +++ E++E + K+  ++KKKKKK  K
Sbjct: 11  KQQKQQNKQKGTKKKNKKSK---KDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKK 67

Query: 421 QKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPPAKQIKEW 469
           +K              +  E   +        +    +D    +++  W
Sbjct: 68  KK--------------NLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNW 102


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
            cell cycle arrest and pre-mRNA splicing. It has been
            shown to be a component of U4/U6 x U5 tri-snRNP complex
            in human, Schizosaccharomyces pombe and Saccharomyces
            cerevisiae. SART-1 is a known tumour antigen in a range
            of cancers recognised by T cells.
          Length = 603

 Score = 37.4 bits (87), Expect = 0.054
 Identities = 30/125 (24%), Positives = 49/125 (39%), Gaps = 13/125 (10%)

Query: 1157 DDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDE 1216
            D Y+  E       KK+   KK ++  ++E   E E E         RK V     +E+ 
Sbjct: 265  DYYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDV-----EEEN 319

Query: 1217 EDESPKKKKRKSTKTPAKDKHKEDDEEKHV------VSKRKKKVVEEE--EEVEEAEGEE 1268
                   KKRK  +        +DD +  +        K++KK+  E+   ++ E   E+
Sbjct: 320  ARLEDSPKKRKEEQEDDDFVEDDDDLQASLAKQRRLAQKKRKKLRPEDIARQIAEERSED 379

Query: 1269 EEEDD 1273
            EEE  
Sbjct: 380  EEEAA 384



 Score = 36.3 bits (84), Expect = 0.11
 Identities = 36/175 (20%), Positives = 63/175 (36%), Gaps = 18/175 (10%)

Query: 1106 DEVEN-SMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEE 1164
            DE+EN  + E+E++++  E       +++K     D     +  K  +      Y+EE E
Sbjct: 166  DELENVELVEKEKDKKNLE------LKKKKPDYDPD-----DDDKFNKRSILSKYDEEIE 214

Query: 1165 EVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQI-----SDEDEDEEDE 1219
              K              S  +E   + +E +K +K               SD  +  E  
Sbjct: 215  G-KKKKSDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMV 273

Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDK 1274
              KK K+K  K   + K  ++DE +              ++VEE     E+   K
Sbjct: 274  KFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKK 328



 Score = 31.7 bits (72), Expect = 2.8
 Identities = 48/249 (19%), Positives = 94/249 (37%), Gaps = 24/249 (9%)

Query: 1044 HQDDEEEEENEVPDDETVNQLIA--RTEDELTLFQKMDAERRKGETKPRLVEIKELPDWM 1101
              DDE E++ +    +     ++      E       D        KP+  + K+     
Sbjct: 231  STDDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRK 290

Query: 1102 VKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEE 1161
              D+DE+E             E L    S  RK V+  ++  +    K  E+   DD+ E
Sbjct: 291  DLDEDELEPE----------AEGLGSSDSGSRKDVEEENARLEDSPKKRKEEQEDDDFVE 340

Query: 1162 EEEEVKPV--TKKRGSGKKGKKSYVEETAPE--DEEEEKPVKKRRRRKKVQISDE----- 1212
            ++++++     ++R + KK KK   E+ A +  +E  E   +         + DE     
Sbjct: 341  DDDDLQASLAKQRRLAQKKRKKLRPEDIARQIAEERSEDEEEAAENNDNGIVIDETSEFV 400

Query: 1213 ---DEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEE 1269
                ++  +E P+ K     +    ++  ED+E++      +   V+ +EE EE + E  
Sbjct: 401  RSLQKEPLEEKPENKDESVEEISDAEEDDEDEEDEDGDGDVEMSAVDNDEEKEEEDKEAI 460

Query: 1270 EEDDKASKP 1278
                   +P
Sbjct: 461  PSTILEEEP 469


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
            nuclear protein involved in bud-site selection. It plays
            a role in positioning the proximal bud pole signal. More
            recently it has been shown to be involved in ribosome
            biogenesis.
          Length = 424

 Score = 37.1 bits (86), Expect = 0.056
 Identities = 34/168 (20%), Positives = 51/168 (30%), Gaps = 24/168 (14%)

Query: 1085 GETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTD 1144
            G          +      K  +  E+S  ++E E E E+      S +    D  +  +D
Sbjct: 141  GIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSD 200

Query: 1145 QEWLKAIEDDGADDYEEEEEEVKPVTKK-----RGSGKKGKKSYVEETAPEDEEEEKPVK 1199
             E     +    D  +EEE E  P              +      E  +  D EE  P  
Sbjct: 201  SEDYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSEESSPPS 260

Query: 1200 KRRRRKKVQ-------------------ISDEDEDEEDESPKKKKRKS 1228
            K+ + KK                       DED D +    K  KRK+
Sbjct: 261  KKPKEKKTSSTFLPSLMGGYFSGSEDEDDDDEDIDPDQVVKKPVKRKN 308



 Score = 33.3 bits (76), Expect = 0.96
 Identities = 15/51 (29%), Positives = 23/51 (45%)

Query: 1223 KKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
            + K K  K   K K  +  E K    K  ++  E E+E +  E  E++ DD
Sbjct: 143  ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDD 193



 Score = 32.9 bits (75), Expect = 1.2
 Identities = 24/97 (24%), Positives = 46/97 (47%), Gaps = 4/97 (4%)

Query: 1186 ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKH 1245
            ET  +  + +K  KK ++++  + SD+D++EE ES  + K + +     D  +E+D +  
Sbjct: 143  ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSE 202

Query: 1246 VVSKRKKKVV----EEEEEVEEAEGEEEEEDDKASKP 1278
              S+    +V    EEE E   +    E+  +  S  
Sbjct: 203  DYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDE 239



 Score = 32.9 bits (75), Expect = 1.2
 Identities = 22/111 (19%), Positives = 40/111 (36%), Gaps = 3/111 (2%)

Query: 403 EQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPP 462
           ++ K  ++ KK KKK  K+     + ++ + ++ S+ E+      +   +     ED   
Sbjct: 147 KKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSED--- 203

Query: 463 AKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDE 513
             Q    L      E   E      +ED    E  E  S+    +   D E
Sbjct: 204 YSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSE 254



 Score = 31.3 bits (71), Expect = 3.4
 Identities = 16/65 (24%), Positives = 30/65 (46%)

Query: 1212 EDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
              E +  +   KKK K +K     +  + D+E+   S+ + K  E  E+  + E EE+ +
Sbjct: 141  GIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSD 200

Query: 1272 DDKAS 1276
             +  S
Sbjct: 201  SEDYS 205


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
            family consists of several hypothetical bacterial
            proteins of around 200 residues in length. The function
            of this family is unknown.
          Length = 214

 Score = 36.3 bits (84), Expect = 0.061
 Identities = 20/94 (21%), Positives = 38/94 (40%), Gaps = 9/94 (9%)

Query: 1185 EETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEK 1244
             + A  DE+E K         +     E+  EE++     + K  K  A    +++DEE 
Sbjct: 41   SDQAAADEQEAKKSDD-----QETAEIEEVKEEEKEAANSEDKEDKGDA----EKEDEES 91

Query: 1245 HVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
               ++ + +   +E E E  E  E   + + + P
Sbjct: 92   EEENEEEDEESSDENEKETEEKTESNVEKEITNP 125



 Score = 31.2 bits (71), Expect = 2.6
 Identities = 17/92 (18%), Positives = 40/92 (43%), Gaps = 6/92 (6%)

Query: 1151 IEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQIS 1210
                 +D    +E+E K    +  +  +  K   +E A  +++E+K   ++   +     
Sbjct: 36   FPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEES---E 92

Query: 1211 DEDEDEEDESPKKKKRKSTKTPAKDKHKEDDE 1242
            +E+E+E++ES  +      +T  K +   + E
Sbjct: 93   EENEEEDEESSDE---NEKETEEKTESNVEKE 121



 Score = 30.1 bits (68), Expect = 5.7
 Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 1/68 (1%)

Query: 1210 SDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEE 1269
            S  D+   DE   KK         ++  KE+++E      ++ K   E+E+ E  E  EE
Sbjct: 39   SPSDQAAADEQEAKKSDDQETAEIEEV-KEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97

Query: 1270 EEDDKASK 1277
            E+++ + +
Sbjct: 98   EDEESSDE 105


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 37.4 bits (86), Expect = 0.062
 Identities = 29/105 (27%), Positives = 48/105 (45%), Gaps = 3/105 (2%)

Query: 1178 KGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKH 1237
            K  + +   +  +  +E K + K+ ++K   I +ED D+ DE P+  +  S   P     
Sbjct: 468  KFDEYWTRISKIQFTQEIKKLIKKSKKKLAPIEEEDSDKHDEPPEGPE-ASGLPPKAPGD 526

Query: 1238 KEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKAS--KPRK 1280
            KE +E +H  SK   +  E  +  E  EGE  ++   A   KP K
Sbjct: 527  KEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSK 571



 Score = 30.0 bits (67), Expect = 9.8
 Identities = 24/89 (26%), Positives = 38/89 (42%), Gaps = 13/89 (14%)

Query: 1167 KPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDE--DEDEEDESPKKK 1224
            K V  KR       KS+ + T  E+ EE          +K+ + D+  + D+ED  P ++
Sbjct: 844  KIVKLKR------SKSFDDLTTVEEAEEMGA-----EARKIVVDDDGTEADDEDTHPPEE 892

Query: 1225 KRKSTKTPAKDKHKEDDEEKHVVSKRKKK 1253
            K KS     +   K    +K    K+ KK
Sbjct: 893  KHKSEVRRRRPPKKPSKPKKPSKPKKPKK 921


>gnl|CDD|218292 pfam04851, ResIII, Type III restriction enzyme, res subunit. 
          Length = 100

 Score = 34.1 bits (79), Expect = 0.074
 Identities = 12/42 (28%), Positives = 19/42 (45%), Gaps = 6/42 (14%)

Query: 666 QWKCMIIDEGHRM--KNHHCKLTQTLNTFYNSSHRVLLTGTP 705
           Q   +IIDE H    K  + K+ +     +  +  + LT TP
Sbjct: 61  QALVIIIDEAHHSSAKTKYRKILE----KFKPAFLLGLTATP 98


>gnl|CDD|218481 pfam05177, RCSD, RCSD region.  Proteins contain this region include
            C.elegans UNC-89. This region is found repeated in UNC-89
            and shows conservation in prolines, lysines and glutamic
            acids. Proteins with RCSD are involved in muscle M-line
            assembly, but the function of this region RCSD is not
            clear.
          Length = 101

 Score = 34.1 bits (77), Expect = 0.084
 Identities = 28/101 (27%), Positives = 48/101 (47%), Gaps = 2/101 (1%)

Query: 1165 EVKPVTKKRGSGKKGKKSYVE--ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPK 1222
             VK   KK  S  +   S  E  E     EE + P KK +  +KV    +   ++++SP+
Sbjct: 1    GVKSPGKKEKSPLRRSPSRTEKTEEDRATEEAKSPEKKEKSPEKVDGKPKSPTKKEKSPE 60

Query: 1223 KKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEE 1263
            K   +  K+P K +   +  E+   S  KK+ V+++E+  E
Sbjct: 61   KSATEEVKSPTKKEKSPEKVEEKPASPTKKERVQKKEKSPE 101


>gnl|CDD|206017 pfam13846, DUF4196, Domain of unknown function (DUF4196).  This is a
            short region of ccdc82_homologues that is conserved from
            Schizo. pombe up to humans. The function is not known.
          Length = 112

 Score = 34.5 bits (78), Expect = 0.086
 Identities = 23/82 (28%), Positives = 39/82 (47%), Gaps = 1/82 (1%)

Query: 1189 PEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDE-EKHVV 1247
            P+ E+     +    +   QI +ED +EE+    K+KR S+     D+  + D   + V 
Sbjct: 25   PDSEKHPGQEEGDLNKHTGQIIEEDSEEENIKIGKRKRLSSVMSDSDESDDSDILSRKVA 84

Query: 1248 SKRKKKVVEEEEEVEEAEGEEE 1269
             KR ++VVE+  E  E    E+
Sbjct: 85   IKRPRRVVEDSSEEMEQSSPEK 106


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 36.2 bits (84), Expect = 0.10
 Identities = 24/117 (20%), Positives = 41/117 (35%), Gaps = 4/117 (3%)

Query: 1168 PVTKKRGSGKKGKKSYVE----ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKK 1223
               K +        S V     E      EEE  V     +K  +    +E ++    KK
Sbjct: 15   TTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKK 74

Query: 1224 KKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
            KK+K  K P  +   +   +    SK+ KK   + +  E+ +    +  + A K   
Sbjct: 75   KKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNV 131



 Score = 31.2 bits (71), Expect = 3.4
 Identities = 19/95 (20%), Positives = 38/95 (40%), Gaps = 5/95 (5%)

Query: 1187 TAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHV 1246
            +    + + +P+        V   +      +E  K     + K   +DK+ E   +K  
Sbjct: 13   SGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNES--KKKS 70

Query: 1247 VSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
              K+KKK  ++E    ++EGE +       K +K+
Sbjct: 71   EKKKKKKKEKKE---PKSEGETKLGFKTPKKSKKT 102


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes the
            fifth essential DNA polymerase in yeast EC:2.7.7.7. Pol5p
            is localised exclusively to the nucleolus and binds near
            or at the enhancer region of rRNA-encoding DNA repeating
            units.
          Length = 784

 Score = 36.4 bits (84), Expect = 0.11
 Identities = 29/112 (25%), Positives = 45/112 (40%), Gaps = 13/112 (11%)

Query: 1014 LNMDEKVIQAGMFDQKSTGSERHEFLQTILHQDDEEEEENEVPDDETVNQLIARTEDELT 1073
            L  DE   +     Q   G E  E    +   DD+E+E   + D E+ ++      +E  
Sbjct: 632  LKADENKSR---HQQLFEGEE--EDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDE 686

Query: 1074 LFQKMDAERRKGETKPR------LVEIKELPDWMVKDDDEVENSMYEEERER 1119
              Q+ DAE  +G           L ++  LPD +   D E E  M +E+  R
Sbjct: 687  --QEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSEDEEGMDDEQMMR 736


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 34.5 bits (80), Expect = 0.12
 Identities = 19/70 (27%), Positives = 42/70 (60%), Gaps = 4/70 (5%)

Query: 320 RNNTARTQKLNKAIINYHINAEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRL 379
           ++   R ++L + ++ +    EKE+++ ++R EKE + +   E+E   R+   Q+  ++L
Sbjct: 61  KDTQLRAKRLMREMLLFWKKNEKEERELRKRAEKEALEQAKKEEEL--REAKRQQ--RKL 116

Query: 380 AFLLSQTDEY 389
            FL++QT+ Y
Sbjct: 117 NFLITQTELY 126


>gnl|CDD|224035 COG1110, COG1110, Reverse gyrase [DNA replication, recombination,
           and repair].
          Length = 1187

 Score = 36.2 bits (84), Expect = 0.14
 Identities = 30/107 (28%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 574 GLGKTVQTIALLTYLMEKKKNLGPYLVIVPLSTLSNWVLE-FEKWAPSAN------VVAY 626
           G+GKT   + +  YL +K K +    +IVP +TL   V E  +K+A  A       V   
Sbjct: 107 GVGKTTFGLLMSLYLAKKGKRV---YIIVPTTTLVRQVYERLKKFAEDAGSLDVLVVYHS 163

Query: 627 KGSPLARRALQGRMKNSKFNVLLTTYEFIIKDKSVLSKIQWKCMIID 673
                 +     R+++  F++L+TT +F+ K    LSK+++  + +D
Sbjct: 164 ALPTKEKEEALERIESGDFDILITTSQFLSKRFEELSKLKFDFIFVD 210


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
            [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 35.4 bits (81), Expect = 0.14
 Identities = 25/118 (21%), Positives = 47/118 (39%), Gaps = 15/118 (12%)

Query: 1158 DYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEE 1217
            D +E+ +E  P         + ++   E    E+EE+E+           ++++E+E+E 
Sbjct: 159  DNDEDNDEAPPAQP--DVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEA 216

Query: 1218 DESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKA 1275
            + S              D     D E   + K++ +  E EEEV      E EE+  +
Sbjct: 217  EGS-------------DDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPS 261



 Score = 34.2 bits (78), Expect = 0.41
 Identities = 26/120 (21%), Positives = 55/120 (45%), Gaps = 3/120 (2%)

Query: 1157 DDYEEEEEEVK--PVTKKRGSGKKGKKSY-VEETAPEDEEEEKPVKKRRRRKKVQISDED 1213
             D EE  E+V    V ++  + K     + +     ED +E  P +     ++ +  +E 
Sbjct: 126  SDVEEPSEKVDEEDVEREILAEKPRVTRFNIVWDNDEDNDEAPPAQPDVDNEEEERLEES 185

Query: 1214 EDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
            +  E+E  ++    S     ++ ++E++EE       +  V  E E +++ +GEEEE ++
Sbjct: 186  DGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEE 245



 Score = 31.1 bits (70), Expect = 3.0
 Identities = 27/122 (22%), Positives = 47/122 (38%), Gaps = 21/122 (17%)

Query: 404 QQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPPA 463
            +K   EE  +K            +D+ D++     E  R+T      DN    ++APPA
Sbjct: 123 LEKSDVEEPSEK------------VDEEDVEREILAEKPRVTRFNIVWDNDEDNDEAPPA 170

Query: 464 K-----QIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDAG 518
           +     + +E L    G     E+ EDE    +   E   E ++   E+    D+ +D  
Sbjct: 171 QPDVDNEEEERLEESDG----REEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVV 226

Query: 519 EE 520
           + 
Sbjct: 227 DY 228


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner membrane
            complex of itself with TolQ and TolR to the outer
            membrane complex of TolB and OprL (also called Pal). Most
            of the length of the protein consists of low-complexity
            sequence that may differ in both length and composition
            from one species to another, complicating efforts to
            discriminate TolA (the most divergent gene in the tol-pal
            system) from paralogs such as TonB. Selection of members
            of the seed alignment and criteria for setting scoring
            cutoffs are based largely conserved operon struction.
            //The Tol-Pal complex is required for maintaining outer
            membrane integrity. Also involved in transport (uptake)
            of colicins and filamentous DNA, and implicated in
            pathogenesis. Transport is energized by the proton motive
            force. TolA is an inner membrane protein that interacts
            with periplasmic TolB and with outer membrane porins
            ompC, phoE and lamB [Transport and binding proteins,
            Other, Cellular processes, Pathogenesis].
          Length = 346

 Score = 35.6 bits (82), Expect = 0.15
 Identities = 25/120 (20%), Positives = 54/120 (45%), Gaps = 8/120 (6%)

Query: 1158 DYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEE 1217
            D     ++   + +++   K   K   +E   + E++ +  +K+R  ++ +  + ++   
Sbjct: 44   DPGAVAQQANRIQQQK---KPAAKK-EQERQKKLEQQAEEAEKQRAAEQARQKELEQRAA 99

Query: 1218 DESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEV---EEAEGEEEEEDDK 1274
             E   K+  ++ K  A++K K+ +E K   +   K   E E E    EEA+ + EEE   
Sbjct: 100  AEKAAKQAEQAAK-QAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKA 158



 Score = 35.2 bits (81), Expect = 0.22
 Identities = 32/165 (19%), Positives = 64/165 (38%), Gaps = 13/165 (7%)

Query: 1116 EREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGS 1175
            + E++ EEA   R + Q +Q +       ++  K  E       E++++  +   K+   
Sbjct: 72   KLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAE 131

Query: 1176 GKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKD 1235
             K        E   E + +E+  K+     K + + E + +  E+ KK +       A+ 
Sbjct: 132  AKA-----KAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAE-------AEA 179

Query: 1236 KHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
            K K + + K    + K K  E  +    AE   + E + A+    
Sbjct: 180  KAKAEAKAKAKAEEAKAK-AEAAKAKAAAEAAAKAEAEAAAAAAA 223


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
            envelope biogenesis, outer membrane].
          Length = 387

 Score = 35.7 bits (82), Expect = 0.16
 Identities = 41/224 (18%), Positives = 90/224 (40%), Gaps = 29/224 (12%)

Query: 1057 DDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEE 1116
            D   V Q   R + + +  +K + +R+K + +    E+K                  E+E
Sbjct: 56   DPGAVVQQYGRIQSQQSSAKKGEQQRKK-KEEQVAEELKP-------------KQAAEQE 101

Query: 1117 REREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSG 1176
            R ++ E+  +    +Q++         + E    +E       ++EE+  K   +++   
Sbjct: 102  RLKQLEKERLKAQEQQKQA-------EEAEKQAQLEQK-----QQEEQARKAAAEQKKKA 149

Query: 1177 KKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDK 1236
            +  K     E A      E    K++  +  + ++E + + + +  KKK ++    A +K
Sbjct: 150  EAAKAKAAAEAAKLKAAAEA---KKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEK 206

Query: 1237 HKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
             K + E K    K+ +   EE+   E+ +   + + DKA+   K
Sbjct: 207  AKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAK 250



 Score = 30.7 bits (69), Expect = 6.0
 Identities = 32/181 (17%), Positives = 66/181 (36%), Gaps = 8/181 (4%)

Query: 341 EKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEH 400
             E+ K ++  E+ER+++L  E  +   +    ++ ++ A L  +  E      +  ++ 
Sbjct: 88  VAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQE------EQARKA 141

Query: 401 KEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDA 460
             EQ+KK    K K      +        K   + +   E+ +   +   +  K   E  
Sbjct: 142 AAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAK 201

Query: 461 PPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDAGEE 520
             A++ K    +    E   E + +E +  E  K+K    +KA           + A   
Sbjct: 202 AAAEKAKAEAEAKAKAEKKAEAAAEEKAAAE--KKKAAAKAKADKAAAAAKAAERKAAAA 259

Query: 521 A 521
           A
Sbjct: 260 A 260


>gnl|CDD|223989 COG1061, SSL2, DNA or RNA helicases of superfamily II
           [Transcription / DNA replication, recombination, and
           repair].
          Length = 442

 Score = 35.5 bits (82), Expect = 0.17
 Identities = 51/278 (18%), Positives = 101/278 (36%), Gaps = 27/278 (9%)

Query: 539 SIMVNGKLKEYQIKGLEWMVSLYNNNLNGILADEMGLGKTVQTIALLTYLMEKKKNLGPY 598
            +    +L+ YQ + L+ +V        G++    G GKTV     +  L   K++    
Sbjct: 30  IVAFEFELRPYQEEALDALVKNRRTERRGVIVLPTGAGKTVVAAEAIAEL---KRST--- 83

Query: 599 LVIVPLSTL-SNWVLEFEKWA-PSANVVAYKGSPLARRALQGRMKNSKFNVLLTTYEFII 656
           LV+VP   L   W    +K+   +  +  Y G         G  +     V + T + + 
Sbjct: 84  LVLVPTKELLDQWAEALKKFLLLNDEIGIYGG---------GEKELEPAKVTVATVQTLA 134

Query: 657 KDK--SVLSKIQWKCMIIDEGHRMKNHHCKLTQTLNTFYNSSHRVLLTGTP---LQNKLP 711
           + +        ++  +I DE H +        + L     +  R+ LT TP      ++ 
Sbjct: 135 RRQLLDEFLGNEFGLIIFDEVHHLPAP--SYRRILELLSAAYPRLGLTATPEREDGGRIG 192

Query: 712 ELWALLNFLLPSIFKSCSTFEQWFNAPFATTGEKVELNEEETILIIRRLHKVLRPFLLRR 771
           +L+ L+  ++  +       ++ + AP+     KV L E+E               LLR 
Sbjct: 193 DLFDLIGPIVYEVSLK-ELIDEGYLAPYKYVEIKVTLTEDE--EREYAKESARFRELLRA 249

Query: 772 LKKEVESQLPEKVEYIIKCDMSGLQKLLYQHLHNRGIL 809
                      ++    +  ++ ++ LL +H      L
Sbjct: 250 RGTLRAENEARRIAIASERKIAAVRGLLLKHARGDKTL 287


>gnl|CDD|217972 pfam04224, DUF417, Protein of unknown function, DUF417.  This
           family of uncharacterized proteins appears to be
           restricted to proteobacteria.
          Length = 153

 Score = 34.0 bits (78), Expect = 0.17
 Identities = 24/87 (27%), Positives = 33/87 (37%), Gaps = 11/87 (12%)

Query: 545 KLKEYQIKGLE-------WMVSLYNNNLNGILADEMGLGKTVQTIALLTYL---MEKKKN 594
           K   Y+ + ++        M  LY    NG       LG T   + +L  +     K   
Sbjct: 33  KFFPYEAEAIKPFVSNSPLMSFLYEFLKNGTYGVSYLLGITETIVGILLLIGLRNPKAGL 92

Query: 595 LGPYLVIV-PLSTLSNWVLEFEKWAPS 620
           LG  LV V  + TLS  +   E W PS
Sbjct: 93  LGGLLVFVTFIVTLSFLITTPEAWVPS 119


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
            consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 35.7 bits (82), Expect = 0.18
 Identities = 33/157 (21%), Positives = 69/157 (43%), Gaps = 11/157 (7%)

Query: 1117 REREKEE----ALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKK 1172
            +ERE +E    A   +    +KQ+D   +    ++ +   D   D+  ++++E K + K 
Sbjct: 204  KERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKP 263

Query: 1173 -RGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKT 1231
               S  K  K   E    E E+ +  +K     K  + + + +D +    K++ + S K 
Sbjct: 264  ADTSSPKEDKQVAENQKREIEKAQIEIK-----KNDEEALKAKDHKAFDLKQESKASEKE 318

Query: 1232 PAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEE 1268
             A+DK  E  +++  V++  +K   + E    +  E+
Sbjct: 319  -AEDKELEAQKKREPVAEDLQKTKPQVEAQPTSLNED 354



 Score = 30.7 bits (69), Expect = 5.8
 Identities = 30/155 (19%), Positives = 67/155 (43%), Gaps = 13/155 (8%)

Query: 1139 TDSLTDQEWLKAI-EDDGAD--------DYEEEEEEV---KPVTKKRGSGKKGKKSYVEE 1186
            TDS++D++ ++A+ ED+           D +E E +    +    K    KK   +   +
Sbjct: 174  TDSISDKKVVEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQ 233

Query: 1187 TAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDE-EKH 1245
               +  ++    ++   R+K Q +       D S  K+ ++  +   ++  K   E +K+
Sbjct: 234  QKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKN 293

Query: 1246 VVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
                 K K  +  +  +E++  E+E +DK  + +K
Sbjct: 294  DEEALKAKDHKAFDLKQESKASEKEAEDKELEAQK 328


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the protein
            product of the dentatorubral-pallidoluysian atrophy
            (DRPLA) gene. DRPLA OMIM:125370 is a progressive
            neurodegenerative disorder. It is caused by the expansion
            of a CAG repeat in the DRPLA gene on chromosome 12p. This
            results in an extended polyglutamine region in
            atrophin-1, that is thought to confer toxicity to the
            protein, possibly through altering its interactions with
            other proteins. The expansion of a CAG repeat is also the
            underlying defect in six other neurodegenerative
            disorders, including Huntington's disease. One
            interaction of expanded polyglutamine repeats that is
            thought to be pathogenic is that with the short glutamine
            repeat in the transcriptional coactivator CREB binding
            protein, CBP. This interaction draws CBP away from its
            usual nuclear location to the expanded polyglutamine
            repeat protein aggregates that are characteristic of the
            polyglutamine neurodegenerative disorders. This
            interferes with CBP-mediated transcription and causes
            cytotoxicity.
          Length = 979

 Score = 35.8 bits (82), Expect = 0.19
 Identities = 28/121 (23%), Positives = 45/121 (37%), Gaps = 27/121 (22%)

Query: 1171 KKRGSGKKGKKSYVEET-----------APEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
            KK+ +   G+ S   E            A     + K    ++  KK++       EE  
Sbjct: 22   KKQTASPDGRASPTNEDQRSSGRNSPSAASTSSNDSKAESTKKPNKKIK-------EEAT 74

Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVV-----EEEEEVEEAEGEEEEEDDK 1274
            SP K  ++  + PA D     +E + V +K+ K           E   E EGE E  D +
Sbjct: 75   SPLKSTKRQREKPASDT----EEPERVTAKKSKTQELSRPNSPSEGEGEGEGEGESSDSR 130

Query: 1275 A 1275
            +
Sbjct: 131  S 131


>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
          Length = 1176

 Score = 35.7 bits (83), Expect = 0.20
 Identities = 26/107 (24%), Positives = 46/107 (42%), Gaps = 10/107 (9%)

Query: 574 GLGKTVQTIALLTYLMEKKKNLGPYLVIVPLSTLSNWVLE-FEKWAPSANVVAYKGSPLA 632
           G+GKT   + +  YL +K K      +I P   L   V+E  EK+              +
Sbjct: 105 GVGKTTFGLVMSLYLAKKGKKS---YIIFPTRLLVEQVVEKLEKFGEKVGCGVKILYYHS 161

Query: 633 ------RRALQGRMKNSKFNVLLTTYEFIIKDKSVLSKIQWKCMIID 673
                 +     R+K   F++L+TT +F+ K+   L K ++  + +D
Sbjct: 162 SLKKKEKEEFLERLKEGDFDILVTTSQFLSKNFDELPKKKFDFVFVD 208


>gnl|CDD|223068 PHA03384, PHA03384, early DNA-binding protein E2A; Provisional.
          Length = 445

 Score = 35.1 bits (81), Expect = 0.25
 Identities = 24/98 (24%), Positives = 43/98 (43%), Gaps = 10/98 (10%)

Query: 1181 KSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKED 1240
              Y  + +P  E  E P KK+ RR   ++S  +E+EE+E  +      +  P +    +D
Sbjct: 10   SPYSSDDSPSLEPPELPPKKKGRR---RVSPVEEEEEEEEAEVVAVGFSYPPVRISRGKD 66

Query: 1241 DEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
                    KR  + ++EE++ E+    E    +  S P
Sbjct: 67   -------GKRPVRPLKEEKDSEKKASTEAAVRNPLSDP 97


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
            RPA34.5.  This is a family of proteins conserved from
            yeasts to human. Subunit A34.5 of RNA polymerase I is a
            non-essential subunit which is thought to help Pol I
            overcome topological constraints imposed on ribosomal DNA
            during the process of transcription.
          Length = 193

 Score = 33.9 bits (78), Expect = 0.29
 Identities = 20/79 (25%), Positives = 37/79 (46%)

Query: 1149 KAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQ 1208
                    D    E       ++K  + K  K++ VEE   ++++++K VKK ++ KK +
Sbjct: 114  FPTGYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDK 173

Query: 1209 ISDEDEDEEDESPKKKKRK 1227
                 E +  +  KKKK+K
Sbjct: 174  KEKMVEPKGSKKKKKKKKK 192



 Score = 33.2 bits (76), Expect = 0.53
 Identities = 21/69 (30%), Positives = 33/69 (47%), Gaps = 2/69 (2%)

Query: 1185 EETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEK 1244
              +    E E    +K    K  + ++ +E+E+ E  KKK+ K  K   KDK ++  E K
Sbjct: 124  PPSELGSESET--SEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPK 181

Query: 1245 HVVSKRKKK 1253
                K+KKK
Sbjct: 182  GSKKKKKKK 190



 Score = 29.7 bits (67), Expect = 6.8
 Identities = 28/100 (28%), Positives = 39/100 (39%), Gaps = 24/100 (24%)

Query: 1154 DGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDED 1213
             GA D    E      ++   S K+      +E   E+EE     KK +++KK       
Sbjct: 118  YGAPDGPPSE----LGSESETSEKETTAKVEKEAEVEEEE-----KKEKKKKK------- 161

Query: 1214 EDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKK 1253
                 E  K+KK K  K   K+K  E    K    K+KKK
Sbjct: 162  -----EVKKEKKEKKDK---KEKMVEPKGSKKKKKKKKKK 193


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region [General
            function prediction only].
          Length = 303

 Score = 34.6 bits (79), Expect = 0.29
 Identities = 23/119 (19%), Positives = 45/119 (37%), Gaps = 14/119 (11%)

Query: 1152 EDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISD 1211
            E +   + EEE +            +K KK  +E+    D+  E    +     + + SD
Sbjct: 199  EKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMETSESEEEESSESE-SD 257

Query: 1212 EDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEE 1270
            EDEDE+++   +K++      ++  H   +              E+E E E+    +  
Sbjct: 258  EDEDEDNKGKIRKRKTDDAKKSRKPHIHIE-------------YEQERENEKIPAVQHS 303



 Score = 32.7 bits (74), Expect = 1.1
 Identities = 23/106 (21%), Positives = 48/106 (45%), Gaps = 2/106 (1%)

Query: 1142 LTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEE-TAPEDEEEEKPVKK 1200
             T++E  +  ++   ++ EE + E++ VT      +K KK  +E+    +   E    ++
Sbjct: 190  ATEREKRQDEKERYVEEEEESDTELEAVTDD-SEKEKTKKKDLEKWLGSDQSMETSESEE 248

Query: 1201 RRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHV 1246
                +     DEDED + +  K+K   + K+     H E ++E+  
Sbjct: 249  EESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQEREN 294



 Score = 32.7 bits (74), Expect = 1.3
 Identities = 33/101 (32%), Positives = 48/101 (47%), Gaps = 1/101 (0%)

Query: 1181 KSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKED 1240
            KS +    P +  E    K    R+K Q   E   EE+E    +    T    K+K K+ 
Sbjct: 170  KSGIYGDTPLNVREHLWNKAATEREKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKK 229

Query: 1241 DEEKHVVSKRKKKVVE-EEEEVEEAEGEEEEEDDKASKPRK 1280
            D EK + S +  +  E EEEE  E+E +E+E++D   K RK
Sbjct: 230  DLEKWLGSDQSMETSESEEEESSESESDEDEDEDNKGKIRK 270


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
            eukaryotic proteins with undetermined function.
          Length = 321

 Score = 34.7 bits (80), Expect = 0.30
 Identities = 39/189 (20%), Positives = 64/189 (33%), Gaps = 15/189 (7%)

Query: 1058 DETVNQLIARTEDELTLFQKMDA----------ERRKGETKPRLVEIKELPDWMVKDDDE 1107
            D+   +   R E E       DA           +R+ E   RL E+KEL     +  D 
Sbjct: 111  DKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKELQS---RRADV 167

Query: 1108 VENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVK 1167
              NSM E    REK+E        +      +     +E  +  +D+  D  ++EE+   
Sbjct: 168  DVNSMLEALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDE--DSEDDEEDNDN 225

Query: 1168 PVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRK 1227
              + K GS    K + + + +     E     K ++  +      D        KK   +
Sbjct: 226  TPSPKSGSSSPAKPTSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKKAAPE 285

Query: 1228 STKTPAKDK 1236
            ST       
Sbjct: 286  STSQSPSSA 294



 Score = 34.7 bits (80), Expect = 0.30
 Identities = 32/151 (21%), Positives = 57/151 (37%), Gaps = 15/151 (9%)

Query: 1103 KDDDEVENSMYEEEREREKEEA---LMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDY 1159
            K D+E E  + +E  E    +A   L  R +  +++++  + L + + L++   D   + 
Sbjct: 112  KLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKELQSRRADVDVNS 171

Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
              E              KK      EE   EDE   K +      ++ +   +DED ED+
Sbjct: 172  MLE--------ALFRREKK----EEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDD 219

Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKR 1250
                    S K+ +    K     K   +KR
Sbjct: 220  EEDNDNTPSPKSGSSSPAKPTSILKKSAAKR 250


>gnl|CDD|236544 PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase;
            Reviewed.
          Length = 830

 Score = 34.7 bits (80), Expect = 0.33
 Identities = 17/63 (26%), Positives = 24/63 (38%), Gaps = 10/63 (15%)

Query: 1176 GKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKD 1235
            G+KGK S   +              RRR +     D+ +D EDE P  +K K      + 
Sbjct: 10   GRKGKPSRPVKQK----------VSRRRYRDDDDYDDYDDYEDEEPMPRKGKGKGRKPRG 59

Query: 1236 KHK 1238
            K  
Sbjct: 60   KRG 62


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
            approximately 300 residues, found in plants and
            vertebrates. They contain a highly conserved DDRGK motif.
          Length = 189

 Score = 33.9 bits (78), Expect = 0.33
 Identities = 26/84 (30%), Positives = 41/84 (48%), Gaps = 4/84 (4%)

Query: 1190 EDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSK 1249
            E EEEE+  +K+   K+     E+E+ E+E  KKK+ +  K   +   KE +E      K
Sbjct: 22   EAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEE----YEK 77

Query: 1250 RKKKVVEEEEEVEEAEGEEEEEDD 1273
             K   V EEE  ++   +EE  + 
Sbjct: 78   LKSSFVVEEEGTDKLSADEESNEL 101



 Score = 30.1 bits (68), Expect = 5.7
 Identities = 22/93 (23%), Positives = 48/93 (51%), Gaps = 12/93 (12%)

Query: 1193 EEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKK 1252
             EEK  ++++R  + +  +E +  E++   ++K +      ++K KE++E K    +R++
Sbjct: 11   LEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERK----EREE 66

Query: 1253 KVVEEEEEVE--------EAEGEEEEEDDKASK 1277
            +  +E+EE E        E EG ++   D+ S 
Sbjct: 67   QARKEQEEYEKLKSSFVVEEEGTDKLSADEESN 99


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 35.0 bits (81), Expect = 0.34
 Identities = 19/139 (13%), Positives = 50/139 (35%), Gaps = 5/139 (3%)

Query: 1144 DQEWLKAIEDDG-----ADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPV 1198
                +  I  DG      +D  +E  +    +++        +       PE+ E     
Sbjct: 783  KILIITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEE 842

Query: 1199 KKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEE 1258
            +K  +++      E ++ E+E     + K+     K+  K+ ++         K V+  +
Sbjct: 843  QKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSD 902

Query: 1259 EEVEEAEGEEEEEDDKASK 1277
            E  ++ +  ++++ D    
Sbjct: 903  EIKQDEKTTKKKKKDLEKT 921


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 34.0 bits (78), Expect = 0.38
 Identities = 21/90 (23%), Positives = 50/90 (55%), Gaps = 17/90 (18%)

Query: 328 KLNKAIINYHINAEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTD 387
           K+ + I ++   A+K +++++ R  KER  RL+AE  E +   +D +  +          
Sbjct: 132 KMPQMIADWR--AQKRKREQKARAAKERKERLVAEAREHFGYWVDPRDPR---------- 179

Query: 388 EYISNLTQMVKEHKEEQQKKMREEKKKKKK 417
                  +M+++ ++E++KK++E K+++K+
Sbjct: 180 -----FQEMLQQKEKEEKKKVKEAKRREKE 204


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus spores
            are protected by a protein shell consisting of over 50
            different polypeptides, known as the coat. This family of
            proteins has an important morphogenetic role in coat
            assembly, it is involved in the assembly of at least 5
            different coat proteins including CotB, CotG, CotS, CotSA
            and CotW. It is likely to act at a late stage of coat
            assembly.
          Length = 185

 Score = 33.3 bits (76), Expect = 0.43
 Identities = 19/82 (23%), Positives = 35/82 (42%), Gaps = 1/82 (1%)

Query: 1199 KKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEE 1258
            K   + ++ + SDE    ++E  K +  ++ K   +    E ++      K +    EE+
Sbjct: 35   KADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEK-EAGEPEREDIAEQQEKEEIAQEEEK 93

Query: 1259 EEVEEAEGEEEEEDDKASKPRK 1280
            EE  E   ++E    K  KP K
Sbjct: 94   EEEAEDVKQQEVFSFKRKKPFK 115


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 34.7 bits (80), Expect = 0.46
 Identities = 39/196 (19%), Positives = 90/196 (45%), Gaps = 25/196 (12%)

Query: 342 KEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFL----------LSQTDEYIS 391
           +E +K+ E++E++      AE  E Y++L  + ++  LA L          L + +E +S
Sbjct: 196 EELEKQLEKLERQ------AEKAERYQELKAELRELELALLLAKLKELRKELEELEEELS 249

Query: 392 NLTQMVKEHKEEQQKKMRE--EKKKKKKSIKQKILDGNMDKLDLDESSQLEDL-----RI 444
            L + ++E +EE ++  +E  E K + + +++++ +  + +  L+   ++E+L      +
Sbjct: 250 RLEEELEELQEELEEAEKEIEELKSELEELREELEE--LQEELLELKEEIEELEGEISLL 307

Query: 445 TVKETNSDNKLRGEDAPPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAV 504
             +    +N+L   +    +  ++  A     E      E+      E++E KEE  + +
Sbjct: 308 RERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKL 367

Query: 505 GEKKTEDDEYKDAGEE 520
                E +E  +A  E
Sbjct: 368 SALLEELEELFEALRE 383



 Score = 33.9 bits (78), Expect = 0.72
 Identities = 42/226 (18%), Positives = 91/226 (40%), Gaps = 10/226 (4%)

Query: 301 ERILAAARYKLNMDEKEYHRNNTARTQKLNKAIINYHINAEKEQKKEQERIEKERMRRLM 360
           E  L  A+ K    E E      +R ++  + +      AEKE ++ +  +E+ R     
Sbjct: 226 ELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEE 285

Query: 361 AEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNL-----TQMVKEHKEEQQKKMREEKKKK 415
            ++E    K   ++ +  ++ L  + +E  + L          + K E  K+  EE++  
Sbjct: 286 LQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETL 345

Query: 416 KKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPPAKQIKEWLASHPG 475
            + ++Q + +  +++   +   +L  L   ++E      LR E A    ++ E       
Sbjct: 346 LEELEQLLAE--LEEAKEELEEKLSALLEELEELFEA--LREELAELEAELAEIRNELEE 401

Query: 476 WEVVMEDSEDEGSE-DEEMKEKKEEPSKAVGEKKTEDDEYKDAGEE 520
            +  +E  E+      E +++ KEE  +   E +    E ++  EE
Sbjct: 402 LKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEE 447



 Score = 31.2 bits (71), Expect = 4.3
 Identities = 53/281 (18%), Positives = 107/281 (38%), Gaps = 27/281 (9%)

Query: 997  ITVNSVEERILAAARYKLNMDEKVIQAGMFDQKSTGSERHEFLQTILHQDDEEEEENEVP 1056
            IT  S  +R   A + +L   E+ +       +    E       +   +D  EE     
Sbjct: 652  ITGGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQL 711

Query: 1057 DDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEE 1116
            ++        + E           E    + + RL E++E  + + ++ +E++      E
Sbjct: 712  EELERQLEELKRELAA-------LEEELEQLQSRLEELEEELEELEEELEELQ------E 758

Query: 1117 REREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSG 1176
            R  E EE L    S +       + + + E  +    +  ++ EEE EE +         
Sbjct: 759  RLEELEEEL---ESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERE 815

Query: 1177 KKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDK 1236
             +  +   E    E EE           ++++   ++ +EE E  +K+  +      K++
Sbjct: 816  LESLEQRRERLEQEIEE------LEEEIEELEEKLDELEEELEELEKELEE-----LKEE 864

Query: 1237 HKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASK 1277
             +E + EK  +    K++ EE+EE+EE   E E E  +  +
Sbjct: 865  LEELEAEKEELEDELKELEEEKEELEEELRELESELAELKE 905


>gnl|CDD|216403 pfam01271, Granin, Granin (chromogranin or secretogranin). 
          Length = 585

 Score = 34.2 bits (78), Expect = 0.48
 Identities = 36/178 (20%), Positives = 68/178 (38%), Gaps = 14/178 (7%)

Query: 1104 DDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEE 1163
              DE    + E  R+ E E      G   R+   Y   +         ED   +  + EE
Sbjct: 81   LSDEWMQIITEALRQAENEP-----GGHPRENQPYNLQVEKNFPTDHSED--YETQQWEE 133

Query: 1164 EEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKK 1223
            E++K +   R      ++ + + T    E    P  +   +   +     E  + +   K
Sbjct: 134  EKLKHMKFPRRYEDNSEEKHSKRTNEIVEVFYNPQSQATLKSVFE-----EVGKLDGQSK 188

Query: 1224 KKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVV--EEEEEVEEAEGEEEEEDDKASKPR 1279
            +KR+ +    K    ++D+     +   + VV  E+   +EE E ++ +E+ K SK R
Sbjct: 189  QKRERSDEREKSYQDDEDDTYRQNNIPYEDVVGGEDWNPIEEGEEDQTQEEVKRSKER 246



 Score = 30.7 bits (69), Expect = 6.2
 Identities = 23/129 (17%), Positives = 48/129 (37%), Gaps = 5/129 (3%)

Query: 1153 DDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDE 1212
            D+    Y+++E++             G + +      E+++ ++ VK+ + R     S  
Sbjct: 195  DEREKSYQDDEDDTYRQNNIPYEDVVGGEDWNPIEEGEEDQTQEEVKRSKERTHKGRSLP 254

Query: 1213 DEDEEDESPKKKKRKSTKTPAKDKHKED-----DEEKHVVSKRKKKVVEEEEEVEEAEGE 1267
            DE +       +     +   K +               V+ R +   +   E E +E  
Sbjct: 255  DESKRSGQLGLEDEAEEEKEEKGQESRGLSAVQTYLLRWVNARGRGRSQNRAERERSEES 314

Query: 1268 EEEEDDKAS 1276
            EE+E D+AS
Sbjct: 315  EEKELDRAS 323


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 32.4 bits (74), Expect = 0.49
 Identities = 11/30 (36%), Positives = 24/30 (80%)

Query: 393 LTQMVKEHKEEQQKKMREEKKKKKKSIKQK 422
           + Q++KE  +++++K +++KKKKKK+ K+ 
Sbjct: 101 VKQVLKEKAKQKKQKKKKKKKKKKKTSKKA 130


>gnl|CDD|224125 COG1204, COG1204, Superfamily II helicase [General function
           prediction only].
          Length = 766

 Score = 34.3 bits (79), Expect = 0.50
 Identities = 33/111 (29%), Positives = 48/111 (43%), Gaps = 19/111 (17%)

Query: 574 GLGKTVQTIALLTYLMEKKKNLGPYLVIVPLSTLSN-WVLEFEKWAP-SANVVAYKG-SP 630
           G GKT+  IALL  L    +  G  + IVPL  L+     EF +       V    G   
Sbjct: 57  GSGKTL--IALLAILSTLLEGGGKVVYIVPLKALAEEKYEEFSRLEELGIRVGISTGDYD 114

Query: 631 LARRALQGRMKNSKFNVLLTTYEFIIKDKSVLSK----IQW-KCMIIDEGH 676
           L    L      ++++V++TT E   K  S+  K    I+    ++IDE H
Sbjct: 115 LDDERL------ARYDVIVTTPE---KLDSLTRKRPSWIEEVDLVVIDEIH 156


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone sialoprotein
            (BSP) is a major structural protein of the bone matrix
            that is specifically expressed by fully-differentiated
            osteoblasts. The expression of bone sialoprotein (BSP) is
            normally restricted to mineralised connective tissues of
            bones and teeth where it has been associated with mineral
            crystal formation. However, it has been found that
            ectopic expression of BSP occurs in various lesions,
            including oral and extraoral carcinomas, in which it has
            been associated with the formation of microcrystalline
            deposits and the metastasis of cancer cells to bone.
          Length = 291

 Score = 33.5 bits (76), Expect = 0.55
 Identities = 30/118 (25%), Positives = 46/118 (38%), Gaps = 19/118 (16%)

Query: 1172 KRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKT 1231
            KR S + G  S  E    +  EEE   +     +  + SD +EDEE E+        T  
Sbjct: 40   KRFSVQSGSDSSEENGDGDSSEEEGEEETSNEEENNEDSDGNEDEEAEAENTTLSTVTLG 99

Query: 1232 PAKD----------------KHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
               D                K   +  +K   + ++ +  E+EEE EE E EE E ++
Sbjct: 100  YGGDATPGTGNIGLAALQLPKKAGNAGKK---ATKEDESDEDEEEEEEEEEEEAEVEE 154



 Score = 32.0 bits (72), Expect = 2.1
 Identities = 24/97 (24%), Positives = 41/97 (42%), Gaps = 23/97 (23%)

Query: 1173 RGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTP 1232
            + +G  GKK+  E+ + EDEEEE+  ++          + + +E ++        ST+  
Sbjct: 120  KKAGNAGKKATKEDESDEDEEEEEEEEE---------EEAEVEENEQGTNGTSTNSTEVD 170

Query: 1233 AKDKHK--EDDEEKHVVSKRKKKVVEEEEEVEEAEGE 1267
              +     ++ EE             EEE V EAE E
Sbjct: 171  HGNGSSGGDNGEE------------GEEESVTEAEAE 195


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
            Superfamily; Provisional.
          Length = 844

 Score = 34.1 bits (78), Expect = 0.58
 Identities = 23/146 (15%), Positives = 52/146 (35%), Gaps = 21/146 (14%)

Query: 1132 QRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPED 1191
            QRK +D  D           +D+    YE++E++    T    S        + +   + 
Sbjct: 82   QRKSLDDDD-----------DDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDA 130

Query: 1192 EEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKK-RKSTKTPAKDKHKEDDEEKHVVSKR 1250
            + +             Q +D D+D+   S   +  ++S    + +K  E+ + +   + +
Sbjct: 131  DTQAN---------NDQTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEKKGQTENTFK 181

Query: 1251 KKKVVEEEEEVEEAEGEEEEEDDKAS 1276
                  + EE    +G + +  D   
Sbjct: 182  FYNFGNDGEEAAAKDGGKSKSSDPGP 207


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 34.0 bits (78), Expect = 0.61
 Identities = 25/85 (29%), Positives = 37/85 (43%), Gaps = 13/85 (15%)

Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
            + ++E V P+ KK  S  K           + EE ++  KK  +RKK    D   D++ +
Sbjct: 173  DSDQEAVLPLVKKSPSEAK---------EEKAEERKQESKKGAKRKK----DASGDDKSK 219

Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEK 1244
              K  +  ST T A  K   D E K
Sbjct: 220  KAKTDRDVSTSTAASQKKSSDLESK 244



 Score = 31.7 bits (72), Expect = 2.9
 Identities = 25/119 (21%), Positives = 50/119 (42%), Gaps = 16/119 (13%)

Query: 1140 DSLT--DQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKP 1197
            D+L+  DQE +  +      + +EE+ E +    K+G+ +K           +D   +  
Sbjct: 169  DTLSDSDQEAVLPLVKKSPSEAKEEKAEERKQESKKGAKRK-----------KDASGDDK 217

Query: 1198 VKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVE 1256
             KK +  + V  S     ++    + K    +K     K   DD +KHV +   ++++E
Sbjct: 218  SKKAKTDRDVSTSTAASQKKSSDLESKLEAQSKELWSLK---DDLKKHVSTAELREMLE 273


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 33.2 bits (76), Expect = 0.61
 Identities = 19/87 (21%), Positives = 33/87 (37%), Gaps = 12/87 (13%)

Query: 1202 RRRKKVQISDEDEDEED--------ESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKK 1253
            R   +      D+  +D         SPK+ K+K   TP K        +K     ++K 
Sbjct: 21   RAHVRSTTESVDKVRDDIFPEDIEIPSPKQPKKKRPTTPRKPATT----KKSKKKDKEKL 76

Query: 1254 VVEEEEEVEEAEGEEEEEDDKASKPRK 1280
              EE++   + +  EE E+D  +    
Sbjct: 77   TEEEKKPESDDDKTEENENDPDNNEES 103



 Score = 30.1 bits (68), Expect = 5.8
 Identities = 17/94 (18%), Positives = 32/94 (34%), Gaps = 9/94 (9%)

Query: 1186 ETAPEDEEE------EKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKE 1239
            E+  +  ++      E P  K+ ++K+     +          KKK K   T  + K + 
Sbjct: 29   ESVDKVRDDIFPEDIEIPSPKQPKKKRPTTPRK---PATTKKSKKKDKEKLTEEEKKPES 85

Query: 1240 DDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
            DD++            E  +  E A      + D
Sbjct: 86   DDDKTEENENDPDNNEESGDSQESASANSLSDID 119


>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D.  SpoVID, the
            stage VI sporulation protein D, is restricted to
            endospore-forming members of the bacteria, all of which
            are found among the Firmicutes. It is widely distributed
            but not quite universal in this group. Between
            well-conserved N-terminal and C-terminal domains is a
            poorly conserved, low-complexity region of variable
            length, rich enough in glutamic acid to cause spurious
            BLAST search results unless a filter is used. The seed
            alignment for this model was trimmed, in effect, by
            choosing member sequences in which these regions are
            relatively short. SpoVID is involved in spore coat
            assembly by the mother cell compartment late in the
            process of sporulation [Cellular processes, Sporulation
            and germination].
          Length = 338

 Score = 33.3 bits (76), Expect = 0.69
 Identities = 25/131 (19%), Positives = 60/131 (45%), Gaps = 5/131 (3%)

Query: 1152 EDDGADDYEEEEE----EVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKV 1207
            + + A + EEEEE    E +   ++  +G++ +    +      E+ E+P        K+
Sbjct: 143  DAEPAREDEEEEESFSAEFEHPAQEETAGEEERTDEPKVEHEAHEQHEQPADDDPDEWKI 202

Query: 1208 QISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGE 1267
              S+  + E +     ++    +   + + + +DEEK  + ++ +   +E+    +A+  
Sbjct: 203  SASEPFQLESEVEASPEEENYEEYEDETELEVEDEEK-ALDEQTEDPQQEDALAGDAKKA 261

Query: 1268 EEEEDDKASKP 1278
             EEE++K  +P
Sbjct: 262  LEEEEEKGERP 272



 Score = 32.2 bits (73), Expect = 1.8
 Identities = 33/132 (25%), Positives = 55/132 (41%), Gaps = 15/132 (11%)

Query: 1158 DYEEEEEEVKPVTKKRG-SGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDE 1216
            DYE  E     +T     SG + + +   E A EDEEEE+             + E+   
Sbjct: 116  DYELLENGCLQLTADISISGIQQENNLDAEPAREDEEEEESFSAEFE----HPAQEETAG 171

Query: 1217 EDESPKKKKRKSTKTPAKDKHKEDDEEKHVVS-------KRKKKVVEEEEEVEEAEGE-- 1267
            E+E   + K +       ++  +DD ++  +S       + + +   EEE  EE E E  
Sbjct: 172  EEERTDEPKVEHEAHEQHEQPADDDPDEWKISASEPFQLESEVEASPEEENYEEYEDETE 231

Query: 1268 -EEEEDDKASKP 1278
             E E+++KA   
Sbjct: 232  LEVEDEEKALDE 243


>gnl|CDD|225087 COG2176, PolC, DNA polymerase III, alpha subunit (gram-positive type)
            [DNA replication, recombination, and repair].
          Length = 1444

 Score = 33.8 bits (78), Expect = 0.71
 Identities = 33/128 (25%), Positives = 54/128 (42%), Gaps = 16/128 (12%)

Query: 1039 LQTILHQDDEEEEENEVPDDETVNQLI-ARTEDELTLFQKMDAERRKGE-TKPRLVEIKE 1096
            L      +D  EE+     +E +N+ +    ++ L   +K+ AE  K E  KP     K 
Sbjct: 161  LLIEFEVNDISEEQEFEKFEEAINEEVEKAAQEALEAEKKLKAESPKVEKPKPLFDGQKG 220

Query: 1097 LPDWMVKDDDEVE--NSMYEEER---------EREKEEALMGRGSRQRKQVDYTDSLTDQ 1145
                 +K  +E++    + EEE          + E +E   GR     K  DYT SL  +
Sbjct: 221  RK---IKSTEEIKPLIKINEEETRVKVEGYIFKIEIKELKSGRTLLNIKVTDYTSSLILK 277

Query: 1146 EWLKAIED 1153
            ++L+  ED
Sbjct: 278  KFLRDEED 285


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
            consists of several mammalian specific proacrosin binding
            protein sp32 sequences. sp32 is a sperm specific protein
            which is known to bind with with 55- and 53-kDa
            proacrosins and the 49-kDa acrosin intermediate. The
            exact function of sp32 is unclear, it is thought however
            that the binding of sp32 to proacrosin may be involved in
            packaging the acrosin zymogen into the acrosomal matrix.
          Length = 243

 Score = 33.1 bits (75), Expect = 0.73
 Identities = 17/56 (30%), Positives = 26/56 (46%)

Query: 1218 DESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
              S + K  K  +     K +E  +E     K+ ++  EEEE  EEA+ EE +  D
Sbjct: 187  GGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTD 242



 Score = 30.4 bits (68), Expect = 4.7
 Identities = 21/119 (17%), Positives = 43/119 (36%), Gaps = 18/119 (15%)

Query: 1161 EEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEE----EEKPVKKRRRRKKVQISDEDEDE 1216
            E   EV+P T      +    +  +   P  E      E+ ++                 
Sbjct: 139  EPSAEVQPTTMTLPIAEHPTITENQSFQPWPERLHNNVEELLQSSLS------LGGSVQV 192

Query: 1217 EDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKA 1275
            +   PK+++  S          ++  ++H   +++ +  +EEEEVEE   +EE +    
Sbjct: 193  KAPKPKQEQLLS--------KLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
            Provisional.
          Length = 695

 Score = 33.8 bits (78), Expect = 0.78
 Identities = 14/88 (15%), Positives = 32/88 (36%), Gaps = 4/88 (4%)

Query: 1190 EDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKT--PAKDKHKEDDEEKHVV 1247
            E E+  +  + R ++     + +D+D    +  + K K      P   K     +   V+
Sbjct: 463  EREKAAR--EARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVI 520

Query: 1248 SKRKKKVVEEEEEVEEAEGEEEEEDDKA 1275
            + R+ +  +      E +     +  KA
Sbjct: 521  AAREARKAQARARQAEKQAAAAADPKKA 548



 Score = 32.2 bits (74), Expect = 1.9
 Identities = 30/181 (16%), Positives = 57/181 (31%), Gaps = 20/181 (11%)

Query: 1114 EEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKR 1173
             E+  RE            + +     +L   +  KA              +   V   R
Sbjct: 464  REKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAAR 523

Query: 1174 GSGKKGKKSYVEETAPEDEEEEK------PVKKRRRRKKVQISDEDEDEEDESPKK---- 1223
             + K   ++   E       + K       + + + +K  Q +   E EE+  PKK    
Sbjct: 524  EARKAQARARQAEKQAAAAADPKKAAVAAAIARAKAKKAAQQAANAEAEEEVDPKKAAVA 583

Query: 1224 ------KKRKSTKTPAKD----KHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
                  K +K+ +  A      +  E D +K  V+    +   ++ E +     EE  D 
Sbjct: 584  AAIARAKAKKAAQQAASAEPEEQVAEVDPKKAAVAAAIARAKAKKAEQQANAEPEEPVDP 643

Query: 1274 K 1274
            +
Sbjct: 644  R 644



 Score = 32.2 bits (74), Expect = 1.9
 Identities = 19/109 (17%), Positives = 40/109 (36%), Gaps = 15/109 (13%)

Query: 1184 VEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKK----------KKRKSTKTPA 1233
            ++  A  D       ++ R+ +      E +      PKK          K +K+ +  A
Sbjct: 508  IKAGARPDNSAVIAAREARKAQARARQAEKQAAAAADPKKAAVAAAIARAKAKKAAQQAA 567

Query: 1234 KDKHKEDDEEKHV-----VSKRKKKVVEEEEEVEEAEGEEEEEDDKASK 1277
              + +E+ + K       +++ K K   ++    E E +  E D K + 
Sbjct: 568  NAEAEEEVDPKKAAVAAAIARAKAKKAAQQAASAEPEEQVAEVDPKKAA 616


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This family
            includes archaebacterial L12, eukaryotic P0, P1 and P2.
          Length = 88

 Score = 31.1 bits (71), Expect = 0.78
 Identities = 11/18 (61%), Positives = 13/18 (72%)

Query: 1256 EEEEEVEEAEGEEEEEDD 1273
              EEE +E E EEEE+DD
Sbjct: 65   AAEEEKKEEEEEEEEDDD 82



 Score = 28.4 bits (64), Expect = 5.3
 Identities = 10/18 (55%), Positives = 12/18 (66%)

Query: 1256 EEEEEVEEAEGEEEEEDD 1273
               EE ++ E EEEEEDD
Sbjct: 64   AAAEEEKKEEEEEEEEDD 81


>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366).  This
            family of proteins is found in bacteria and eukaryotes.
            Proteins in this family are typically between 227 and 387
            amino acids in length.
          Length = 213

 Score = 32.7 bits (75), Expect = 0.78
 Identities = 19/66 (28%), Positives = 24/66 (36%), Gaps = 14/66 (21%)

Query: 1135 QVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEE 1194
            QVD  D L        +E++ A         V  V     +G         E  PE E E
Sbjct: 104  QVDEADLLA------LMEEEDAAAPTCSCCPVCSVNMTECTGP--------EPEPEPEPE 149

Query: 1195 EKPVKK 1200
            E+P KK
Sbjct: 150  EEPEKK 155



 Score = 30.8 bits (70), Expect = 3.5
 Identities = 18/83 (21%), Positives = 25/83 (30%), Gaps = 34/83 (40%)

Query: 1160 EEEEEEVKPVTKKRGSG---------------------KKGKKSYVEETAPEDEEEEKPV 1198
            E E E  +   KK G G                      K K+        + +E +   
Sbjct: 143  EPEPEPEEEPEKKSGMGPLLLVLAVALIGGGAYYYFKFYKPKQQEKGAPDDDLDEYDYG- 201

Query: 1199 KKRRRRKKVQISDEDEDEEDESP 1221
                        DEDE+E+DE P
Sbjct: 202  ------------DEDEEEDDEPP 212


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 32.4 bits (74), Expect = 0.82
 Identities = 17/59 (28%), Positives = 22/59 (37%), Gaps = 4/59 (6%)

Query: 1224 KKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVE----EEEEVEEAEGEEEEEDDKASKP 1278
            KK K +K     K    + EK V   R + V E    E   V   E    EE++    P
Sbjct: 117  KKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAP 175


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is a
            family of uncharacterized proteins.
          Length = 449

 Score = 33.4 bits (76), Expect = 0.83
 Identities = 57/272 (20%), Positives = 90/272 (33%), Gaps = 50/272 (18%)

Query: 1054 EVPDDETVNQLIARTEDELTLFQKMDAE---RRKGETKPRLVEIKELPDWMVKDDDEVEN 1110
            E+ DD+ V   +   E +LTL   M  E   R+ G+        +E  D    DDD+ E+
Sbjct: 9    ELEDDDWVRGSLDYKE-KLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDED 67

Query: 1111 SMYEEEREREKEEALMGRGSRQRKQVDYTDSL-TDQEWLKAIEDDGADDYEEEEEEV--- 1166
               +++ + E +E      S         D   TD E   A  DD +DD  E        
Sbjct: 68   EDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNEAGFADSDDESDDGSEYVFWAPGT 127

Query: 1167 --------KPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQIS-------- 1210
                    K  T +R S ++   S  +      +  +    +R   K  +I         
Sbjct: 128  TTAATSPRKLETMRRKSRRRTSDSSADSLNERKQRRKWKRPRRSPIKPPKIRPGTPELPD 187

Query: 1211 ---------DEDEDEEDE--SPKKKKRKSTK----------TPAKDKHKEDDEEKHVVSK 1249
                     DED   E    S  + +R S +           P  D   E+DE       
Sbjct: 188  STDFVCGTLDEDRPLEAAYKSCMEARRLSKQVVIPQDIDPSFPTSDPEDEEDEL-----D 242

Query: 1250 RKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
              ++V+E ++        E     +   PR S
Sbjct: 243  DVEEVIESDDHFFLDLDGERGRRRRKRSPRTS 274


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
            effective cell-replication that replication is not
            stalled at any point by, for instance, damaged bases.
            Rtf2 stabilizes the replication fork stalled at the
            site-specific replication barrier RTS1 by preventing
            replication restart until completion of DNA synthesis by
            a converging replication fork initiated at a flanking
            origin. The RTS1 element terminates replication forks
            that are moving in the cen2-distal direction while
            allowing forks moving in the cen2-proximal direction to
            pass through the region. Rtf2 contains a C2HC2 motif
            related to the C3HC4 RING-finger motif, and would appear
            to fold up, creating a RING finger-like structure but
            forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 33.1 bits (76), Expect = 0.84
 Identities = 19/87 (21%), Positives = 32/87 (36%), Gaps = 11/87 (12%)

Query: 1206 KVQISDEDEDEE-------DESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKR----KKKV 1254
             V ++  +E+ E       +E  KKKK+K  K   K+       E  V S          
Sbjct: 159  VVPLNPTEEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGA 218

Query: 1255 VEEEEEVEEAEGEEEEEDDKASKPRKS 1281
             +  E  +  +      D++ S+  KS
Sbjct: 219  GQVGEAKKLKKKRSIAPDNEKSEVYKS 245


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
            membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 32.8 bits (75), Expect = 0.86
 Identities = 20/90 (22%), Positives = 27/90 (30%), Gaps = 15/90 (16%)

Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEE-----KPVKKRRRRKKVQISDEDE 1214
            E   EE +P  +      K       ET PE    +     KP KK ++ K         
Sbjct: 50   EAPTEEPQPEPEPPEEQPKPPT--EPETPPEPTPPKPKEKPKPEKKPKKPKPKPK----- 102

Query: 1215 DEEDESPKKKKRKSTKTPAKDKHKEDDEEK 1244
                +   K K K    P K   K   +  
Sbjct: 103  ---PKPKPKPKVKPQPKPKKPPSKTAAKAP 129


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.  This
            family represents Cwf15/Cwc15 (from Schizosaccharomyces
            pombe and Saccharomyces cerevisiae respectively) and
            their homologues. The function of these proteins is
            unknown, but they form part of the spliceosome and are
            thus thought to be involved in mRNA splicing.
          Length = 241

 Score = 32.8 bits (75), Expect = 0.87
 Identities = 28/116 (24%), Positives = 47/116 (40%), Gaps = 20/116 (17%)

Query: 1170 TKKRGSGKKGKKSYVE-----------ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEED 1218
             ++    KK  K  +E               ED++EE  +K++R  +  + SD D+ +  
Sbjct: 66   AERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSDSS 125

Query: 1219 ESPKKKKRKSTKTPAKDKHKEDDEEKHV---VSKRKKKVVEEEEEVEEAEGEEEEE 1271
                     S      D    +DE   +   + K KK+  EE+E  EE +  EEE+
Sbjct: 126  SDSDSSDDDS------DDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEK 175


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 33.0 bits (76), Expect = 0.91
 Identities = 14/90 (15%), Positives = 34/90 (37%), Gaps = 13/90 (14%)

Query: 341 EKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEH 400
            KE+  E ER + E          E  ++     + +          E++  L + ++  
Sbjct: 201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQE-----RSYQEHVKQLIEKMEAE 255

Query: 401 KEEQQ--------KKMREEKKKKKKSIKQK 422
           +E+           K++E+++  K+  K +
Sbjct: 256 REKLLAEQERMLEHKLQEQEELLKEGFKTE 285


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 33.6 bits (76), Expect = 0.96
 Identities = 45/241 (18%), Positives = 95/241 (39%), Gaps = 12/241 (4%)

Query: 1034 ERHEFLQTILHQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVE 1093
            E  +F       ++E+E +NE    ++ +Q    +E  L   + + ++    ++  R   
Sbjct: 442  EDVDFTGKKGAINNEDESDNEEVAFDSDSQF-DESEGNLRWKEGLASKLAYSQSGKR--- 497

Query: 1094 IKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIED 1153
                    ++     E+   EE  E  K E+     S    Q +  D     +   A E 
Sbjct: 498  -----GRNIQKIFYDESLSPEECIEEYKGESAKSSESDLVVQDEPEDFFDVSK--VANES 550

Query: 1154 DGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEE-TAPEDEEEEKPVKKRRRRKKVQISDE 1212
              ++  +  E E + + KK  S  + K  + ++ T    E EE+ ++   +     + DE
Sbjct: 551  ISSNHEKLMESEFEELKKKWSSLAQLKSRFQKDATLDSIEGEEELIQDDEKGNFEDLEDE 610

Query: 1213 DEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEED 1272
            +   ++E  + +    T    +   + D E +   + RKK+ +    E+EE    E+++ 
Sbjct: 611  ENSSDNEMEESRGSSVTAENEESADEVDYETEREENARKKEELRGNFELEERGDPEKKDV 670

Query: 1273 D 1273
            D
Sbjct: 671  D 671


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 32.9 bits (76), Expect = 0.97
 Identities = 18/41 (43%), Positives = 23/41 (56%)

Query: 1233 AKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
              DK   D+E K V+S + +    EEEE EE E EEEE  +
Sbjct: 278  LADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSE 318


>gnl|CDD|110514 pfam01517, HDV_ag, Hepatitis delta virus delta antigen.  The
            hepatitis delta virus (HDV) encodes a single protein, the
            hepatitis delta antigen (HDAg). The central region of
            this protein has been shown to bind RNA. Several
            interactions are also mediated by a coiled-coil region at
            the N terminus of the protein.
          Length = 194

 Score = 32.5 bits (74), Expect = 1.0
 Identities = 25/98 (25%), Positives = 41/98 (41%), Gaps = 20/98 (20%)

Query: 1190 EDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSK 1249
            +D E   P K+ R          D+ E D  P K+      T   D+ + D   +  +  
Sbjct: 62   KDGEGAPPAKRPR---------TDQMEVDSGPGKRPHAGGFT---DQERRDHRRRKALEN 109

Query: 1250 RKKKVV--------EEEEEVEEAEGEEEEEDDKASKPR 1279
            +KK++         EEEEE+     E+EE + + + PR
Sbjct: 110  KKKQLSSGGKHLSREEEEELRRLTEEDEERERRVAGPR 147


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
            unknown].
          Length = 769

 Score = 33.1 bits (75), Expect = 1.0
 Identities = 44/215 (20%), Positives = 74/215 (34%), Gaps = 27/215 (12%)

Query: 1082 RRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEE-REREKEEALMGRGSRQRKQVDYTD 1140
            R  G    RL  I +  D++   +  + N  YE+  RE   + A       Q       +
Sbjct: 274  RGIGFNANRLKHIPDFGDFLS--NKIIVNGQYEQTIREIFADRATKLELDLQTVFESNMN 331

Query: 1141 SLTDQEWLKAIEDDGADDYEEEEEEV--------------KPVTKKRGSGKKGKKSYVEE 1186
              T  E+    ED  +D  E+ E +               +  T K+ +  KG   Y + 
Sbjct: 332  RDTLDEYAPEGEDLRSDYDEDFEYDGLTTVRIDDHGFLPGREQTSKKAAVPKGTSFY-QA 390

Query: 1187 TAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHV 1246
               EDEEEE             I D+D  E D           +    ++   DD E   
Sbjct: 391  KWAEDEEEEDGQCNDEESTMSAIDDDDPKENDNE---------EVAGDEESAIDDNEGFE 441

Query: 1247 VSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
                +++  +  E  +  + + E  D+   +P +S
Sbjct: 442  ELSPEEEERQLREFRDMEKEDREFPDEAELQPSES 476


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
            region.  This family includes the N-terminal regions of
            the junctin, junctate and aspartyl beta-hydroxylase
            proteins. Junctate is an integral ER/SR membrane calcium
            binding protein, which comes from an alternatively
            spliced form of the same gene that generates aspartyl
            beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
            catalyzes the post-translational hydroxylation of
            aspartic acid or asparagine residues contained within
            epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 32.6 bits (74), Expect = 1.0
 Identities = 20/115 (17%), Positives = 47/115 (40%), Gaps = 7/115 (6%)

Query: 1160 EEEEEEVKPVTKKRGSGKKGKKS---YVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDE 1216
            +E + +         S  + +      V E   +    E+   ++ ++   + + E E+E
Sbjct: 128  KEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENE 187

Query: 1217 EDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
            + + P +K  ++         ++ DEE + V   K      +EE+ +   EE++E
Sbjct: 188  DSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSK----AIKEELAKEPVEEQQE 238



 Score = 32.6 bits (74), Expect = 1.2
 Identities = 22/111 (19%), Positives = 49/111 (44%), Gaps = 8/111 (7%)

Query: 1172 KRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDE--DEDEEDESPKKKKRKST 1229
            K  S  +      E     +EE +  V+K +++ + ++ ++     E+    K+++    
Sbjct: 68   KEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPG 127

Query: 1230 KTPAKDKHK------EDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDK 1274
            K P  D+ K       DD ++ + + +  +  E+   VEE   E+ ++D K
Sbjct: 128  KEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMK 178



 Score = 31.8 bits (72), Expect = 1.7
 Identities = 15/85 (17%), Positives = 37/85 (43%), Gaps = 1/85 (1%)

Query: 1194 EEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKK 1253
              K           + ++   +EE +   +K ++  +   K++ +    EK VVSK+++ 
Sbjct: 66   GLKEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQS-LLEKIVVSKQEED 124

Query: 1254 VVEEEEEVEEAEGEEEEEDDKASKP 1278
               +E +++E +    E+ D   + 
Sbjct: 125  GPGKEPQLDEDKFLLAEDSDDRQET 149



 Score = 31.8 bits (72), Expect = 1.8
 Identities = 22/129 (17%), Positives = 52/129 (40%), Gaps = 2/129 (1%)

Query: 1150 AIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQI 1209
            A+        EE +E+++ + +K    K+ +    +E   ++++         R++ ++ 
Sbjct: 93   AVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEA 152

Query: 1210 SDEDEDEEDESPKKK-KRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEE 1268
                E+ ED    ++   +  K   K+K  E + E       +K    + E  +  E + 
Sbjct: 153  GKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPV-EKAERTKAETDDVTEEDY 211

Query: 1269 EEEDDKASK 1277
            +EED+    
Sbjct: 212  DEEDNPVED 220



 Score = 31.8 bits (72), Expect = 2.1
 Identities = 20/122 (16%), Positives = 41/122 (33%), Gaps = 5/122 (4%)

Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEE---EKPVKKRRRRKKVQISDEDEDE 1216
               E+ V    ++ G GK+ +    +    ED ++        K     +     E+   
Sbjct: 111  SLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETAS 170

Query: 1217 EDESPKKKKRKSTKTPAKDKH--KEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDK 1274
            E      K++ S +     K   ++ +  K       ++  +EE+   E     +EE  K
Sbjct: 171  EQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIKEELAK 230

Query: 1275 AS 1276
              
Sbjct: 231  EP 232



 Score = 31.4 bits (71), Expect = 2.4
 Identities = 33/141 (23%), Positives = 64/141 (45%), Gaps = 9/141 (6%)

Query: 401 KEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDA 460
             + ++K+ EE K++ +S+ +KI+    ++    +  QL++ +  + E + D   R E  
Sbjct: 94  VRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDD---RQETL 150

Query: 461 PPAKQIKEWLASHPGWEVVME----DSEDEGSEDEEMKEKKEEPSKAVGEKKTEDD-EYK 515
              K  +E   S+   E   E    D +++ SE +E ++ KE   KA   K   DD   +
Sbjct: 151 EAGKVHEETEDSYHVEETASEQYKQDMKEKASE-QENEDSKEPVEKAERTKAETDDVTEE 209

Query: 516 DAGEEATYYSIAHTYHEKVTE 536
           D  EE      +    E++ +
Sbjct: 210 DYDEEDNPVEDSKAIKEELAK 230


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
            family represents the C-terminus (approximately 300
            residues) of proteins that are involved as binding
            partners for Prp19 as part of the nuclear pore complex.
            The family in Drosophila is necessary for pre-mRNA
            splicing, and the human protein has been found in
            purifications of the spliceosome. In the past this family
            was thought, erroneously, to be associated with
            microfibrillin.
          Length = 277

 Score = 32.6 bits (74), Expect = 1.1
 Identities = 32/124 (25%), Positives = 60/124 (48%), Gaps = 3/124 (2%)

Query: 1152 EDDGADDYEEEEEEVKPV-TKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQIS 1210
            E++  D  ++ E  +KPV T+K+      ++          EEE K   + R+R+ ++I 
Sbjct: 23   EEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKALEEEAKRKAEERKRETLKIV 82

Query: 1211 DEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRK--KKVVEEEEEVEEAEGEE 1268
            +E+  +E E  K+           D   E++EE++   K +  K++  + EE EE E E+
Sbjct: 83   EEEVKKELELKKRNTLLEANIDDVDTDDENEEEEYEAWKLRELKRIKRDREEREEMEREK 142

Query: 1269 EEED 1272
             E +
Sbjct: 143  AEIE 146



 Score = 31.4 bits (71), Expect = 2.8
 Identities = 46/189 (24%), Positives = 77/189 (40%), Gaps = 21/189 (11%)

Query: 1093 EIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIE 1152
            E+ EL +     ++E E S  EEE + E +     +    RK+   T    ++E  K   
Sbjct: 3    EVLELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKA 62

Query: 1153 DDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRR-KKVQISD 1211
                    EEE + K   +KR + K      VEE   ++ E +K           V   D
Sbjct: 63   -------LEEEAKRKAEERKRETLK-----IVEEEVKKELELKKRNTLLEANIDDVDTDD 110

Query: 1212 EDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
            E+E+EE E+ K ++ K        + K D EE+  + + K ++ +     EE    E  +
Sbjct: 111  ENEEEEYEAWKLRELK--------RIKRDREEREEMEREKAEIEKMRNMTEEERRAELRK 162

Query: 1272 DDKASKPRK 1280
            + K S  + 
Sbjct: 163  NPKVSTNKA 171


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
            known as chromatin decondensation proteins. They bind to
            core histones and transfer DNA to them in a reaction that
            requires ATP. This is thought to play a role in the
            assembly of regular nucleosomal arrays.
          Length = 146

 Score = 31.5 bits (72), Expect = 1.1
 Identities = 25/79 (31%), Positives = 33/79 (41%), Gaps = 38/79 (48%)

Query: 1148 LKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKV 1207
            L A E+D +DD EE+EEE                        +DEE++            
Sbjct: 106  LVASEEDESDDDEEDEEEE-----------------------DDEEDD------------ 130

Query: 1208 QISDEDEDEEDESPKKKKR 1226
               DEDE EE+ESP KK +
Sbjct: 131  ---DEDESEEEESPVKKVK 146



 Score = 29.2 bits (66), Expect = 7.8
 Identities = 9/25 (36%), Positives = 17/25 (68%)

Query: 1256 EEEEEVEEAEGEEEEEDDKASKPRK 1280
            E+EEE ++ E ++E+E ++   P K
Sbjct: 119  EDEEEEDDEEDDDEDESEEEESPVK 143


>gnl|CDD|211344 cd02867, PseudoU_synth_TruB_4, Pseudouridine synthase homolog 4.
            This group consists of Eukaryotic TruB proteins similar
            to Saccharomyces cerevisiae Pus4. S. cerevisiae Pus4,
            makes psi55 in the T loop of both cytoplasmic and
            mitochondrial tRNAs. Psi55 is almost universally
            conserved.  Pseudouridine synthases catalyze the
            isomerization of specific uridines in an RNA molecule to
            pseudouridines (5-ribosyluracil, psi).
          Length = 312

 Score = 32.8 bits (75), Expect = 1.1
 Identities = 23/107 (21%), Positives = 44/107 (41%), Gaps = 7/107 (6%)

Query: 1133 RKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETA---- 1188
            R+QV  ++ L  ++W++          EE ++  + V K  G   K      E TA    
Sbjct: 170  RRQVVVSELLV-KDWIEPGPLFTRTVEEEGKQYERSVVKMLGKELKTFAEVTELTATAEG 228

Query: 1189 PEDEEEEKPVK--KRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPA 1233
               EE E   +  KR+   + + +++    E  S +    + + +PA
Sbjct: 229  DPVEEVEATHEESKRKSEVEEEANEKSLGPEARSLESDAGRGSFSPA 275


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
            [Posttranslational modification, protein turnover,
            chaperones].
          Length = 926

 Score = 33.4 bits (76), Expect = 1.1
 Identities = 23/96 (23%), Positives = 34/96 (35%), Gaps = 9/96 (9%)

Query: 1159 YEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEED 1218
              EEEE   P   +  SGK  +K      +         +K   R K+        D+E 
Sbjct: 759  DLEEEEFEYPRMYEEASGKSVRKVNTAVLSTT-------IKAAARAKQKPKEKGPNDKEI 811

Query: 1219 --ESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKK 1252
              ESP  +      T  + + K  D    +  K+KK
Sbjct: 812  KIESPSVETEGERCTIKQREEKGIDAPAILNVKKKK 847


>gnl|CDD|235570 PRK05703, flhF, flagellar biosynthesis regulator FlhF; Validated.
          Length = 424

 Score = 32.9 bits (76), Expect = 1.1
 Identities = 26/105 (24%), Positives = 43/105 (40%), Gaps = 2/105 (1%)

Query: 1170 TKK-RGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKS 1228
             KK R  G  GKK  VE TA  DE+E        R +K + +      +    K+  R +
Sbjct: 31   NKKVRKGGFLGKK-LVEVTAAVDEDETPKKNPVLREEKRKPAKSILSLQALLEKRPSRTN 89

Query: 1229 TKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
            ++           E K  + K  +   EE +   E++  ++E D+
Sbjct: 90   SQDALLQAENALPEWKKELEKPSEPKEEEPKAAAESKVVQKELDE 134


>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region.  The
            timeless (tim) gene is essential for circadian function
            in Drosophila. Putative homologues of Drosophila tim have
            been identified in both mice and humans (mTim and hTIM,
            respectively). Mammalian TIM is not the true orthologue
            of Drosophila TIM, but is the likely orthologue of a fly
            gene, timeout (also called tim-2). mTim has been shown to
            be essential for embryonic development, but does not have
            substantiated circadian function. Some family members
            contain a SANT domain in this region.
          Length = 507

 Score = 33.1 bits (75), Expect = 1.1
 Identities = 42/238 (17%), Positives = 86/238 (36%), Gaps = 42/238 (17%)

Query: 1080 AERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEERE--------REKEEALMGRGSR 1131
             ++R+ + KP+    +E  +   ++D E E  + E + E         + E + +     
Sbjct: 226  GKKRRKKLKPKQPNGEESGEDDFQEDPEEEEQLPESKPEETEKRVSAFQVEGSTLISAEN 285

Query: 1132 QRKQVDYTDSLTDQEWLKAIEDDGADDYEEEE--------------------EEVKPVTK 1171
             R+Q+    +     WL++     ADD EE+E                    E+ + + K
Sbjct: 286  LRQQLKQEKTSWPLLWLQSCLIRAADDREEDECDQAVPLVPLTEENEEAMENEQFQRLLK 345

Query: 1172 K--------------RGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEE 1217
                           R   K          A    EEE+P  + +     + +DE E E 
Sbjct: 346  ALGLRPPRSGQEGFWRIPAKLSSTQLRRRAASLSGEEEEPEDELKDDVDGEQADESEHET 405

Query: 1218 DESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKA 1275
                K  +++     + ++    +EE+    K+K+   + +++       +EE DD +
Sbjct: 406  LALRKNARQRKAGLASPEEEALGEEEQKAPPKKKQLNQKNKQQTGSGTNSDEERDDTS 463



 Score = 31.2 bits (70), Expect = 4.3
 Identities = 49/328 (14%), Positives = 100/328 (30%), Gaps = 34/328 (10%)

Query: 970  WNPHQDLQAQD--RAHRIGQKNEVRVLRLITVNSVEERI--------LAAARYKLNMDEK 1019
            W   +DL+ Q      R        +++ IT      ++        L + + +L   +K
Sbjct: 169  WTGDEDLELQRLYEEFRDSDDVLGHIMKNITGKRSRAQVVDKLLQLGLVSEKRELRGGKK 228

Query: 1020 VIQAGMFDQKSTGSERHEFLQTILHQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMD 1079
              +     Q +      +  Q      +EEE+  E   +ET  ++ A   +  TL    +
Sbjct: 229  RRKKLKPKQPNGEESGEDDFQEDP---EEEEQLPESKPEETEKRVSAFQVEGSTLISAEN 285

Query: 1080 AERRKGETKPRLVEIKELPDWMVKDDDEVENSMYE-----EEREREKEEALMGRGSRQRK 1134
              ++  + K     +      +   DD  E+   +        E  +E     +  R  K
Sbjct: 286  LRQQLKQEKTSWPLLWLQSCLIRAADDREEDECDQAVPLVPLTEENEEAMENEQFQRLLK 345

Query: 1135 QVDYTDSLTDQE--WLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDE 1192
             +      + QE  W    +                  +     K        E A E E
Sbjct: 346  ALGLRPPRSGQEGFWRIPAKLSSTQLRRRAASLSGEEEEPEDELKDDVDG---EQADESE 402

Query: 1193 EEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKK 1252
             E   ++K  R++K  ++             ++    +   K   K+    +    +   
Sbjct: 403  HETLALRKNARQRKAGLA-----------SPEEEALGEEEQKAPPKKKQLNQKNKQQTGS 451

Query: 1253 KVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
                +EE  + +  E+ +  D    PR 
Sbjct: 452  GTNSDEERDDTSLDEDRDLADDGGLPRI 479


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This family
            consists of several bovine specific leukaemia virus
            receptors which are thought to function as transmembrane
            proteins, although their exact function is unknown.
          Length = 561

 Score = 33.1 bits (75), Expect = 1.2
 Identities = 48/210 (22%), Positives = 85/210 (40%), Gaps = 30/210 (14%)

Query: 1097 LPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTD------------ 1144
            + D  VK ++E  +    E+ +REK++    +  R+R     T+S  D            
Sbjct: 77   MSDQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTE 136

Query: 1145 ---QEWLKAIEDDG--ADDYEE---------EEEEVKPVTKKRGSGKKGKKSYVEETAPE 1190
               +  L + EDD    D Y            + E  PV K R +  +  KS  +   P 
Sbjct: 137  EMPENALPSDEDDKDPNDPYRALDIDLDKPLADSEKLPVQKHRNA--ETSKSPEKGDVPA 194

Query: 1191 DEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKR 1250
             E++ K  KK+ +++K +  D+D+ +E E  K        +PA      + +E  + +  
Sbjct: 195  VEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTV 254

Query: 1251 KKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
                   + E +E +  E EE  K+ K +K
Sbjct: 255  SGT--APDSEPDEPKDAEAEETKKSPKHKK 282



 Score = 32.7 bits (74), Expect = 1.5
 Identities = 22/108 (20%), Positives = 39/108 (36%)

Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
            +E++E+ K   K +    +G KS +              +         +S    D E +
Sbjct: 205  KEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAPDSEPD 264

Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGE 1267
             PK  + + TK   K K K+  +EK    K+KK         +    +
Sbjct: 265  EPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQ 312


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
            homolog A (Rrp7A) and similar proteins.  The family
            corresponds to the RRP7 domain of Rrp7A, also termed
            gastric cancer antigen Zg14, and similar proteins which
            are yeast ribosomal RNA-processing protein 7 (Rrp7p)
            homologs mainly found in Metazoans. The cellular function
            of Rrp7A remains unclear currently. Rrp7A harbors an
            N-terminal RNA recognition motif (RRM), also termed RBD
            (RNA binding domain) or RNP (ribonucleoprotein domain),
            and a C-terminal RRP7 domain.
          Length = 129

 Score = 31.1 bits (71), Expect = 1.3
 Identities = 26/91 (28%), Positives = 42/91 (46%), Gaps = 14/91 (15%)

Query: 1137 DYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEK 1196
            +Y  S+ D   L++  D+  ++Y++EEEE              +K   E+ A  DE+   
Sbjct: 8    EYNASIPDPAELQSEIDEYMEEYDKEEEE--------------EKEEKEKEAEPDEDGWV 53

Query: 1197 PVKKRRRRKKVQISDEDEDEEDESPKKKKRK 1227
             V K+ RR K    +    +  E  KKKK+K
Sbjct: 54   TVTKKGRRPKTARKESVAAKAAEKEKKKKKK 84


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
            [Transcription].
          Length = 392

 Score = 32.8 bits (74), Expect = 1.3
 Identities = 33/181 (18%), Positives = 69/181 (38%), Gaps = 6/181 (3%)

Query: 1102 VKDDDEVENSMYEEEREREKEEALMG--RGSRQRKQVDYTDSLTDQEWLKAIEDDGADDY 1159
            V  ++   N   ++ERE      L    +  R R+    +  +  +E  K ++D    D 
Sbjct: 158  VYHENSFLNKHLKKEREYYYLHGLSPPLKYVRARRFRKKSSKIEIEEVEKKVDDLLEKDM 217

Query: 1160 EEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
            + E   V    +K    ++ + S  E    E  E       ++ ++  +   E+   E++
Sbjct: 218  KAESVSVVLKDEKE-LARQERVSSWENFKEEPGEPLSRPALKKEKQGAEEEGEEGMSEED 276

Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPR 1279
                      K  ++   ++  EE       K+   E + +  +  GEE+EEDD+  +  
Sbjct: 277  LDVGAAEIENKEVSEGDKEQQQEEVENAEAHKE---EVQSDRPDEIGEEKEEDDENEENE 333

Query: 1280 K 1280
            +
Sbjct: 334  R 334


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 32.6 bits (74), Expect = 1.3
 Identities = 16/46 (34%), Positives = 22/46 (47%), Gaps = 2/46 (4%)

Query: 475 GWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDAGEE 520
             +VV E+S DE SEDE   E  E   + V E +  + +  D  E 
Sbjct: 339 EGDVVTEESTDEESEDEV--EIDESVIEEVAEMELLEVQVDDLAER 382


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are RNase
            Y, an endoribonuclease. The member from Bacillus
            subtilis, YmdA, has been shown to be involved in turnover
            of yitJ riboswitch [Transcription, Degradation of RNA].
          Length = 514

 Score = 32.6 bits (75), Expect = 1.4
 Identities = 21/76 (27%), Positives = 43/76 (56%), Gaps = 5/76 (6%)

Query: 1201 RRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEE 1260
            + RR ++Q  +    + +E+  +K     K   K+++ E  E++  +S ++K + E+EEE
Sbjct: 72   KERRNELQRLERRLLQREETLDRKMESLDK---KEENLEKKEKE--LSNKEKNLDEKEEE 126

Query: 1261 VEEAEGEEEEEDDKAS 1276
            +EE   E+ EE ++ S
Sbjct: 127  LEELIAEQREELERIS 142


>gnl|CDD|180481 PRK06231, PRK06231, F0F1 ATP synthase subunit B; Validated.
          Length = 205

 Score = 32.1 bits (73), Expect = 1.4
 Identities = 19/67 (28%), Positives = 35/67 (52%), Gaps = 6/67 (8%)

Query: 327 QKLNKAIINYHINAEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQT 386
           ++ N  I       EKE+++ +E+++KE +   M   EE  +K +D++ D +L       
Sbjct: 140 RQANLIIFQARQEIEKERRELKEQLQKESVELAMLAAEELIKKKVDREDDDKL------V 193

Query: 387 DEYISNL 393
           DE+I  L
Sbjct: 194 DEFIREL 200


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain fusion
            protein; Provisional.
          Length = 131

 Score = 31.1 bits (70), Expect = 1.5
 Identities = 24/100 (24%), Positives = 38/100 (38%), Gaps = 23/100 (23%)

Query: 1174 GSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPA 1233
             +G+  K      TA  +  EE            + +D DED ++ +             
Sbjct: 54   EAGRAEKGPAAAATAAAEAAEEA-----------EAADADEDADEAA------------E 90

Query: 1234 KDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
             D   E DEE+       +   E + E EEA+ EE+EE +
Sbjct: 91   ADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEAE 130


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
            Med19 represents a family of conserved proteins which are
            members of the multi-protein co-activator Mediator
            complex. Mediator is required for activation of RNA
            polymerase II transcription by DNA binding
            transactivators.
          Length = 178

 Score = 31.7 bits (72), Expect = 1.6
 Identities = 21/65 (32%), Positives = 33/65 (50%), Gaps = 4/65 (6%)

Query: 1172 KRGSGKKGKKSYVEETAPE----DEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRK 1227
            K+    K KK   ++  PE    D E  K  +K+ ++KK +   E + ++ E  KKKKR 
Sbjct: 110  KKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169

Query: 1228 STKTP 1232
            S + P
Sbjct: 170  SPEHP 174



 Score = 29.4 bits (66), Expect = 8.7
 Identities = 13/65 (20%), Positives = 27/65 (41%), Gaps = 11/65 (16%)

Query: 1178 KGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKH 1237
            + +    EET  + E  +   KK +++K            ++  ++KK+K  K   K +H
Sbjct: 121  RTQDPLPEETPSDSEGLKGHEKKHKKKKH-----------EDDKERKKKKKEKKKKKKRH 169

Query: 1238 KEDDE 1242
              +  
Sbjct: 170  SPEHP 174


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
            ribosomal biogenesis [Translation, ribosomal structure
            and biogenesis].
          Length = 821

 Score = 32.7 bits (74), Expect = 1.6
 Identities = 25/121 (20%), Positives = 50/121 (41%), Gaps = 12/121 (9%)

Query: 1161 EEEEEVKP--------VTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDE 1212
            ++ E++KP         TKK   GK  K +     + +DE +E  +     + +  + D+
Sbjct: 646  KKVEDIKPEDKFFYQYFTKKTADGKGKKSNKASFDS-DDEMDENEIWSALVKSRPDVEDD 704

Query: 1213 DEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEED 1272
             +D E +  +     ST   + D+ K D  +         +  ++EE ++E     + E 
Sbjct: 705  SDDSELDFAEDDFSDST---SDDEPKLDAIDDEDAKSEGSQESDQEEGLDEIFYSFDGEQ 761

Query: 1273 D 1273
            D
Sbjct: 762  D 762



 Score = 30.4 bits (68), Expect = 9.1
 Identities = 34/139 (24%), Positives = 60/139 (43%), Gaps = 24/139 (17%)

Query: 1050 EEENEVPDDETVNQLIAR-------TEDELTLFQKMDAERRKGETKPRLVEIKELPDWMV 1102
            + ++E+ ++E  + L+         ++D    F + D      + +P+L  I        
Sbjct: 680  DSDDEMDENEIWSALVKSRPDVEDDSDDSELDFAEDDFSDSTSDDEPKLDAI-------- 731

Query: 1103 KDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEE 1162
              DDE   S  E  +E ++EE L         + D +DS  +       E+D + + E+E
Sbjct: 732  --DDEDAKS--EGSQESDQEEGLDEIFYSFDGEQDNSDSFAESS-----EEDESSEEEKE 782

Query: 1163 EEEVKPVTKKRGSGKKGKK 1181
            EEE K V+ KR   K+ K 
Sbjct: 783  EEENKEVSAKRAKKKQRKN 801


>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
            [Function unknown].
          Length = 1104

 Score = 32.7 bits (74), Expect = 1.8
 Identities = 19/129 (14%), Positives = 40/129 (31%), Gaps = 13/129 (10%)

Query: 1132 QRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSY------VE 1185
            +  +    ++L        ++      +     + +    KR +    +         + 
Sbjct: 324  ETDRAREMETLAHD----NVKQIVGAQHGILSAKREGAVDKRRTISTARAGLDALVKGLG 379

Query: 1186 ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKH 1245
              APE  EE   +    R    +     E  E      ++    +T A D+ K  D+E  
Sbjct: 380  GAAPESAEELLELNNAARLTVDEYPAAREALESA---GQRNVEDRTRAVDEFKAADQELS 436

Query: 1246 VVSKRKKKV 1254
             +SK    +
Sbjct: 437  SLSKGSSNI 445


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter subunit
            G; Reviewed.
          Length = 197

 Score = 31.6 bits (71), Expect = 2.0
 Identities = 24/98 (24%), Positives = 47/98 (47%), Gaps = 24/98 (24%)

Query: 1180 KKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKE 1239
            ++  +E+   E EE E+ ++  RR +K+   DE ED+E++  +++++         + + 
Sbjct: 124  RQEQIEKARQEREELEERMEWERREEKI---DEREDQEEQEREREEQTI-------EEQS 173

Query: 1240 DDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASK 1277
            DD E  ++              E+ E E E +DDK  K
Sbjct: 174  DDSEHEII--------------EQDESETESDDDKTEK 197


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 31.9 bits (72), Expect = 2.3
 Identities = 40/210 (19%), Positives = 74/210 (35%), Gaps = 48/210 (22%)

Query: 1103 KDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEE 1162
            KD ++++       +E  ++  +   G+   K         ++E  KA+  +     EE 
Sbjct: 5    KDLEDLKKEAARRAKEEARKRLVAKHGAEISKL-----EEENREKEKALPKNDDMTIEEA 59

Query: 1163 EEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRR------------RRKKVQIS 1210
            +       K + +    +K    E   E  EEEK   K +             ++K + +
Sbjct: 60   KRRAAAAAKAKAAALAKQK---REGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGT 116

Query: 1211 DEDEDEEDESPKKK--------------KRKSTKTPAKDKHKEDDEEK-----------H 1245
            +E  +EE  + K K              +++       ++ +E D+EK            
Sbjct: 117  EEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAK 176

Query: 1246 VVSKRKKKVVEEEEEVEEAEGEEEEEDDKA 1275
              +  K+K  E  E  EE     EEE  KA
Sbjct: 177  AAALAKQKAAEAGEGTEEV---TEEEKAKA 203


>gnl|CDD|147445 pfam05250, UPF0193, Uncharacterized protein family (UPF0193).  This
            family of proteins is functionally uncharacterized.
          Length = 212

 Score = 31.5 bits (71), Expect = 2.3
 Identities = 32/143 (22%), Positives = 58/143 (40%), Gaps = 22/143 (15%)

Query: 1132 QRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPED 1191
            QR+Q++    L + E L   ++     +E+E +    +  +R + KK     +EE+   D
Sbjct: 44   QRRQINDH--LRNGEPLPVPKEP---KFEQEYDNTLAMADRRKNIKKRSLRTIEESGAFD 98

Query: 1192 EEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRK 1251
             E+ +P   +                 E  +K+KRK   T A     +    K V  +R 
Sbjct: 99   VEKYRPADNK-----------------EPGEKEKRKLQNTMAGTHIPDPGIRKRVPRRRS 141

Query: 1252 KKVVEEEEEVEEAEGEEEEEDDK 1274
            ++ ++  EE    E  EE  + K
Sbjct: 142  EEEMDVTEEDRFDELLEEINERK 164


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
            (Rrp7p), ribosomal RNA-processing protein 7 homolog A
            (Rrp7A), and similar proteins.  This CD corresponds to
            the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
            YCL031C gene from Saccharomyces cerevisiae. It is an
            essential yeast protein involved in pre-rRNA processing
            and ribosome assembly, and is speculated to be required
            for correct assembly of rpS27 into the pre-ribosomal
            particle. Rrp7A, also termed gastric cancer antigen Zg14,
            is the Rrp7p homolog mainly found in Metazoans. The
            cellular function of Rrp7A remains unclear currently.
            Both Rrp7p and Rrp7A harbor an N-terminal RNA recognition
            motif (RRM), also termed RBD (RNA binding domain) or RNP
            (ribonucleoprotein domain), and a C-terminal RRP7 domain.
          Length = 118

 Score = 30.3 bits (69), Expect = 2.3
 Identities = 22/95 (23%), Positives = 38/95 (40%), Gaps = 13/95 (13%)

Query: 1137 DYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEK 1196
            +Y  S  D   L++  D+  +++++ EEE K   K+                  DE+   
Sbjct: 8    EYKRSRPDPAELQSEVDEYMEEFDKREEEEKEEAKEAR-------------NEPDEDGFV 54

Query: 1197 PVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKT 1231
             V +  R+ K    +  E +  E  KKKK+K    
Sbjct: 55   TVTRGGRKGKTAREEAVEAKAKEKEKKKKKKKELE 89


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1 protein
            homologues. SDA1 is a Saccharomyces cerevisiae protein
            which is involved in the control of the actin
            cytoskeleton. The protein is essential for cell viability
            and is localised in the nucleus.
          Length = 317

 Score = 31.5 bits (72), Expect = 2.4
 Identities = 36/147 (24%), Positives = 53/147 (36%), Gaps = 47/147 (31%)

Query: 1135 QVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEE 1194
            +    +    +E  + +E D  DD EEE E                   VEE    D+E 
Sbjct: 76   EKWKEEERKKKEAEQGLESDDDDDEEEEWE-------------------VEEDEDSDDEG 116

Query: 1195 EKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKV 1254
            E             I  E + E + S                  ED+EEK   +K+ K+ 
Sbjct: 117  E------------WIDVESDKEIESS----------------DSEDEEEKDEAAKKAKED 148

Query: 1255 VEEEEEVEEAEGEEEEEDDKASKPRKS 1281
             +EE   E+ E   EEE+ +A K + S
Sbjct: 149  SDEELSEEDEEEAAEEEEAEAEKEKAS 175


>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN.  FtsN is a poorly
            conserved protein active in cell division in a number of
            Proteobacteria. The N-terminal 30 residue region tends to
            by Lys/Arg-rich, and is followed by a membrane-spanning
            region. This is followed by an acidic low-complexity
            region of variable length and a well-conserved C-terminal
            domain of two tandem regions matched by pfam05036
            (Sporulation related repeat), found in several cell
            division and sporulation proteins. The role of FtsN as a
            suppressor for other cell division mutations is poorly
            understood; it may involve cell wall hydrolysis [Cellular
            processes, Cell division].
          Length = 298

 Score = 31.6 bits (71), Expect = 2.4
 Identities = 34/152 (22%), Positives = 53/152 (34%), Gaps = 13/152 (8%)

Query: 1093 EIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIE 1152
             I+EL    V  +D  E S      E  +  A   R   ++ Q D   +           
Sbjct: 85   YIEELEAREVLINDPEEPSNGGGVEESAQLTAEQ-RQLLEQMQADMRAAEK--------- 134

Query: 1153 DDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDE 1212
                      E+ V    +K+ + KK +K+   E      E EK   K +  K+ Q +  
Sbjct: 135  ---VLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVETEKIASKVKEAKQKQKALP 191

Query: 1213 DEDEEDESPKKKKRKSTKTPAKDKHKEDDEEK 1244
             +  E +S  K    + K    DK K   +EK
Sbjct: 192  KQTAETQSNSKPIETAPKADKADKTKPKPKEK 223


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.  Genome
            duplication is precisely regulated by cyclin-dependent
            kinases CDKs, which bring about the onset of S phase by
            activating replication origins and then prevent
            relicensing of origins until mitosis is completed. The
            optimum sequence motif for CDK phosphorylation is
            S/T-P-K/R-K/R, and Drc1-Sld2 is found to have at least 11
            potential phosphorylation sites. Drc1 is required for DNA
            synthesis and S-M replication checkpoint control. Drc1
            associates with Cdc2 and is phosphorylated at the onset
            of S phase when Cdc2 is activated. Thus Cdc2 promotes DNA
            replication by phosphorylating Drc1 and regulating its
            association with Cut5. Sld2 and Sld3 represent the
            minimal set of S-CDK substrates required for DNA
            replication.
          Length = 397

 Score = 31.7 bits (72), Expect = 2.5
 Identities = 28/127 (22%), Positives = 49/127 (38%), Gaps = 17/127 (13%)

Query: 1168 PVTKKRGSGK------KGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESP 1221
            P+  +R  GK      + ++S  +E     E E +        ++V  S    DE     
Sbjct: 225  PLRPQRPVGKGLSELVQEEESIDDELDVLREIEAEEAGIGPIEEEVVDSQAANDEPRRVF 284

Query: 1222 KKKKRKST-----------KTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEE 1270
            KKK +K T           K   +    E D  + +    +K + E    +   + ++E+
Sbjct: 285  KKKGQKRTTRRVKMRPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDED 344

Query: 1271 EDDKASK 1277
            EDD+ SK
Sbjct: 345  EDDEESK 351


>gnl|CDD|221643 pfam12572, DUF3752, Protein of unknown function (DUF3752).  This
            domain family is found in eukaryotes, and is typically
            between 140 and 163 amino acids in length.
          Length = 148

 Score = 30.5 bits (69), Expect = 2.6
 Identities = 22/90 (24%), Positives = 40/90 (44%), Gaps = 18/90 (20%)

Query: 1195 EKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEE------KHVVS 1248
            E P +K +R             +DE    +   S    A  K  + D+E      ++   
Sbjct: 37   ETPEEKAKR------------LQDEVLGVEAGASAPAAASAKASKRDKEMARKVKEYNEK 84

Query: 1249 KRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
            KR K +VE+ ++ ++ + +EEE DD + +P
Sbjct: 85   KRGKSLVEQHQKKQKKKKKEEENDDPSRRP 114


>gnl|CDD|178320 PLN02718, PLN02718, Probable galacturonosyltransferase.
          Length = 603

 Score = 31.8 bits (72), Expect = 2.7
 Identities = 33/172 (19%), Positives = 63/172 (36%), Gaps = 5/172 (2%)

Query: 1094 IKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIED 1153
            I+EL     + DD   +++ +EE E  K   L+         V   +S          E 
Sbjct: 43   IEELSTIRYRTDDLRLSAIEQEEGEGLKGPKLVIYQDGDFNSVVSYNSSDKNNDSLESEV 102

Query: 1154 DGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVK-KRRRRKKVQISDE 1212
            DG ++++ +EE+ +   K   S  +  +    +     + +  P   K  +  +VQ    
Sbjct: 103  DGGNNHKPKEEQAQVSQKTTVSSSEEVQISARDIQLNHKTQFNPPTVKHEKNTRVQP--- 159

Query: 1213 DEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEA 1264
                 DE  K+ + K  +  A            +V + + +  E E  V +A
Sbjct: 160  -RRATDEKVKEIRDKIIQAKAYLNLAPPGSNSQLVKELRLRTKELERAVGDA 210


>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter.  The Monovalent
            Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
            CPA1 family is a large family of proteins derived from
            Gram-positive and Gram-negative bacteria, blue green
            bacteria, yeast, plants and animals. Transporters from
            eukaryotes have been functionally characterized, and all
            of these catalyze Na+:H+ exchange. Their primary
            physiological functions may be in (1) cytoplasmic pH
            regulation, extruding the H+ generated during metabolism,
            and (2) salt tolerance (in plants), due to Na+ uptake
            into vacuoles. This model is specific for the fungal
            members of this family [Transport and binding proteins,
            Cations and iron carrying compounds].
          Length = 810

 Score = 31.8 bits (72), Expect = 2.8
 Identities = 53/282 (18%), Positives = 88/282 (31%), Gaps = 54/282 (19%)

Query: 1043 LHQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMV 1102
            L  + E E++ E+ D         +     T         + G    R   ++E     +
Sbjct: 516  LRNNKEMEDDIEMNDLGRERLQKEKEAHAATFDLSTTTTTQLGTENGRGGGLEERSKTNI 575

Query: 1103 KDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEE 1162
            K+  E  N++Y  ++     E           + D  +S  D  +    + +     E  
Sbjct: 576  KERSENVNTIYGLDKLARDTE-NRDVTYVPTSRYDGIESEIDDVYTYENDSESIASSERR 634

Query: 1163 ------EEEVKPVTK---------KRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRR--- 1204
                  EEE +             +   G+  +   + +    D E       R +R   
Sbjct: 635  RIKKLREEEQQAYIAYTEDNQVIIENRQGEILEYVDIHDRGARDAEVGVHNGGRLKRALS 694

Query: 1205 ----KKVQISDED----------------EDEEDE-------SPKKKKRKSTKTPAKDKH 1237
                K  QI++E                 EDE  E       SP   KRK       D  
Sbjct: 695  PPLEKLHQITNEAKKSKYYAYKVGNDLIIEDESGEVFRRYRISPHGGKRKIK--KRNDSV 752

Query: 1238 KEDDEEKHV-----VSKRKKK-VVEEEEEVEEAEGEEEEEDD 1273
               DEEK +     V +R    ++  E+E+ + E E E  DD
Sbjct: 753  VSVDEEKAIEGPSRVPERGNHDLLHSEDEMADDEAESENMDD 794


>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
           SseC is a secreted protein that forms a complex together
           with SecB and SecD on the surface of Salmonella. All
           these proteins are secreted by the type III secretion
           system. Many mucosal pathogens use type III secretion
           systems for the injection of effector proteins into
           target cells. SecB, SseC and SecD are inserted into the
           target cell membrane. where they form a small pore or
           translocon. In addition to SseC, this family includes
           the bacterial secreted proteins PopB, PepB, YopB and
           EspD which are thought to be directly involved in pore
           formation, and type III secretion system translocon.
          Length = 303

 Score = 31.3 bits (71), Expect = 2.8
 Identities = 13/58 (22%), Positives = 28/58 (48%)

Query: 368 RKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEHKEEQQKKMREEKKKKKKSIKQKILD 425
            +LI +  +K+    L Q +       +  +E++E+ +K + + ++ KK  I  KI  
Sbjct: 6   TELISKLAEKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAKKSGIFSKIFG 63


>gnl|CDD|222574 pfam14159, DUF4308, Domain of unknown function (DUF4308).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in bacteria and eukaryotes, and
           is approximately 90 amino acids in length. The domain is
           found in several amino-acyl tRNA synthetase enzymes as
           well as in isolation in single domain proteins.
          Length = 90

 Score = 29.5 bits (67), Expect = 3.0
 Identities = 14/36 (38%), Positives = 16/36 (44%), Gaps = 5/36 (13%)

Query: 204 YFIWRDFSYLRLDGTTKSEDRGELLRKFNEPESKYF 239
           Y  W  F Y  L     SEDR ELL K  E + +  
Sbjct: 59  YSGW--FVYRYL---LFSEDRQELLAKIQELKKEIL 89



 Score = 29.5 bits (67), Expect = 3.0
 Identities = 14/36 (38%), Positives = 16/36 (44%), Gaps = 5/36 (13%)

Query: 907 YFIWRDFSYLRLDGTTKSEDRGELLRKFNEPESKYF 942
           Y  W  F Y  L     SEDR ELL K  E + +  
Sbjct: 59  YSGW--FVYRYL---LFSEDRQELLAKIQELKKEIL 89


>gnl|CDD|184039 PRK13411, PRK13411, molecular chaperone DnaK; Provisional.
          Length = 653

 Score = 31.6 bits (72), Expect = 3.0
 Identities = 23/95 (24%), Positives = 51/95 (53%), Gaps = 8/95 (8%)

Query: 352 EKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEHKEEQQKKMREE 411
           E ERMR+    + E Y +  D+++ K+L  L +Q D  + +    +KE+ E   +++++ 
Sbjct: 506 EIERMRQ----EAEKYAE-EDRRR-KQLIELKNQADSLLYSYESTLKENGELISEELKQR 559

Query: 412 KKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITV 446
            ++K + ++  + D N+   +L    QLE+ +  +
Sbjct: 560 AEQKVEQLEAALTDPNISLEEL--KQQLEEFQQAL 592


>gnl|CDD|217830 pfam03986, Autophagy_N, Autophagocytosis associated protein (Atg3),
            N-terminal domain.  Autophagocytosis is a
            starvation-induced process responsible for transport of
            cytoplasmic proteins to the lysosome/vacuole. Atg3 is a
            ubiquitin like modifier that is topologically similar to
            the canonical E2 enzyme. It catalyzes the conjugation of
            Atg8 and phosphatidylethanolamine.
          Length = 146

 Score = 30.4 bits (69), Expect = 3.0
 Identities = 21/71 (29%), Positives = 33/71 (46%), Gaps = 9/71 (12%)

Query: 1152 EDDGADD-YEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQIS 1210
              DGA++  E+E+E+   VT      K+      EE  PE  +++           V  S
Sbjct: 84   YGDGAEEIVEDEDEDDGWVTTHGNRDKQKDDIADEEDIPEIGDDDD--------DVVDSS 135

Query: 1211 DEDEDEEDESP 1221
            D DED++D+ P
Sbjct: 136  DADEDDDDDIP 146


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
            (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
            phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 31.7 bits (72), Expect = 3.0
 Identities = 39/201 (19%), Positives = 74/201 (36%), Gaps = 22/201 (10%)

Query: 1078 MDAERRKGETKPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVD 1137
            +  E+  G+  P     K L  W     DE+E+   E+      E       +R      
Sbjct: 159  LVREKLTGDAPPDSAG-KVLDLW----RDEIEDKAGEDLDGLAAEIDDQQAFAR-----V 208

Query: 1138 YTDSLTDQEWLKAIEDDGA--DDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEE 1195
              D L   +  +   DDG   D  EE+ ++ +P   +     + +    +E+  ++ E  
Sbjct: 209  VRDMLGSMDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEAT 268

Query: 1196 KPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVV 1255
                +       + S++ E +E +       + T+TP +D  +       ++ +   KV 
Sbjct: 269  DGEGEEGEMDAAEASEDSESDESD-------EDTETPGEDA-RPATPFTELMEEVDYKVF 320

Query: 1256 EEE--EEVEEAEGEEEEEDDK 1274
              E  E V   E  +E E D+
Sbjct: 321  TREFDEIVLAEELCDEAELDR 341


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
            includes the B. subtilis YqfQ protein, also known as
            VrrA, which is functionally uncharacterized. This family
            of proteins is found in bacteria. Proteins in this family
            are typically between 146 and 237 amino acids in length.
            There are two conserved sequence motifs: QYGP and PKLY.
          Length = 155

 Score = 30.5 bits (69), Expect = 3.1
 Identities = 14/70 (20%), Positives = 28/70 (40%), Gaps = 10/70 (14%)

Query: 1210 SDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEE 1269
            S +DE+EE E     + +    P      ++ +++ V   + +K   + E  +       
Sbjct: 93   SSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP------ 146

Query: 1270 EEDDKASKPR 1279
                K SKP+
Sbjct: 147  ----KPSKPK 152


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family consists
            of various plant development proteins which are
            homologues of floricaula (FLO) and Leafy (LFY) proteins
            which are floral meristem identity proteins. Mutations in
            the sequences of these proteins affect flower and leaf
            development.
          Length = 382

 Score = 31.5 bits (72), Expect = 3.2
 Identities = 32/107 (29%), Positives = 54/107 (50%), Gaps = 15/107 (14%)

Query: 1116 EREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGS 1175
            ER R +EE + GR    R+ +   D+      L A+  +G       EE V+   +  GS
Sbjct: 114  ERRRLEEEDVEGR----RRHLLSGDTTNA---LDALSQEGL-----SEEPVQQEKEAAGS 161

Query: 1176 GKKGKKSYVEETAPE---DEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
            G  G  ++     P    D E++K  KK+RR++  ++ ++D+D+EDE
Sbjct: 162  GGGGGGTWGLVGVPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDE 208



 Score = 30.4 bits (69), Expect = 7.1
 Identities = 9/41 (21%), Positives = 22/41 (53%)

Query: 1233 AKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDD 1273
                H  D E+K    K+++K  +E  E ++ + +E+++ +
Sbjct: 173  GVPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGE 213


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
            consist of three concentric proteinaceous capsid layers.
            The innermost capsid (core) is made of VP2. The genomic
            RNA and the two minor proteins VP1 and VP3 are
            encapsidated within this layer. The N-terminus of
            rotavirus VP2 is necessary for the encapsidation of VP1
            and VP3.
          Length = 887

 Score = 31.8 bits (72), Expect = 3.3
 Identities = 18/101 (17%), Positives = 43/101 (42%), Gaps = 9/101 (8%)

Query: 1171 KKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTK 1230
             +R +              +DE+++   +K R   K ++ D+ E+   ++     +   +
Sbjct: 5    NRREANINNNDR---MQEKDDEKQD---QKNRMELKEKVLDKKEEVVTDNVDSPVK---E 55

Query: 1231 TPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
              +++  K  DE K    +  K+++E  +  EE + E + E
Sbjct: 56   QSSQENLKIADEVKKSTKEESKQLLEVLKTKEEHQKEIQYE 96


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
            required for initiation of DNA replication in S.
            cerevisiae, forming a complex with MCM5/CDC46. Homologues
            of CDC45 have been identified in human, mouse and smut
            fungus among others.
          Length = 583

 Score = 31.5 bits (72), Expect = 3.3
 Identities = 17/79 (21%), Positives = 35/79 (44%), Gaps = 4/79 (5%)

Query: 1190 EDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVS- 1248
            E+E +++P      R   +  D+DE+ ++E  +  K +  +    D   +DD      S 
Sbjct: 107  EEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDED-DDDDDDDDDIATRERSL 165

Query: 1249 --KRKKKVVEEEEEVEEAE 1265
              +R+++  EE+    E  
Sbjct: 166  ERRRRRREWEEKRAELEFY 184


>gnl|CDD|218889 pfam06088, TLP-20, Nucleopolyhedrovirus telokin-like protein-20
            (TLP20).  This family consists of several
            Nucleopolyhedrovirus telokin-like protein-20 (TLP20)
            sequences. The function of this family is unknown but
            TLP20 is known to shares some antigenic similarities to
            the smooth muscle protein telokin although the amino acid
            sequence shows no homologies to telokin.
          Length = 162

 Score = 30.4 bits (69), Expect = 3.4
 Identities = 14/61 (22%), Positives = 20/61 (32%), Gaps = 13/61 (21%)

Query: 1221 PKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRK 1280
               KK  S   P      E+DE+             EE+  + A  E   +D     P+K
Sbjct: 107  WNSKKFVSAPVPHHSSEDENDED-------------EEDNADRAGIESGIDDSAPPSPKK 153

Query: 1281 S 1281
             
Sbjct: 154  Q 154


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 31.1 bits (70), Expect = 3.5
 Identities = 20/96 (20%), Positives = 33/96 (34%), Gaps = 12/96 (12%)

Query: 1186 ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKH 1245
             T  +  EE+KP  K+    K            E+ K KK   T +    K     ++  
Sbjct: 7    TTKKKTTEEKKPAAKKATTSK------------ETAKTKKTAKTTSTKAAKKAAKVKKTK 54

Query: 1246 VVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
             V    KKV  + E+ E  + E   +     +   +
Sbjct: 55   SVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSA 90


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 31.7 bits (72), Expect = 3.6
 Identities = 39/240 (16%), Positives = 73/240 (30%), Gaps = 22/240 (9%)

Query: 287 EVRVLRLIT--VNSVEERILAA-ARYKLNMDEKEYHRNNTARTQKLNKAIINYHINAEKE 343
            VR  +     +     R+L      +L +D     R      Q               +
Sbjct: 518 AVREEKAAVTDIPEELARLLITDELPELAVDLLVQSRIRQHWQQLRKA----------LD 567

Query: 344 QKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEHKEE 403
           Q +      + R     A   E ++   ++  D+ L      + E   ++   +K+ K+ 
Sbjct: 568 QLEAAYEALEGRFAAAEAAMAE-WQSEWEEALDE-LGLSRELSPEQQLDILSTMKDLKKL 625

Query: 404 QQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLRGEDAPPA 463
            QKK     +  +   +Q   +  ++ L     +Q  DL                     
Sbjct: 626 MQKKAELTHQVARLREEQAAFEERVEGLLAVLEAQFIDLSTLF-CVQRLRVAA------E 678

Query: 464 KQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKDAGEEATY 523
            Q +E   +  G     ++  DE   + E+  K+       G   TED   + A EE   
Sbjct: 679 LQKEEARLALEGNIERTKELNDELRAELELHRKEILDLFDCGTADTEDAFREAAREEQQL 738


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
            subunit 1; Provisional.
          Length = 319

 Score = 31.2 bits (71), Expect = 3.6
 Identities = 13/39 (33%), Positives = 22/39 (56%)

Query: 1240 DDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
             DEE       K +  EEE++  E+E E+EE++D+  + 
Sbjct: 274  GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEE 312


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
            family of proteins has no known function. However, one of
            the members is annotated as an EF-hand family protein.
          Length = 349

 Score = 31.3 bits (71), Expect = 3.7
 Identities = 23/98 (23%), Positives = 52/98 (53%)

Query: 1174 GSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPA 1233
             +G++  K    +  P  ++ E  VK  R  +K + + E +++++   +KKK+K      
Sbjct: 251  SAGEENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKV 310

Query: 1234 KDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
            K + + +D  + +  +RK+++ +  +E +E E E E+E
Sbjct: 311  KQRFEANDPAQKLQEERKEQLAKLRKEEKEREKEYEQE 348


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This family
            includes several uncharacterized proteins from
            Caenorhabditis elegans.
          Length = 517

 Score = 31.2 bits (69), Expect = 3.8
 Identities = 24/97 (24%), Positives = 54/97 (55%), Gaps = 1/97 (1%)

Query: 1186 ETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRK-STKTPAKDKHKEDDEEK 1244
            E   + ++ EK  K  +++ + QI+ + E E+  + KK   K + +  A+ K +   EE+
Sbjct: 59   ELTEQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEE 118

Query: 1245 HVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
            H   K +++ +++E+E +EA+ ++ + + K  K  K+
Sbjct: 119  HKQWKAEQERIQKEQEKKEADLKKLQAEKKKEKAVKA 155


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 31.3 bits (72), Expect = 3.8
 Identities = 23/138 (16%), Positives = 58/138 (42%), Gaps = 17/138 (12%)

Query: 339 NAEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVK 398
             E +  +E E+  ++ ++    E +E  ++L   +K    +    +  E    L +  +
Sbjct: 562 EEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANE 621

Query: 399 EHKEEQQKKMREEKKKKK------KSIKQK--ILDGNMDKLDLDESS-QLEDLRITVKET 449
           + +++++K+  ++++ K        S+ QK  +L       D  E+  Q   +++ V  +
Sbjct: 622 KKEKKKKKQKEKQEELKVGDEVKYLSLGQKGEVL----SIPDDKEAIVQAGIMKMKVPLS 677

Query: 450 NSDNKLRGEDAPPAKQIK 467
           +    L     P  K+ K
Sbjct: 678 D----LEKIQKPKKKKKK 691


>gnl|CDD|177546 PHA03151, PHA03151, hypothetical protein; Provisional.
          Length = 259

 Score = 30.9 bits (69), Expect = 3.9
 Identities = 18/68 (26%), Positives = 30/68 (44%), Gaps = 3/68 (4%)

Query: 1157 DDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDE 1216
            D  + EE+ V+   ++R      KK      +  D+E   P +K RR++         D 
Sbjct: 176  DSSDSEEDVVEAFIRQRAQMAGKKKKGKRSISTSDDE---PPRKSRRKRHSHRISSSTDS 232

Query: 1217 EDESPKKK 1224
            +DE P+ K
Sbjct: 233  DDEEPRHK 240


>gnl|CDD|197366 cd09916, CpxP_like, CpxP component of the bacterial
           Cpx-two-component system and related proteins.  This
           family summarizes bacterial proteins related to CpxP, a
           periplasmic protein that forms part of a two-component
           system which acts as a global modulator of cell-envelope
           stress in gram-negative bacteria. CpxP aids in combating
           extracytoplasmic protein-mediated toxicity, and may also
           be involved in the response to alkaline pH. Functioning
           as a dimer, it inhibits activation of the kinase CpxA,
           but also plays a vital role in the quality control
           system of P pili. It has been suggested that CpxP
           directly interacts with CpxA via its concave polar
           surface. Another member of this family, Spy, is also a
           periplasmic protein that may be involved in the response
           to stress. The homology between CpxP and Spy suggests
           similar functions. A characteristic 5-residue sequence
           motif LTXXQ is found repeated twice in many members of
           this family.
          Length = 96

 Score = 29.1 bits (66), Expect = 4.0
 Identities = 17/80 (21%), Positives = 34/80 (42%), Gaps = 7/80 (8%)

Query: 340 AEKEQKKEQERIEKERMRRLMAE---DEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQM 396
           A+ +  +EQ R  +E +R L+     DE   R L  +  + +    + +         Q+
Sbjct: 21  AQMKALREQMRAAREELRALLTADTFDEAAVRALAAEMAELQQELAVERAK----ARNQI 76

Query: 397 VKEHKEEQQKKMREEKKKKK 416
            +    EQ+ K+ E   K++
Sbjct: 77  YQVLTPEQRAKLNELFAKRR 96


>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
            protein.  Members of this protein family are designated
            TraM and are found in a proposed transfer region of a
            class of conjugative transposon found in the Bacteroides
            lineage [Cellular processes, DNA transformation].
          Length = 410

 Score = 31.2 bits (71), Expect = 4.0
 Identities = 25/137 (18%), Positives = 46/137 (33%), Gaps = 17/137 (12%)

Query: 1151 IEDDGADDYEEEEEEVKPVTKKRGSG------KKGKKSYVEETAPEDEEEEKPVKKRRRR 1204
            I  D    YE+E  E K   KKR         +K  K   + ++ +   +E+P +     
Sbjct: 54   IIGDKMKAYEQEALEQKQKDKKRTLQDLADSFEKSDKESQDYSSSDSGFDEEPDEPA--- 110

Query: 1205 KKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEA 1264
             +   S         + +   R+        K  E+ E         ++V E E  +   
Sbjct: 111  -ETAGSLRPIRSSAAAYRDINRELGSFYEYPKTDEEKEL-------LREVEELESRLATE 162

Query: 1265 EGEEEEEDDKASKPRKS 1281
                 E +++ +   KS
Sbjct: 163  PSPAPELEEQLALMEKS 179


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
            domain.  This domain is found in a number of different
            types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 30.0 bits (68), Expect = 4.0
 Identities = 22/74 (29%), Positives = 35/74 (47%)

Query: 1199 KKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEE 1258
            K+         ++E+EDE+DES  + KR   +  AK+K + D  +       K+K  EE 
Sbjct: 34   KRSNSSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEER 93

Query: 1259 EEVEEAEGEEEEED 1272
                 AE E+E  +
Sbjct: 94   FMKALAEAEKERAE 107


>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
          Length = 535

 Score = 31.4 bits (71), Expect = 4.0
 Identities = 26/95 (27%), Positives = 43/95 (45%)

Query: 1177 KKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDK 1236
            K+  + Y EE   E + E + +K+   R   + +  D  +E+ S K+K  +S +    DK
Sbjct: 78   KEEARKYREEIEQEFKSERQELKQIESRLTERATSLDRKDENLSSKEKTLESKEQSLTDK 137

Query: 1237 HKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEE 1271
             K  DE +  V K +++   E E V      E  E
Sbjct: 138  SKHIDEREEQVEKLEEQKKAELERVAALSQAEARE 172


>gnl|CDD|222875 PHA02558, uvsW, UvsW helicase; Provisional.
          Length = 501

 Score = 31.1 bits (71), Expect = 4.3
 Identities = 28/133 (21%), Positives = 55/133 (41%), Gaps = 17/133 (12%)

Query: 576 GKTVQTIALLTYLMEKKKNLGPYLVIVPLSTL-SNWVLEFE--KWAPSANVVAYKGSPLA 632
           GK++    L  Y +E  +  G  L+IVP ++L +  + +F   +  P   +         
Sbjct: 141 GKSLIQYLLSRYYLENYE--GKVLIIVPTTSLVTQMIDDFVDYRLFPREAMHKIYS---- 194

Query: 633 RRALQGRMKNSKFNVLLTTYEFIIKDKSVLSKIQWKCMIIDEGHRMKNHHCKLTQTLNTF 692
                G  K++   ++++T++  +K        Q+  +I+DE H        LT  +   
Sbjct: 195 -----GTAKDTDAPIVVSTWQSAVKQPKEWFD-QFGMVIVDECHLFTGK--SLTSIITKL 246

Query: 693 YNSSHRVLLTGTP 705
            N   +  LTG+ 
Sbjct: 247 DNCKFKFGLTGSL 259


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
            subunit [Translation, ribosomal structure and
            biogenesis].
          Length = 591

 Score = 31.2 bits (70), Expect = 4.4
 Identities = 21/78 (26%), Positives = 39/78 (50%)

Query: 1185 EETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEK 1244
            EET    EE+     +  R + V   ++D+D+E+   +K+     +     +  E D++ 
Sbjct: 466  EETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDV 525

Query: 1245 HVVSKRKKKVVEEEEEVE 1262
            +    +K+KV EEEEE +
Sbjct: 526  NKSKNKKRKVDEEEEEKK 543


>gnl|CDD|227392 COG5059, KIP1, Kinesin-like protein [Cytoskeleton].
          Length = 568

 Score = 31.2 bits (71), Expect = 4.7
 Identities = 25/146 (17%), Positives = 52/146 (35%), Gaps = 11/146 (7%)

Query: 321 NNTARTQKLNKAIINYHINAEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKK--DKR 378
           +++   +++   +       E    +EQ ++ +  +  + A  +   ++    K   D  
Sbjct: 347 DSSREIEEIKFDLSEDRSEIEILVFREQSQLSQSSLSGIFAYMQSLKKETETLKSRIDLI 406

Query: 379 LAFLLSQTDEYISNLTQMVKEHKEEQQKK---------MREEKKKKKKSIKQKILDGNMD 429
           +  ++S T E    L +   ++K   Q           +REE+  KKK+   K+     D
Sbjct: 407 MKSIISGTFERKKLLKEEGWKYKSTLQFLRIEIDRLLLLREEELSKKKTKIHKLNKLRHD 466

Query: 430 KLDLDESSQLEDLRITVKETNSDNKL 455
              L  S   E       E  S  + 
Sbjct: 467 LSSLLSSIPEETSDRVESEKASKLRS 492


>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
            unknown].
          Length = 523

 Score = 31.0 bits (70), Expect = 4.8
 Identities = 16/59 (27%), Positives = 28/59 (47%), Gaps = 4/59 (6%)

Query: 1215 DEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVS--KRKKKVVEEEEEVEEAEGEEEEE 1271
            D   +  + KK   TK   ++ H E+  E       K K+K+ +   +V+ +  EEEE+
Sbjct: 21   DRRKQLARSKKVYGTKN--RNSHTENKMESGTNDNNKNKEKLSKLYSDVDSSSSEEEED 77


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 31.1 bits (71), Expect = 4.9
 Identities = 23/96 (23%), Positives = 34/96 (35%), Gaps = 35/96 (36%)

Query: 1164 EEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKK 1223
            +E++P TK     K GK S             K V  +R  KK               +K
Sbjct: 374  DELRPKTKAPSEKKTGKPS-------------KKVLAKRAEKKE-------------KEK 407

Query: 1224 KKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEE 1259
            +K K      K +H+    +   + KR+K     EE
Sbjct: 408  EKPK-----VKKRHR----DTKNIGKRRKPSGTSEE 434


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein TolA;
            Provisional.
          Length = 387

 Score = 30.9 bits (70), Expect = 4.9
 Identities = 15/105 (14%), Positives = 44/105 (41%), Gaps = 5/105 (4%)

Query: 1171 KKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTK 1230
            +++ + +  ++   +E    +E ++K   ++ R K+++       E+ +  ++  ++   
Sbjct: 70   QQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQ--- 126

Query: 1231 TPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKA 1275
              A  K K+ +E     +   K   E E +   A  ++   + K 
Sbjct: 127  --AALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKK 169



 Score = 30.5 bits (69), Expect = 5.8
 Identities = 36/173 (20%), Positives = 63/173 (36%), Gaps = 18/173 (10%)

Query: 1116 EREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEE---EEEVKPVTKK 1172
            E +R+K+E       +Q++  +       QE LK +E +     E++   EE  K    K
Sbjct: 78   EEQRKKKEQQQAEELQQKQAAE-------QERLKQLEKERLAAQEQKKQAEEAAKQAALK 130

Query: 1173 RGSGKKGKKSYVEETAPEDEEEEKPV----KKRRRRKKVQISDEDEDEEDESPKKKKRKS 1228
            +   ++           + E E K      KK     K +   E   +     KKK    
Sbjct: 131  QKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAE 190

Query: 1229 TKTP----AKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASK 1277
                    AK K + + ++K     +KK   E +    +A  E +   +KA+ 
Sbjct: 191  AAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAA 243



 Score = 30.2 bits (68), Expect = 8.7
 Identities = 35/185 (18%), Positives = 70/185 (37%), Gaps = 27/185 (14%)

Query: 340 AEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKE 399
           AE+++KK++++  +E  ++  AE E      + Q + +RLA              Q  K+
Sbjct: 77  AEEQRKKKEQQQAEELQQKQAAEQER-----LKQLEKERLA-------------AQEQKK 118

Query: 400 HKEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLD-ESSQLEDLRITVKETNSDNKLRGE 458
             EE  K+   ++K+ +++  +        K   + E+ +        K+  ++ K + E
Sbjct: 119 QAEEAAKQAALKQKQAEEAAAKAAAAA---KAKAEAEAKRAAAA---AKKAAAEAKKKAE 172

Query: 459 DAPPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEK--KEEPSKAVGEKKTEDDEYKD 516
                K   E              +E +   + E K+K   E   KA  E K    +   
Sbjct: 173 AEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAA 232

Query: 517 AGEEA 521
             + A
Sbjct: 233 EAKAA 237


>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like protein.
             This domain family is found in eukaryotes, and is
            typically between 212 and 238 amino acids in length. The
            family is found in association with pfam01805. There are
            two completely conserved residues (W and H) that may be
            functionally important. PRP21 is required for assembly of
            the prespliceosome and it interacts with U2 snRNP and/or
            pre-mRNA in the prespliceosome. This family also contains
            proteins similar to PRP21, such as the mammalian SF3a.
            SF3a also interacts with U2 snRNP from the
            prespliceosome, converting it to its active form.
          Length = 230

 Score = 30.5 bits (69), Expect = 4.9
 Identities = 16/76 (21%), Positives = 30/76 (39%), Gaps = 6/76 (7%)

Query: 1184 VEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEE 1243
             EE   E+++EE   K     K+++I    E       + K RK           E  ++
Sbjct: 114  EEEDIDEEKKEESAKKSTPPIKEMKIPAAGE------SRLKIRKDYVPSGNKPRAEKPKK 167

Query: 1244 KHVVSKRKKKVVEEEE 1259
            K +      +++ E+E
Sbjct: 168  KMIKCPITGELIPEDE 183


>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
            Provisional.
          Length = 486

 Score = 30.9 bits (70), Expect = 4.9
 Identities = 20/102 (19%), Positives = 41/102 (40%)

Query: 1176 GKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKD 1235
             K+G K   +E   + EEE++ +   +   ++  ++           + K +  K   K 
Sbjct: 364  LKEGSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKITIKV 423

Query: 1236 KHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASK 1277
            +  E+D     +   KK     E E  + E E+E+E  +  +
Sbjct: 424  EKSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPEFVE 465


>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777).  This is a
            family of eukaryotic proteins of unknown function. Some
            of the proteins in this family are putative nucleic acid
            binding proteins.
          Length = 158

 Score = 29.8 bits (67), Expect = 5.0
 Identities = 16/88 (18%), Positives = 36/88 (40%), Gaps = 7/88 (7%)

Query: 1191 DEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKK------KRKSTKTPAKDKHKEDDEEK 1244
                    ++RR R + +  D        SP +       +R  +++ +  + ++   E+
Sbjct: 17   RSRSRDRRERRRERSRSRERDRRRRSRSRSPHRSRRSRSPRRHRSRSRSPSRRRDRKRER 76

Query: 1245 HVVSKRKKKVVEEEEEVEEA-EGEEEEE 1271
               ++  KK   ++   EE  EG+ +EE
Sbjct: 77   DKDAREPKKRERQKLIKEEDLEGKSDEE 104


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 30.6 bits (69), Expect = 5.3
 Identities = 32/155 (20%), Positives = 52/155 (33%), Gaps = 19/155 (12%)

Query: 1114 EEEREREKEEALMGRGSRQRKQVDYTDSLTD--QEWLKAIEDDGADDYEEEEEEV----- 1166
            EE R  +  E L  R +  R Q   T+ L     E  K +  DG    +E   E+     
Sbjct: 136  EEVRADDTPEVLAKRLASYRAQ---TEPLVHYYSEKRKLLTVDGMMTIDEVTREIGRVLA 192

Query: 1167 -------KPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDE 1219
                   K   K   +    KK+  +  +   +  +K   K     K   + +   +  +
Sbjct: 193  AVGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKK--AAKTAAKAAK 250

Query: 1220 SPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKV 1254
              KK  +K+ K  AK   K   +     +K  K  
Sbjct: 251  KAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGA 285


>gnl|CDD|222428 pfam13872, AAA_34, P-loop containing NTP hydrolase pore-1. 
          Length = 303

 Score = 30.6 bits (70), Expect = 5.6
 Identities = 26/93 (27%), Positives = 35/93 (37%), Gaps = 28/93 (30%)

Query: 637 QGRMKNSKFNVLLTTYEFII-------KDKSVLSKI-QW------KCMIIDEGHRMKNHH 682
           QG+    K  +L TTY  +        K +S L +I  W        ++ DE H+ KN  
Sbjct: 129 QGKPIRLKEGILFTTYATLRSESQAGGKKRSRLDQIVDWLGEDFDGVIVFDECHKAKNAA 188

Query: 683 ---CKLTQTLNTFYNSSHRVLLTGTPLQNKLPE 712
               K +Q             L G  LQN LP 
Sbjct: 189 GGEKKPSQQ-----------GLAGLRLQNALPN 210


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
            (DUF874).  This family consists of several hypothetical
            proteins specific to Helicobacter pylori. The function of
            this family is unknown.
          Length = 417

 Score = 30.6 bits (68), Expect = 5.8
 Identities = 28/131 (21%), Positives = 59/131 (45%), Gaps = 4/131 (3%)

Query: 1148 LKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKV 1207
            L A   D  D   E  +  K     R    K      +E    ++E++K  K+       
Sbjct: 116  LAACFADDQDKKIELAQAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANS 175

Query: 1208 QISDEDEDEEDESPKKK----KRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEE 1263
            QI  E E ++ E  K+K    K+K++    K+  + + E++   ++++  + E+++ ++E
Sbjct: 176  QIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKEQKDFIKE 235

Query: 1264 AEGEEEEEDDK 1274
            AE   +E  ++
Sbjct: 236  AEQNCQENHNQ 246


>gnl|CDD|224142 COG1221, PspF, Transcriptional regulators containing an AAA-type
           ATPase domain and a DNA-binding domain [Transcription /
           Signal transduction mechanisms].
          Length = 403

 Score = 30.4 bits (69), Expect = 5.9
 Identities = 32/148 (21%), Positives = 55/148 (37%), Gaps = 17/148 (11%)

Query: 201 LEDYFIWRDFSYLRLDGTTKSEDRGELLRKF-------NEPESKYFIFLLSTRAGGLGLN 253
           L ++F+    S  R  G   S D  E LR         N  E K  +     +A G G +
Sbjct: 259 LAEHFLK---SEARRLGLPLSVDSPEALRALLAYDWPGNIRELKNLVERAVAQASGEGQD 315

Query: 254 LQAADTVIIFDSDWNPHQDLQAQDRAHRIGQKNEVRVLRLITVNSVEERILAAARYKLNM 313
           L+     I  D    P  +      + ++        LR  ++  VEER+L  A  +   
Sbjct: 316 LEDLIISIRLDEL--PGLNDVPAGISLQLNLPELPLSLRETSLEQVEERLLQKALEQNKG 373

Query: 314 DEKEYHRNNTARTQKLNKAIINYHINAE 341
           ++K+      AR   +++  + Y +   
Sbjct: 374 NKKK-----AARLLGISRKTLRYRLKKY 396


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 30.1 bits (68), Expect = 6.1
 Identities = 15/62 (24%), Positives = 25/62 (40%), Gaps = 1/62 (1%)

Query: 1219 ESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKP 1278
            ES K K  K    P         ++     K   K  +  E V++   +E++E+ K  K 
Sbjct: 17   ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIK-AQMAERVKKLHSQEKKEEKKKPKK 75

Query: 1279 RK 1280
            +K
Sbjct: 76   KK 77


>gnl|CDD|218595 pfam05460, ORC6, Origin recognition complex subunit 6 (ORC6).  This
            family consists of several eukaryotic origin recognition
            complex subunit 6 (ORC6) proteins. Despite differences in
            their structure and sequences among eukaryotic
            replicators, ORC is a conserved feature of replication
            initiation in all eukaryotes. ORC-related genes have been
            identified in organisms ranging from S. pombe to plants
            to humans. All DNA replication initiation is driven by a
            single conserved eukaryotic initiator complex termed he
            origin recognition complex (ORC). The ORC is a six
            protein complex. The function of ORC is reviewed in.
          Length = 297

 Score = 30.3 bits (68), Expect = 6.2
 Identities = 27/151 (17%), Positives = 45/151 (29%), Gaps = 13/151 (8%)

Query: 1028 QKSTGSERHEFLQTILHQDDEEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGET 1087
                G E   F      + D+    + +   E  NQ     E  ++  +  +A + K   
Sbjct: 138  HVLLGVESIVFNDGRKKKKDKISIADII---ELCNQFGLPVEVTVSALEGFEAYKHKFLK 194

Query: 1088 KPRLVEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYT-----DSL 1142
            K        L   +          + +          L      QR+ VD       + L
Sbjct: 195  KSAWQLACGLV--LAAYFRINHRLLRQTPGIDSL---LPQLEGSQREDVDLWIPIVEEWL 249

Query: 1143 TDQEWLKAIEDDGADDYEEEEEEVKPVTKKR 1173
             D++W + IE         EEE+V     K 
Sbjct: 250  EDEDWFREIEIKYDYFDGAEEEQVIRKRVKL 280


>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator.  This protein is
            found in a wide range of eukaryotes. It is a nuclear
            protein and is suggested to be DNA binding. In plants,
            this family is essential for correct circadian clock
            functioning by acting as a light-quality regulator
            coordinating the activities of blue and red light
            signalling pathways during plant growth - inhibiting
            growth in red light but promoting growth in blue light.
          Length = 233

 Score = 30.0 bits (68), Expect = 6.4
 Identities = 25/106 (23%), Positives = 44/106 (41%), Gaps = 20/106 (18%)

Query: 1171 KKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTK 1230
            KK+   KK K S+ ++   EDE+E +  KK  +             ++ +    K+K  K
Sbjct: 1    KKKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKESSEP--------DEANVNPNKKKIGK 52

Query: 1231 TP------AKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEE 1270
             P        DK +E+ E         ++ + EE   ++   +EEE
Sbjct: 53   NPSVDTSFLPDKAREEKEA------ELREELREEFLKKQEAVKEEE 92


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part of
            a large ribonucleoprotein complex containing the U3
            snoRNA. Depletion of the Utp proteins impedes production
            of the 18S rRNA, indicating that they are part of the
            active pre-rRNA processing complex. This large RNP
            complex has been termed the small subunit (SSU)
            processome.
          Length = 239

 Score = 30.0 bits (68), Expect = 6.6
 Identities = 26/137 (18%), Positives = 54/137 (39%), Gaps = 6/137 (4%)

Query: 1118 EREKEE-ALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSG 1176
            E+ KEE  L   GS   K + + D   +Q+     E         +  E +P   +    
Sbjct: 106  EKLKEELHLTDSGSAGPKHIVFVDDEEEQKSFDPAEYFDTTPELLDRRENRPRISQLEKT 165

Query: 1177 KKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDK 1236
                +   +++A +  +  K +K+R+ R+K     + +  E     +++        K K
Sbjct: 166  SLVDEKQKKKSAKKKRKLYKELKERKEREK-----KLKKVEQRLELQRELMKKGKGKKKK 220

Query: 1237 HKEDDEEKHVVSKRKKK 1253
              +D + K V   +K++
Sbjct: 221  IVKDKDGKVVYKWKKER 237


>gnl|CDD|129701 TIGR00614, recQ_fam, ATP-dependent DNA helicase, RecQ family.  All
           proteins in this family for which functions are known
           are 3'-5' DNA-DNA helicases. These proteins are used for
           recombination, recombinational repair, and possibly
           maintenance of chromosome stability. This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 470

 Score = 30.5 bits (69), Expect = 6.7
 Identities = 20/83 (24%), Positives = 36/83 (43%), Gaps = 5/83 (6%)

Query: 599 LVIVPL-STLSNWVLEFEKWAPSANVVAYKGSPLARRALQGRMKNSKFNVLLTTYEFIIK 657
           LVI PL S + + VL+ +     A  +    S   ++ +   +K+ K  +L  T E    
Sbjct: 55  LVISPLISLMEDQVLQLKASGIPATFLNSSQSKEQQKNVLTDLKDGKIKLLYVTPEKCSA 114

Query: 658 DKSVLSKIQ--WK-CMI-IDEGH 676
              +L  ++      +I +DE H
Sbjct: 115 SNRLLQTLEERKGITLIAVDEAH 137


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 30.5 bits (69), Expect = 7.0
 Identities = 22/92 (23%), Positives = 33/92 (35%), Gaps = 12/92 (13%)

Query: 1171 KKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDED----EDEEDESPKKKKR 1226
              R  GK  +     E A + +  +K  K+R+  KKV    +     E +      KK  
Sbjct: 730  APRNVGKTAR-----EKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDA 784

Query: 1227 KSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEE 1258
            +  K P+    K     K   +KR  K    E
Sbjct: 785  RKAKKPSAKTQKIAAATK---AKRAAKKKVAE 813


>gnl|CDD|240653 cd12176, PGDH_3, Phosphoglycerate dehydrogenases, NAD-binding and
           catalytic domains.  Phosphoglycerate dehydrogenases
           (PGDHs) catalyze the initial step in the biosynthesis of
           L-serine from D-3-phosphoglycerate. PGDHs come in 3
           distinct structural forms, with this first group being
           related to 2-hydroxy acid dehydrogenases, sharing
           structural similarity to formate and glycerate
           dehydrogenases. PGDH in E. coli and Mycobacterium
           tuberculosis form tetramers, with subunits containing a
           Rossmann-fold NAD binding domain. Formate/glycerate and
           related dehydrogenases of the D-specific 2-hydroxyacid
           dehydrogenase superfamily include groups such as formate
           dehydrogenase, glycerate dehydrogenase, L-alanine
           dehydrogenase, and S-adenosylhomocysteine hydrolase.
           Despite often low sequence identity, these proteins
           typically have a characteristic arrangement of 2 similar
           subdomains of the alpha/beta Rossmann fold NAD+ binding
           form. The NAD+ binding domain is inserted within the
           linear sequence of the mostly N-terminal catalytic
           domain, which has a similar domain structure to the
           internal NAD binding domain. Structurally, these domains
           are connected by extended alpha helices and create a
           cleft in which NAD is bound, primarily to the C-terminal
           portion of the 2nd (internal) domain. Some related
           proteins have similar structural subdomain but with a
           tandem arrangement of the catalytic and NAD-binding
           subdomains in the linear sequence.
          Length = 304

 Score = 30.2 bits (69), Expect = 7.1
 Identities = 13/26 (50%), Positives = 16/26 (61%)

Query: 32  FNAPFATTGEKVELNEEETILIIRRL 57
           FNAPF+ T    EL   E I++ RRL
Sbjct: 91  FNAPFSNTRSVAELVIGEIIMLARRL 116



 Score = 30.2 bits (69), Expect = 7.1
 Identities = 13/26 (50%), Positives = 16/26 (61%)

Query: 735 FNAPFATTGEKVELNEEETILIIRRL 760
           FNAPF+ T    EL   E I++ RRL
Sbjct: 91  FNAPFSNTRSVAELVIGEIIMLARRL 116


>gnl|CDD|217920 pfam04137, ERO1, Endoplasmic Reticulum Oxidoreductin 1 (ERO1).
            Members of this family are required for the formation of
            disulphide bonds in the ER.
          Length = 348

 Score = 30.4 bits (69), Expect = 7.1
 Identities = 13/66 (19%), Positives = 29/66 (43%)

Query: 1048 EEEEENEVPDDETVNQLIARTEDELTLFQKMDAERRKGETKPRLVEIKELPDWMVKDDDE 1107
               +E+E+P+    ++L      E   F ++D          +  +  +L D+ V DD++
Sbjct: 45   CTCDESEIPEVWKPSELEGSACKEGKDFGELDCSLSDCRASCKWKDTDDLDDYCVVDDED 104

Query: 1108 VENSMY 1113
             +  +Y
Sbjct: 105  SDKGVY 110


>gnl|CDD|219939 pfam08619, Nha1_C, Alkali metal cation/H+ antiporter Nha1 C terminus.
             The C terminus of the plasma membrane Nha1 antiporter
            plays an important role in the immediate cell response to
            hypo-osmotic shock which prevents an execessive loss of
            ions and water. This domain is found with pfam00999.
          Length = 430

 Score = 30.5 bits (69), Expect = 7.1
 Identities = 24/128 (18%), Positives = 49/128 (38%), Gaps = 24/128 (18%)

Query: 1092 VEIKELPDWMVKDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAI 1151
            V+     +    D   + +S  EE+ E +   +L+G  + +R++     S +     ++ 
Sbjct: 326  VKAPARDEVDTNDSSPLTDSKSEEKLEDDLAHSLLGSEASERRKNREPSSSSPPSRPRSR 385

Query: 1152 EDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISD 1211
             D   +D E ++ +                         D+EEE P ++RRR   +    
Sbjct: 386  RDSEDEDTERDDSD------------------------SDDEEETPAERRRRLAALGELP 421

Query: 1212 EDEDEEDE 1219
               D++DE
Sbjct: 422  SARDDDDE 429


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of AF4
            (Proto-oncogene AF4) and FMR2 (Fragile X E mental
            retardation syndrome) nuclear proteins. These proteins
            have been linked to human diseases such as acute
            lymphoblastic leukaemia and mental retardation. The
            family also contains a Drosophila AF4 protein homologue
            Lilliputian which contains an AT-hook domain. Lilliputian
            represents a novel pair-rule gene that acts in
            cytoskeleton regulation, segmentation and morphogenesis
            in Drosophila.
          Length = 1154

 Score = 30.7 bits (69), Expect = 7.2
 Identities = 17/68 (25%), Positives = 27/68 (39%), Gaps = 10/68 (14%)

Query: 1216 EEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEE--AEGEEEEEDD 1273
             E +S    K++++KT +     E    K    KRK K  EE +++E      EE+    
Sbjct: 724  AEKDSLSAPKKQTSKTAS-----EKSSSK---GKRKHKNDEEADKIESKKQRLEEKSSSC 775

Query: 1274 KASKPRKS 1281
              S     
Sbjct: 776  SPSSSSSH 783


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This family
            includes gbp a protein from Soybean that binds to GAGA
            element dinucleotide repeat DNA. It seems likely that the
            this domain mediates DNA binding. This putative domain
            contains several conserved cysteines and a histidine
            suggesting this may be a zinc-binding DNA interaction
            domain.
          Length = 301

 Score = 30.2 bits (68), Expect = 7.2
 Identities = 21/106 (19%), Positives = 29/106 (27%), Gaps = 31/106 (29%)

Query: 1152 EDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISD 1211
             D+     E    EV P+                 TAP + +E K  KK +  K      
Sbjct: 120  GDNPYGTREMHHLEVPPI----------------STAPPEAKEVKKPKKGQSPKV----- 158

Query: 1212 EDEDEEDESPKKKK-RKSTKTPAKDKHKEDDEEKHVVSKRKKKVVE 1256
                     PK  K +K  K  +              SK   K  +
Sbjct: 159  ---------PKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWKSQD 195


>gnl|CDD|165245 PHA02934, PHA02934, Hypothetical protein; Provisional.
          Length = 253

 Score = 30.0 bits (67), Expect = 7.3
 Identities = 30/105 (28%), Positives = 49/105 (46%), Gaps = 7/105 (6%)

Query: 507 KKTEDDEYKDAGEEA-TYYSIAHTYHEKVTEQASIMVNGKLKEYQIKGLE-WMVSLYNNN 564
           K+  DD Y+    +  +Y  I + Y +KV    +  +N     Y +   E + V   NN 
Sbjct: 104 KQRLDDLYRQLPTKTRSYIDIINMYCDKVNNDYNRDMN---IMYDMASTESFTVYDINNE 160

Query: 565 LNGILADEMGLGKTVQTIALLTYLMEKKKNLGPYLVIVPLSTLSN 609
           +N IL D  GLG  + TI+ +T L   ++ + P   I   + LS+
Sbjct: 161 VNTILMDNKGLGVRLATISFITEL--GRRCMNPVETIKMFTLLSH 203


>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
            Cobaltochelatase is responsible for the insertion of
            cobalt into the corrin ring of coenzyme B12 during its
            biosynthesis. Two versions have been well described.
            CbiK/CbiX is a monomeric, anaerobic version which acts
            early in the biosynthesis (pfam06180). CobNST is a
            trimeric, ATP-dependent, aerobic version which acts late
            in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
            number of genomes (actinobacteria, cyanobacteria,
            betaproteobacteria and pseudomonads) which apparently
            biosynthesize B12, encode a cobN gene but are
            demonstrably lacking cobS and cobT. These genomes do,
            however contain a homolog (modelled here) of the
            magnesium chelatase subunits BchI/BchD family. Aside from
            the cyanobacteria (which have a separate magnesium
            chelatase trimer), these species do not make chlorins, so
            do not have any use for a magnesium chelatase.
            Furthermore, in nearly all cases the members of this
            family are proximal to either CobN itself or other genes
            involved in cobalt transport or B12 biosynthesis.
          Length = 633

 Score = 30.4 bits (69), Expect = 7.4
 Identities = 18/81 (22%), Positives = 34/81 (41%), Gaps = 5/81 (6%)

Query: 1199 KKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEE 1258
              RRRRK     ++ + ++++  ++K  +    P K    EDD E+     R       +
Sbjct: 306  PHRRRRKP---FEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQS--GPRGHPTPGND 360

Query: 1259 EEVEEAEGEEEEEDDKASKPR 1279
            +E E    EE +    ++ P 
Sbjct: 361  DEKEPDPQEEADGQGSSTDPA 381


>gnl|CDD|220844 pfam10675, DUF2489, Protein of unknown function (DUF2489).  This is
           a bacterial family of uncharacterized proteins.
          Length = 131

 Score = 29.1 bits (66), Expect = 7.4
 Identities = 17/62 (27%), Positives = 31/62 (50%), Gaps = 8/62 (12%)

Query: 389 YISNLTQMVKEHKEEQQKKMREEKKKKKKSIKQ--KILDGNM--DKLDLDESSQLEDLRI 444
           Y   L   +K+ KE Q+K   E K+ ++  I +  +I+   M  ++ +L E +    +RI
Sbjct: 7   YAGYLLLKLKKQKELQEKAQAEAKQARRARILESIRIIARAMLQEQCELSEGA----IRI 62

Query: 445 TV 446
            V
Sbjct: 63  KV 64


>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1).  This family
            consists of several mammalian dentin matrix protein 1
            (DMP1) sequences. The dentin matrix acidic phosphoprotein
            1 (DMP1) gene has been mapped to human chromosome 4q21.
            DMP1 is a bone and teeth specific protein initially
            identified from mineralised dentin. DMP1 is primarily
            localised in the nuclear compartment of undifferentiated
            osteoblasts. In the nucleus, DMP1 acts as a
            transcriptional component for activation of
            osteoblast-specific genes like osteocalcin. During the
            early phase of osteoblast maturation, Ca(2+) surges into
            the nucleus from the cytoplasm, triggering the
            phosphorylation of DMP1 by a nuclear isoform of casein
            kinase II. This phosphorylated DMP1 is then exported out
            into the extracellular matrix, where it regulates
            nucleation of hydroxyapatite. DMP1 is a unique molecule
            that initiates osteoblast differentiation by
            transcription in the nucleus and orchestrates mineralised
            matrix formation extracellularly, at later stages of
            osteoblast maturation. The DMP1 gene has been found to be
            ectopically expressed in lung cancer although the reason
            for this is unknown.
          Length = 514

 Score = 30.4 bits (68), Expect = 7.5
 Identities = 30/168 (17%), Positives = 63/168 (37%), Gaps = 10/168 (5%)

Query: 1114 EEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKR 1173
            E  R   +E++   +     ++V    S + QE     +    ++  E +EEV  V++ R
Sbjct: 306  EHSRSESQEDSEENQSQEDSQEVQDPSSESSQE----ADLPSQENSSESQEEV--VSESR 359

Query: 1174 GSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPKKKKRKSTKTPA 1233
            G       S+ E+    +  EE  +      +    S   E++ D    +    S ++P 
Sbjct: 360  GDNPDNTTSHSEDQEDSESSEEDSLDTPSSSE----SQSTEEQADSESNESLSSSEESPE 415

Query: 1234 KDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
              + +    ++ + S         +E   E +   EE+D  +    +S
Sbjct: 416  STEDENSSSQEGLQSHSASTESRSQESQSEQDSRSEEDDSDSQDSSRS 463


>gnl|CDD|240403 PTZ00400, PTZ00400, DnaK-type molecular chaperone; Provisional.
          Length = 663

 Score = 30.6 bits (69), Expect = 7.7
 Identities = 35/151 (23%), Positives = 68/151 (45%), Gaps = 20/151 (13%)

Query: 1129 GSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETA 1188
            G +Q   +  +  L+D+E  K +++  A++Y+E++E+ K +   +               
Sbjct: 528  GKKQEITIQSSGGLSDEEIEKMVKE--AEEYKEQDEKKKELVDAKN-------------- 571

Query: 1189 PEDEEEEKPVKKRRRRKKVQISDEDEDE-EDESPKKKKRKSTKTPA--KDKHKEDDEEKH 1245
             E E     V+K+    K +ISD D+DE + +  K +   S++     KDK K+  E   
Sbjct: 572  -EAETLIYSVEKQLSDLKDKISDADKDELKQKITKLRSTLSSEDVDSIKDKTKQLQEASW 630

Query: 1246 VVSKRKKKVVEEEEEVEEAEGEEEEEDDKAS 1276
             +S++  K    + +  E     EE ++K  
Sbjct: 631  KISQQAYKQGNSDNQQSEQSTNSEESEEKND 661


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 30.2 bits (68), Expect = 7.7
 Identities = 30/161 (18%), Positives = 64/161 (39%), Gaps = 5/161 (3%)

Query: 340 AEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKE 399
            +       E  EKE   +  A+     +K + Q +++ +     Q + +   + Q    
Sbjct: 36  DQSRILNTLEEFEKEANEKR-AQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELA 94

Query: 400 HKEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLED-LRITVKETNSDNKLRGE 458
            ++E  K    E    +K  + ++L+  +D+L  +   QL++   I  K+  ++      
Sbjct: 95  LQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERL 154

Query: 459 DAPPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEE 499
                K+++E L          E+   E + D E KE +E+
Sbjct: 155 KFENEKKLEESLELE---REKFEEQLHEANLDLEFKENEEQ 192


>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
           endonuclease-like fold, RmuC family [General function
           prediction only].
          Length = 448

 Score = 30.1 bits (68), Expect = 7.8
 Identities = 23/160 (14%), Positives = 49/160 (30%), Gaps = 20/160 (12%)

Query: 293 LITVNSVEERILAAARYKLNMDEKEYHRNNTARTQKL---NKAIINYHINAEKEQKKEQE 349
           LI +  V    +     +L   E+          Q L           +       +   
Sbjct: 9   LILLLLVNLAFIRQLLLRLGRLEQMLGELAAVLEQLLLLLAFRAEAEQLRTFARSLQALN 68

Query: 350 RIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYISNLTQMVKEHKEEQQKKMR 409
               + +  L A  ++   +  +Q +              I +L Q+  E  +E   ++ 
Sbjct: 69  LELIQELNELKARLQQQLLQSREQLQLL------------IESLAQLSSE-FQELANEIF 115

Query: 410 EEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKET 449
           EE  ++   + Q+ L     +L       LE  R  +++ 
Sbjct: 116 EELNRRLAELNQQNL----KQLLKPLREVLEKFREQLEQR 151


>gnl|CDD|215527 PLN02976, PLN02976, amine oxidase.
          Length = 1713

 Score = 30.6 bits (69), Expect = 8.1
 Identities = 21/90 (23%), Positives = 35/90 (38%), Gaps = 16/90 (17%)

Query: 1194 EEKPVKKRRRRKKVQISDEDEDEED--ESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRK 1251
            +E+    +R  K +QI D+ +D+E      K K+ K+     K K   +   K       
Sbjct: 4    KERKSGSKRGLKVIQIGDDSDDDEPIGSLFKLKRPKN---SKKVKVGLESTGK------- 53

Query: 1252 KKVVEEEEEVEEAEGEEEEEDDKASKPRKS 1281
                 EE+     E  E  +D  AS  ++ 
Sbjct: 54   ----REEKLSALDEDSEGMDDTLASFRKRL 79


>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
            Validated.
          Length = 605

 Score = 30.2 bits (68), Expect = 8.2
 Identities = 23/131 (17%), Positives = 49/131 (37%), Gaps = 14/131 (10%)

Query: 1159 YEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDE------ 1212
              E E  +  +        + KK    E + + E++     +    KK+ I +E      
Sbjct: 350  VFEFEIYLYKIINANPDKLENKKI---EISIKPEKKTINNSQIITTKKINIVEESNKNSV 406

Query: 1213 ---DEDEEDESPKKKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEE 1269
                  +      KKK          K +E  E++ V  K+ +K+ + +E ++  E  ++
Sbjct: 407  HFDTLYKTKIFYHKKKINQNNKEQDIKKEELLEKEFV--KKSEKIPKNDELLDNLELAKQ 464

Query: 1270 EEDDKASKPRK 1280
            +  +K  +  K
Sbjct: 465  KFFNKDIELSK 475


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
            family is found in eukaryotes, and is approximately 40
            amino acids in length. The family is found in association
            with pfam07719, pfam00515. There is a single completely
            conserved residue L that may be functionally important.
            NARP1 is the mammalian homologue of a yeast N-terminal
            acetyltransferase that regulates entry into the G(0)
            phase of the cell cycle.
          Length = 516

 Score = 30.3 bits (69), Expect = 8.4
 Identities = 26/120 (21%), Positives = 48/120 (40%), Gaps = 20/120 (16%)

Query: 1149 KAIEDDGADDYEEEEEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQ 1208
            +  E++G +      E  K   K+R + KK +K   E      +++ +   K+ +     
Sbjct: 395  EGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKE--EAEKAAAKKKAEAAAKKAKGP--- 449

Query: 1209 ISDEDEDEEDESPKKKKRKSTKTPAKDKHK---------EDDEEKHV----VSKRKKKVV 1255
              D +  + D  P  +K   T+ P ++  K          D+ E H+    V  RKKK +
Sbjct: 450  --DGETKKVDPDPLGEKLARTEDPLEEAMKFLKPLLQLSPDNIETHLLAFEVYIRKKKYL 507


>gnl|CDD|215637 PLN03223, PLN03223, Polycystin cation channel protein; Provisional.
          Length = 1634

 Score = 30.3 bits (68), Expect = 8.6
 Identities = 20/106 (18%), Positives = 41/106 (38%), Gaps = 23/106 (21%)

Query: 1190 EDEEEEKPVKKRRRRKKVQISDEDEDEE--------------DESPKKK----KRKSTKT 1231
              +EEE+   +  + K     +++ DE                ++ K K     R     
Sbjct: 1476 NPDEEEEEAFREEKEKVFTYLNKELDEAGLKRVLRRCVIETYQKTDKSKVLLRLRVGNVG 1535

Query: 1232 PAKDKH-----KEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEED 1272
              K K       E D+  H++  +  +V ++E++ E+ +  E+E D
Sbjct: 1536 KGKKKETLATADEIDQAAHMLMDQVGQVPDDEDDDEDGDVLEKEVD 1581


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 30.4 bits (69), Expect = 8.8
 Identities = 36/181 (19%), Positives = 81/181 (44%), Gaps = 14/181 (7%)

Query: 342 KEQKKEQERIEKE--RMRRLMAEDEEGYRKLIDQKKD-KRLAFLLSQTDEYISNLTQMVK 398
           KE++KE E + +E   +   + E  E   KL  + K+ + L   + + ++ + +L    +
Sbjct: 196 KEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKR 255

Query: 399 --EHKEEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSDNKLR 456
             E K  + ++  EE KK+ + +++K+ +    K   +E  +L +      +   + + R
Sbjct: 256 KLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKR 315

Query: 457 GEDAPPAKQIKEWLASHPGWEVVMEDSEDEGSEDEEMKEKKEEPSKAVGEKKTEDDEYKD 516
                             G E  +++ E++    EE+K+K +E  K + E +   + Y++
Sbjct: 316 LSR---------LEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEE 366

Query: 517 A 517
           A
Sbjct: 367 A 367


>gnl|CDD|176532 cd08590, PI-PLCc_Rv2075c_like, Catalytic domain of uncharacterized
           Mycobacterium tuberculosis Rv2075c-like proteins.  This
           subfamily corresponds to the catalytic domain present in
           uncharacterized Mycobacterium tuberculosis Rv2075c and
           its homologs. Members in this family are more closely
           related to the Streptomyces antibioticus
           phosphatidylinositol-specific phospholipase
           C1(SaPLC1)-like proteins rather than the typical
           bacterial phosphatidylinositol-specific phospholipase C
           (PI-PLC, EC 4.6.1.13), which participate in
           Ca2+-independent PI metabolism, hydrolyzing the membrane
           lipid phosphatidylinositol (PI) to produce
           phosphorylated myo-inositol and diacylglycerol (DAG). In
           contrast, SaPLC1-like proteins have two Ca2+-chelating
           amino acid substitutions which convert them to
           metal-dependent bacterial PI-PLC. Rv2075c and its
           homologs have the same amino acid substitutions as well,
           which might suggest they have metal-dependent PI-PLC
           activity.
          Length = 267

 Score = 29.7 bits (67), Expect = 8.8
 Identities = 13/31 (41%), Positives = 19/31 (61%), Gaps = 4/31 (12%)

Query: 464 KQIKEWLASHPGWEVV---MEDSEDEGSEDE 491
            +I +WL ++P  EVV   +ED  D G +DE
Sbjct: 95  NEIADWLNANPD-EVVILYLEDHGDGGKDDE 124


>gnl|CDD|178144 PLN02529, PLN02529, lysine-specific histone demethylase 1.
          Length = 738

 Score = 30.2 bits (68), Expect = 9.0
 Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 9/57 (15%)

Query: 1193 EEEKPVKKRRRRKKVQISDEDEDEEDESPKK------KKRKSTKTPAKDKHKEDDEE 1243
             ++    KR  RKK  + + DE+  DE  +K      KK+  TK   +D  KE + E
Sbjct: 5    GQDGTAPKRSLRKKAGLKNYDENLMDELIEKHLGGSFKKKNRTK---QDLEKETETE 58


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 29.9 bits (68), Expect = 9.2
 Identities = 23/73 (31%), Positives = 39/73 (53%), Gaps = 6/73 (8%)

Query: 402 EEQQKKMREEKKKKKKSIKQKILDGNMDKLDLDESSQLEDLRITVKETNSD-NKLRGEDA 460
           EE+ K++ EE +K KK +K K+      +L  ++  QLE L  T ++ + +  +L  E  
Sbjct: 340 EEELKELEEELEKIKKLLK-KLPKKARGQLPPEKREQLEKLLETKEKLSEELEELEEEL- 397

Query: 461 PPAKQIKEWLASH 473
              K++KE L S 
Sbjct: 398 ---KELKEELESL 407


>gnl|CDD|151656 pfam11214, Med2, Mediator complex subunit 2.  This family of
           mediator complex subunit 2 proteins is conserved in
           fungi. Cyclin-dependent kinase CDK8 or Srb10 interacts
           with and phosphorylates Med2. Post-translational
           modifications of Mediator subunits are important for
           regulation of gene expression.
          Length = 99

 Score = 28.2 bits (63), Expect = 9.5
 Identities = 13/36 (36%), Positives = 21/36 (58%)

Query: 379 LAFLLSQTDEYISNLTQMVKEHKEEQQKKMREEKKK 414
               L+ T  Y+  L +  K+ +EE++KK  EE+KK
Sbjct: 64  TESKLNDTKWYVDTLVENKKKQEEEERKKKEEEEKK 99


>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
            unknown].
          Length = 622

 Score = 30.1 bits (67), Expect = 9.6
 Identities = 29/161 (18%), Positives = 65/161 (40%), Gaps = 10/161 (6%)

Query: 1103 KDDDEVENSMYEEEREREKEEALMGRGSRQRKQVDYTDSLTDQEWLKAIEDDGADDYEEE 1162
                + + S Y   +  ++++         + Q  +++   D      + D  +DD E  
Sbjct: 353  DGIRDKDFSAYTASKLSDEDD---DSVMESKMQKLFSEKEIDFGLNSELVDM-SDDGENG 408

Query: 1163 EEEVKPVTKKRGSGKKGKKSYVEETAPEDEEEEKPVKKRRRRKKVQISDEDEDEEDESPK 1222
            E E    +    S +      +E T  + + + +   +R+  +K +   + +D+ D   K
Sbjct: 409  EMEDTFTSHLPASNESESDDKLETTIEKLDRKLR---ERQENRKERQLKKTKDDSDVDLK 465

Query: 1223 KKKRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEE 1263
             KK    K   K KH     E+   SK + ++++ ++E +E
Sbjct: 466  DKKESINKKNKKGKHA---IERTAASKEELELIKADDEDDE 503


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
            fungal and plant proteins and contains many hypothetical
            proteins. VID27 is a cytoplasmic protein that plays a
            potential role in vacuolar protein degradation.
          Length = 794

 Score = 30.1 bits (68), Expect = 9.7
 Identities = 17/53 (32%), Positives = 27/53 (50%)

Query: 1225 KRKSTKTPAKDKHKEDDEEKHVVSKRKKKVVEEEEEVEEAEGEEEEEDDKASK 1277
            K+K TK    ++    D    +  +      ++EEE +E E EEE+ED+  SK
Sbjct: 358  KQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSK 410


>gnl|CDD|178164 PLN02549, PLN02549, asparagine synthase (glutamine-hydrolyzing).
          Length = 578

 Score = 30.1 bits (68), Expect = 9.7
 Identities = 14/47 (29%), Positives = 25/47 (53%), Gaps = 3/47 (6%)

Query: 324 ARTQKLNKAIINYHINAEKEQK---KEQERIEKERMRRLMAEDEEGY 367
           AR   L+K  I+  ++ + E K     + RIEK  +R+   ++E+ Y
Sbjct: 394 ARVPFLDKEFIDVAMSIDPEWKMIRPGEGRIEKWVLRKAFDDEEDPY 440


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.132    0.374 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 66,791,967
Number of extensions: 6921755
Number of successful extensions: 16399
Number of sequences better than 10.0: 1
Number of HSP's gapped: 12495
Number of HSP's successfully gapped: 1122
Length of query: 1281
Length of database: 10,937,602
Length adjustment: 108
Effective length of query: 1173
Effective length of database: 6,147,370
Effective search space: 7210865010
Effective search space used: 7210865010
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 65 (28.7 bits)