RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7704
         (124 letters)



>gnl|CDD|215689 pfam00067, p450, Cytochrome P450.  Cytochrome P450s are
           haem-thiolate proteins involved in the oxidative
           degradation of various compounds. They are particularly
           well known for their role in the degradation of
           environmental toxins and mutagens. They can be divided
           into 4 classes, according to the method by which
           electrons from NAD(P)H are delivered to the catalytic
           site. Sequence conservation is relatively low within the
           family - there are only 3 absolutely conserved residues
           - but their general topography and structural fold are
           highly conserved. The conserved core is composed of a
           coil termed the 'meander', a four-helix bundle, helices
           J and K, and two sets of beta-sheets. These constitute
           the haem-binding loop (with an absolutely conserved
           cysteine that serves as the 5th ligand for the haem
           iron), the proton-transfer groove and the absolutely
           conserved EXXR motif in helix K. While prokaryotic P450s
           are soluble proteins, most eukaryotic P450s are
           associated with microsomal membranes. their general
           enzymatic function is to catalyze regiospecific and
           stereospecific oxidation of non-activated hydrocarbons
           at physiological temperatures.
          Length = 461

 Score = 76.2 bits (188), Expect = 2e-17
 Identities = 32/74 (43%), Positives = 47/74 (63%), Gaps = 3/74 (4%)

Query: 2   LALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPI-IGRHI 60
           LA HP++Q+++ +EI +V+GD     PTYD +Q +  L  VIKETLRL+P  P+ + R +
Sbjct: 288 LAKHPEVQEKLREEIDEVIGDK--RSPTYDDLQNMPYLDAVIKETLRLHPVVPLLLPREV 345

Query: 61  TSEIQADKYTIPIG 74
           T +     Y IP G
Sbjct: 346 TKDTVIPGYLIPKG 359


>gnl|CDD|215164 PLN02290, PLN02290, cytokinin trans-hydroxylase.
          Length = 516

 Score = 60.6 bits (147), Expect = 6e-12
 Identities = 27/74 (36%), Positives = 41/74 (55%), Gaps = 3/74 (4%)

Query: 1   MLALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHI 60
           +LA +P  Q +V  E+ +V G +    P+ D + +L LL  VI E+LRLYP A ++ R  
Sbjct: 342 LLASNPTWQDKVRAEVAEVCGGET---PSVDHLSKLTLLNMVINESLRLYPPATLLPRMA 398

Query: 61  TSEIQADKYTIPIG 74
             +I+     IP G
Sbjct: 399 FEDIKLGDLHIPKG 412


>gnl|CDD|225035 COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis,
           transport, and catabolism].
          Length = 411

 Score = 59.4 bits (144), Expect = 1e-11
 Identities = 21/75 (28%), Positives = 30/75 (40%), Gaps = 18/75 (24%)

Query: 2   LALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHIT 61
           L  HP    ++  E                   +  LL  V++ETLRLYP  P+  R  T
Sbjct: 263 LLRHPDQLAKLRAE------------------PDRPLLEAVVEETLRLYPPVPLARRVAT 304

Query: 62  SEIQADKYTIPIGRK 76
            +++   Y IP G  
Sbjct: 305 EDVELGGYRIPAGTV 319


>gnl|CDD|215354 PLN02655, PLN02655, ent-kaurene oxidase.
          Length = 466

 Score = 53.6 bits (129), Expect = 2e-09
 Identities = 27/74 (36%), Positives = 37/74 (50%), Gaps = 4/74 (5%)

Query: 2   LALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPII-GRHI 60
           LA +P  Q+ +Y EI +V GD  E V T + +  L  L  V  ETLR Y   P++  R +
Sbjct: 289 LAKNPDKQERLYREIREVCGD--ERV-TEEDLPNLPYLNAVFHETLRKYSPVPLLPPRFV 345

Query: 61  TSEIQADKYTIPIG 74
             +     Y IP G
Sbjct: 346 HEDTTLGGYDIPAG 359


>gnl|CDD|215393 PLN02738, PLN02738, carotene beta-ring hydroxylase.
          Length = 633

 Score = 53.4 bits (128), Expect = 2e-09
 Identities = 27/78 (34%), Positives = 42/78 (53%), Gaps = 5/78 (6%)

Query: 1   MLALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHI 60
           +L+  P +  ++ +E+  VLGD     PT + +++L   TRVI E+LRLYP  P++ R  
Sbjct: 417 LLSKEPSVVAKLQEEVDSVLGD---RFPTIEDMKKLKYTTRVINESLRLYPQPPVLIRRS 473

Query: 61  TSEIQADKYTIPIGRKEN 78
                   Y  PI R E+
Sbjct: 474 LENDMLGGY--PIKRGED 489


>gnl|CDD|215235 PLN02426, PLN02426, cytochrome P450, family 94, subfamily C
           protein.
          Length = 502

 Score = 53.2 bits (128), Expect = 2e-09
 Identities = 14/55 (25%), Positives = 36/55 (65%), Gaps = 3/55 (5%)

Query: 1   MLALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPI 55
           +L+ HP++   + +E  +V+G + +   ++++++E+H L   + E++RL+P  P+
Sbjct: 319 LLSKHPEVASAIREEADRVMGPN-QEAASFEEMKEMHYLHAALYESMRLFP--PV 370


>gnl|CDD|215221 PLN02394, PLN02394, trans-cinnamate 4-monooxygenase.
          Length = 503

 Score = 51.3 bits (123), Expect = 1e-08
 Identities = 27/74 (36%), Positives = 40/74 (54%), Gaps = 5/74 (6%)

Query: 5   HPKIQQEVYDEIVQVLGDD-PETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHIT-S 62
           HP+IQ+++ DE+  VLG     T P      +L  L  V+KETLRL+ A P++  H+   
Sbjct: 323 HPEIQKKLRDELDTVLGPGNQVTEPD---THKLPYLQAVVKETLRLHMAIPLLVPHMNLE 379

Query: 63  EIQADKYTIPIGRK 76
           + +   Y IP   K
Sbjct: 380 DAKLGGYDIPAESK 393


>gnl|CDD|178524 PLN02936, PLN02936, epsilon-ring hydroxylase.
          Length = 489

 Score = 50.2 bits (120), Expect = 3e-08
 Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 4/76 (5%)

Query: 1   MLALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPI-IGRH 59
           +L+ +P+  ++  +E+ +VL   P   PTY+ I+EL  LTR I E++RLYP  P+ I R 
Sbjct: 304 LLSKNPEALRKAQEELDRVLQGRP---PTYEDIKELKYLTRCINESMRLYPHPPVLIRRA 360

Query: 60  ITSEIQADKYTIPIGR 75
              ++    Y +  G+
Sbjct: 361 QVEDVLPGGYKVNAGQ 376


>gnl|CDD|215371 PLN02687, PLN02687, flavonoid 3'-monooxygenase.
          Length = 517

 Score = 45.6 bits (108), Expect = 1e-06
 Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 3/74 (4%)

Query: 2   LALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHIT 61
           L  HP I ++  +E+  V+G D   V   D + +L  L  VIKET RL+P+ P+    + 
Sbjct: 324 LIRHPDILKKAQEELDAVVGRD-RLVSESD-LPQLTYLQAVIKETFRLHPSTPLSLPRMA 381

Query: 62  SE-IQADKYTIPIG 74
           +E  + + Y IP G
Sbjct: 382 AEECEINGYHIPKG 395


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 45.7 bits (109), Expect = 1e-06
 Identities = 14/50 (28%), Positives = 27/50 (54%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           R + K   +KK     +K   K+ +KK+K K++ K KK+ +  +   K++
Sbjct: 377 RPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRR 426



 Score = 44.9 bits (107), Expect = 2e-06
 Identities = 15/48 (31%), Positives = 27/48 (56%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
            E K  K  KK   K+ ++K+K+++K K+KK+ +  K   K++K    
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431



 Score = 43.4 bits (103), Expect = 6e-06
 Identities = 15/49 (30%), Positives = 25/49 (51%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K   +KK  K   K   K+ +KK+K+++  K KK  +  K   K++K  
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPS 429



 Score = 42.2 bits (100), Expect = 1e-05
 Identities = 11/46 (23%), Positives = 24/46 (52%)

Query: 79  KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           KK  +  K+   ++ +KK+K+K++   K++    K   K++K    
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431



 Score = 39.9 bits (94), Expect = 9e-05
 Identities = 12/53 (22%), Positives = 29/53 (54%)

Query: 72  PIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           P  +  ++KK  K  +    K+ +KK+++K+K K +K+ +  K   ++++   
Sbjct: 378 PKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSG 430



 Score = 38.0 bits (89), Expect = 4e-04
 Identities = 12/49 (24%), Positives = 25/49 (51%)

Query: 72  PIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
           P  +K  K  K+   +  ++K+++K+K K +K  +  K   K++K    
Sbjct: 383 PSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431



 Score = 38.0 bits (89), Expect = 5e-04
 Identities = 12/48 (25%), Positives = 27/48 (56%)

Query: 66  ADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKK 113
           ++K T    +K   K+ +KK++ K++ K KK+ +  + + K++K    
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431


>gnl|CDD|215171 PLN02302, PLN02302, ent-kaurenoic acid oxidase.
          Length = 490

 Score = 44.3 bits (105), Expect = 3e-06
 Identities = 21/74 (28%), Positives = 39/74 (52%), Gaps = 2/74 (2%)

Query: 5   HPKIQQEVYDEIVQVLGDDPET--VPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHITS 62
           HP++ Q+   E  ++    P      T   ++++  L++VI ETLRL   +  + R   +
Sbjct: 317 HPEVLQKAKAEQEEIAKKRPPGQKGLTLKDVRKMEYLSQVIDETLRLINISLTVFREAKT 376

Query: 63  EIQADKYTIPIGRK 76
           +++ + YTIP G K
Sbjct: 377 DVEVNGYTIPKGWK 390


>gnl|CDD|215086 PLN00168, PLN00168, Cytochrome P450; Provisional.
          Length = 519

 Score = 43.8 bits (103), Expect = 4e-06
 Identities = 26/74 (35%), Positives = 39/74 (52%), Gaps = 2/74 (2%)

Query: 2   LALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHIT 61
           L  +P IQ +++DEI    GDD E V   D + ++  L  V+ E LR +P A  +  H  
Sbjct: 333 LVKNPSIQSKLHDEIKAKTGDDQEEVSEED-VHKMPYLKAVVLEGLRKHPPAHFVLPHKA 391

Query: 62  SE-IQADKYTIPIG 74
           +E ++   Y IP G
Sbjct: 392 AEDMEVGGYLIPKG 405


>gnl|CDD|215627 PLN03195, PLN03195, fatty acid omega-hydroxylase; Provisional.
          Length = 516

 Score = 42.1 bits (99), Expect = 2e-05
 Identities = 22/78 (28%), Positives = 37/78 (47%), Gaps = 18/78 (23%)

Query: 1   MLALHPKIQQEVYDEI--------VQVLGDDPETVP----------TYDQIQELHLLTRV 42
           M+ ++P + +++Y E+         +   +D ++            TYD + +L  L  V
Sbjct: 318 MIMMNPHVAEKLYSELKALEKERAKEEDPEDSQSFNQRVTQFAGLLTYDSLGKLQYLHAV 377

Query: 43  IKETLRLYPAAPIIGRHI 60
           I ETLRLYPA P   + I
Sbjct: 378 ITETLRLYPAVPQDPKGI 395


>gnl|CDD|173595 PTZ00404, PTZ00404, cytochrome P450; Provisional.
          Length = 482

 Score = 40.5 bits (95), Expect = 7e-05
 Identities = 22/74 (29%), Positives = 32/74 (43%), Gaps = 4/74 (5%)

Query: 1   MLALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHI 60
           ML  +P+IQ++ Y+EI   +    +        Q       +IKETLR  P +P      
Sbjct: 309 MLCNYPEIQEKAYNEIKSTVNGRNKV--LLSDRQSTPYTVAIIKETLRYKPVSPFGLPRS 366

Query: 61  TSE--IQADKYTIP 72
           TS   I    + IP
Sbjct: 367 TSNDIIIGGGHFIP 380


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 40.7 bits (95), Expect = 7e-05
 Identities = 16/50 (32%), Positives = 30/50 (60%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           RK+ K++K K+K+ KK K  +K+ + K + ++        KK +KK +K+
Sbjct: 20  RKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKR 69



 Score = 33.7 bits (77), Expect = 0.014
 Identities = 15/45 (33%), Positives = 30/45 (66%)

Query: 80  KKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            + +KK  T+++ ++KKKK++K K K+ KK K  +K+ + K + +
Sbjct: 6   SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQ 50



 Score = 28.3 bits (63), Expect = 1.1
 Identities = 15/46 (32%), Positives = 29/46 (63%), Gaps = 2/46 (4%)

Query: 79  KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           KK   +++  +K+KK++K K+K  ++KK K  +K+ K K + Q+  
Sbjct: 10  KKILTEEELERKKKKEEKAKEK--ELKKLKAAQKEAKAKLQAQQAS 53


>gnl|CDD|215583 PLN03112, PLN03112, cytochrome P450 family protein; Provisional.
          Length = 514

 Score = 39.0 bits (91), Expect = 2e-04
 Identities = 17/71 (23%), Positives = 35/71 (49%), Gaps = 3/71 (4%)

Query: 5   HPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAP-IIGRHITSE 63
           +P++ +++ +E+  V+G     +     +  L+ L  V++ET R++PA P +I       
Sbjct: 326 NPRVLRKIQEELDSVVG--RNRMVQESDLVHLNYLRCVVRETFRMHPAGPFLIPHESLRA 383

Query: 64  IQADKYTIPIG 74
              + Y IP  
Sbjct: 384 TTINGYYIPAK 394


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 38.8 bits (91), Expect = 3e-04
 Identities = 17/48 (35%), Positives = 28/48 (58%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
             ++KK +KKQ K +KK +K++ +K   KK+ +   KK K    + KK
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKK 455



 Score = 34.1 bits (79), Expect = 0.011
 Identities = 15/49 (30%), Positives = 25/49 (51%), Gaps = 3/49 (6%)

Query: 76  KENKK--KKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
            E KK  KKQ+K + KK +K++ +K   +K  +    K K    + K+ 
Sbjct: 409 AERKKLRKKQRKAE-KKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456



 Score = 30.7 bits (70), Expect = 0.18
 Identities = 11/42 (26%), Positives = 22/42 (52%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKK 116
           RK+ +K ++K ++ + +K   KKK +    K +    + KK 
Sbjct: 415 RKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 38.4 bits (90), Expect = 3e-04
 Identities = 21/46 (45%), Positives = 35/46 (76%)

Query: 78  NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           NK+ ++ K++  K  K+K++++K QK KK+KK KKK KK++KK +K
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 36.9 bits (86), Expect = 0.001
 Identities = 17/42 (40%), Positives = 31/42 (73%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
           +E K+K  K  + K+++K+ +K++KK+K KK+ K +KKK +K
Sbjct: 373 EEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 35.7 bits (83), Expect = 0.003
 Identities = 19/49 (38%), Positives = 34/49 (69%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           KE   K+ ++ + K  K  KKK+++K+  K++KK K+KKK K++K+K +
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 34.6 bits (80), Expect = 0.007
 Identities = 18/40 (45%), Positives = 29/40 (72%)

Query: 73  IGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKK 112
           I  K  K  K+K+++ K QK++KKKK+KK+  K++KK +K
Sbjct: 375 IKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 32.6 bits (75), Expect = 0.033
 Identities = 17/36 (47%), Positives = 26/36 (72%)

Query: 68  KYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQK 103
           KY  P  +K  +KK QK+K+ KK+KK+ KK++KK +
Sbjct: 378 KYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 28.8 bits (65), Expect = 0.69
 Identities = 14/32 (43%), Positives = 25/32 (78%)

Query: 72  PIGRKENKKKKQKKKQTKKQKKQKKKKQKKQK 103
           P  ++E KK +++KK+ K++KK KK+K+K +K
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 37.7 bits (88), Expect = 6e-04
 Identities = 13/43 (30%), Positives = 30/43 (69%)

Query: 82  KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K   K+  +++K++K+  K+QK  ++ K +++KKKK+ ++ +K
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 34.2 bits (79), Expect = 0.010
 Identities = 14/41 (34%), Positives = 32/41 (78%)

Query: 82  KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
           K++ ++ K++K+  K++++ +K+KKQ++ KKK+ +K +K K
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590



 Score = 33.1 bits (76), Expect = 0.027
 Identities = 13/40 (32%), Positives = 30/40 (75%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKK 115
           +E +++K++K+  K+QK+ +K K++++K KK+ +  +K K
Sbjct: 551 EELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590



 Score = 30.8 bits (70), Expect = 0.17
 Identities = 11/37 (29%), Positives = 29/37 (78%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTK 111
           ++E ++K+  K+Q + +K +K++++KK++++K +K K
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590



 Score = 29.2 bits (66), Expect = 0.53
 Identities = 13/44 (29%), Positives = 29/44 (65%)

Query: 65  QADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQK 108
             DK  +   ++E +  K++K+  K +K+++KKK++ +K++K K
Sbjct: 547 LDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 36.8 bits (86), Expect = 0.001
 Identities = 15/47 (31%), Positives = 33/47 (70%)

Query: 78  NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
             KK  KK +   +K +KK++++K++ KK+    KKK+++++++K+K
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEK 450



 Score = 35.7 bits (83), Expect = 0.003
 Identities = 13/46 (28%), Positives = 37/46 (80%)

Query: 74  GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQK 119
            +    KKK+++++ +K+KK+++K++++++ +++K+ +++KKKKQ 
Sbjct: 432 KKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477



 Score = 34.9 bits (81), Expect = 0.006
 Identities = 15/53 (28%), Positives = 38/53 (71%), Gaps = 1/53 (1%)

Query: 73  IGRKENKKKKQKKKQTKKQKKQKK-KKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            G K+  KK +K  +  ++K++++ K++KK+    +KK ++++++K+KK+++K
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455



 Score = 34.1 bits (79), Expect = 0.011
 Identities = 13/49 (26%), Positives = 38/49 (77%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K+ KK  +K ++ ++++K++KKK+     KK+++ +++K+KK+++++++
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEE 458



 Score = 32.2 bits (74), Expect = 0.048
 Identities = 13/49 (26%), Positives = 39/49 (79%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K+  +K +KK++ +K++K+KK    K+K +++++ K+KK++++++++++
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461



 Score = 31.8 bits (73), Expect = 0.069
 Identities = 16/52 (30%), Positives = 40/52 (76%)

Query: 73  IGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           I  K  KK++++KK+ KK+    KKK+++++ +K+KK ++K++++++ +++K
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466



 Score = 30.3 bits (69), Expect = 0.23
 Identities = 9/50 (18%), Positives = 40/50 (80%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            K+ +++K++KK+     K+K+++++++K KK+++ ++++++ +++++++
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469



 Score = 29.9 bits (68), Expect = 0.26
 Identities = 12/50 (24%), Positives = 42/50 (84%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           R+E KK+K+KK    K+K+++++++K++K +++++ +++ +++++++++K
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 36.7 bits (86), Expect = 0.001
 Identities = 15/42 (35%), Positives = 23/42 (54%), Gaps = 1/42 (2%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
           KE K++ +     KK+ K   K QKK++ K++ K  K K K 
Sbjct: 50  KEMKEELKAALLDKKELKAWHKAQKKKE-KQEAKAAKAKSKP 90



 Score = 36.0 bits (84), Expect = 0.002
 Identities = 16/45 (35%), Positives = 24/45 (53%), Gaps = 1/45 (2%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
           ++ K+ K++ K     KK+ K   K QK KK+K+  K  K K K 
Sbjct: 47  EQYKEMKEELKAALLDKKELKAWHKAQK-KKEKQEAKAAKAKSKP 90



 Score = 34.8 bits (81), Expect = 0.006
 Identities = 14/41 (34%), Positives = 22/41 (53%), Gaps = 2/41 (4%)

Query: 86  KQTKKQKKQKK--KKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K+ K++ K     KK+ K   K QKK +K++ K  K + K 
Sbjct: 50  KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKP 90



 Score = 31.3 bits (72), Expect = 0.080
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 81  KKQKKKQTKKQKKQKKKKQKKQKMKKQKK 109
           KK+ K   K QKK++K++ K  K K + +
Sbjct: 63  KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 30.6 bits (70), Expect = 0.16
 Identities = 14/46 (30%), Positives = 26/46 (56%)

Query: 79  KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           ++KK +K + +     ++ K+ K+++K     KK+ K   K QKKK
Sbjct: 31  QRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKK 76



 Score = 29.0 bits (66), Expect = 0.54
 Identities = 9/28 (32%), Positives = 15/28 (53%)

Query: 79  KKKKQKKKQTKKQKKQKKKKQKKQKMKK 106
           K+ K   K  KK++KQ+ K  K +   +
Sbjct: 64  KELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 26.3 bits (59), Expect = 4.1
 Identities = 9/24 (37%), Positives = 14/24 (58%)

Query: 75 RKENKKKKQKKKQTKKQKKQKKKK 98
          +  +K +K+K+KQ  K  K K K 
Sbjct: 67 KAWHKAQKKKEKQEAKAAKAKSKP 90



 Score = 25.6 bits (57), Expect = 8.7
 Identities = 7/21 (33%), Positives = 13/21 (61%)

Query: 75 RKENKKKKQKKKQTKKQKKQK 95
          + + KK+KQ+ K  K + K +
Sbjct: 71 KAQKKKEKQEAKAAKAKSKPR 91


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 36.3 bits (84), Expect = 0.002
 Identities = 14/43 (32%), Positives = 20/43 (46%)

Query: 82  KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           + +KK   K  K+  +K KK   K QK     K K+  K+K  
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVA 812



 Score = 35.9 bits (83), Expect = 0.003
 Identities = 15/42 (35%), Positives = 20/42 (47%)

Query: 78  NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQK 119
            +KK + K   K  +K KK   K QK+    K K+  KKK  
Sbjct: 771 GEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVA 812



 Score = 35.5 bits (82), Expect = 0.004
 Identities = 15/42 (35%), Positives = 19/42 (45%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKK 116
            ++  K K  KK  +K KK   K QK     K K+  KKK  
Sbjct: 771 GEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVA 812



 Score = 33.9 bits (78), Expect = 0.014
 Identities = 12/42 (28%), Positives = 21/42 (50%)

Query: 79  KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
           + +K+ K +  K+  +K KK   +  K    TK K+  K+K 
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKV 811



 Score = 32.4 bits (74), Expect = 0.039
 Identities = 16/57 (28%), Positives = 26/57 (45%)

Query: 67  DKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           +K       K+  K++Q  K+   +     + +KK K K  KK  +K KK   K +K
Sbjct: 740 EKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQK 796



 Score = 31.3 bits (71), Expect = 0.094
 Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 8/58 (13%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQK--------MKKQKKTKKKKKKKQKKQKKK 124
           R   K  ++K K+    KK  K++Q  +K         + +KK K K  KK  ++ KK
Sbjct: 732 RNVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKK 789



 Score = 30.5 bits (69), Expect = 0.19
 Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 11/62 (17%)

Query: 74  GRKENKKKKQKKKQT----------KKQKKQKKKKQKKQKMKKQKKTKKKKKK-KQKKQK 122
           G+K  K+++  KK            KK K +  KK  ++  K   KT+K     K K+  
Sbjct: 748 GKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAA 807

Query: 123 KK 124
           KK
Sbjct: 808 KK 809



 Score = 29.3 bits (66), Expect = 0.53
 Identities = 14/58 (24%), Positives = 25/58 (43%), Gaps = 9/58 (15%)

Query: 76  KENKKKKQKKKQTKKQKKQKKK---------KQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           +E  KK    K+  K+++  KK         + +K+   K  K   +K KK   + +K
Sbjct: 739 REKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQK 796


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 35.2 bits (82), Expect = 0.002
 Identities = 16/47 (34%), Positives = 26/47 (55%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
           K+ K+  +     KK+ K  +K +KK +  K K  KKK KK++ K +
Sbjct: 52  KDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 32.9 bits (76), Expect = 0.017
 Identities = 12/33 (36%), Positives = 21/33 (63%)

Query: 92  KKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           KK+ K  +K +K  ++ K K +KKK +K++ K 
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97



 Score = 27.1 bits (61), Expect = 1.7
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQK 100
           + E K +K K K  KK+ K+++ K +
Sbjct: 73  KAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 27.1 bits (61), Expect = 1.7
 Identities = 8/29 (27%), Positives = 17/29 (58%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQK 103
           +   K +K+ +K   K +K+K KK++ + 
Sbjct: 69  KAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 34.7 bits (80), Expect = 0.006
 Identities = 18/57 (31%), Positives = 32/57 (56%), Gaps = 9/57 (15%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQ---------KMKKQKKTKKKKKKKQKKQKKK 124
           EN+ K+QK++  +K  K+K KK KK          ++  + +  + K+  +KK+KKK
Sbjct: 7   ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKK 63



 Score = 28.5 bits (64), Expect = 0.79
 Identities = 25/69 (36%), Positives = 34/69 (49%), Gaps = 20/69 (28%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQ--------------------KKTKKKKK 115
             N+  + + KQ K+Q KQK  K+K +K KK                     K+  KKKK
Sbjct: 1   AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60

Query: 116 KKQKKQKKK 124
           KK+KK+KKK
Sbjct: 61  KKKKKKKKK 69



 Score = 26.6 bits (59), Expect = 4.2
 Identities = 11/47 (23%), Positives = 18/47 (38%)

Query: 78  NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            +   + ++   KQ  +KKKK+KK+K KK                  
Sbjct: 41  AELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSS 87



 Score = 25.8 bits (57), Expect = 7.0
 Identities = 11/45 (24%), Positives = 19/45 (42%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
               ++ + K+  KK+KK+KKKK+KK   +               
Sbjct: 44  ISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSA 88



 Score = 25.8 bits (57), Expect = 7.1
 Identities = 11/45 (24%), Positives = 22/45 (48%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
            ++ ++ + KQ  K+KK+KKKK+KK+ + +               
Sbjct: 44  ISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSA 88


>gnl|CDD|165828 PLN02183, PLN02183, ferulate 5-hydroxylase.
          Length = 516

 Score = 34.8 bits (80), Expect = 0.006
 Identities = 24/84 (28%), Positives = 39/84 (46%), Gaps = 13/84 (15%)

Query: 6   PKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHITSEIQ 65
           P+  + V  E+  V+G +   V   D +++L  L   +KETLRL+P  P++      + +
Sbjct: 335 PEDLKRVQQELADVVGLN-RRVEESD-LEKLTYLKCTLKETLRLHPPIPLLLHETAEDAE 392

Query: 66  ADKYTIP-----------IGRKEN 78
              Y IP           IGR +N
Sbjct: 393 VAGYFIPKRSRVMINAWAIGRDKN 416


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 34.8 bits (81), Expect = 0.007
 Identities = 12/47 (25%), Positives = 20/47 (42%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           E   +K +K +       K    KK  +K  +KTK++    +K  K 
Sbjct: 812 ELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 33.2 bits (77), Expect = 0.020
 Identities = 9/43 (20%), Positives = 15/43 (34%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
            K  K +       K    +K   +  +K K+Q    +K  K 
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 30.9 bits (71), Expect = 0.15
 Identities = 9/45 (20%), Positives = 15/45 (33%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
              K +K +       K    KK   +  +K K+     +K  K 
Sbjct: 814 LAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 30.5 bits (70), Expect = 0.21
 Identities = 8/36 (22%), Positives = 13/36 (36%)

Query: 89  KKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           +K +       K    KK      +K K+Q    +K
Sbjct: 819 EKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEK 854



 Score = 28.2 bits (64), Expect = 1.2
 Identities = 11/48 (22%), Positives = 18/48 (37%)

Query: 63  EIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKT 110
           E+ A+K        +   K    K+   +  +K K+Q     K  K T
Sbjct: 812 ELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKAT 859


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 34.3 bits (79), Expect = 0.008
 Identities = 20/48 (41%), Positives = 31/48 (64%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           KE+K  + KKK  KK+KK+K+KK+ K + + +   K  KK K+ K+K 
Sbjct: 59  KEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKP 106



 Score = 34.3 bits (79), Expect = 0.010
 Identities = 16/50 (32%), Positives = 26/50 (52%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
             E + K       K +K+ K  + KK+  KK+KK K+KK+ K + + K 
Sbjct: 42  FSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKL 91



 Score = 33.9 bits (78), Expect = 0.012
 Identities = 21/51 (41%), Positives = 29/51 (56%)

Query: 74  GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
              E+KKK +KKK+ KK+KK+ K + + +   K  K  KK KKK  K K  
Sbjct: 62  KNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPN 112



 Score = 32.0 bits (73), Expect = 0.051
 Identities = 16/49 (32%), Positives = 28/49 (57%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
               +++ K   T  +K +K+ K  + K K +KK KKKK+KK+ K + +
Sbjct: 40  STFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGE 88



 Score = 32.0 bits (73), Expect = 0.059
 Identities = 16/49 (32%), Positives = 25/49 (51%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K      +K K+  K  + KKK +KK+K KK+KK  K + + +   K  
Sbjct: 48  KVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTP 96



 Score = 30.8 bits (70), Expect = 0.13
 Identities = 16/45 (35%), Positives = 23/45 (51%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
           E KKKK+K+K+  K + + K   K  K  K+ K K  K K  +  
Sbjct: 71  EKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDV 115



 Score = 29.7 bits (67), Expect = 0.33
 Identities = 15/44 (34%), Positives = 25/44 (56%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQK 119
           ++ KKKK++KK+ K + + K   +  +K KK KK   K K  + 
Sbjct: 71  EKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114



 Score = 27.4 bits (61), Expect = 2.1
 Identities = 15/49 (30%), Positives = 21/49 (42%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           KE K + + K   K  KK KK K+K  K K  +       K  +  +K 
Sbjct: 81  KEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKS 129



 Score = 26.6 bits (59), Expect = 4.0
 Identities = 12/38 (31%), Positives = 19/38 (50%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKK 112
           +KE K+ K + +     K  KK K+ K+K  K K  + 
Sbjct: 77  KKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 33.9 bits (78), Expect = 0.011
 Identities = 13/39 (33%), Positives = 20/39 (51%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKK 115
           E K  KQK++   +++K+      K+    QK   KKKK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 31.2 bits (71), Expect = 0.085
 Identities = 12/40 (30%), Positives = 21/40 (52%), Gaps = 2/40 (5%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKK 114
           RK+ K+K++   + +K+      K+     K +  TKKKK
Sbjct: 159 RKDAKQKEEFAAEERKEALAAAAKKSATPQKVE--TKKKK 196



 Score = 25.4 bits (56), Expect = 8.3
 Identities = 9/38 (23%), Positives = 22/38 (57%)

Query: 85  KKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
           +K  K++++   +++K+      KK+   +K + KK+K
Sbjct: 159 RKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 33.8 bits (78), Expect = 0.011
 Identities = 9/39 (23%), Positives = 20/39 (51%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKK 115
           E K  K  ++   ++ +     ++++  + +KK  KKKK
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 30.4 bits (69), Expect = 0.15
 Identities = 8/38 (21%), Positives = 18/38 (47%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKK 112
            K+ K  ++   +  +     K+++  +  KK+ K KK
Sbjct: 159 WKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 28.8 bits (65), Expect = 0.51
 Identities = 6/35 (17%), Positives = 16/35 (45%)

Query: 90  KQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K  K  ++   ++        +++  + +KK+ KK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKK 194



 Score = 26.5 bits (59), Expect = 3.3
 Identities = 7/36 (19%), Positives = 18/36 (50%)

Query: 89  KKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K  K  ++   ++ +     K ++  + ++K+ KKK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195



 Score = 25.4 bits (56), Expect = 8.0
 Identities = 6/35 (17%), Positives = 18/35 (51%)

Query: 89  KKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
            K  ++   ++ +     +++   + +KK+ K+KK
Sbjct: 162 AKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196


>gnl|CDD|178773 PLN03234, PLN03234, cytochrome P450 83B1; Provisional.
          Length = 499

 Score = 33.9 bits (77), Expect = 0.012
 Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 3/72 (4%)

Query: 2   LALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPI-IGRHI 60
           L  +P+  ++  DE+  V+GD  +   + + I  L  L  VIKE+LRL P  PI + R  
Sbjct: 315 LIKYPEAMKKAQDEVRNVIGD--KGYVSEEDIPNLPYLKAVIKESLRLEPVIPILLHRET 372

Query: 61  TSEIQADKYTIP 72
            ++ +   Y IP
Sbjct: 373 IADAKIGGYDIP 384


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 34.0 bits (78), Expect = 0.013
 Identities = 24/75 (32%), Positives = 30/75 (40%), Gaps = 12/75 (16%)

Query: 50  YPAAPIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKK 109
           YPA  +IG  I + I   KY              K    KK K +KK    K K  K  K
Sbjct: 722 YPACSVIGNSIDAVIT--KYA----------GTPKTPYEKKTKAKKKSASTKGKAAKTVK 769

Query: 110 TKKKKKKKQKKQKKK 124
            K K K K+  +K+ 
Sbjct: 770 KKSKAKSKKTTKKRA 784


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 33.8 bits (78), Expect = 0.014
 Identities = 14/50 (28%), Positives = 35/50 (70%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            K  ++++Q++ Q KK++K+K++++ K      ++ +K ++K++KKQ +K
Sbjct: 273 LKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322


>gnl|CDD|177725 PLN00110, PLN00110, flavonoid 3',5'-hydroxylase (F3'5'H);
           Provisional.
          Length = 504

 Score = 33.3 bits (76), Expect = 0.018
 Identities = 19/68 (27%), Positives = 36/68 (52%), Gaps = 3/68 (4%)

Query: 6   PKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPI-IGRHITSEI 64
           P I +  ++E+ QV+G +   V +   + +L  L  + KE+ R +P+ P+ + R  T   
Sbjct: 320 PSILKRAHEEMDQVIGRNRRLVES--DLPKLPYLQAICKESFRKHPSTPLNLPRVSTQAC 377

Query: 65  QADKYTIP 72
           + + Y IP
Sbjct: 378 EVNGYYIP 385


>gnl|CDD|177847 PLN02196, PLN02196, abscisic acid 8'-hydroxylase.
          Length = 463

 Score = 33.0 bits (75), Expect = 0.025
 Identities = 22/76 (28%), Positives = 40/76 (52%), Gaps = 1/76 (1%)

Query: 2   LALHPKIQQEVYDEIVQVLGDDPE-TVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHI 60
           LA +P + + V +E + +  D  E    T++  +++ L +RVI+ETLR+        R  
Sbjct: 291 LAENPSVLEAVTEEQMAIRKDKEEGESLTWEDTKKMPLTSRVIQETLRVASILSFTFREA 350

Query: 61  TSEIQADKYTIPIGRK 76
             +++ + Y IP G K
Sbjct: 351 VEDVEYEGYLIPKGWK 366


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 32.5 bits (75), Expect = 0.037
 Identities = 12/49 (24%), Positives = 28/49 (57%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            ++K++ +K +  K+++ +K  K+  + MKK +  KKK+ K    ++  
Sbjct: 166 DDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214



 Score = 31.7 bits (73), Expect = 0.073
 Identities = 13/48 (27%), Positives = 28/48 (58%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           E    K++ ++ + QK+++ +K  K+ ++  KK + +KKK+ K    K
Sbjct: 164 EIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPK 211



 Score = 30.2 bits (69), Expect = 0.29
 Identities = 14/69 (20%), Positives = 31/69 (44%)

Query: 54  PIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKK 113
            I+      +   ++      +KE + +K  K+  +  KK + +K+K+ K    K+   +
Sbjct: 157 GILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQ 216

Query: 114 KKKKQKKQK 122
             KK  K++
Sbjct: 217 IGKKIDKEE 225



 Score = 26.3 bits (59), Expect = 4.5
 Identities = 13/50 (26%), Positives = 26/50 (52%), Gaps = 2/50 (4%)

Query: 76  KENKKKKQKKKQTKKQKKQ--KKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           +  K + QK+++ +K  K+  +  K+ + + KKQ K    K+   +  KK
Sbjct: 171 ELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKK 220


>gnl|CDD|178550 PLN02966, PLN02966, cytochrome P450 83A1.
          Length = 502

 Score = 32.4 bits (73), Expect = 0.037
 Identities = 21/78 (26%), Positives = 38/78 (48%), Gaps = 1/78 (1%)

Query: 2   LALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAP-IIGRHI 60
           L  +P++ ++   E+ + + +   T  T D ++ L     ++KETLR+ P  P +I R  
Sbjct: 316 LMKYPQVLKKAQAEVREYMKEKGSTFVTEDDVKNLPYFRALVKETLRIEPVIPLLIPRAC 375

Query: 61  TSEIQADKYTIPIGRKEN 78
             + +   Y IP G   N
Sbjct: 376 IQDTKIAGYDIPAGTTVN 393


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 32.0 bits (73), Expect = 0.059
 Identities = 14/48 (29%), Positives = 29/48 (60%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           E + +K K+K  K   K K ++ KK++  + ++ KK+KK K +++  +
Sbjct: 347 EKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQ 394



 Score = 30.1 bits (68), Expect = 0.23
 Identities = 14/50 (28%), Positives = 30/50 (60%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           R+E +K+ +K K+   +   K K ++ +K +  +  +KKK+KK K +++ 
Sbjct: 343 REELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRG 392



 Score = 30.1 bits (68), Expect = 0.26
 Identities = 14/64 (21%), Positives = 31/64 (48%)

Query: 59  HITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQ 118
             + E         + ++  K K++  K   K K ++ KK++  + +++KK KK K +++
Sbjct: 332 AFSGEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERR 391

Query: 119 KKQK 122
             Q 
Sbjct: 392 GLQN 395



 Score = 27.4 bits (61), Expect = 2.1
 Identities = 14/57 (24%), Positives = 31/57 (54%)

Query: 68  KYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
                I  +E  +K+ +K + K  K   K K ++ K ++  + ++KKK+K+ K +++
Sbjct: 335 GEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERR 391


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 31.7 bits (72), Expect = 0.064
 Identities = 13/50 (26%), Positives = 32/50 (64%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
             E K +    ++ + + K++K KQ++ + K++++  + K+K+QK ++KK
Sbjct: 385 LGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 25.9 bits (57), Expect = 8.0
 Identities = 9/46 (19%), Positives = 29/46 (63%)

Query: 79  KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           KK  + K +T   ++ + K ++++  +++ + K+K++  + K+K++
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQ 428


>gnl|CDD|235047 PRK02509, PRK02509, hypothetical protein; Provisional.
          Length = 973

 Score = 31.5 bits (72), Expect = 0.078
 Identities = 25/107 (23%), Positives = 40/107 (37%), Gaps = 35/107 (32%)

Query: 6   PKIQQ------EVYD-EIVQVLGDDPETVPTYDQIQELHLLTRVIK--------ET---- 46
           P IQ+        +D + ++V   DP+   T   +    L  R I+        ET    
Sbjct: 400 PYIQRSIALTRAAFDLDKIEVRTFDPQGNLTAADLAANRLTIRNIRLWDTRPLLETNRQL 459

Query: 47  --LRLY---PAAPIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQT 88
             +RLY   P A I           D+YT+     +N+    +K+Q 
Sbjct: 460 QQIRLYYRFPDADI-----------DRYTLKTENDDNRSISTEKQQV 495


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 31.4 bits (72), Expect = 0.082
 Identities = 10/45 (22%), Positives = 28/45 (62%)

Query: 79  KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           ++ +Q+ K+ +   +++K  ++++K K +   KK+K + QK + +
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQ 157



 Score = 31.0 bits (71), Expect = 0.095
 Identities = 12/49 (24%), Positives = 29/49 (59%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K   + ++ ++Q KK++   +K++  ++ +K K    +KK+K + QK +
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPE 155



 Score = 30.7 bits (70), Expect = 0.11
 Identities = 12/49 (24%), Positives = 31/49 (63%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           +E K K Q ++  ++ KK++   +K++  ++++K K K  +K++K + +
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152



 Score = 30.7 bits (70), Expect = 0.12
 Identities = 11/48 (22%), Positives = 27/48 (56%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
           R E + KK++    K++  ++++K K +  +K++K + +K + Q    
Sbjct: 114 RAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPV 161



 Score = 27.6 bits (62), Expect = 1.5
 Identities = 8/49 (16%), Positives = 27/49 (55%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           +  + ++Q KK+    +K+K  +++++   K  + K+K + ++ + +  
Sbjct: 111 QAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHT 159



 Score = 27.6 bits (62), Expect = 1.7
 Identities = 11/49 (22%), Positives = 23/49 (46%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            E +K  +++++ K +  +KK+K + QK + Q        +    Q  K
Sbjct: 126 GEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQAVK 174



 Score = 26.4 bits (59), Expect = 3.6
 Identities = 9/48 (18%), Positives = 30/48 (62%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           E+ +K+ ++ + K Q ++ +++ KK++   +K+   ++++K K +  +
Sbjct: 97  EHARKQLEEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPR 144



 Score = 26.0 bits (58), Expect = 4.7
 Identities = 7/50 (14%), Positives = 23/50 (46%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           R+   +K++  ++ +K K +  +K++K + +K +          +    +
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQ 171



 Score = 25.7 bits (57), Expect = 6.8
 Identities = 8/48 (16%), Positives = 21/48 (43%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
             +K+K  +++ K + K  +KK+K +  K + +        +    + 
Sbjct: 125 AGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQA 172


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 31.3 bits (72), Expect = 0.085
 Identities = 9/56 (16%), Positives = 32/56 (57%), Gaps = 4/56 (7%)

Query: 73  IGRKENKKKKQKKK----QTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           I   E  +++ ++K    +   ++ +K K++ ++K +K ++ + K  ++ +K+ ++
Sbjct: 522 IASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQ 577



 Score = 28.3 bits (64), Expect = 1.0
 Identities = 13/46 (28%), Positives = 29/46 (63%)

Query: 79  KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K+ +Q +K      K  +  + ++++ K  + K+KKKKKQK+++++
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEE 636



 Score = 27.5 bits (62), Expect = 2.3
 Identities = 13/50 (26%), Positives = 29/50 (58%)

Query: 73  IGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
           I ++  + +K      K  +  + +K+  +  +K++K KKK+K+KQ++ K
Sbjct: 589 IIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638



 Score = 26.7 bits (60), Expect = 4.0
 Identities = 13/42 (30%), Positives = 24/42 (57%)

Query: 83  QKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           QK      +  +  + +K+     +KK KKKKK+K+K+++ K
Sbjct: 597 QKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 31.2 bits (71), Expect = 0.086
 Identities = 24/42 (57%), Positives = 32/42 (76%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
           +E KK+K+KKK+ KK+KK+KK K++K    K  K KKKKKKK
Sbjct: 151 EEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192



 Score = 26.2 bits (58), Expect = 4.9
 Identities = 24/75 (32%), Positives = 45/75 (60%)

Query: 50  YPAAPIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKK 109
           Y A       + SE +  +       ++  + ++++K+ KK+KK+ KK++K++K KK+K 
Sbjct: 118 YGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177

Query: 110 TKKKKKKKQKKQKKK 124
            + K  KK+KK+KKK
Sbjct: 178 VEPKGSKKKKKKKKK 192


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 31.6 bits (72), Expect = 0.088
 Identities = 15/50 (30%), Positives = 24/50 (48%)

Query: 75   RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
              E K+  ++++   K K +  K +K +  KK+KK KK    K KK    
Sbjct: 1146 EVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVV 1195



 Score = 30.4 bits (69), Expect = 0.20
 Identities = 19/48 (39%), Positives = 27/48 (56%)

Query: 77   ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            E  ++K+  K+ + + K K K  K +K K +KK KKKKK    K KK 
Sbjct: 1145 EEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKA 1192



 Score = 29.6 bits (67), Expect = 0.40
 Identities = 16/51 (31%), Positives = 26/51 (50%), Gaps = 1/51 (1%)

Query: 75   RKENKKKKQKKKQTKKQ-KKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            ++  K+++ K K   K  K +K K +KK+K KK+    K KK       K+
Sbjct: 1150 KEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKR 1200



 Score = 29.2 bits (66), Expect = 0.48
 Identities = 14/49 (28%), Positives = 23/49 (46%)

Query: 75   RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
             ++   K+Q+ K   K K  K +K K +K +K+KK     K K+     
Sbjct: 1148 EEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVG 1196



 Score = 29.2 bits (66), Expect = 0.58
 Identities = 17/48 (35%), Positives = 30/48 (62%)

Query: 77   ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            E +++ ++K+  K+Q+ + K K K  K++K K  KK+KKKK+    K 
Sbjct: 1142 EEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKS 1189



 Score = 28.1 bits (63), Expect = 1.3
 Identities = 12/62 (19%), Positives = 25/62 (40%)

Query: 60   ITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQK 119
             +     +           KKK +K+ +      +KKKK +K+  +K+K   + K+    
Sbjct: 1288 PSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASAS 1347

Query: 120  KQ 121
            + 
Sbjct: 1348 QS 1349



 Score = 25.4 bits (56), Expect = 9.9
 Identities = 15/47 (31%), Positives = 22/47 (46%)

Query: 78   NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            +K+   +     K     KKK KK+        KKKKK ++K  +KK
Sbjct: 1289 SKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKK 1335


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 30.6 bits (69), Expect = 0.15
 Identities = 16/92 (17%), Positives = 35/92 (38%), Gaps = 8/92 (8%)

Query: 41  RVIKETLRLYPAAPIIGRHITSEIQADKYTIPIGRKENKKK--------KQKKKQTKKQK 92
           +    TLR      II      E+ A+   I   R   K +         QK++  +++ 
Sbjct: 302 KAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREI 361

Query: 93  KQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            ++ ++++   M + +  +   K K+K     
Sbjct: 362 NREARQERAAAMARARARRAAVKAKKKGLIDA 393


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 30.1 bits (68), Expect = 0.22
 Identities = 14/49 (28%), Positives = 23/49 (46%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
              + +   +    K K++ K ++K +K K + K K K K K K Q K 
Sbjct: 69  PPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKP 117



 Score = 27.1 bits (60), Expect = 2.7
 Identities = 12/49 (24%), Positives = 22/49 (44%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           +    ++Q K  T+ +   +    K ++  K +K  KK K K K + K 
Sbjct: 59  EPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKP 107



 Score = 26.7 bits (59), Expect = 3.7
 Identities = 16/49 (32%), Positives = 23/49 (46%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            E   +    K  +K K +KK K+ K K K + K K K K + K +K  
Sbjct: 73  PETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPP 121



 Score = 25.5 bits (56), Expect = 8.5
 Identities = 19/53 (35%), Positives = 24/53 (45%)

Query: 70  TIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
             P   K  +K K +KK  K + K K K + K K+K Q K KK   K   K  
Sbjct: 77  PEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKAP 129



 Score = 25.5 bits (56), Expect = 9.4
 Identities = 14/50 (28%), Positives = 21/50 (42%)

Query: 71  IPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
            P  + + +KK +K K   K K + K K K Q   K+  +K   K     
Sbjct: 83  KPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKAPAAP 132



 Score = 25.5 bits (56), Expect = 9.6
 Identities = 15/46 (32%), Positives = 21/46 (45%)

Query: 79  KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K K++ K + K +K + K K K +   K K   K KK   K   K 
Sbjct: 83  KPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKA 128


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 30.1 bits (69), Expect = 0.27
 Identities = 10/50 (20%), Positives = 35/50 (70%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           ++EN  +K +  + ++++ +KK+K+ +QK ++ +K +++ ++  ++Q ++
Sbjct: 94  KEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143



 Score = 27.0 bits (61), Expect = 2.5
 Identities = 10/57 (17%), Positives = 35/57 (61%), Gaps = 6/57 (10%)

Query: 74  GRKENKKKKQ------KKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            ++E  K +       ++++ + QK +K+  QK++ + ++ +  +K++++ +K++K+
Sbjct: 62  AKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKE 118


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 29.6 bits (67), Expect = 0.28
 Identities = 16/45 (35%), Positives = 28/45 (62%)

Query: 80  KKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           ++ +KK+   K+ +  + K+K    +++KK KKKK  K+KK KK 
Sbjct: 154 ERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKV 198



 Score = 27.3 bits (61), Expect = 1.8
 Identities = 16/42 (38%), Positives = 23/42 (54%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
           KE +  + KKK    ++++KKKK+K  K KK KK      K 
Sbjct: 164 KEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205


>gnl|CDD|216872 pfam02096, 60KD_IMP, 60Kd inner membrane protein. 
          Length = 193

 Score = 29.8 bits (68), Expect = 0.28
 Identities = 10/38 (26%), Positives = 21/38 (55%), Gaps = 2/38 (5%)

Query: 86  KQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           KQ K   K ++ + + +++  Q+K K   +K Q++  K
Sbjct: 24  KQYKSMAKMQELQPEIKEI--QEKYKDDPQKLQQEMMK 59


>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
          Length = 381

 Score = 30.0 bits (68), Expect = 0.28
 Identities = 11/38 (28%), Positives = 19/38 (50%)

Query: 81  KKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQ 118
           KK +K     Q+K   K  +K K ++ +K  KK +  +
Sbjct: 333 KKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAK 370



 Score = 28.1 bits (63), Expect = 1.3
 Identities = 12/47 (25%), Positives = 20/47 (42%), Gaps = 1/47 (2%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKK-QKKQKMKKQKKTKKKKKKKQKK 120
            ++     Q+K   K  +K K ++ +KK K  +  K  K   K  K 
Sbjct: 335 LRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKALYKSIKA 381



 Score = 27.7 bits (62), Expect = 1.6
 Identities = 16/61 (26%), Positives = 25/61 (40%), Gaps = 15/61 (24%)

Query: 79  KKKKQKKKQTKK---------------QKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           KKK  K++  KK                KK +K     Q+    K  +K K ++ +K+ K
Sbjct: 305 KKKPAKRRVQKKNPLKNRRVRLRLNPAAKKLRKLAVLAQEKATAKGAQKVKNRRARKKAK 364

Query: 124 K 124
           K
Sbjct: 365 K 365


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 29.9 bits (68), Expect = 0.28
 Identities = 11/54 (20%), Positives = 29/54 (53%), Gaps = 5/54 (9%)

Query: 72  PIGRKENKKKKQKKKQTKKQKKQK-----KKKQKKQKMKKQKKTKKKKKKKQKK 120
           P+   E + +++ +++ K+ + +K     +KK  ++K KK  K  K ++   ++
Sbjct: 62  PLEEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEE 115


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 30.0 bits (68), Expect = 0.28
 Identities = 16/54 (29%), Positives = 31/54 (57%)

Query: 71  IPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            P+ +   + KK+K++  +  +  + + +KK+K K++ K KK K+ K  K  KK
Sbjct: 221 KPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKK 274



 Score = 28.8 bits (65), Expect = 0.54
 Identities = 19/71 (26%), Positives = 33/71 (46%), Gaps = 1/71 (1%)

Query: 55  IIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKK-QKKQKKKKQKKQKMKKQKKTKKK 113
           +    I      DKY     +K+ + ++  +    + +KK+K K++ K+K  K+ K  K 
Sbjct: 212 LKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKA 271

Query: 114 KKKKQKKQKKK 124
            KK   K  KK
Sbjct: 272 LKKVVAKGMKK 282



 Score = 25.8 bits (57), Expect = 8.2
 Identities = 11/53 (20%), Positives = 24/53 (45%)

Query: 54  PIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKK 106
               +  T E      +    ++++K++ +KKK  + +  +  KK   + MKK
Sbjct: 230 SKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAKGMKK 282


>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria and eukaryotes.
           Proteins in this family are typically between 137 to 196
           amino acids in length.
          Length = 149

 Score = 29.2 bits (66), Expect = 0.34
 Identities = 10/47 (21%), Positives = 20/47 (42%), Gaps = 2/47 (4%)

Query: 78  NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
               +++     K+ +  KK  K  +     K+K K+KK + K+   
Sbjct: 1   MASSRKRLPFEPKRSR--KKGVKALRKAAVAKSKDKQKKPKSKRAAS 45



 Score = 26.1 bits (58), Expect = 3.7
 Identities = 10/39 (25%), Positives = 17/39 (43%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKK 113
           R   + K+ +KK  K  +K    K K ++ K + K    
Sbjct: 7   RLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAAS 45



 Score = 26.1 bits (58), Expect = 4.8
 Identities = 9/49 (18%), Positives = 19/49 (38%), Gaps = 1/49 (2%)

Query: 74  GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
                K+   + K+  ++K  K  ++      K K+ K K K+   +  
Sbjct: 1   MASSRKRLPFEPKR-SRKKGVKALRKAAVAKSKDKQKKPKSKRAASESG 48


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 29.8 bits (67), Expect = 0.37
 Identities = 7/46 (15%), Positives = 21/46 (45%)

Query: 78   NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
            +K+  + ++  +K ++   + +     K  ++  KK   K+  +K 
Sbjct: 1169 DKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKA 1214



 Score = 27.5 bits (61), Expect = 2.0
 Identities = 11/38 (28%), Positives = 17/38 (44%)

Query: 77   ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKK 114
            E ++K Q+     +    KK  ++  K    KKT KK 
Sbjct: 1177 EAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKA 1214



 Score = 27.1 bits (60), Expect = 2.5
 Identities = 11/53 (20%), Positives = 21/53 (39%)

Query: 63   EIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKK 115
            E + DK      + E  ++K ++   + +    KK  ++   K   K   KK 
Sbjct: 1162 EKELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKA 1214



 Score = 26.7 bits (59), Expect = 3.7
 Identities = 8/35 (22%), Positives = 16/35 (45%)

Query: 90   KQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            K ++ ++K Q+     +    KK  ++  KK   K
Sbjct: 1174 KAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPK 1208



 Score = 26.7 bits (59), Expect = 4.1
 Identities = 9/45 (20%), Positives = 21/45 (46%)

Query: 75   RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQK 119
            +++ K ++ ++K  +   + +    KK   +  KK   KK  K+ 
Sbjct: 1170 KEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKA 1214


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 29.4 bits (66), Expect = 0.38
 Identities = 14/46 (30%), Positives = 25/46 (54%), Gaps = 1/46 (2%)

Query: 79  KKKKQKKKQTKKQKKQKKKKQ-KKQKMKKQKKTKKKKKKKQKKQKK 123
           K++K+ K   KK K  K KK  KK  ++ +KK  + + +   ++K 
Sbjct: 107 KEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKA 152



 Score = 28.6 bits (64), Expect = 0.75
 Identities = 14/58 (24%), Positives = 24/58 (41%)

Query: 60  ITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
            T      K+      KE K   +K K +K +K  KK   + +K   + + +   +KK
Sbjct: 94  FTEAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKK 151


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 29.7 bits (66), Expect = 0.42
 Identities = 11/59 (18%), Positives = 38/59 (64%)

Query: 66   ADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            + +  +P+  +  +K + K +  K++ K +++K++ + +++ K+ +  KK+K+++Q+ +
Sbjct: 1000 SRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIR 1058



 Score = 28.6 bits (63), Expect = 0.84
 Identities = 13/47 (27%), Positives = 27/47 (57%)

Query: 75   RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
             KE  +  Q+ K+ +  KK+K+++Q+ +K       +  KK+ +KK+
Sbjct: 1031 EKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKKR 1077



 Score = 28.6 bits (63), Expect = 1.0
 Identities = 11/51 (21%), Positives = 34/51 (66%), Gaps = 2/51 (3%)

Query: 76   KENKKKKQKKKQTKKQKKQ--KKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            K+ ++K++ +   + ++++  KK+K+++Q+++K      K+  K++ +KK+
Sbjct: 1027 KDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKKR 1077



 Score = 28.2 bits (62), Expect = 1.3
 Identities = 12/57 (21%), Positives = 35/57 (61%)

Query: 68   KYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            K  I   R +++++K++ +  ++ K+++  K++K++ ++ +KT     K+  K++ K
Sbjct: 1018 KDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLK 1074


>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
          Length = 977

 Score = 29.6 bits (66), Expect = 0.47
 Identities = 24/92 (26%), Positives = 45/92 (48%), Gaps = 10/92 (10%)

Query: 29  TYDQIQELHLLTRVIKET-LRLYPAAPIIGRHIT------SEIQADKYTIPIGRKENKKK 81
            +D ++  H   +    + L       I  RHI+      SE+  D+Y  PI +  N+ +
Sbjct: 63  VFDDVETKHSRNQSTMASHLNDTQDVGIHARHISSVSTVDSELN-DEYGNPIWK--NRVE 119

Query: 82  KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKK 113
             K K+ KK+K  KKK+  K ++  +++ ++K
Sbjct: 120 SWKDKKNKKKKSAKKKEAHKAQIPPEQQMEEK 151


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 29.1 bits (65), Expect = 0.48
 Identities = 18/53 (33%), Positives = 24/53 (45%)

Query: 72  PIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
              +K +KKK  K   + K+  +   K  K+  K  KK  KK  K  KK  KK
Sbjct: 221 SAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKK 273



 Score = 28.3 bits (63), Expect = 1.0
 Identities = 19/49 (38%), Positives = 21/49 (42%), Gaps = 1/49 (2%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           +K  K     KK  K   K  KK +K  K K  KK  K  KK  KK  K
Sbjct: 229 KKAAKTAVSAKKAAKTAAKAAKKAKKTAK-KALKKAAKAVKKAAKKAAK 276



 Score = 27.9 bits (62), Expect = 1.4
 Identities = 18/50 (36%), Positives = 23/50 (46%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           +K + K K   K+  K+K  K     K+  K   K  KK KK  KK  KK
Sbjct: 213 KKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKK 262



 Score = 27.9 bits (62), Expect = 1.6
 Identities = 20/59 (33%), Positives = 22/59 (37%)

Query: 66  ADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           A K      +   K KK  KK  KK  K  KK  KK      K  K   K  + K K K
Sbjct: 238 AKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAK 296



 Score = 26.4 bits (58), Expect = 4.4
 Identities = 19/59 (32%), Positives = 23/59 (38%)

Query: 66  ADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           A K      +   K  K+  K  KK  K+  K   K      K TK K K K+K  KK 
Sbjct: 245 AAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKA 303


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 28.9 bits (65), Expect = 0.48
 Identities = 24/39 (61%), Positives = 27/39 (69%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKK 115
           E K  KQ  K+  KQKKQKKKK+KK+K K  KK  KKKK
Sbjct: 97  EQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 27.0 bits (60), Expect = 2.2
 Identities = 21/34 (61%), Positives = 25/34 (73%)

Query: 90  KQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           KQ  ++K KQKKQK KK+KK KKK  KK  K+KK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 25.8 bits (57), Expect = 4.6
 Identities = 18/34 (52%), Positives = 23/34 (67%)

Query: 91  QKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           QK  K+  ++K K KKQKK KKKKKKK+  +K  
Sbjct: 98  QKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAA 131


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 29.3 bits (65), Expect = 0.51
 Identities = 8/50 (16%), Positives = 26/50 (52%)

Query: 75   RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            + +  KKK ++ +   +  + + +    + +  ++  +  +KK+++ KKK
Sbjct: 1330 KADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379



 Score = 28.6 bits (63), Expect = 1.0
 Identities = 12/50 (24%), Positives = 25/50 (50%)

Query: 75   RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            + E  KK  +  + + +    + +  ++K +  +K K++ KKK    KKK
Sbjct: 1337 KAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386



 Score = 28.2 bits (62), Expect = 1.4
 Identities = 11/49 (22%), Positives = 24/49 (48%)

Query: 76   KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            +E KKK    K+  ++ K+  +  K +      + +  ++K +  +KKK
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373



 Score = 27.8 bits (61), Expect = 1.7
 Identities = 9/46 (19%), Positives = 23/46 (50%)

Query: 79   KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            KK ++ KK+    KK+ ++ +K  +  K +      + +  ++K +
Sbjct: 1322 KKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367



 Score = 27.8 bits (61), Expect = 1.8
 Identities = 13/50 (26%), Positives = 25/50 (50%)

Query: 75   RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
             +      + +   +K +  +KKK++ +K     K K ++KKK  + KKK
Sbjct: 1350 AEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399



 Score = 27.4 bits (60), Expect = 2.3
 Identities = 11/50 (22%), Positives = 25/50 (50%)

Query: 75   RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            +     K + +    + +  ++K +  +K K++ K K    KK+ ++KKK
Sbjct: 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392



 Score = 27.4 bits (60), Expect = 2.3
 Identities = 15/50 (30%), Positives = 29/50 (58%)

Query: 75   RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
              E +  ++K +  +K+K++ KKK    K K ++K K  + KK+ ++ KK
Sbjct: 1356 ADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKK 1405



 Score = 27.4 bits (60), Expect = 2.7
 Identities = 9/50 (18%), Positives = 24/50 (48%)

Query: 75   RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            + +  KKK ++ + K    +KK ++ K+  +  K   +    + +  ++K
Sbjct: 1316 KADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEK 1365



 Score = 27.4 bits (60), Expect = 2.7
 Identities = 11/52 (21%), Positives = 27/52 (51%)

Query: 73   IGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
              + E +    + +  +++ +  +KK+++ K K     KK ++KK+  + KK
Sbjct: 1347 AAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKK 1398



 Score = 26.6 bits (58), Expect = 4.1
 Identities = 10/48 (20%), Positives = 23/48 (47%)

Query: 77   ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            E KKK ++ K+     K+K ++ KK     + + +    + +  ++K 
Sbjct: 1319 EAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKA 1366



 Score = 26.6 bits (58), Expect = 4.3
 Identities = 10/50 (20%), Positives = 26/50 (52%)

Query: 75   RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            +K ++ KK+ ++  KK    KKK ++ +K  +  K + +    + +  ++
Sbjct: 1315 KKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEE 1364



 Score = 26.3 bits (57), Expect = 5.8
 Identities = 11/50 (22%), Positives = 26/50 (52%)

Query: 75   RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            +K  +  K + +    + +  ++K +  + KK++  KK    K+K ++KK
Sbjct: 1342 KKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKK 1391



 Score = 26.3 bits (57), Expect = 6.5
 Identities = 13/46 (28%), Positives = 25/46 (54%)

Query: 79   KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            KK ++ KK  + +KK ++ K+K    KK+ +  KK  +  K + + 
Sbjct: 1309 KKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEA 1354



 Score = 26.3 bits (57), Expect = 6.7
 Identities = 14/48 (29%), Positives = 27/48 (56%)

Query: 77   ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            E KKK ++ K+  + KK+ ++ +KK    K+K  + KK  +  K + +
Sbjct: 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAE 1353



 Score = 25.9 bits (56), Expect = 7.3
 Identities = 11/60 (18%), Positives = 26/60 (43%)

Query: 65   QADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            +A K      +K  + KK  +    + +    + +  ++  +  + KK++ KK+    KK
Sbjct: 1326 EAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385



 Score = 25.9 bits (56), Expect = 9.2
 Identities = 8/47 (17%), Positives = 23/47 (48%)

Query: 76   KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
            K+  ++ +KK    K+K ++ KK  +    + +    + +  ++K +
Sbjct: 1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367



 Score = 25.5 bits (55), Expect = 9.8
 Identities = 11/49 (22%), Positives = 24/49 (48%)

Query: 76   KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            +E KK  + KK+ ++ KK+    +KK +  K+     K + +    + +
Sbjct: 1312 EEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAE 1360


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 29.1 bits (66), Expect = 0.51
 Identities = 11/48 (22%), Positives = 25/48 (52%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           K  + +++     K+QK+  K+++  ++ K      KK K + K+ +K
Sbjct: 242 KAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEK 289



 Score = 29.1 bits (66), Expect = 0.52
 Identities = 11/47 (23%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 79  KKKKQKKKQTKKQKKQKKKKQKK-QKMKKQKKTKKKKKKKQKKQKKK 124
           ++K ++ +Q     ++++K+  K Q+  ++ K    K KK K + K+
Sbjct: 240 EQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKR 286



 Score = 26.1 bits (58), Expect = 6.7
 Identities = 12/42 (28%), Positives = 23/42 (54%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKK 116
           ++    +KQ+K+  K+Q+  ++ K    K KK K   K+ +K
Sbjct: 248 QEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEK 289


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 28.7 bits (64), Expect = 0.52
 Identities = 12/20 (60%), Positives = 14/20 (70%)

Query: 101 KQKMKKQKKTKKKKKKKQKK 120
           K+K KK KKTKK KK  +K 
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120



 Score = 28.3 bits (63), Expect = 0.58
 Identities = 13/24 (54%), Positives = 17/24 (70%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKK 98
            ++ KKKK+K K+TKK KK  KK 
Sbjct: 97  TEKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 27.5 bits (61), Expect = 1.2
 Identities = 12/22 (54%), Positives = 16/22 (72%)

Query: 80  KKKQKKKQTKKQKKQKKKKQKK 101
           K K+KKK++KK KK KK  +K 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 27.5 bits (61), Expect = 1.2
 Identities = 12/19 (63%), Positives = 13/19 (68%)

Query: 106 KQKKTKKKKKKKQKKQKKK 124
           K+KK K KK KK KK  KK
Sbjct: 101 KKKKKKSKKTKKPKKSSKK 119



 Score = 27.2 bits (60), Expect = 1.8
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 96  KKKQKKQKMKKQKKTKKKKKKK 117
           K K+KK+K KK KK KK  KK 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 25.6 bits (56), Expect = 6.4
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 87  QTKKQKKQKKKKQKKQKMKKQKK 109
           +  K+KK+K KK KK K   +K 
Sbjct: 98  EKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 25.2 bits (55), Expect = 7.4
 Identities = 13/22 (59%), Positives = 16/22 (72%)

Query: 93  KQKKKKQKKQKMKKQKKTKKKK 114
           K KKKK+K +K KK KK+ KK 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 24.8 bits (54), Expect = 9.3
 Identities = 12/23 (52%), Positives = 15/23 (65%)

Query: 102 QKMKKQKKTKKKKKKKQKKQKKK 124
           +K KK+KK  KK KK +K  KK 
Sbjct: 98  EKPKKKKKKSKKTKKPKKSSKKD 120


>gnl|CDD|234272 TIGR03592, yidC_oxa1_cterm, membrane protein insertase, YidC/Oxa1
           family, C-terminal domain.  This model describes
           full-length from some species, and the C-terminal region
           only from other species, of the YidC/Oxa1 family of
           proteins. This domain appears to be univeral among
           bacteria (although absent from Archaea). The
           well-characterized YidC protein from Escherichia coli
           and its close homologs contain a large N-terminal
           periplasmic domain in addition to the region modeled
           here [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 181

 Score = 28.6 bits (65), Expect = 0.57
 Identities = 11/39 (28%), Positives = 22/39 (56%), Gaps = 2/39 (5%)

Query: 85  KKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
            KQ K  +K ++ + K +++  Q+K K   +K Q++  K
Sbjct: 22  LKQYKSMRKMQELQPKLKEI--QEKYKDDPQKLQQEMMK 58


>gnl|CDD|166628 PLN02987, PLN02987, Cytochrome P450, family 90, subfamily A.
          Length = 472

 Score = 29.2 bits (65), Expect = 0.57
 Identities = 20/70 (28%), Positives = 33/70 (47%), Gaps = 8/70 (11%)

Query: 10  QEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIG---RHITSEIQA 66
           +E +++I  +  D       +   + +     V+ ETLR+   A IIG   R   ++I+ 
Sbjct: 305 KEEHEKIRAMKSD--SYSLEWSDYKSMPFTQCVVNETLRV---ANIIGGIFRRAMTDIEV 359

Query: 67  DKYTIPIGRK 76
             YTIP G K
Sbjct: 360 KGYTIPKGWK 369


>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
          Length = 815

 Score = 29.0 bits (65), Expect = 0.62
 Identities = 14/43 (32%), Positives = 22/43 (51%)

Query: 82  KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           ++ K  T+KQK + K+ + +Q  KK KK KK+        K  
Sbjct: 15  EEGKGGTRKQKAENKEHEGEQSPKKAKKEKKQDDSGNGNGKSA 57


>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
          Length = 1052

 Score = 29.2 bits (65), Expect = 0.64
 Identities = 18/51 (35%), Positives = 23/51 (45%)

Query: 74   GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
             +K+  KK   KK   K+   KK   KK+  KK    K  +K   KK  KK
Sbjct: 992  MKKKVAKKAPAKKAAAKKAAAKKAAAKKKVAKKAPAKKVARKPAAKKAAKK 1042



 Score = 28.8 bits (64), Expect = 0.74
 Identities = 17/52 (32%), Positives = 23/52 (44%)

Query: 73   IGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            + +K   KK   KK   K+   KKK  KK   KK  +    KK  +K  +K 
Sbjct: 996  VAKKAPAKKAAAKKAAAKKAAAKKKVAKKAPAKKVARKPAAKKAAKKPARKA 1047



 Score = 26.5 bits (58), Expect = 5.2
 Identities = 13/45 (28%), Positives = 20/45 (44%)

Query: 75   RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQK 119
            +K   KK   KK+  K+   KK  +K    K  KK  +K   ++ 
Sbjct: 1008 KKAAAKKAAAKKKVAKKAPAKKVARKPAAKKAAKKPARKAAGRKA 1052



 Score = 26.1 bits (57), Expect = 5.8
 Identities = 14/45 (31%), Positives = 20/45 (44%)

Query: 79   KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
            KK   KK   KK+  +K   +K  +    KK  KK  +K   +K 
Sbjct: 1008 KKAAAKKAAAKKKVAKKAPAKKVARKPAAKKAAKKPARKAAGRKA 1052



 Score = 26.1 bits (57), Expect = 7.3
 Identities = 16/50 (32%), Positives = 21/50 (42%)

Query: 75   RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            +K   KK   KK   K+K  KK   KK   K   K   KK  ++   +K 
Sbjct: 1003 KKAAAKKAAAKKAAAKKKVAKKAPAKKVARKPAAKKAAKKPARKAAGRKA 1052


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 28.6 bits (64), Expect = 0.65
 Identities = 12/47 (25%), Positives = 25/47 (53%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
            +  +++   +   + K++KK++  K K +K+K   + KK K  K K
Sbjct: 106 TDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152



 Score = 27.8 bits (62), Expect = 1.2
 Identities = 11/43 (25%), Positives = 25/43 (58%)

Query: 82  KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           ++   +T+++   + K + K+K K++    K +K+K K + KK
Sbjct: 103 EESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKK 145


>gnl|CDD|213371 cd12837, EcCorA-like_u1, uncharacterized subgroup of the
          Escherichia coli Mg2+ transporter CorA_like subfamily. 
          A uncharacterized subgroup of the Escherichia coli
          CorA-Salmonella typhimurium ZntB_like family
          (EcCorA_ZntB-like) family of the MIT superfamily of
          essential membrane proteins involved in transporting
          divalent cations (uptake or efflux) across membranes.
          The EcCorA_ZntB-like family includes the Mg2+
          transporters Escherichia coli and Salmonella
          typhimurium CorAs, which can also transport Co2+, and
          Ni2+. Structures of the intracellular domain of
          EcCorA_ZntB-like family members, Vibrio
          parahaemolyticus and Salmonella typhimurium ZntB, form
          funnel-shaped homopentamers, the tip of the funnel is
          formed from two C-terminal transmembrane (TM) helices
          from each monomer, and the large opening of the funnel
          from the N-terminal cytoplasmic domains. The GMN
          signature motif of the MIT superfamily occurs just
          after TM1, mutation within this motif is known to
          abolish Mg2+ transport through Salmonella typhimurium
          CorA. Natural variants such as GVN and GIN, such as
          occur in some ZntB family proteins, may be associated
          with the transport of different divalent cations, such
          as zinc and cadmium. The functional diversity of MIT
          transporters may also be due to minor structural
          differences regulating gating, substrate selection, and
          transport.
          Length = 298

 Score = 28.7 bits (65), Expect = 0.66
 Identities = 16/55 (29%), Positives = 30/55 (54%), Gaps = 5/55 (9%)

Query: 4  LHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGR 58
          L+P  ++E   E   +LG     +PT +++ E+   +R+ +E   LY  AP++ R
Sbjct: 6  LNPTPEEEAAVE--ALLG---IAIPTREEMSEIEASSRLYREGGALYMTAPLLSR 55


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 28.8 bits (64), Expect = 0.74
 Identities = 15/51 (29%), Positives = 29/51 (56%)

Query: 74  GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
             K  +K K +  +T K ++++KKK+K++  KK     KK++K  K++   
Sbjct: 543 KMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDG 593


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 28.9 bits (65), Expect = 0.76
 Identities = 10/47 (21%), Positives = 21/47 (44%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           +N  K +   Q  K  ++K  +++       ++  K+ K+ Q K K 
Sbjct: 896 QNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKHKG 942



 Score = 26.6 bits (59), Expect = 4.5
 Identities = 12/60 (20%), Positives = 25/60 (41%), Gaps = 8/60 (13%)

Query: 73  IGRKENKKKKQ--------KKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           I  +EN  K Q        K +   +  K  ++K  +++M      ++  K+ ++ Q K 
Sbjct: 881 IRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKH 940


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 28.5 bits (64), Expect = 0.78
 Identities = 12/46 (26%), Positives = 31/46 (67%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
           E + +++++++ K   K K KK  K K+++++K K++K++K  ++ 
Sbjct: 39  EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84



 Score = 27.7 bits (62), Expect = 1.4
 Identities = 12/44 (27%), Positives = 27/44 (61%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQK 119
           ++ +K+++K K   K K +K  K K ++ +K K+ K++K  ++ 
Sbjct: 41  EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84



 Score = 27.7 bits (62), Expect = 1.5
 Identities = 12/44 (27%), Positives = 27/44 (61%)

Query: 81  KKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           + ++K++ K +   K K +K  K K ++K K K++K++K  ++ 
Sbjct: 41  EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84


>gnl|CDD|129295 TIGR00191, thrB, homoserine kinase.  Homoserine kinase is part of
          the threonine biosynthetic pathway.Homoserine kinase is
          a member of the GHMP kinases (Galactokinase, Homoserine
          kinase, Mevalonate kinase, Phosphomevalonate kinase)
          and shares with them an amino-terminal domain probably
          related to ATP binding.P.aeruginosa homoserine kinase
          seems not to be homologous (see PROSITE:PDOC0054)
          [Amino acid biosynthesis, Aspartate family].
          Length = 302

 Score = 28.5 bits (64), Expect = 0.80
 Identities = 17/70 (24%), Positives = 28/70 (40%), Gaps = 9/70 (12%)

Query: 6  PKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHITSEIQ 65
            + QE  D  ++  G+  E +PT       +L+ +V K  L       +    +T E  
Sbjct: 33 DVVAQESDDTEIEAEGEGVEKIPTEPTD---NLIYQVAKRFLDQLGIR-MPPVKVTLEKN 88

Query: 66 ADKYTIPIGR 75
               IP+GR
Sbjct: 89 -----IPLGR 93


>gnl|CDD|166612 PLN02971, PLN02971, tryptophan N-hydroxylase.
          Length = 543

 Score = 28.8 bits (64), Expect = 0.82
 Identities = 21/72 (29%), Positives = 35/72 (48%), Gaps = 3/72 (4%)

Query: 6   PKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHIT-SEI 64
           P+I  +  +EI +V+G   E       I +L+ +  +I+E  RL+P A     H+  S+ 
Sbjct: 358 PEILHKAMEEIDRVVGK--ERFVQESDIPKLNYVKAIIREAFRLHPVAAFNLPHVALSDT 415

Query: 65  QADKYTIPIGRK 76
               Y IP G +
Sbjct: 416 TVAGYHIPKGSQ 427


>gnl|CDD|169556 PRK08703, PRK08703, short chain dehydrogenase; Provisional.
          Length = 239

 Score = 28.4 bits (63), Expect = 0.86
 Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 1  MLALHPKIQQEVYDEIVQVLGDDPETVP 28
          ++A H K  ++VYD IV+    +P  + 
Sbjct: 35 LVARHQKKLEKVYDAIVEAGHPEPFAIR 62


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 28.8 bits (64), Expect = 0.88
 Identities = 5/40 (12%), Positives = 22/40 (55%)

Query: 85  KKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K   + ++K K+K    +++K +  + + + ++   ++++
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQRE 831



 Score = 28.4 bits (63), Expect = 1.1
 Identities = 9/44 (20%), Positives = 21/44 (47%), Gaps = 2/44 (4%)

Query: 80  KKKQKKKQTKKQK--KQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
           K   + KQ  K+K    K+ K +   ++ + +    K++++K  
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGI 835



 Score = 27.6 bits (61), Expect = 1.8
 Identities = 10/63 (15%), Positives = 25/63 (39%)

Query: 62  SEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
            +++ +++  P   +E   K  +K  T       K   + ++  K+K    K+ K +   
Sbjct: 758 EDLEEEEFEYPRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPS 817

Query: 122 KKK 124
            + 
Sbjct: 818 VET 820



 Score = 26.5 bits (58), Expect = 4.8
 Identities = 10/43 (23%), Positives = 20/43 (46%)

Query: 78  NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
           +K+ K +    + + ++   KQ+++K          KKKK  K
Sbjct: 808 DKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYK 850



 Score = 26.5 bits (58), Expect = 5.4
 Identities = 8/40 (20%), Positives = 18/40 (45%)

Query: 84  KKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           K     KQK ++K    K+   +    + + ++   KQ++
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQRE 831



 Score = 25.7 bits (56), Expect = 8.3
 Identities = 13/87 (14%), Positives = 27/87 (31%), Gaps = 5/87 (5%)

Query: 43  IKETLRLYPAAPIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQ 102
           ++E    YP         +             +   + K++ K++    K+ K +    +
Sbjct: 760 LEEEEFEYPRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVE 819

Query: 103 KMK-----KQKKTKKKKKKKQKKQKKK 124
                   KQ++ K          KKK
Sbjct: 820 TEGERCTIKQREEKGIDAPAILNVKKK 846


>gnl|CDD|179310 PRK01622, PRK01622, OxaA-like protein precursor; Validated.
          Length = 256

 Score = 28.6 bits (64), Expect = 0.88
 Identities = 13/40 (32%), Positives = 22/40 (55%), Gaps = 1/40 (2%)

Query: 83  QKKKQTKKQKKQKKKKQKKQKMK-KQKKTKKKKKKKQKKQ 121
           Q K Q   Q+K    K +  K++ K K TK  +K+K+ ++
Sbjct: 82  QYKSQRGMQEKMAVMKPELDKIQAKLKVTKDLEKQKEYQK 121


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 28.5 bits (63), Expect = 0.90
 Identities = 12/67 (17%), Positives = 42/67 (62%)

Query: 58  RHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
           R   S  + D Y  P+   +  KK+++  +  K++ ++K ++++++ K+++K +++++++
Sbjct: 557 RGYNSCARTDLYFTPLASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKERERERER 616

Query: 118 QKKQKKK 124
           + ++  K
Sbjct: 617 EAERAAK 623


>gnl|CDD|99959 cd01635, Glycosyltransferase_GTB_type, Glycosyltransferases
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. The acceptor molecule can be a lipid,
           a protein, a heterocyclic compound, or another
           carbohydrate residue. The structures of the formed
           glycoconjugates are extremely diverse, reflecting a wide
           range of biological functions. The members of this
           family share a common GTB topology, one of the two
           protein topologies observed for
           nucleotide-sugar-dependent glycosyltransferases. GTB
           proteins have distinct N- and C- terminal domains each
           containing a typical Rossmann fold. The two domains have
           high structural homology despite minimal sequence
           homology. The large cleft that separates the two domains
           includes the catalytic center and permits a high degree
           of flexibility.
          Length = 229

 Score = 28.4 bits (63), Expect = 0.93
 Identities = 11/56 (19%), Positives = 21/56 (37%)

Query: 1   MLALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPII 56
           ++     +++   D  + + GD PE     + +  L LL RVI            +
Sbjct: 122 LIEAFALLKERGPDLKLVIAGDGPEREYLEELLAALLLLDRVIFLGGLDPEELLAL 177


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 28.4 bits (64), Expect = 0.94
 Identities = 9/41 (21%), Positives = 18/41 (43%)

Query: 82  KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
           K   K+  K K+ +++K  K++ K        K + +K   
Sbjct: 10  KNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAA 50



 Score = 26.5 bits (59), Expect = 5.1
 Identities = 10/39 (25%), Positives = 18/39 (46%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKK 113
            K   KK+ K KQ +++K  K++ +        K   +K
Sbjct: 9   SKNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRK 47


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 28.1 bits (63), Expect = 0.95
 Identities = 12/50 (24%), Positives = 20/50 (40%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            +  KKKK+KKK+TKK        +         +      +  + +K K
Sbjct: 179 ERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLK 228



 Score = 25.8 bits (57), Expect = 7.6
 Identities = 12/49 (24%), Positives = 20/49 (40%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            E  K + ++++ KK+KK+KKKK KK                  +    
Sbjct: 169 VELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSG 217


>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
           ribosomal structure and biogenesis / Posttranslational
           modification, protein turnover, chaperones].
          Length = 379

 Score = 28.5 bits (63), Expect = 0.99
 Identities = 17/85 (20%), Positives = 30/85 (35%)

Query: 39  LTRVIKETLRLYPAAPIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKK 98
           L R+++   +  P             +  K+    G +       K K   K K + + +
Sbjct: 218 LKRLVQIAKKRDPRIKSFKEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAE 277

Query: 99  QKKQKMKKQKKTKKKKKKKQKKQKK 123
                   +KK K+  KK  K +KK
Sbjct: 278 ALASATAVKKKAKEVMKKALKMEKK 302


>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
           Activating Proteins.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. Slit-Robo GTPase Activating Proteins
           (srGAPs) are Rho GAPs that interact with Robo1, the
           transmembrane receptor of Slit proteins. Slit proteins
           are secreted proteins that control axon guidance and the
           migration of neurons and leukocytes. Vertebrates contain
           three isoforms of srGAPs, all of which are expressed
           during embryonic and early development in the nervous
           system but with different localization and timing.
           srGAPs contain an N-terminal F-BAR domain, a Rho GAP
           domain, and a C-terminal SH3 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 241

 Score = 28.1 bits (63), Expect = 1.1
 Identities = 12/41 (29%), Positives = 27/41 (65%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
           E K K+ +K++ K+++  +KK ++ +  KK +K  +K++ K
Sbjct: 151 ERKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191



 Score = 27.3 bits (61), Expect = 2.3
 Identities = 10/40 (25%), Positives = 28/40 (70%)

Query: 81  KKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
           +K K+ + +++K+++  ++K ++ +  KK +K+ +K+Q K
Sbjct: 152 RKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191


>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
          Length = 300

 Score = 28.1 bits (63), Expect = 1.1
 Identities = 17/48 (35%), Positives = 29/48 (60%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
              KKK+KKK+    KK K KK   ++ K + K KK +++++ ++K K
Sbjct: 253 SKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKKIK 300



 Score = 27.7 bits (62), Expect = 1.7
 Identities = 16/43 (37%), Positives = 24/43 (55%)

Query: 81  KKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
              +   +K +KK+KKKK+   K  K KK   K++K + K KK
Sbjct: 246 AAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKK 288



 Score = 27.3 bits (61), Expect = 1.8
 Identities = 18/49 (36%), Positives = 30/49 (61%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           + N  K +KKK+ KK+   KK K KK   K++K   K KK +++++ +K
Sbjct: 249 RANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQK 297



 Score = 26.2 bits (58), Expect = 6.0
 Identities = 18/48 (37%), Positives = 25/48 (52%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           E+  KK          K KKKK+KK+K+  +K   KK   KQ+K + K
Sbjct: 238 EDMYKKAHAAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVK 285


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 28.3 bits (63), Expect = 1.1
 Identities = 11/23 (47%), Positives = 20/23 (86%)

Query: 80  KKKQKKKQTKKQKKQKKKKQKKQ 102
           +KK+KKK++K++KK+K ++  KQ
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706



 Score = 27.1 bits (60), Expect = 2.9
 Identities = 16/53 (30%), Positives = 29/53 (54%), Gaps = 3/53 (5%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMK---KQKKTKKKKKKKQKKQKKK 124
           RK+ KKKK K+K+ KK ++  K+    + ++   +    +K  +  Q K +KK
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKK 736



 Score = 26.0 bits (57), Expect = 6.4
 Identities = 10/20 (50%), Positives = 16/20 (80%)

Query: 104 MKKQKKTKKKKKKKQKKQKK 123
            +K+KK KK K+KK+KK ++
Sbjct: 683 SRKKKKKKKSKEKKKKKNRE 702


>gnl|CDD|236861 PRK11147, PRK11147, ABC transporter ATPase component; Reviewed.
          Length = 635

 Score = 28.4 bits (64), Expect = 1.1
 Identities = 10/42 (23%), Positives = 21/42 (50%)

Query: 82  KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           +Q++ Q    K+   KK+++    K +  K+  KK   K ++
Sbjct: 527 RQQQAQYLALKQPAVKKKEEAAAPKAETVKRSSKKLSYKLQR 568


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 28.3 bits (63), Expect = 1.2
 Identities = 20/51 (39%), Positives = 40/51 (78%)

Query: 74  GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           G++E K+K+Q K++ KK+K++ K++ K +K K++ K K+  K+K+K+++KK
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153



 Score = 26.8 bits (59), Expect = 3.6
 Identities = 14/51 (27%), Positives = 36/51 (70%)

Query: 74  GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
                +++K+K++  +++KK+K+K +++ K +K K+  K+K+  ++K+K+K
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEK 150



 Score = 26.0 bits (57), Expect = 6.1
 Identities = 15/50 (30%), Positives = 35/50 (70%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            KE  K+++KKK+ K +++ K +K K++  +K+   +K+K+K++K ++ +
Sbjct: 109 EKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPR 158


>gnl|CDD|236698 PRK10475, PRK10475, 23S rRNA pseudouridine synthase F; Provisional.
          Length = 290

 Score = 28.2 bits (63), Expect = 1.2
 Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 1/48 (2%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           E K K + K +T   K+   K +K  + K  +     K+     +KKK
Sbjct: 242 EAKPKAKAKPKTAGIKRPVVKMEKTAE-KGGRPASNGKRFTSPGRKKK 288


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 28.1 bits (63), Expect = 1.2
 Identities = 16/50 (32%), Positives = 27/50 (54%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            K  KK     +     K++ K K ++   K   + K+KKK+K+K++KKK
Sbjct: 293 LKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKK 342



 Score = 27.3 bits (61), Expect = 2.1
 Identities = 13/48 (27%), Positives = 31/48 (64%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
            + K+K + K +   +K   + K+KK++ KK++K KK+ ++ +++ +K
Sbjct: 307 SDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEK 354



 Score = 26.5 bits (59), Expect = 3.8
 Identities = 13/40 (32%), Positives = 29/40 (72%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKK 115
           +  +KKK+KKK+ KK+K+ ++ +++ +K++ Q   K++ K
Sbjct: 327 EVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEENK 366



 Score = 25.8 bits (57), Expect = 7.7
 Identities = 16/53 (30%), Positives = 26/53 (49%), Gaps = 4/53 (7%)

Query: 76  KENKKKKQKKKQTKKQKKQ----KKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K  +K K  K Q K+ KK     +     K+K+K + +   +K   + K+KKK
Sbjct: 281 KLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKK 333



 Score = 25.8 bits (57), Expect = 8.0
 Identities = 13/48 (27%), Positives = 29/48 (60%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           +     K+K K   ++  +K   + K+K K++KK +KKKK+ ++ +++
Sbjct: 304 EMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEER 351


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 28.1 bits (62), Expect = 1.2
 Identities = 13/39 (33%), Positives = 27/39 (69%)

Query: 67  DKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMK 105
           +K  +P   K++KK K+K+K+ K++++ K KK++ +  K
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226



 Score = 27.3 bits (60), Expect = 2.4
 Identities = 16/59 (27%), Positives = 33/59 (55%)

Query: 66  ADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           AD   +P+ +  N +  +  ++      +KK K+ K+K KK+K+ ++ K KK++ +  K
Sbjct: 168 ADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226



 Score = 26.6 bits (58), Expect = 4.2
 Identities = 15/42 (35%), Positives = 25/42 (59%)

Query: 83  QKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            +  + K  + ++ KK  K K KKQ+K K++KKKK+K    +
Sbjct: 261 SEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302



 Score = 26.6 bits (58), Expect = 4.4
 Identities = 14/52 (26%), Positives = 26/52 (50%)

Query: 70  TIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
              +       +  + K  + ++ +K  K KK+K +K+K+ KKKKKK    +
Sbjct: 251 ANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302


>gnl|CDD|236544 PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase;
           Reviewed.
          Length = 830

 Score = 28.2 bits (63), Expect = 1.3
 Identities = 12/53 (22%), Positives = 25/53 (47%)

Query: 72  PIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           PIGRK    +  K+K ++++ +           + ++   +K K K +K + K
Sbjct: 8   PIGRKGKPSRPVKQKVSRRRYRDDDDYDDYDDYEDEEPMPRKGKGKGRKPRGK 60


>gnl|CDD|180505 PRK06278, PRK06278, cobyrinic acid a,c-diamide synthase; Validated.
          Length = 476

 Score = 28.1 bits (63), Expect = 1.4
 Identities = 15/48 (31%), Positives = 26/48 (54%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
            EN  K  K K+ +K++  KK K  K+K+K +       KK+++ + K
Sbjct: 192 VENFLKYLKIKEDEKEEIFKKNKILKEKLKSRSGISTNNKKEERNKPK 239


>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106.  This
           archaebacterial protein family has no known function.
           Members are predicted to be integral membrane proteins.
          Length = 168

 Score = 27.6 bits (62), Expect = 1.4
 Identities = 17/74 (22%), Positives = 34/74 (45%), Gaps = 15/74 (20%)

Query: 56  IGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQ---------KKQKKKKQKKQK--M 104
           I R + SE+      I    ++ +K ++++K+ +K+         K   KK +K+Q+  M
Sbjct: 24  ILRGLLSEL-LQWLLID---RKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELM 79

Query: 105 KKQKKTKKKKKKKQ 118
           + QK+      K  
Sbjct: 80  EDQKEMMMDMMKPN 93



 Score = 25.3 bits (56), Expect = 9.8
 Identities = 11/36 (30%), Positives = 24/36 (66%), Gaps = 2/36 (5%)

Query: 89  KKQKKQKKKKQKKQKMKKQKKT--KKKKKKKQKKQK 122
           K +K QK++K+ +++ ++ +K   K   KK +K+Q+
Sbjct: 41  KMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQE 76


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 27.8 bits (62), Expect = 1.4
 Identities = 15/50 (30%), Positives = 27/50 (54%)

Query: 74  GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           G  E   K   + +  ++ K   K Q  +++KK    +KK++KK+ K+KK
Sbjct: 28  GNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77



 Score = 26.7 bits (59), Expect = 3.2
 Identities = 13/46 (28%), Positives = 22/46 (47%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
           K  +K K   K    ++ +K   Q+K++ KK+ K KK   +    Q
Sbjct: 41  KAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKVPLQVNPAQ 86


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 28.0 bits (63), Expect = 1.5
 Identities = 13/64 (20%), Positives = 21/64 (32%)

Query: 61  TSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
             +I      +    K+     +     KK K   K    K   KK+ K +    KK +K
Sbjct: 49  PEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEK 108

Query: 121 QKKK 124
           +   
Sbjct: 109 KNAL 112



 Score = 27.3 bits (61), Expect = 2.6
 Identities = 11/48 (22%), Positives = 23/48 (47%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           + +   +++ + K +K   K K K    K++ K   + KKK  +Q  +
Sbjct: 7   KAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQ 54



 Score = 26.9 bits (60), Expect = 3.0
 Identities = 14/84 (16%), Positives = 28/84 (33%)

Query: 41  RVIKETLRLYPAAPIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQK 100
            + KE ++    +         ++      +     +  +    KK+TK   K    K  
Sbjct: 32  FITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAP 91

Query: 101 KQKMKKQKKTKKKKKKKQKKQKKK 124
            +K  K +    KK +K+    K 
Sbjct: 92  AKKKLKDELDSSKKAEKKNALDKD 115



 Score = 26.9 bits (60), Expect = 3.3
 Identities = 10/49 (20%), Positives = 18/49 (36%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
                  +KK +T  +    K   KK+   +   +KK +KK    +   
Sbjct: 69  ATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDD 117



 Score = 26.9 bits (60), Expect = 3.4
 Identities = 9/48 (18%), Positives = 22/48 (45%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           K     +++ K+  K+   K K +     ++ K+  + KKK  ++  +
Sbjct: 7   KAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQ 54


>gnl|CDD|233383 TIGR01373, soxB, sarcosine oxidase, beta subunit family,
           heterotetrameric form.  This model describes the beta
           subunit of a family of known and putative
           heterotetrameric sarcosine oxidases. Five operons of
           such oxidases are found in Mesorhizobium loti and three
           in Agrobacterium tumefaciens, a high enough copy number
           to suggest that not all members are share the same
           function. The model is designated as subfamily rather
           than equivalog for this reason. Sarcosine oxidase
           catalyzes the oxidative demethylation of sarcosine to
           glycine. The reaction converts tetrahydrofolate to
           5,10-methylene-tetrahydrofolate. The enzyme is known in
           monomeric and heterotetrameric (alpha,beta,gamma,delta)
           forms [Energy metabolism, Amino acids and amines].
          Length = 407

 Score = 27.9 bits (62), Expect = 1.6
 Identities = 13/50 (26%), Positives = 22/50 (44%), Gaps = 2/50 (4%)

Query: 3   ALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPA 52
           A+H  + Q    E+V  +G   +   +Y Q   L  L  V+   L ++P 
Sbjct: 278 AVHFYVSQSDKGELV--IGGGIDGYNSYAQRGNLPTLEHVLAAILEMFPI 325


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 27.7 bits (62), Expect = 1.7
 Identities = 14/50 (28%), Positives = 27/50 (54%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           ++E + KK+K+ +TK  K+  KKK+KK     +       + K+K ++  
Sbjct: 79  QREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERIS 128



 Score = 25.8 bits (57), Expect = 5.8
 Identities = 14/50 (28%), Positives = 22/50 (44%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            +  KKK+ K K  K+  K+KKKK        +    + KKK ++     
Sbjct: 82  ERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAP 131


>gnl|CDD|234988 PRK01889, PRK01889, GTPase RsgA; Reviewed.
          Length = 356

 Score = 27.6 bits (62), Expect = 1.7
 Identities = 14/52 (26%), Positives = 27/52 (51%), Gaps = 7/52 (13%)

Query: 70  TIPIGRKENKKKKQKK------KQTKKQKKQKKKKQKKQKMKKQKKTKKKKK 115
            +   R ++ +K Q++      K   +  + ++KK KK   K+ KK K+KK+
Sbjct: 306 ELDERRLQSYRKLQREQAYNAAKLAARLARAERKKWKK-ISKQVKKMKRKKR 356


>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
           unknown].
          Length = 523

 Score = 27.5 bits (61), Expect = 1.7
 Identities = 9/46 (19%), Positives = 18/46 (39%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
           +    ++KQ  +  K    + +    + KM+       K K+K  K
Sbjct: 17  KNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSK 62


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 27.3 bits (61), Expect = 1.7
 Identities = 12/51 (23%), Positives = 38/51 (74%)

Query: 74  GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           GR  + ++K+++K+ +K++K++K++++K+  K   + ++++K+ ++ +K K
Sbjct: 9   GRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAK 59



 Score = 25.3 bits (56), Expect = 6.8
 Identities = 12/46 (26%), Positives = 33/46 (71%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
           ++E K++++++K+ K++K+++  K   QK +++K+ ++ +K K K 
Sbjct: 17  KREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKP 62



 Score = 25.0 bits (55), Expect = 9.6
 Identities = 13/51 (25%), Positives = 37/51 (72%)

Query: 73  IGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           I  +E +++K+++K+ K++K++K+K+  K  ++K+++ K+ ++ ++ K K 
Sbjct: 12  IDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKP 62


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 27.3 bits (61), Expect = 1.7
 Identities = 19/49 (38%), Positives = 30/49 (61%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            E  +K +K+ + K++ K KKKK KK+K K + K   KK  K +K+ +K
Sbjct: 68  AEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEK 116



 Score = 27.3 bits (61), Expect = 1.9
 Identities = 18/49 (36%), Positives = 30/49 (61%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K+  ++KQK K  KK+ K+KK K K +K  K+    +KK +K+ + K +
Sbjct: 75  KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLE 123



 Score = 27.0 bits (60), Expect = 2.7
 Identities = 17/50 (34%), Positives = 33/50 (66%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           +KE ++K++ K + KK KK+K K + K+  KK  K++KK +K+ + + + 
Sbjct: 75  KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124



 Score = 26.6 bits (59), Expect = 3.6
 Identities = 20/50 (40%), Positives = 31/50 (62%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           +KE  ++ +K K+  ++K++ K K+KK K KK K   KK  KK  K +KK
Sbjct: 64  KKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK 113



 Score = 25.8 bits (57), Expect = 6.3
 Identities = 20/50 (40%), Positives = 30/50 (60%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            K  K+ ++K+K   K+KK KKKK K +  K  KK  K +KK +K+ + K
Sbjct: 72  EKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDK 121



 Score = 25.4 bits (56), Expect = 8.4
 Identities = 16/48 (33%), Positives = 29/48 (60%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           E +K K KKK++KK+K + K K+  +K  K +K  +K+ + + +   K
Sbjct: 80  EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTK 127


>gnl|CDD|225123 COG2213, MtlA, Phosphotransferase system, mannitol-specific IIBC
           component [Carbohydrate transport and metabolism].
          Length = 472

 Score = 27.7 bits (62), Expect = 1.8
 Identities = 9/42 (21%), Positives = 13/42 (30%), Gaps = 1/42 (2%)

Query: 76  KENKKKKQKKKQTK-KQKKQKKKKQKKQKMKKQKKTKKKKKK 116
           K   +   +    K K+ K + K Q        K      KK
Sbjct: 339 KTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSNVKK 380



 Score = 26.9 bits (60), Expect = 3.3
 Identities = 6/42 (14%), Positives = 12/42 (28%)

Query: 80  KKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
           K + +        K K+ K + +          K      K+
Sbjct: 339 KTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSNVKK 380



 Score = 26.5 bits (59), Expect = 4.2
 Identities = 10/45 (22%), Positives = 14/45 (31%), Gaps = 1/45 (2%)

Query: 81  KKQKKKQTKK-QKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K  K +     +    K K+ K + K Q        K      KK
Sbjct: 336 KTSKTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSNVKK 380


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 27.1 bits (61), Expect = 1.8
 Identities = 11/35 (31%), Positives = 25/35 (71%)

Query: 89  KKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           K++KK+KKKK++ +  +++KK     +++++ QK 
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 27.7 bits (62), Expect = 1.9
 Identities = 16/47 (34%), Positives = 20/47 (42%)

Query: 68  KYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKK 114
             T    +K N K K KKK+ K       K +  Q+ K   K K KK
Sbjct: 347 TATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393



 Score = 26.5 bits (59), Expect = 4.0
 Identities = 16/47 (34%), Positives = 23/47 (48%)

Query: 78  NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
              KK  KK   K K +KKKK+       + +T ++ K   KK+ KK
Sbjct: 347 TATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393



 Score = 26.1 bits (58), Expect = 5.9
 Identities = 12/46 (26%), Positives = 20/46 (43%)

Query: 72  PIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
              +   K   + K + KK+K       K +  ++ K + KKK KK
Sbjct: 348 ATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393


>gnl|CDD|234438 TIGR03999, thiol_BshA, N-acetyl-alpha-D-glucosaminyl L-malate
           synthase BshA.  Members of this protein family are BshA,
           a glycosyltransferase required for bacillithiol
           biosynthesis. This enzyme combines UDP-GlcNAc and
           L-malate to form N-acetyl-alpha-D-glucosaminyl L-malate
           synthase. Bacillithiol is a low-molecular-weight thiol,
           an analog of glutathione and mycothiol, and is found
           largely in the Firmicutes [Biosynthesis of cofactors,
           prosthetic groups, and carriers, Glutathione and
           analogs].
          Length = 374

 Score = 27.6 bits (62), Expect = 1.9
 Identities = 15/36 (41%), Positives = 22/36 (61%), Gaps = 1/36 (2%)

Query: 7   KIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRV 42
           ++QQEV  +++ V GD PE  P    ++EL L  RV
Sbjct: 223 RVQQEVPAKLLLV-GDGPERSPAEQLVRELGLTDRV 257


>gnl|CDD|222911 PHA02616, PHA02616, VP2/VP3; Provisional.
          Length = 259

 Score = 27.3 bits (60), Expect = 2.0
 Identities = 14/53 (26%), Positives = 21/53 (39%), Gaps = 1/53 (1%)

Query: 63  EIQADKYTIPI-GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKK 114
           E+  D Y IP    K  + +      TKK    KK K    K  +  K ++ +
Sbjct: 194 ELNKDIYKIPTQAVKRKQDELHPVSPTKKAALSKKSKWTGTKSSQSSKKRRGR 246


>gnl|CDD|178592 PLN03018, PLN03018, homomethionine N-hydroxylase.
          Length = 534

 Score = 27.7 bits (61), Expect = 2.0
 Identities = 21/71 (29%), Positives = 36/71 (50%), Gaps = 3/71 (4%)

Query: 5   HPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHIT-SE 63
           +P+I ++   E+ +V+G D   V   D I  L+ L    +ET R++P+A  +  H+   +
Sbjct: 344 NPEILRKALKELDEVVGKD-RLVQESD-IPNLNYLKACCRETFRIHPSAHYVPPHVARQD 401

Query: 64  IQADKYTIPIG 74
                Y IP G
Sbjct: 402 TTLGGYFIPKG 412


>gnl|CDD|215494 PLN02915, PLN02915, cellulose synthase A [UDP-forming], catalytic
           subunit.
          Length = 1044

 Score = 27.6 bits (61), Expect = 2.1
 Identities = 15/45 (33%), Positives = 23/45 (51%), Gaps = 5/45 (11%)

Query: 74  GRKENKKKKQKKKQTKKQ-----KKQKKKKQKKQKMKKQKKTKKK 113
           GR+   KK +K K+ ++      KK+KKK      M  +K  +KK
Sbjct: 606 GRRGKSKKSKKGKKGRRSLLGGLKKRKKKGGGGGSMMGKKYGRKK 650


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 27.5 bits (61), Expect = 2.1
 Identities = 13/74 (17%), Positives = 18/74 (24%), Gaps = 7/74 (9%)

Query: 51  PAAPIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKT 110
                   H                 +  KK Q  K  K  K +K K       KK   +
Sbjct: 122 NPYGTREMHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPK-------KKGSVS 174

Query: 111 KKKKKKKQKKQKKK 124
            +  K      + K
Sbjct: 175 NRSVKMPGIDPRSK 188


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 27.5 bits (62), Expect = 2.1
 Identities = 14/43 (32%), Positives = 26/43 (60%)

Query: 82  KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            Q+ +Q +++ KQ K +      ++++K K+KKK K+KK K  
Sbjct: 375 LQRVEQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVP 417


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 26.8 bits (60), Expect = 2.1
 Identities = 20/54 (37%), Positives = 35/54 (64%), Gaps = 4/54 (7%)

Query: 75  RKENKKKKQKKKQTKKQ----KKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           R+  K+ K+    TK Q     + ++ KQ+K+K  K+KK ++K++K+Q KQ+KK
Sbjct: 72  RQAAKEVKKPGISTKAQQALKLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKK 125



 Score = 25.3 bits (56), Expect = 7.9
 Identities = 14/39 (35%), Positives = 31/39 (79%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKK 113
           + E+++ KQ+KK+  K+KK+++K++K+Q  +++KK K +
Sbjct: 92  KLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130


>gnl|CDD|234961 PRK01581, speE, spermidine synthase; Validated.
          Length = 374

 Score = 27.2 bits (60), Expect = 2.1
 Identities = 13/55 (23%), Positives = 25/55 (45%)

Query: 67  DKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
             Y I   +K+ + +    K   +Q+  + K+ K+ K  KQ +  +  + KQ  Q
Sbjct: 11  KIYRITSYKKDKRSELDSNKFELEQQLVENKQDKQGKQYKQDRGIQYAETKQDNQ 65


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 27.5 bits (61), Expect = 2.2
 Identities = 13/46 (28%), Positives = 17/46 (36%)

Query: 79  KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
              K   KQ          ++ K K +K KK     K   KK+KK 
Sbjct: 97  FDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKV 142



 Score = 26.3 bits (58), Expect = 5.3
 Identities = 11/43 (25%), Positives = 18/43 (41%)

Query: 67  DKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKK 109
                 + + + K +K KKK T  +   KKKK+      +  K
Sbjct: 109 PLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIK 151



 Score = 26.3 bits (58), Expect = 5.7
 Identities = 14/65 (21%), Positives = 22/65 (33%)

Query: 60  ITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQK 119
           I  ++  D         +N  K+          ++ K K +K K K        KKKK+ 
Sbjct: 83  IKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKV 142

Query: 120 KQKKK 124
              K 
Sbjct: 143 LSSKD 147


>gnl|CDD|223016 PHA03233, PHA03233, DNA packaging protein UL32; Provisional.
          Length = 518

 Score = 27.4 bits (61), Expect = 2.2
 Identities = 7/31 (22%), Positives = 16/31 (51%)

Query: 82  KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKK 112
           K ++   KK+K    +++  + +KK  +  K
Sbjct: 219 KIRRFLQKKKKPAAAEEELLRLLKKHTRPSK 249


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 27.1 bits (60), Expect = 2.2
 Identities = 15/35 (42%), Positives = 25/35 (71%)

Query: 87  QTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
           +  K  ++K KK+K +  K++KK KK+KKKK+K+ 
Sbjct: 135 EGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169


>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
           Autographa californica nuclear polyhedrosis virus
           (AcMNPV) pp31 is a nuclear phosphoprotein that
           accumulates in the virogenic stroma, which is the viral
           replication centre in the infected-cell nucleus, binds
           to DNA, and serves as a late expression factor.
          Length = 267

 Score = 27.3 bits (61), Expect = 2.2
 Identities = 14/44 (31%), Positives = 20/44 (45%)

Query: 59  HITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQ 102
             T+   A     P     ++K+K  KK  KK    KK+KQK+ 
Sbjct: 205 DKTAASSASVVPTPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRS 248



 Score = 25.4 bits (56), Expect = 9.8
 Identities = 6/30 (20%), Positives = 16/30 (53%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKK 106
           ++    +K+K  KK  K+    +K+++ + 
Sbjct: 219 QSSSPSRKRKAPKKVAKKVAAAKKRKQKRS 248


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 27.3 bits (60), Expect = 2.2
 Identities = 17/66 (25%), Positives = 33/66 (50%)

Query: 58  RHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
           + +  E Q  KY+      ++  K + KK+   +++++KK +      KQKK  KK K  
Sbjct: 504 KELELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYS 563

Query: 118 QKKQKK 123
             K+++
Sbjct: 564 NAKKEE 569



 Score = 25.8 bits (56), Expect = 7.9
 Identities = 20/48 (41%), Positives = 27/48 (56%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           +E KK K      K++K  KK K    K ++Q +  KKKKK+  KQKK
Sbjct: 539 EEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIAKQKK 586



 Score = 25.4 bits (55), Expect = 10.0
 Identities = 19/49 (38%), Positives = 24/49 (48%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K+ K      KQ K  KK K    KK++  +  K KKK+  KQKK   K
Sbjct: 542 KKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIAKQKKLDSK 590


>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
          Length = 205

 Score = 27.3 bits (61), Expect = 2.3
 Identities = 12/39 (30%), Positives = 23/39 (58%)

Query: 66  ADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKM 104
           A  Y I +G+K+  KK+++ K+ KK+ +    K + +K 
Sbjct: 114 AKHYGIDLGKKKKYKKEEENKKAKKKSESLVAKLEVRKK 152


>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
           Toc86/159, G and M domains.  The long precursor of the
           86K protein originally described is proposed to have
           three domains. The N-terminal A-domain is acidic,
           repetitive, weakly conserved, readily removed by
           proteolysis during chloroplast isolation, and not
           required for protein translocation. The other domains
           are designated G (GTPase) and M (membrane anchor); this
           family includes most of the G domain and all of M
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 763

 Score = 27.2 bits (60), Expect = 2.4
 Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 3/48 (6%)

Query: 80  KKKQKKKQTKKQKKQ---KKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            K Q  K +K+Q+K    +   + K   KKQ + + K+ K  KK  K+
Sbjct: 417 TKAQMAKLSKEQRKAYLEEYDYRVKLLQKKQWREELKRMKMMKKFGKE 464


>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
           function prediction only].
          Length = 386

 Score = 27.1 bits (60), Expect = 2.8
 Identities = 9/25 (36%), Positives = 13/25 (52%)

Query: 78  NKKKKQKKKQTKKQKKQKKKKQKKQ 102
            K+K  +K  + K KK  KK Q + 
Sbjct: 358 RKRKGDRKGVSHKAKKGGKKNQAET 382



 Score = 26.3 bits (58), Expect = 5.3
 Identities = 9/31 (29%), Positives = 15/31 (48%)

Query: 88  TKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQ 118
            K +  +K+K  +K    K KK  KK + + 
Sbjct: 352 KKAEAARKRKGDRKGVSHKAKKGGKKNQAET 382



 Score = 25.6 bits (56), Expect = 8.6
 Identities = 9/32 (28%), Positives = 15/32 (46%)

Query: 90  KQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
            +K +  +K+K  +     K KK  KK Q + 
Sbjct: 351 PKKAEAARKRKGDRKGVSHKAKKGGKKNQAET 382


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 27.1 bits (60), Expect = 2.8
 Identities = 13/46 (28%), Positives = 17/46 (36%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
           K  K K  +   T     QKK    + K++ Q K     K   KK 
Sbjct: 217 KSKKAKTDRDVSTSTAASQKKSSDLESKLEAQSKELWSLKDDLKKH 262


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 26.7 bits (59), Expect = 2.9
 Identities = 14/34 (41%), Positives = 21/34 (61%)

Query: 83  QKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKK 116
           ++K  T   +K+KK K KK K+K +K   KKK +
Sbjct: 81  RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAR 114


>gnl|CDD|197876 smart00792, Agouti, Agouti protein.  The agouti protein regulates
           pigmentation in the mouse hair follicle producing a
           black hair with a subapical yellow band. A highly
           homologous protein agouti signal protein (ASIP) is
           present in humans and is expressed at highest levels in
           adipose tissue where it may play a role in energy
           homeostasis and possibly human pigmentation.
          Length = 124

 Score = 26.4 bits (58), Expect = 2.9
 Identities = 14/49 (28%), Positives = 25/49 (51%)

Query: 58  RHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKK 106
           R  +S  QAD  ++ I     K KK   ++ +K+  QKK+K+    + +
Sbjct: 32  RSNSSLNQADFPSVSIVGLNKKPKKISAEEAEKKLLQKKEKKALTNVLR 80


>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
           protein 2.  THO and TREX form a eukaryotic complex which
           functions in messenger ribonucleoprotein metabolism and
           plays a role in preventing the transcription-associated
           genetic instability. Tho2, along with four other
           subunits forms THO.
          Length = 296

 Score = 26.9 bits (60), Expect = 2.9
 Identities = 13/44 (29%), Positives = 24/44 (54%)

Query: 80  KKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           +K+ K+  +      KKKK+KK+     KK +++ KK  +  +K
Sbjct: 37  EKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEK 80



 Score = 26.5 bits (59), Expect = 4.5
 Identities = 12/40 (30%), Positives = 21/40 (52%)

Query: 85  KKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K+   K+ ++ +K+ K+         KKKK+KK+ K   K
Sbjct: 26  KELYDKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIK 65



 Score = 26.1 bits (58), Expect = 6.1
 Identities = 11/52 (21%), Positives = 28/52 (53%)

Query: 73  IGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           I   ++      KK+ +K++ +   K+ ++++KK  +  +K KK+  ++K  
Sbjct: 40  IKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEEKSS 91


>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
          Length = 201

 Score = 26.9 bits (60), Expect = 2.9
 Identities = 13/45 (28%), Positives = 28/45 (62%)

Query: 79  KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           +K K+ +K  K+ +K+ ++ Q+   MKK KK ++K+ +    Q++
Sbjct: 72  EKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQMEMMDDQRE 116


>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
           SseC is a secreted protein that forms a complex together
           with SecB and SecD on the surface of Salmonella. All
           these proteins are secreted by the type III secretion
           system. Many mucosal pathogens use type III secretion
           systems for the injection of effector proteins into
           target cells. SecB, SseC and SecD are inserted into the
           target cell membrane. where they form a small pore or
           translocon. In addition to SseC, this family includes
           the bacterial secreted proteins PopB, PepB, YopB and
           EspD which are thought to be directly involved in pore
           formation, and type III secretion system translocon.
          Length = 303

 Score = 27.1 bits (60), Expect = 3.0
 Identities = 8/40 (20%), Positives = 27/40 (67%), Gaps = 1/40 (2%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKM-KKQKKTKKK 113
           + ++K ++ ++ + K++KK ++ +++ +K  +K ++ KK 
Sbjct: 16  QAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAKKS 55



 Score = 26.7 bits (59), Expect = 3.4
 Identities = 9/42 (21%), Positives = 28/42 (66%)

Query: 83  QKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           +K+ ++K Q+ ++ + ++++K ++ ++  KK  +K ++ KK 
Sbjct: 14  EKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAKKS 55


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 26.9 bits (60), Expect = 3.0
 Identities = 18/49 (36%), Positives = 22/49 (44%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           +EN   K      K  KKQ  KK+   K+ K      K   K KK+KKK
Sbjct: 492 EENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKK 540



 Score = 26.9 bits (60), Expect = 3.7
 Identities = 17/53 (32%), Positives = 23/53 (43%)

Query: 72  PIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           P  +  +   K  KKQ  K+K   K  +   K+ K     KKKKKK+K     
Sbjct: 495 PWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLD 547


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 26.7 bits (59), Expect = 3.1
 Identities = 15/41 (36%), Positives = 23/41 (56%)

Query: 80  KKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
           ++ ++KK  +  KK+ +      +  KQKK KK KKKK  K
Sbjct: 64  RQAERKKGYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPK 104


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
           describes the eukaryotic 60S (cytosolic) ribosomal
           protein L7 and paralogs that may or may not also be L7.
           Human, Drosophila, and Arabidopsis all have both a
           typical L7 and an L7-related paralog. This family is
           designated subfamily rather than equivalog to reflect
           these uncharacterized paralogs. Members of this family
           average ~ 250 residues in length, somewhat longer than
           the archaeal L30P/L7E homolog (~ 155 residues) and much
           longer than the related bacterial/organellar form (~ 60
           residues).
          Length = 235

 Score = 26.9 bits (60), Expect = 3.1
 Identities = 13/33 (39%), Positives = 21/33 (63%)

Query: 92  KKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           KK+K +++   ++ KQ K KKK  KK++K   K
Sbjct: 5   KKRKARQELAVQVAKQAKAKKKANKKKRKIYFK 37



 Score = 25.4 bits (56), Expect = 8.0
 Identities = 11/34 (32%), Positives = 19/34 (55%)

Query: 89  KKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
           KK+K +++   +  K  K KK   KKK+K   ++
Sbjct: 5   KKRKARQELAVQVAKQAKAKKKANKKKRKIYFKR 38


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 26.5 bits (59), Expect = 3.1
 Identities = 23/46 (50%), Positives = 35/46 (76%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
           ++E KK+K ++K  KK+ K++KKKQKK+K KK KK  KK++K+  K
Sbjct: 71  KREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSK 116



 Score = 25.4 bits (56), Expect = 6.8
 Identities = 18/49 (36%), Positives = 37/49 (75%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           ++ KK+ + ++  +K++++K+K ++K   K+ K+ KKK+KKK+KK+ KK
Sbjct: 57  EKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKK 105


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 26.9 bits (60), Expect = 3.2
 Identities = 8/48 (16%), Positives = 26/48 (54%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           E   + + + +  K++ +K +++ +Q   + +K +++ +K + K   K
Sbjct: 88  EENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDK 135



 Score = 26.9 bits (60), Expect = 3.2
 Identities = 8/47 (17%), Positives = 27/47 (57%)

Query: 78  NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
             + + ++ + + +K ++K +Q + +++K+++  +K K K   +  K
Sbjct: 92  EIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWK 138



 Score = 26.1 bits (58), Expect = 5.5
 Identities = 13/47 (27%), Positives = 26/47 (55%)

Query: 78  NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            + KK+ KK  +K ++ + + +KK++  ++ K K   K  +K  KK 
Sbjct: 98  EELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKY 144


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 27.1 bits (60), Expect = 3.2
 Identities = 13/61 (21%), Positives = 34/61 (55%), Gaps = 4/61 (6%)

Query: 65  QADKYTIPIGRKENKKKKQKKKQTKKQKK----QKKKKQKKQKMKKQKKTKKKKKKKQKK 120
             ++  + I +    +K+ ++K    +K     +K KK+ +Q+M++ K+ ++ KK + +K
Sbjct: 509 FKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEK 568

Query: 121 Q 121
           +
Sbjct: 569 E 569


>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 594

 Score = 26.9 bits (60), Expect = 3.3
 Identities = 13/51 (25%), Positives = 18/51 (35%)

Query: 74  GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           G  E   K+  +K   K K +   K        Q+K   K  +K KK    
Sbjct: 438 GTIELPFKEVCEKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDF 488



 Score = 26.9 bits (60), Expect = 3.3
 Identities = 11/47 (23%), Positives = 17/47 (36%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
            E    K K K   K        Q+K  +K  +K KK     ++ + 
Sbjct: 448 CEKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFET 494



 Score = 26.6 bits (59), Expect = 4.8
 Identities = 8/46 (17%), Positives = 16/46 (34%)

Query: 79  KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K K   K        Q+K   K  +  K+     K+ +    + ++
Sbjct: 456 KNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDALKLQE 501


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
           glycoprotein C-terminal domain.  The trypanosome
           parasite expresses these proteins to evade the immune
           response.
          Length = 98

 Score = 25.9 bits (57), Expect = 3.3
 Identities = 18/51 (35%), Positives = 25/51 (49%)

Query: 74  GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           G K +KK+   K + K+ K +K      Q    +  T+K K KK KK  KK
Sbjct: 15  GCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKK 65


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 26.4 bits (59), Expect = 3.3
 Identities = 14/57 (24%), Positives = 32/57 (56%), Gaps = 11/57 (19%)

Query: 79  KKKKQKKKQTKKQKKQKKKKQ-----------KKQKMKKQKKTKKKKKKKQKKQKKK 124
           KK K+ KK+ +KQ+KQ +K             ++ K +K ++ ++  +++Q + ++K
Sbjct: 15  KKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQK 71


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 26.9 bits (59), Expect = 3.4
 Identities = 10/47 (21%), Positives = 26/47 (55%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
           +E+ K+ Q+ K+   +K+    K +++    Q    K++ + ++KQ+
Sbjct: 209 QEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQ 255



 Score = 25.7 bits (56), Expect = 7.5
 Identities = 13/50 (26%), Positives = 24/50 (48%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           R +  K++  KKQ    K Q+K    +    KQ+   ++K+++ K   K 
Sbjct: 214 RAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKP 263


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 26.5 bits (59), Expect = 3.6
 Identities = 8/24 (33%), Positives = 16/24 (66%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQK 100
            +    +KKKQ KKQ++++ K+ +
Sbjct: 176 GHSSDSEKKKQRKKQRRKRSKELR 199



 Score = 25.4 bits (56), Expect = 9.2
 Identities = 10/26 (38%), Positives = 19/26 (73%)

Query: 71  IPIGRKENKKKKQKKKQTKKQKKQKK 96
           +P    +++KKKQ+KKQ +K+ K+ +
Sbjct: 174 VPGHSSDSEKKKQRKKQRRKRSKELR 199


>gnl|CDD|215276 PLN02500, PLN02500, cytochrome P450 90B1.
          Length = 490

 Score = 26.7 bits (59), Expect = 3.7
 Identities = 20/74 (27%), Positives = 33/74 (44%), Gaps = 3/74 (4%)

Query: 6   PKIQQEVYDE---IVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHITS 62
           PK  QE+ +E   I +      E+   ++  +++     VI ETLRL      + R    
Sbjct: 310 PKAVQELREEHLEIARAKKQSGESELNWEDYKKMEFTQCVINETLRLGNVVRFLHRKALK 369

Query: 63  EIQADKYTIPIGRK 76
           +++   Y IP G K
Sbjct: 370 DVRYKGYDIPSGWK 383


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 26.6 bits (59), Expect = 3.8
 Identities = 8/32 (25%), Positives = 20/32 (62%)

Query: 82  KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKK 113
               +  + +KK++ + +KK++M++ K  +KK
Sbjct: 82  ASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113



 Score = 25.9 bits (57), Expect = 7.0
 Identities = 11/55 (20%), Positives = 29/55 (52%)

Query: 69  YTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
           +      ++ K+      + ++ KK+++ + KK+K  ++ K  +KK  +  K+K+
Sbjct: 68  FLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQ 122


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 26.8 bits (59), Expect = 3.9
 Identities = 9/38 (23%), Positives = 22/38 (57%)

Query: 78  NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKK 115
           N+++       + Q+K  +K+ +K +M+ ++K   KK+
Sbjct: 5   NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKE 42


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 26.6 bits (59), Expect = 4.0
 Identities = 14/60 (23%), Positives = 29/60 (48%), Gaps = 12/60 (20%)

Query: 77   ENKKKKQKKKQTKKQKKQKKKKQKKQKMK----KQKK--------TKKKKKKKQKKQKKK 124
            E + + Q++K+   Q + +  K+KK  ++     Q+K        +  KK+K +K+ K  
Sbjct: 1519 EIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSN 1578


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 26.5 bits (58), Expect = 4.3
 Identities = 19/48 (39%), Positives = 21/48 (43%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           E KK   KK  T K+  + KK  K    K  KK  K KK K  K   K
Sbjct: 14  EEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTK 61


>gnl|CDD|223778 COG0706, YidC, Preprotein translocase subunit YidC [Intracellular
           trafficking and secretion].
          Length = 314

 Score = 26.3 bits (58), Expect = 4.3
 Identities = 9/40 (22%), Positives = 24/40 (60%)

Query: 83  QKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
             +K T+   K ++ + K ++++++ K   K+K++Q+  K
Sbjct: 128 LSQKSTRSMAKMQELQPKIKEIQEKYKGTDKQKQQQEMMK 167


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 26.1 bits (58), Expect = 4.5
 Identities = 15/50 (30%), Positives = 38/50 (76%), Gaps = 1/50 (2%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKK-KKKKKQKKQKKK 124
           KE KKK+++  + ++   + ++++ K++ +K+K+T+K ++KK +++QK+K
Sbjct: 79  KEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKFKRRQKEK 128


>gnl|CDD|221659 pfam12597, DUF3767, Protein of unknown function (DUF3767).  This
           family of proteins is found in eukaryotes. Proteins in
           this family are typically between 112 and 199 amino
           acids in length.
          Length = 119

 Score = 25.7 bits (57), Expect = 4.7
 Identities = 8/31 (25%), Positives = 20/31 (64%)

Query: 94  QKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           + ++++  Q MKK  +   +KK K+ ++K++
Sbjct: 89  RYRRRKSFQGMKKAVEAMAEKKAKKLREKEE 119


>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein.  PACS-1 is a
           cytosolic sorting protein that directs the localisation
           of membrane proteins in the trans-Golgi network
           (TGN)/endosomal system. PACS-1 connects the clathrin
           adaptor AP-1 to acidic cluster sorting motifs contained
           in the cytoplasmic domain of cargo proteins such as
           furin, the cation-independent mannose-6-phosphate
           receptor and in viral proteins such as human
           immunodeficiency virus type 1 Nef.
          Length = 413

 Score = 26.3 bits (58), Expect = 4.7
 Identities = 8/19 (42%), Positives = 11/19 (57%)

Query: 95  KKKKQKKQKMKKQKKTKKK 113
           K+K +K   M   KK K+K
Sbjct: 332 KEKNKKVPVMFLGKKPKEK 350



 Score = 25.9 bits (57), Expect = 7.8
 Identities = 9/27 (33%), Positives = 14/27 (51%)

Query: 92  KKQKKKKQKKQKMKKQKKTKKKKKKKQ 118
            K+K KK     + K+ K K+ + K Q
Sbjct: 331 TKEKNKKVPVMFLGKKPKEKEVESKSQ 357


>gnl|CDD|222806 PHA00645, PHA00645, hypothetical protein.
          Length = 125

 Score = 25.8 bits (56), Expect = 4.9
 Identities = 10/36 (27%), Positives = 18/36 (50%)

Query: 83  QKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQ 118
           +K     K   Q   KQ+ ++ KK++  + K K K+
Sbjct: 87  RKGINVGKGVYQNSNKQQVKREKKRQSIRAKAKNKR 122


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 26.3 bits (58), Expect = 5.0
 Identities = 14/48 (29%), Positives = 33/48 (68%)

Query: 71  IPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQ 118
           +P   +E +K+++K+++ K   K+++  ++++K +K+KKTKK K+   
Sbjct: 35  VPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTT 82


>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
           pleckstrin homology (PH) domain.  The BCR gene is one of
           the two genes in the BCR-ABL complex, which is
           associated with the Philadelphia chromosome, a product
           of a reciprocal translocation between chromosomes 22 and
           9. BCR is a GTPase-activating protein (GAP) for RAC1
           (primarily) and CDC42. The Dbl region of BCR has the
           most RhoGEF activity for Cdc42, and less activity
           towards Rac and Rho. Since BCR possesses both GAP and
           GEF activities, it may function to temporally regulate
           the activity of these GTPases. It also displays
           serine/threonine kinase activity. The BCR protein
           contains multiple domains including an N-terminal kinase
           domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
           domain, and a C-terminal RhoGAP domain. ABR, a related
           smaller protein, is structurally similar to BCR, but
           lacks the N-terminal kinase domain and has GAP activity
           for both Rac and Cdc42. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 179

 Score = 26.1 bits (58), Expect = 5.0
 Identities = 9/41 (21%), Positives = 20/41 (48%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKK 116
           +E  + K+     K++ K+   ++KK      +  +K +KK
Sbjct: 89  EELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKK 129



 Score = 25.8 bits (57), Expect = 6.7
 Identities = 9/44 (20%), Positives = 20/44 (45%)

Query: 81  KKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           K  ++    K+     K++ K+   ++KK      +  +K +KK
Sbjct: 86  KSSEELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKK 129


>gnl|CDD|177060 CHL00138, rps5, ribosomal protein S5; Validated.
          Length = 143

 Score = 25.7 bits (57), Expect = 5.2
 Identities = 13/42 (30%), Positives = 20/42 (47%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
              KK   KKK  K   K+ K +++  ++K+  K  K  KK 
Sbjct: 3   FLLKKMYNKKKNRKSNIKENKWEERVIQIKRVSKVVKGGKKL 44


>gnl|CDD|232969 TIGR00422, valS, valyl-tRNA synthetase.  The valyl-tRNA synthetase
           (ValS) is a class I amino acyl-tRNA ligase and is
           particularly closely related to the isoleucyl tRNA
           synthetase [Protein synthesis, tRNA aminoacylation].
          Length = 861

 Score = 26.2 bits (58), Expect = 5.5
 Identities = 20/111 (18%), Positives = 44/111 (39%), Gaps = 11/111 (9%)

Query: 14  DEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHITSEIQADKYTIPI 73
            E  + L  +   +       E+ ++    + T  +    P     I           P+
Sbjct: 760 AETAERLKLNAVDIKGAINFSEVEVVIEKPEVTEAVVELVPGFEIII-----------PV 808

Query: 74  GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
               NK K+  + Q +  K++K+  + + K++ +   KK  K+  +K+K+K
Sbjct: 809 KGLINKAKELARLQKQLDKEKKEVIRIEGKLENEGFVKKAPKEVIEKEKEK 859


>gnl|CDD|205286 pfam13105, DUF3959, Protein of unknown function (DUF3959).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 260 amino acids in length.
          Length = 242

 Score = 26.0 bits (57), Expect = 5.9
 Identities = 18/59 (30%), Positives = 26/59 (44%), Gaps = 7/59 (11%)

Query: 73  IGRKENKKKKQKKKQTKKQKKQKKKKQK-------KQKMKKQKKTKKKKKKKQKKQKKK 124
           I RKE    K+     K++KK+KK   K       K KMK  K  +K    K  ++ + 
Sbjct: 150 IHRKEKDDNKENINLFKQKKKRKKLSFKIRLPRLPKLKMKLFKFGRKPSNSKDPRKYEH 208


>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
          Length = 278

 Score = 26.0 bits (57), Expect = 6.2
 Identities = 16/39 (41%), Positives = 26/39 (66%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKK 113
           ++E + KKQ+ +Q K +  +KKKK KK+K+  + K  KK
Sbjct: 49  KREFELKKQQFEQLKAELSKKKKKFKKEKVDVRVKVVKK 87


>gnl|CDD|181988 PRK09609, PRK09609, hypothetical protein; Provisional.
          Length = 312

 Score = 25.8 bits (57), Expect = 6.3
 Identities = 17/52 (32%), Positives = 27/52 (51%)

Query: 73  IGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           I R +NK  KQK++     +    +K +K K K     KKKKK ++  ++K 
Sbjct: 114 IKRYDNKIFKQKEQYDFALENPNSEKIQKIKQKIILLEKKKKKLEKTNEEKS 165


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 25.9 bits (57), Expect = 6.4
 Identities = 11/43 (25%), Positives = 16/43 (37%)

Query: 78  NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
             K+   ++  K  K         +K  K K   KK  KK +K
Sbjct: 172 KAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214


>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
          Length = 822

 Score = 26.1 bits (58), Expect = 6.4
 Identities = 7/57 (12%), Positives = 15/57 (26%)

Query: 68  KYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            YT    +++  K + +  +    K  +K K    +               K     
Sbjct: 763 NYTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLTAADPDAVAAKVDGVS 819


>gnl|CDD|235325 PRK04974, PRK04974, glycerol-3-phosphate acyltransferase;
           Validated.
          Length = 818

 Score = 26.0 bits (58), Expect = 6.5
 Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 9/48 (18%)

Query: 11  EVYDEIVQVL--------GDDPETVPTYDQIQELHLLTRVIKETLRLY 50
           EV D+I+  L         DD   +    + +EL LL   ++ETL+ Y
Sbjct: 668 EVIDQIIDELVRQGLITLDDDELHINP-PRSRELQLLAAGVRETLQRY 714


>gnl|CDD|185575 PTZ00361, PTZ00361, 26 proteosome regulatory subunit 4-like
           protein; Provisional.
          Length = 438

 Score = 25.9 bits (57), Expect = 6.5
 Identities = 14/39 (35%), Positives = 21/39 (53%)

Query: 83  QKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
             + Q   QK + KKK+KK+K       + K+KKK+K  
Sbjct: 3   NAQGQGNNQKDKNKKKEKKKKESPPPPHEIKRKKKRKGP 41


>gnl|CDD|188606 TIGR04091, LTA_dltB, D-alanyl-lipoteichoic acid biosynthesis
           protein DltB.  Members of this protein family are DltB,
           part of a four-gene operon for D-alanyl-lipoteichoic
           acid biosynthesis that is present in the vast majority
           of low-GC Gram-positive organisms. This protein may be
           involved in transport of D-alanine across the plasma
           membrane [Cell envelope, Biosynthesis and degradation of
           murein sacculus and peptidoglycan].
          Length = 380

 Score = 26.0 bits (58), Expect = 6.6
 Identities = 8/14 (57%), Positives = 11/14 (78%)

Query: 107 QKKTKKKKKKKQKK 120
           Q+K+KK KK K+ K
Sbjct: 342 QRKSKKYKKNKKNK 355


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 25.8 bits (57), Expect = 6.7
 Identities = 12/35 (34%), Positives = 19/35 (54%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKT 110
                 ++    + K+K++  KKQKK+K KK K T
Sbjct: 342 SPPAAFERPLALSPKRKREGDKKQKKKKSKKLKLT 376


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 26.2 bits (57), Expect = 6.9
 Identities = 10/41 (24%), Positives = 22/41 (53%)

Query: 81  KKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
           K+ KK   + ++K        ++ K +KK +K++K K+  +
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDE 148


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 25.7 bits (57), Expect = 6.9
 Identities = 10/31 (32%), Positives = 22/31 (70%)

Query: 90  KQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
            Q  +++K++KK   K++KK  K++K+K ++
Sbjct: 91  YQYFEEEKEKKKAMSKEEKKAIKEEKEKLEE 121


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 25.9 bits (57), Expect = 7.3
 Identities = 16/47 (34%), Positives = 22/47 (46%)

Query: 76  KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
           K +KKKK K   TK    +     ++   KK KK +KK + K  K  
Sbjct: 418 KGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVP 464


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 25.6 bits (56), Expect = 7.7
 Identities = 10/51 (19%), Positives = 25/51 (49%)

Query: 74  GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
            R   +K+K++ + ++K K+ ++KKQ++++   + K           Q   
Sbjct: 161 KRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGL 211


>gnl|CDD|215581 PLN03109, PLN03109, ETHYLENE-INSENSITIVE3-like3 protein;
           Provisional.
          Length = 599

 Score = 25.9 bits (57), Expect = 8.0
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 80  KKKQKKKQTKKQKKQKKKKQKKQKMKKQ 107
           K++QKK Q    +K K KK   Q  +K+
Sbjct: 57  KERQKKLQQAALEKSKPKKISDQARRKK 84


>gnl|CDD|236172 PRK08173, PRK08173, DNA topoisomerase III; Validated.
          Length = 862

 Score = 25.8 bits (57), Expect = 8.1
 Identities = 10/48 (20%), Positives = 16/48 (33%)

Query: 77  ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           E + ++ K    KK   +       +  K   K    KK   KK   +
Sbjct: 812 EFEPREPKAAAAKKTAAKATAAAATKAEKAAAKKAPAKKTAAKKTAAR 859



 Score = 25.8 bits (57), Expect = 9.2
 Identities = 10/46 (21%), Positives = 15/46 (32%)

Query: 75  RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
           R+      +K           K ++   K    KKT  KK   +K 
Sbjct: 816 REPKAAAAKKTAAKATAAAATKAEKAAAKKAPAKKTAAKKTAARKT 861


>gnl|CDD|198457 cd10314, FAM20_C, C-terminal putative kinase domain of FAM20
           (family with sequence similarity 20) proteins.  This
           family contains the C-terminal domain of FAM20A, -B, -C
           and related proteins. FAM20A may participate in enamel
           development and gingival homeostasis, FAM20B in
           proteoglycan production, and FAM20C in bone development.
           FAM20B is a xylose kinase that may regulate the number
           of glycosaminoglycan chains by phosphorylating the
           xylose residue in the glycosaminoglycan-protein linkage
           region of proteoglycans. FAM20C, also called Dentin
           Matrix Protein 4, is abundant in the dentin matrix, and
           may participate in the differentiation of mesenchymal
           precursor cells into functional odontoblast-like cells.
           Mutations in FAM20C are associated with lethal
           Osteosclerotic Bone Dysplasia (Raine Syndrome), and
           mutations in FAM20A with Amelogenesis imperfecta (AI)
           and Gingival Hyperplasia Syndrome. The C-terminal
           domains of members of this family are putative kinase
           domains, based on mutagenesis of the C-terminal domain
           of Drosophila Four-Jointed, a related Golgi kinase. This
           domain family is also known as DUF1193.
          Length = 209

 Score = 25.3 bits (56), Expect = 8.3
 Identities = 8/32 (25%), Positives = 19/32 (59%), Gaps = 4/32 (12%)

Query: 29  TYDQIQELH----LLTRVIKETLRLYPAAPII 56
           T+ ++Q L      L+ +++E+L   P +P++
Sbjct: 151 TWLRLQLLAKGGGSLSDLLRESLSHDPLSPVL 182


>gnl|CDD|234941 PRK01315, PRK01315, putative inner membrane protein translocase
           component YidC; Provisional.
          Length = 329

 Score = 25.5 bits (56), Expect = 8.4
 Identities = 7/31 (22%), Positives = 20/31 (64%)

Query: 87  QTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
            T+ +++ K++  ++Q+ K+Q + K++K   
Sbjct: 297 TTEVREQTKRQTVQRQQPKRQSRAKRQKGGA 327


>gnl|CDD|203260 pfam05496, RuvB_N, Holliday junction DNA helicase ruvB N-terminus. 
           The RuvB protein makes up part of the RuvABC revolvasome
           which catalyzes the resolution of Holliday junctions
           that arise during genetic recombination and DNA repair.
           Branch migration is catalyzed by the RuvB protein that
           is targeted to the Holliday junction by the structure
           specific RuvA protein. This family contains the
           N-terminal region of the protein.
          Length = 231

 Score = 25.5 bits (57), Expect = 8.9
 Identities = 10/20 (50%), Positives = 13/20 (65%), Gaps = 2/20 (10%)

Query: 33  IQELHLLTRVIKETLRLYPA 52
           I E+H L R ++E   LYPA
Sbjct: 107 IDEIHRLNRAVEEI--LYPA 124


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 25.4 bits (56), Expect = 9.0
 Identities = 9/45 (20%), Positives = 27/45 (60%)

Query: 80  KKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
           ++K+K+ +  +Q   +++K+   K+++ +K  +  +K+ +K K  
Sbjct: 327 REKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEKLKSN 371


>gnl|CDD|216690 pfam01767, Birna_VP3, Birnavirus VP3 protein.  VP3 is a minor
           structural component of the virus. The large RNA segment
           of birnaviruses codes for a polyprotein
           (N-VP2-VP4-VP3-C).
          Length = 229

 Score = 25.3 bits (55), Expect = 9.4
 Identities = 11/35 (31%), Positives = 19/35 (54%)

Query: 10  QEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIK 44
              YDE+  V  ++    P  +Q+++L LL R +K
Sbjct: 193 DAFYDEVAAVYAENQGRGPNQEQMRDLRLLARRMK 227


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 25.7 bits (57), Expect = 9.8
 Identities = 10/37 (27%), Positives = 20/37 (54%), Gaps = 2/37 (5%)

Query: 86  KQTKKQKKQKKKKQKKQKMKKQKKTKK--KKKKKQKK 120
           ++ KK KK  +K+ +  ++   KK K+     +KQ+ 
Sbjct: 84  REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRH 120


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.308    0.126    0.331 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,070,789
Number of extensions: 566546
Number of successful extensions: 15173
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8627
Number of HSP's successfully gapped: 2651
Length of query: 124
Length of database: 10,937,602
Length adjustment: 85
Effective length of query: 39
Effective length of database: 7,167,512
Effective search space: 279532968
Effective search space used: 279532968
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 53 (24.0 bits)