RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1374
         (263 letters)



>gnl|CDD|214980 smart01024, BCS1_N, This domain is found at the N terminal of the
           mitochondrial ATPase BSC1. It encodes the import and
           intramitochondrial sorting for the protein. 
          Length = 170

 Score =  202 bits (517), Expect = 3e-66
 Identities = 80/150 (53%), Positives = 109/150 (72%), Gaps = 2/150 (1%)

Query: 82  RRHYMITLEIPCRDKSYHWLLHWITVRGA-KKTQHLSVETSFEKFDTGYVKTKYDFIPSI 140
           RR +++TLE+P +D+SY WLL W++ +   ++++HLSVETS+ + D G   T + F+P  
Sbjct: 22  RRRFLVTLEVPSKDESYAWLLEWLSQQPFRQRSRHLSVETSYTQHDNGKSSTSFSFVPGP 81

Query: 141 GTHLFSYQGNWIRVERSRESTTLDIQQGIPWETVTLTAFGRDRSLFFNILEEARTLALKQ 200
           GTH F Y+G WI V R RE T  D++ G P+ET+TLT  GRDR +F  +LEEAR LALK+
Sbjct: 82  GTHWFWYKGRWIWVTREREKTMADMRTGSPFETLTLTTLGRDRDVFKELLEEARELALKR 141

Query: 201 YEGKTVMYTALGSEWRPFGHPKRRRPLNSV 230
            EGKTV+YTA G EWR F  P+R+RPL+SV
Sbjct: 142 TEGKTVIYTADGPEWRRF-APRRKRPLSSV 170


>gnl|CDD|219997 pfam08740, BCS1_N, BCS1 N terminal.  This domain is found at the N
           terminal of the mitochondrial ATPase BSC1. It encodes
           the import and intramitochondrial sorting for the
           protein.
          Length = 179

 Score =  156 bits (397), Expect = 6e-48
 Identities = 67/161 (41%), Positives = 96/161 (59%), Gaps = 15/161 (9%)

Query: 82  RRHYMITLEIPCRDKSYHWLLHWITVRG-AKKTQHLSVET----------SFEKFDTGYV 130
           RR++  TLE+   D+SY+W+L W++ +  A++++ L   T          S E       
Sbjct: 22  RRYFTSTLEVRSDDESYNWVLEWLSQQPFAQRSRRLVASTSLNSRRDSHLSVETSSAQEK 81

Query: 131 KTKYDFIPSIGTHLFSYQGNWIRVERSRESTTLDIQQGIPWETVTLTAFGRDRSLFFNIL 190
           K  + F+P  GTH F Y+G  I  ER RE+ + D+    P+ET+TLT  GR R +   +L
Sbjct: 82  KAPFRFVPGFGTHWFRYKGRLIWFERERETKSYDLS--TPFETLTLTTLGRSRQILKELL 139

Query: 191 EEARTLALKQYEGKTVMYTALGSE-WRPFGHPKRRRPLNSV 230
           EEAR LALK+ EGKTV+Y A GSE WR  G P+ +RPL++V
Sbjct: 140 EEARALALKKDEGKTVIYRADGSEGWRR-GAPRPKRPLSTV 179


>gnl|CDD|223541 COG0465, HflB, ATP-dependent Zn proteases [Posttranslational
           modification, protein turnover, chaperones].
          Length = 596

 Score = 47.7 bits (114), Expect = 3e-06
 Identities = 20/147 (13%), Positives = 36/147 (24%), Gaps = 12/147 (8%)

Query: 90  EIPCRDKSYHWLLHWITVRGAKKTQHLSVE--------TSFEKFDTGYVKTKYDFIPSIG 141
            +P    S    +      GA + + L  +           ++ D    +         G
Sbjct: 208 GVPFFSISGSDFVEMFVGVGASRVRDLFEQAKKNAPCIIFIDEIDAVGRQRG--AGLGGG 265

Query: 142 THLFSYQGNWIRVERSR--ESTTLDIQQGIPWETVTLTAFGRDRSLFFNILEEARTLALK 199
                   N + VE      +  + +        V   A  R       IL E   +  +
Sbjct: 266 NDEREQTLNQLLVEMDGFGGNEGVIVIAATNRPDVLDPALLRPGRFDRQILVELPDIKGR 325

Query: 200 QYEGKTVMYTALGSEWRPFGHPKRRRP 226
           +   K        +E        R  P
Sbjct: 326 EQILKVHAKNKPLAEDVDLKKIARGTP 352



 Score = 30.0 bits (68), Expect = 1.5
 Identities = 10/15 (66%), Positives = 12/15 (80%)

Query: 6   LTILDPALVRPGRVD 20
             +LDPAL+RPGR D
Sbjct: 298 PDVLDPALLRPGRFD 312


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 38.8 bits (91), Expect = 0.002
 Identities = 14/35 (40%), Positives = 23/35 (65%)

Query: 49  KFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
           K+    KKK+  ++ +K+K KKK+KKK  K+K + 
Sbjct: 378 KYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKG 412



 Score = 36.9 bits (86), Expect = 0.007
 Identities = 14/29 (48%), Positives = 21/29 (72%)

Query: 55  KKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
           KK++  + +K+KK KK+KKK K +KK  R
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 32.2 bits (74), Expect = 0.26
 Identities = 14/28 (50%), Positives = 20/28 (71%)

Query: 53  QAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
           + +KK   R+KKKK+ KK KK+KK  +K
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 29.6 bits (67), Expect = 2.0
 Identities = 13/23 (56%), Positives = 16/23 (69%)

Query: 52  DQAKKKKNFRRKKKKKTKKKKKK 74
            Q +KKK  R+KK KK KKK +K
Sbjct: 392 PQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 29.6 bits (67), Expect = 2.0
 Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 56  KKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
           K+K  +  KKK+ +KK +K+K KKK ++
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKK 403



 Score = 29.2 bits (66), Expect = 2.4
 Identities = 9/24 (37%), Positives = 18/24 (75%)

Query: 61  RRKKKKKTKKKKKKKKTKKKNRRH 84
           ++K+++K  +K+KKKK +KK  + 
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKK 407


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 36.1 bits (84), Expect = 0.017
 Identities = 11/30 (36%), Positives = 17/30 (56%)

Query: 54  AKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
           +KK    R +KK+K K+K K KK  +  + 
Sbjct: 392 SKKVLAKRAEKKEKEKEKPKVKKRHRDTKN 421



 Score = 35.7 bits (83), Expect = 0.019
 Identities = 10/33 (30%), Positives = 19/33 (57%)

Query: 51  ADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
           +++   K + +   K+  KK+K+K+K K K R 
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRH 416



 Score = 35.3 bits (82), Expect = 0.023
 Identities = 11/36 (30%), Positives = 18/36 (50%)

Query: 48  SKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
           SK     + +K  + K+K K KK+ +  K   K R+
Sbjct: 392 SKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRK 427



 Score = 35.3 bits (82), Expect = 0.027
 Identities = 9/31 (29%), Positives = 19/31 (61%)

Query: 55  KKKKNFRRKKKKKTKKKKKKKKTKKKNRRHY 85
            KK   +R +KK+ +K+K K K + ++ ++ 
Sbjct: 392 SKKVLAKRAEKKEKEKEKPKVKKRHRDTKNI 422



 Score = 34.5 bits (80), Expect = 0.052
 Identities = 14/56 (25%), Positives = 28/56 (50%)

Query: 29  SRHQLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
            R+  E +  R   ++   +K   + K  K  ++   K+ +KK+K+K+  K  +RH
Sbjct: 361 ERYIEEPLKARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRH 416



 Score = 34.1 bits (79), Expect = 0.057
 Identities = 12/36 (33%), Positives = 19/36 (52%)

Query: 47  ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
           A       K  K    K+ +K +K+K+K K KK++R
Sbjct: 382 APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHR 417



 Score = 31.5 bits (72), Expect = 0.45
 Identities = 8/35 (22%), Positives = 18/35 (51%)

Query: 46  NASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
              K + +   K+  +++K+K+  K KK+ +  K 
Sbjct: 387 KTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKN 421



 Score = 31.5 bits (72), Expect = 0.50
 Identities = 7/35 (20%), Positives = 16/35 (45%)

Query: 47  ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
           A +   + K+K+  + KK+ +  K   K++     
Sbjct: 397 AKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431



 Score = 30.3 bits (69), Expect = 1.00
 Identities = 9/35 (25%), Positives = 17/35 (48%)

Query: 48  SKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
           +K A++ +K+K   + KK+    K   K+ K    
Sbjct: 397 AKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431



 Score = 28.4 bits (64), Expect = 4.3
 Identities = 7/29 (24%), Positives = 16/29 (55%)

Query: 53  QAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
           + +K+K   +K+ + TK   K++K    +
Sbjct: 404 EKEKEKPKVKKRHRDTKNIGKRRKPSGTS 432



 Score = 27.6 bits (62), Expect = 7.3
 Identities = 4/29 (13%), Positives = 12/29 (41%)

Query: 54  AKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
            K+K   +++ +      K++K +     
Sbjct: 406 EKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434


>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 406

 Score = 35.3 bits (82), Expect = 0.022
 Identities = 14/22 (63%), Positives = 16/22 (72%)

Query: 1   MSTQALTILDPALVRPGRVDLK 22
           M+T    ILDPAL+RPGR D K
Sbjct: 295 MATNRPDILDPALLRPGRFDRK 316


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 35.0 bits (81), Expect = 0.022
 Identities = 19/47 (40%), Positives = 27/47 (57%), Gaps = 3/47 (6%)

Query: 34  EQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
           E++ K+   ++D     A + KKK N   ++KKK KKK  KKK  KK
Sbjct: 154 ERLKKK---ELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKK 197


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 34.5 bits (80), Expect = 0.029
 Identities = 11/35 (31%), Positives = 19/35 (54%)

Query: 47  ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
           A +   QAKK++    K+K   +++K K K  +K 
Sbjct: 112 AQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKK 146



 Score = 32.6 bits (75), Expect = 0.12
 Identities = 8/36 (22%), Positives = 20/36 (55%)

Query: 47  ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
            ++ A+Q  KK+    +K+K  ++++K K    + +
Sbjct: 111 QAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKK 146



 Score = 31.0 bits (71), Expect = 0.38
 Identities = 12/34 (35%), Positives = 20/34 (58%)

Query: 47  ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
           A K     +K+K  RR++K K K  +KK+K + +
Sbjct: 119 AKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152



 Score = 30.3 bits (69), Expect = 0.68
 Identities = 7/37 (18%), Positives = 20/37 (54%)

Query: 48  SKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
           +K   QA++ +   +K++   +K+K  ++ +K   + 
Sbjct: 106 AKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKA 142



 Score = 27.6 bits (62), Expect = 5.5
 Identities = 10/43 (23%), Positives = 19/43 (44%)

Query: 38  KRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
           +R  Q         ++ K  +  R+ K K  +KK+K +  K +
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPE 155



 Score = 26.8 bits (60), Expect = 9.5
 Identities = 5/38 (13%), Positives = 17/38 (44%)

Query: 45  DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
           + A +     ++++  + K  +K +K + +K   +   
Sbjct: 123 EAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTP 160


>gnl|CDD|185575 PTZ00361, PTZ00361, 26 proteosome regulatory subunit 4-like
           protein; Provisional.
          Length = 438

 Score = 35.1 bits (81), Expect = 0.032
 Identities = 13/22 (59%), Positives = 17/22 (77%)

Query: 1   MSTQALTILDPALVRPGRVDLK 22
           M+T  +  LDPAL+RPGR+D K
Sbjct: 327 MATNRIESLDPALIRPGRIDRK 348


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 34.7 bits (80), Expect = 0.043
 Identities = 19/68 (27%), Positives = 31/68 (45%), Gaps = 8/68 (11%)

Query: 18  RVDL--KEYIGFCSRHQLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKK 75
           R+D    E  G   R +LE+  ++         K      K K  R KK++  + ++KKK
Sbjct: 329 RIDAFSGEPDGISLREELEKRIEKL------KEKPPKPPTKAKPERDKKERPGRYRRKKK 382

Query: 76  KTKKKNRR 83
           + K K+ R
Sbjct: 383 EKKAKSER 390


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 33.6 bits (77), Expect = 0.057
 Identities = 17/49 (34%), Positives = 22/49 (44%), Gaps = 6/49 (12%)

Query: 41  YQDVDNASKFADQA------KKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
            QD       +D        KK K  + +  K+ KKKKK+KK KKK   
Sbjct: 122 TQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHS 170



 Score = 30.6 bits (69), Expect = 0.60
 Identities = 16/55 (29%), Positives = 26/55 (47%)

Query: 30  RHQLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
           +H+ +              K  ++  KKK     K++K KKK+KKKK K+ +  H
Sbjct: 119 KHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEH 173


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 32.7 bits (75), Expect = 0.078
 Identities = 17/32 (53%), Positives = 22/32 (68%)

Query: 49  KFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
           K   Q  K+K  ++K+KKK KKKKKKK +KK 
Sbjct: 99  KIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKA 130



 Score = 32.4 bits (74), Expect = 0.090
 Identities = 16/31 (51%), Positives = 21/31 (67%)

Query: 50  FADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
             ++AK+KK  ++KKKKK KK  KK   KKK
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 29.7 bits (67), Expect = 0.87
 Identities = 16/33 (48%), Positives = 20/33 (60%)

Query: 49  KFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
           K   + K K+  ++KKKKK KKKK  KK  KK 
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 26.6 bits (59), Expect = 9.6
 Identities = 14/30 (46%), Positives = 19/30 (63%)

Query: 47  ASKFADQAKKKKNFRRKKKKKTKKKKKKKK 76
             K   + +KKK  ++KKKK +KK  KKKK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 31.9 bits (73), Expect = 0.15
 Identities = 19/42 (45%), Positives = 25/42 (59%), Gaps = 2/42 (4%)

Query: 51  ADQAK--KKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMITLE 90
            D+ K  KK+  R+KKK+K KKKKK KK  KK  +    + E
Sbjct: 78  KDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSE 119



 Score = 27.7 bits (62), Expect = 4.2
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 52  DQAKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
           ++ K+K   +  KK+  ++KKK+KK KKK  
Sbjct: 73  EEKKRKDEEKTAKKRAKRQKKKQKKKKKKKA 103


>gnl|CDD|240423 PTZ00454, PTZ00454, 26S protease regulatory subunit 6B-like
           protein; Provisional.
          Length = 398

 Score = 32.8 bits (75), Expect = 0.17
 Identities = 13/22 (59%), Positives = 16/22 (72%)

Query: 1   MSTQALTILDPALVRPGRVDLK 22
           M+T     LDPAL+RPGR+D K
Sbjct: 289 MATNRADTLDPALLRPGRLDRK 310


>gnl|CDD|223898 COG0828, RpsU, Ribosomal protein S21 [Translation, ribosomal
          structure and biogenesis].
          Length = 67

 Score = 30.4 bits (69), Expect = 0.17
 Identities = 14/55 (25%), Positives = 31/55 (56%), Gaps = 4/55 (7%)

Query: 33 LEQMYKRFYQDVDNASKFAD----QAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
          L++  +RF + V+      +    +  +K + +RK+KK   +K+K K+ +K+ +R
Sbjct: 12 LDKALRRFKRKVEKEGILREMKEREFYEKPSEKRKRKKAAARKRKFKRLRKEQQR 66


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score = 32.5 bits (75), Expect = 0.17
 Identities = 10/13 (76%), Positives = 12/13 (92%)

Query: 8   ILDPALVRPGRVD 20
           ILDPA++RPGR D
Sbjct: 282 ILDPAILRPGRFD 294


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
          (DUF1754).  This is a eukaryotic protein family of
          unknown function.
          Length = 90

 Score = 30.9 bits (70), Expect = 0.18
 Identities = 13/27 (48%), Positives = 15/27 (55%)

Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKKN 81
          K KK   +KKKKK KKK K K+     
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTE 39



 Score = 27.0 bits (60), Expect = 4.2
 Identities = 13/26 (50%), Positives = 16/26 (61%)

Query: 63 KKKKKTKKKKKKKKTKKKNRRHYMIT 88
          K KK   KKKKKKK KK   +  ++T
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVT 38


>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
          Length = 300

 Score = 32.3 bits (74), Expect = 0.20
 Identities = 19/58 (32%), Positives = 28/58 (48%), Gaps = 5/58 (8%)

Query: 33  LEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMITLE 90
           LE MYK+ +  +      A+ +K KK  ++KKK   KK K KK T K+ +        
Sbjct: 237 LEDMYKKAHAAIR-----ANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKA 289


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
          This domain is found to the N-terminus of bacterial
          signal peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 31.3 bits (72), Expect = 0.23
 Identities = 15/51 (29%), Positives = 24/51 (47%), Gaps = 1/51 (1%)

Query: 33 LEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
          L + YK   + +  A+    +  K      KK +K K K +KKK KK+  +
Sbjct: 47 LNEEYKDLKESL-EAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96



 Score = 29.4 bits (67), Expect = 1.2
 Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
            K +K   + K K  KKK KK++ K +
Sbjct: 71 WEKAEKKAEKAKAKAEKKKAKKEEPKPR 98


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
          non-specific RNA binding region part 2.  This is a
          region found N terminal to the catalytic domain of
          glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
          but not in Escherichia coli. This region is thought to
          bind RNA in a non-specific manner, enhancing
          interactions between the tRNA and enzyme, but is not
          essential for enzyme function.
          Length = 83

 Score = 30.0 bits (68), Expect = 0.24
 Identities = 14/31 (45%), Positives = 16/31 (51%), Gaps = 3/31 (9%)

Query: 51 ADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
          AD  KKKK   +KKKKK +      K KK  
Sbjct: 20 ADLVKKKK---KKKKKKAEDTAATAKAKKAT 47


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region
          covers both the Brf homology II and III regions. This
          region is involved in binding TATA binding protein.
          Length = 95

 Score = 29.9 bits (68), Expect = 0.32
 Identities = 10/29 (34%), Positives = 17/29 (58%)

Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
          +++K  ++K  +     KKKKK K K +R
Sbjct: 32 QEEKELKQKADEGNNSGKKKKKRKAKKKR 60


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 31.8 bits (72), Expect = 0.35
 Identities = 13/30 (43%), Positives = 13/30 (43%)

Query: 54  AKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
            K KK    K  K  K KK KKK    NR 
Sbjct: 148 KKPKKGQSPKVPKAPKPKKPKKKGSVSNRS 177


>gnl|CDD|182683 PRK10733, hflB, ATP-dependent metalloprotease; Reviewed.
          Length = 644

 Score = 31.9 bits (72), Expect = 0.38
 Identities = 15/37 (40%), Positives = 21/37 (56%)

Query: 2   STQALTILDPALVRPGRVDLKEYIGFCSRHQLEQMYK 38
           +T    +LDPAL+RPGR D +  +G       EQ+ K
Sbjct: 296 ATNRPDVLDPALLRPGRFDRQVVVGLPDVRGREQILK 332


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 31.6 bits (71), Expect = 0.39
 Identities = 14/34 (41%), Positives = 23/34 (67%)

Query: 51  ADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
           A++ KK    ++KK++K K++KKKKK    +R H
Sbjct: 271 AEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCH 304



 Score = 30.0 bits (67), Expect = 1.3
 Identities = 14/37 (37%), Positives = 23/37 (62%), Gaps = 3/37 (8%)

Query: 51  ADQAKKKKNFRRKKK---KKTKKKKKKKKTKKKNRRH 84
           +DQ  K +  RR ++   K  ++KKK++K K+  RRH
Sbjct: 78  SDQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRRH 114



 Score = 28.9 bits (64), Expect = 3.4
 Identities = 14/32 (43%), Positives = 23/32 (71%)

Query: 53  QAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
           +A++ K   + KKKK +K+K++KK KKK+  H
Sbjct: 270 EAEETKKSPKHKKKKQRKEKEEKKKKKKHHHH 301



 Score = 28.5 bits (63), Expect = 4.6
 Identities = 7/40 (17%), Positives = 22/40 (55%)

Query: 45  DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
           D   K  ++ + ++   + K++K K++K+K+  ++ +   
Sbjct: 79  DQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLG 118


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 30.3 bits (69), Expect = 0.40
 Identities = 8/30 (26%), Positives = 20/30 (66%)

Query: 53  QAKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
           +  K++  + K+ +K ++KK K++ K+K +
Sbjct: 101 ERTKERAEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 29.5 bits (67), Expect = 0.87
 Identities = 10/28 (35%), Positives = 21/28 (75%)

Query: 55  KKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
           KK++   R ++++TK++ +K+K  +KNR
Sbjct: 90  KKRRIALRLRRERTKERAEKEKRTRKNR 117



 Score = 26.9 bits (60), Expect = 7.4
 Identities = 9/31 (29%), Positives = 18/31 (58%)

Query: 53  QAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
            A + +  R K++ + +K+ +K + KK  RR
Sbjct: 94  IALRLRRERTKERAEKEKRTRKNREKKFKRR 124


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
          Provisional.
          Length = 330

 Score = 31.3 bits (72), Expect = 0.41
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
          K+ K + + +KKK K++ K  K K K R
Sbjct: 64 KELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 31.0 bits (71), Expect = 0.56
 Identities = 8/27 (29%), Positives = 14/27 (51%)

Query: 57 KKNFRRKKKKKTKKKKKKKKTKKKNRR 83
          KK  +   K + KK+K++ K  K   +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSK 89



 Score = 30.2 bits (69), Expect = 1.2
 Identities = 12/44 (27%), Positives = 23/44 (52%), Gaps = 1/44 (2%)

Query: 37 YKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
          YK   +++  A+    +  K  +  +KKK+K + K  K K+K +
Sbjct: 49 YKEMKEEL-KAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 31.7 bits (72), Expect = 0.45
 Identities = 16/61 (26%), Positives = 20/61 (32%), Gaps = 16/61 (26%)

Query: 40  FYQDVDNASKFADQAKKKKNFRR----------------KKKKKTKKKKKKKKTKKKNRR 83
              D     +   + K K N                    +  K KK KKKKK KKK R+
Sbjct: 231 STDDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRK 290

Query: 84  H 84
            
Sbjct: 291 D 291



 Score = 30.9 bits (70), Expect = 0.64
 Identities = 13/30 (43%), Positives = 19/30 (63%)

Query: 47  ASKFADQAKKKKNFRRKKKKKTKKKKKKKK 76
           AS + D ++  K  + KKKKK KKK++K  
Sbjct: 263 ASDYYDVSEMVKFKKPKKKKKKKKKRRKDL 292


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 30.8 bits (70), Expect = 0.46
 Identities = 13/36 (36%), Positives = 21/36 (58%)

Query: 49  KFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
           K   + KK+K  ++ KK+K  + K  KK KKK ++ 
Sbjct: 158 KKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 30.8 bits (70), Expect = 0.53
 Identities = 13/35 (37%), Positives = 20/35 (57%)

Query: 47  ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
           A    +   +++  + KKKKK  KK+KK+K  KK 
Sbjct: 141 AKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKE 175



 Score = 29.7 bits (67), Expect = 1.2
 Identities = 15/41 (36%), Positives = 20/41 (48%), Gaps = 5/41 (12%)

Query: 45  DNASKFADQAKKKKNFRRKKKKK-----TKKKKKKKKTKKK 80
           +   +   + + KK  + KK KK      K  KKKKK KKK
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192



 Score = 27.8 bits (62), Expect = 4.3
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 55  KKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
           +KKK    KK+KK KK KK+K  + K  +
Sbjct: 156 EKKKKKEVKKEKKEKKDKKEKMVEPKGSK 184



 Score = 27.8 bits (62), Expect = 4.4
 Identities = 11/30 (36%), Positives = 19/30 (63%)

Query: 55  KKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
           ++KK  ++KK+ K +KK+KK K +K     
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKDKKEKMVEPK 181



 Score = 27.8 bits (62), Expect = 4.4
 Identities = 12/34 (35%), Positives = 20/34 (58%)

Query: 47  ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
           A    ++ K+KK  +  KK+K +KK KK+K  + 
Sbjct: 147 AEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEP 180


>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
          Length = 64

 Score = 28.7 bits (65), Expect = 0.49
 Identities = 11/54 (20%), Positives = 31/54 (57%), Gaps = 6/54 (11%)

Query: 33 LEQMYKRFYQDVDNASKFADQAKKKKNF-----RRKKKKKTKKKKKKKKTKKKN 81
          +++  +RF + V+ A     + ++++ +     +RK+KK   +K+++KK  ++ 
Sbjct: 12 IDKALRRFKRKVEKAGILR-ELRRREFYEKPSEKRKRKKAAARKRRRKKLAREE 64


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 31.3 bits (72), Expect = 0.49
 Identities = 9/43 (20%), Positives = 17/43 (39%), Gaps = 1/43 (2%)

Query: 44  VDNASKFADQ-AKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHY 85
           V+ A +   +  +K +       K    KK   K  +K ++ Y
Sbjct: 807 VEEALELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQY 849



 Score = 30.1 bits (69), Expect = 1.2
 Identities = 7/35 (20%), Positives = 15/35 (42%)

Query: 47  ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
               AD   K    ++   K ++K K++    +K+
Sbjct: 821 LRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKD 855



 Score = 27.8 bits (63), Expect = 7.4
 Identities = 8/28 (28%), Positives = 12/28 (42%)

Query: 52  DQAKKKKNFRRKKKKKTKKKKKKKKTKK 79
           D A KK   +  +K K +    +K  K 
Sbjct: 831 DPAGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 27.4 bits (62), Expect = 9.3
 Identities = 7/34 (20%), Positives = 12/34 (35%)

Query: 47  ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
           A   A     KK   +  +K  ++    +K  K 
Sbjct: 825 ADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 31.6 bits (71), Expect = 0.49
 Identities = 8/36 (22%), Positives = 17/36 (47%)

Query: 45   DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
            D A K A++AKK     + + +    + +  + K +
Sbjct: 1332 DAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367



 Score = 29.0 bits (64), Expect = 3.7
 Identities = 10/36 (27%), Positives = 17/36 (47%)

Query: 45   DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
            + A K AD AKKK    +K  +  K + +    + +
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAE 1360



 Score = 29.0 bits (64), Expect = 3.8
 Identities = 13/39 (33%), Positives = 22/39 (56%)

Query: 45   DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
            D A K A++AKKK +  +KK ++ KK  +  K + +   
Sbjct: 1318 DEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAA 1356



 Score = 28.6 bits (63), Expect = 4.8
 Identities = 8/33 (24%), Positives = 16/33 (48%)

Query: 48   SKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
            +K A +A K +      + +  ++K +   KKK
Sbjct: 1341 AKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373



 Score = 28.2 bits (62), Expect = 5.9
 Identities = 17/37 (45%), Positives = 23/37 (62%), Gaps = 3/37 (8%)

Query: 47   ASKFADQAKKKKNFRRKK---KKKTKKKKKKKKTKKK 80
            A K AD+AKKK   ++K    KKK ++ KK  + KKK
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKK 1452



 Score = 28.2 bits (62), Expect = 6.0
 Identities = 15/37 (40%), Positives = 21/37 (56%), Gaps = 4/37 (10%)

Query: 48   SKFADQAKKKKNFRRKK----KKKTKKKKKKKKTKKK 80
            +K AD+AKKK    +KK    KKK ++ KK  +  K 
Sbjct: 1314 AKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKA 1350



 Score = 27.8 bits (61), Expect = 6.9
 Identities = 12/38 (31%), Positives = 23/38 (60%)

Query: 45   DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
            D A K A++AKK +  ++K ++  K  + KKK ++  +
Sbjct: 1447 DEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK 1484



 Score = 27.4 bits (60), Expect = 9.3
 Identities = 9/35 (25%), Positives = 18/35 (51%)

Query: 45   DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKK 79
              A+K   +A   +    ++K +  +KKK++  KK
Sbjct: 1345 AEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379


>gnl|CDD|233327 TIGR01241, FtsH_fam, ATP-dependent metalloprotease FtsH.
           HflB(FtsH) is a pleiotropic protein required for correct
           cell division in bacteria. It has ATP-dependent zinc
           metalloprotease activity. It was formerly designated
           cell division protein FtsH [Cellular processes, Cell
           division, Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 495

 Score = 31.5 bits (72), Expect = 0.50
 Identities = 10/13 (76%), Positives = 12/13 (92%)

Query: 8   ILDPALVRPGRVD 20
           +LDPAL+RPGR D
Sbjct: 205 VLDPALLRPGRFD 217


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 31.3 bits (71), Expect = 0.55
 Identities = 14/43 (32%), Positives = 16/43 (37%), Gaps = 6/43 (13%)

Query: 45  DNASKFADQAKKKKNF------RRKKKKKTKKKKKKKKTKKKN 81
                   Q  KK NF      R +KK K K  KK  +  KK 
Sbjct: 748 GKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKP 790



 Score = 28.6 bits (64), Expect = 4.5
 Identities = 11/29 (37%), Positives = 15/29 (51%)

Query: 51  ADQAKKKKNFRRKKKKKTKKKKKKKKTKK 79
           A + +KK   +  KK   K KK   KT+K
Sbjct: 768 AFRGEKKAKPKAAKKDARKAKKPSAKTQK 796



 Score = 27.8 bits (62), Expect = 8.2
 Identities = 12/34 (35%), Positives = 15/34 (44%)

Query: 47  ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
             K A     KK+ R+ KK   K +K    TK K
Sbjct: 771 GEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAK 804


>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
           Autographa californica nuclear polyhedrosis virus
           (AcMNPV) pp31 is a nuclear phosphoprotein that
           accumulates in the virogenic stroma, which is the viral
           replication centre in the infected-cell nucleus, binds
           to DNA, and serves as a late expression factor.
          Length = 267

 Score = 30.8 bits (70), Expect = 0.57
 Identities = 8/38 (21%), Positives = 15/38 (39%)

Query: 47  ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
           ++      +     R++K  K   KK     K+K +R 
Sbjct: 211 SASVVPTPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRS 248


>gnl|CDD|130309 TIGR01242, 26Sp45, 26S proteasome subunit P45 family.  Many
           proteins may score above the trusted cutoff because an
           internal.
          Length = 364

 Score = 30.9 bits (70), Expect = 0.66
 Identities = 12/19 (63%), Positives = 14/19 (73%)

Query: 2   STQALTILDPALVRPGRVD 20
           +T    ILDPAL+RPGR D
Sbjct: 267 ATNRPDILDPALLRPGRFD 285


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 29.5 bits (66), Expect = 0.71
 Identities = 13/20 (65%), Positives = 17/20 (85%)

Query: 62  RKKKKKTKKKKKKKKTKKKN 81
           +KKKKK+KK KK KK+ KK+
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120



 Score = 28.7 bits (64), Expect = 1.3
 Identities = 11/20 (55%), Positives = 14/20 (70%)

Query: 61  RRKKKKKTKKKKKKKKTKKK 80
           ++KKKK  K KK KK +KK 
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120



 Score = 26.8 bits (59), Expect = 7.1
 Identities = 13/32 (40%), Positives = 18/32 (56%), Gaps = 1/32 (3%)

Query: 44  VDNASKFADQAKKKKNFRRKKKKKTKKKKKKK 75
            D+  +  ++ KKKK  + KK KK KK  KK 
Sbjct: 90  DDHPPEPTEKPKKKK-KKSKKTKKPKKSSKKD 120



 Score = 26.4 bits (58), Expect = 8.2
 Identities = 12/23 (52%), Positives = 15/23 (65%)

Query: 61  RRKKKKKTKKKKKKKKTKKKNRR 83
             K KKK KK KK KK KK +++
Sbjct: 97  TEKPKKKKKKSKKTKKPKKSSKK 119


>gnl|CDD|197876 smart00792, Agouti, Agouti protein.  The agouti protein regulates
          pigmentation in the mouse hair follicle producing a
          black hair with a subapical yellow band. A highly
          homologous protein agouti signal protein (ASIP) is
          present in humans and is expressed at highest levels in
          adipose tissue where it may play a role in energy
          homeostasis and possibly human pigmentation.
          Length = 124

 Score = 29.5 bits (66), Expect = 0.75
 Identities = 9/27 (33%), Positives = 13/27 (48%)

Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKK 79
            K KK    + +KK  +KK+KK    
Sbjct: 51 NKKPKKISAEEAEKKLLQKKEKKALTN 77


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 30.4 bits (69), Expect = 0.77
 Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 2/40 (5%)

Query: 42  QDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
           +DV+ A   A++ +K K     KKKK K+ K  K  KK  
Sbjct: 239 EDVEAAESRAEKKRKSKE--EIKKKKPKESKGVKALKKVV 276



 Score = 27.7 bits (62), Expect = 6.6
 Identities = 10/25 (40%), Positives = 11/25 (44%)

Query: 55  KKKKNFRRKKKKKTKKKKKKKKTKK 79
            KKK  +  K  K  KK   K  KK
Sbjct: 258 IKKKKPKESKGVKALKKVVAKGMKK 282


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 30.4 bits (69), Expect = 0.80
 Identities = 12/36 (33%), Positives = 17/36 (47%)

Query: 46  NASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
           N ++   +  K +    + KKK KKKKKK K     
Sbjct: 163 NPTEEEVELLKARLEEERAKKKKKKKKKKTKKNNAT 198


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 30.0 bits (68), Expect = 0.86
 Identities = 12/28 (42%), Positives = 19/28 (67%)

Query: 53  QAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
           Q K+++N +++K  K  KKKKK K K +
Sbjct: 177 QKKREENLKKRKDDKKNKKKKKAKKKGR 204


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 30.7 bits (70), Expect = 0.88
 Identities = 11/49 (22%), Positives = 18/49 (36%)

Query: 32 QLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
          Q++Q+       V +     +    KK  +   K    K   KKK K +
Sbjct: 51 QIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDE 99


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
          describes the eukaryotic 60S (cytosolic) ribosomal
          protein L7 and paralogs that may or may not also be L7.
          Human, Drosophila, and Arabidopsis all have both a
          typical L7 and an L7-related paralog. This family is
          designated subfamily rather than equivalog to reflect
          these uncharacterized paralogs. Members of this family
          average ~ 250 residues in length, somewhat longer than
          the archaeal L30P/L7E homolog (~ 155 residues) and much
          longer than the related bacterial/organellar form (~ 60
          residues).
          Length = 235

 Score = 30.0 bits (68), Expect = 0.89
 Identities = 13/38 (34%), Positives = 18/38 (47%)

Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMITLE 90
            KK+K  +    +  K+ K KKK  KK R+ Y    E
Sbjct: 3  LLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAE 40


>gnl|CDD|220816 pfam10592, AIPR, AIPR protein.  This family of proteins was
           identified in as an abortive infection phage resistance
           protein often found in restriction modification system
           operons.
          Length = 300

 Score = 30.4 bits (69), Expect = 0.95
 Identities = 14/58 (24%), Positives = 21/58 (36%), Gaps = 2/58 (3%)

Query: 32  QLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMITL 89
            L+ +YK  + D      +     K +   +  +K  KKKKK       N   Y I  
Sbjct: 193 DLKGIYKIIFNDHLEEDYYKSIVAKYRLIFKALEKIIKKKKKYF--GGANIVAYTILA 248


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 30.1 bits (68), Expect = 0.96
 Identities = 8/21 (38%), Positives = 10/21 (47%)

Query: 63  KKKKKTKKKKKKKKTKKKNRR 83
             +K  K K   KK  KK R+
Sbjct: 194 GDEKSPKSKAAPKKAGKKMRK 214


>gnl|CDD|219553 pfam07751, Abi_2, Abi-like protein.  This family, found in various
           bacterial species, contains sequences that are similar
           to the Abi group of proteins, which are involved in
           bacteriophage resistance mediated by abortive infection
           in Lactococcus species. The proteins are thought to have
           helix-turn-helix motifs, found in many DNA-binding
           proteins, allowing them to perform their function.
          Length = 190

 Score = 29.6 bits (67), Expect = 1.0
 Identities = 17/63 (26%), Positives = 27/63 (42%), Gaps = 3/63 (4%)

Query: 31  HQLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMIT-L 89
           + L + Y  F     + S + D+    K+F +K  K  K+ K K +   K+ R    T L
Sbjct: 67  YHLSEKYGPFAYL--DKSLYKDRFNYSKDFIQKLSKIIKRSKDKLEDFIKHYRKNYDTEL 124

Query: 90  EIP 92
             P
Sbjct: 125 LFP 127


>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease.  The Slx4 protein is a
          heteromeric structure-specific endonuclease found in
          fungi. Slx4 with Slx1 acts as a nuclease on branched
          DNA substrates, particularly simple-Y, 5'-flap, or
          replication fork structures by cleaving the strand
          bearing the 5' non-homologous arm at the branch
          junction and thus generating ligatable nicked products
          from 5'-flap or replication fork substrates.
          Length = 627

 Score = 30.3 bits (68), Expect = 1.1
 Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 7/61 (11%)

Query: 30 RHQLEQMYKRF--YQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMI 87
          R QL Q+  RF  ++  D A+      K+       KKKK KK K +K+TK+KN++   +
Sbjct: 42 RLQLRQVISRFTSFESDDQANSGNVSGKRVP-----KKKKIKKPKLRKRTKRKNKKIKSL 96

Query: 88 T 88
          T
Sbjct: 97 T 97


>gnl|CDD|200312 TIGR03689, pup_AAA, proteasome ATPase.  In the Actinobacteria, as
           shown for Mycobacterium tuberculosis, some proteins are
           modified by ligation between an epsilon-amino group of a
           lysine side chain and the C-terminal carboxylate of the
           ubiquitin-like protein Pup. This modification leads to
           protein degradation by the archaeal-like proteasome
           found in the Actinobacteria. Members of this protein
           family belong to the AAA family of ATPases and tend to
           be clustered with the genes for Pup, the Pup ligase
           PafA, and structural components of the proteasome. This
           protein forms hexameric rings with ATPase activity
           [Protein fate, Degradation of proteins, peptides, and
           glycopeptides].
          Length = 512

 Score = 30.4 bits (69), Expect = 1.1
 Identities = 10/19 (52%), Positives = 16/19 (84%)

Query: 8   ILDPALVRPGRVDLKEYIG 26
           ++DPA++RPGR+D+K  I 
Sbjct: 345 MIDPAILRPGRLDVKIRIE 363


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 30.2 bits (68), Expect = 1.2
 Identities = 12/26 (46%), Positives = 12/26 (46%)

Query: 55  KKKKNFRRKKKKKTKKKKKKKKTKKK 80
           K KK     K K  K  KKK K K K
Sbjct: 752 KAKKKSASTKGKAAKTVKKKSKAKSK 777



 Score = 29.0 bits (65), Expect = 3.2
 Identities = 14/31 (45%), Positives = 15/31 (48%), Gaps = 2/31 (6%)

Query: 54  AKKKKNFRRKKKKKTKKKKKK--KKTKKKNR 82
           AKKK    + K  KT KKK K   K   K R
Sbjct: 753 AKKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783



 Score = 28.6 bits (64), Expect = 4.2
 Identities = 14/44 (31%), Positives = 19/44 (43%), Gaps = 5/44 (11%)

Query: 48  SKFADQAK---KKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMIT 88
           +K+A   K   +KK   + KKK    K K  KT KK  +     
Sbjct: 737 TKYAGTPKTPYEKKT--KAKKKSASTKGKAAKTVKKKSKAKSKK 778



 Score = 27.5 bits (61), Expect = 8.5
 Identities = 11/30 (36%), Positives = 14/30 (46%)

Query: 54  AKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
            KK  + + K  K  KKK K K  K   +R
Sbjct: 754 KKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
          This presumed domain is found at the N-terminus of
          Ribosomal L30 proteins and has been termed RL30NT or
          NUC018.
          Length = 71

 Score = 28.0 bits (63), Expect = 1.2
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
          KK+K   + + K+ KK+  KK  +KK R
Sbjct: 4  KKRKRNEKLRAKRAKKRAAKKAARKKKR 31



 Score = 26.0 bits (58), Expect = 5.5
 Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 56 KKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
          KK+    K + K  KK+  KK  +K +R
Sbjct: 4  KKRKRNEKLRAKRAKKRAAKKAARKKKR 31



 Score = 26.0 bits (58), Expect = 6.4
 Identities = 9/34 (26%), Positives = 15/34 (44%)

Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYM 86
            K +    +K+  K   +KKK+K   K    Y+
Sbjct: 9  NEKLRAKRAKKRAAKKAARKKKRKLIFKRAEKYV 42


>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755).  This
           family is predominated by ORFs from Circoviridae. The
           function of this family remains to be determined.
          Length = 122

 Score = 28.5 bits (64), Expect = 1.4
 Identities = 11/52 (21%), Positives = 22/52 (42%)

Query: 32  QLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
           QL  + ++  Q      +    ++  K  R+K +  + +  KK   KK+ R 
Sbjct: 53  QLIHLSQQLQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRS 104



 Score = 27.0 bits (60), Expect = 5.6
 Identities = 10/50 (20%), Positives = 19/50 (38%)

Query: 31  HQLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
           H  +Q+ +   +  +         KK+K  R    +  KK   KK+ +  
Sbjct: 56  HLSQQLQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSS 105


>gnl|CDD|235501 PRK05559, PRK05559, DNA topoisomerase IV subunit B; Reviewed.
          Length = 631

 Score = 30.1 bits (69), Expect = 1.5
 Identities = 13/37 (35%), Positives = 19/37 (51%)

Query: 40  FYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKK 76
             Q+ + A K A++A K    R +  KK K+KKK   
Sbjct: 362 LNQNPELAEKLAEKAIKAAQARLRAAKKVKRKKKTSG 398


>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
           topoisomerase IA [General function prediction only].
          Length = 298

 Score = 29.7 bits (67), Expect = 1.5
 Identities = 10/33 (30%), Positives = 15/33 (45%)

Query: 51  ADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
           A++  K    ++  KK T  K KK   KK   +
Sbjct: 253 AERRAKGGPGKKPAKKATAAKAKKTTAKKAAAK 285


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 29.0 bits (65), Expect = 1.6
 Identities = 9/28 (32%), Positives = 19/28 (67%)

Query: 53  QAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
           + +K+ +   ++K+  K KK+KK+ K+K
Sbjct: 97  KEQKEVSKETEEKEAIKAKKEKKEKKEK 124



 Score = 27.8 bits (62), Expect = 4.2
 Identities = 9/32 (28%), Positives = 20/32 (62%)

Query: 52  DQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
           +  K++K   ++ ++K   K KK+K +KK ++
Sbjct: 94  ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125


>gnl|CDD|233478 TIGR01577, oligosac_amyl, oligosaccharide amylase.  The name of
           this type of amylase is based on the characterization of
           an glucoamylase family enzyme from Thermoactinomyces
           vulgaris. The T. vulgaris enzyme was expressed in E.
           coli and, like other glucoamylases, it releases
           beta-D-glucose from starch. However, unlike previously
           characterized glucoamylases, this T. vulgaris amylase
           hydrolyzes maltooligosaccharides (maltotetraose,
           maltose) more efficiently than starch (PMID: 11549021),
           indicating this enzyme belongs to a class of
           glucoamylase-type enzymes with
           oligosaccharide-metabolizing activity.
          Length = 616

 Score = 29.9 bits (67), Expect = 1.6
 Identities = 16/81 (19%), Positives = 26/81 (32%), Gaps = 11/81 (13%)

Query: 155 ERSRESTTLDIQQGIPWET----------VTLTAFGRDRSLFFNILEEARTLALKQYEGK 204
             ++E   +       WE               AFG+        LEE RTL  ++   +
Sbjct: 169 NCNKEDIGMHTDGACLWELGLCPPGSKCFTIFFAFGKTVEKALEELEELRTLGFERIRSQ 228

Query: 205 TVMY-TALGSEWRPFGHPKRR 224
           T+    A   + +P   P   
Sbjct: 229 TLADWEAYFGQAKPVQTPCLE 249


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
          Williams-Beuren syndrome.  This domain family is found
          in eukaryotes, and is typically between 72 and 83 amino
          acids in length. The family is found in association
          with pfam08241. This family is made up of
          S-adenosylmethionine-dependent methyltransferases. The
          proteins are deleted in Williams-Beuren syndrome (WBS),
          a complex developmental disorder with multisystemic
          manifestations including supravalvular aortic stenosis
          (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 27.6 bits (62), Expect = 1.7
 Identities = 15/37 (40%), Positives = 23/37 (62%), Gaps = 5/37 (13%)

Query: 52 DQAKKKKNFRRKKKKKTKKKKKKKKTK-----KKNRR 83
          D +K ++  +R ++KK KKKK KKK+K     KK + 
Sbjct: 28 DASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQM 64


>gnl|CDD|233328 TIGR01243, CDC48, AAA family ATPase, CDC48 subfamily.  This
           subfamily of the AAA family ATPases includes two members
           each from three archaeal species. It also includes yeast
           CDC48 (cell division control protein 48) and the human
           ortholog, transitional endoplasmic reticulum ATPase
           (valosin-containing protein). These proteins in
           eukaryotes are involved in the budding and transfer of
           membrane from the transitional endoplasmic reticulum to
           the Golgi apparatus.
          Length = 733

 Score = 29.9 bits (67), Expect = 1.7
 Identities = 11/13 (84%), Positives = 12/13 (92%)

Query: 8   ILDPALVRPGRVD 20
           ILDPAL+RPGR D
Sbjct: 602 ILDPALLRPGRFD 614


>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.
          This domain is the C-terminal end of ribosomal L18/L5
          proteins.
          Length = 93

 Score = 27.9 bits (63), Expect = 1.7
 Identities = 15/49 (30%), Positives = 23/49 (46%), Gaps = 11/49 (22%)

Query: 32 QLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
           +E+MYK+           A  A +    R+KK+KK  K + K+   KK
Sbjct: 43 DIEEMYKK-----------AHAAIRADPSRKKKEKKEVKAESKRYNAKK 80


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 29.7 bits (66), Expect = 1.7
 Identities = 12/53 (22%), Positives = 27/53 (50%), Gaps = 2/53 (3%)

Query: 34   EQMYKRFYQDVDNASKFADQAKK--KKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
            E   K   +D     +  DQ +K   ++ +R K+++  KK+K+++ + +   H
Sbjct: 1010 ECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIH 1062


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 29.8 bits (67), Expect = 1.7
 Identities = 12/25 (48%), Positives = 20/25 (80%)

Query: 61  RRKKKKKTKKKKKKKKTKKKNRRHY 85
           R+KKKKK  K+KKKKK ++ +++ +
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIH 708



 Score = 27.9 bits (62), Expect = 7.4
 Identities = 13/31 (41%), Positives = 17/31 (54%)

Query: 60  FRRKKKKKTKKKKKKKKTKKKNRRHYMITLE 90
              +KKKK KK K+KKK K +     +  LE
Sbjct: 681 CGSRKKKKKKKSKEKKKKKNREASKQIHALE 711


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 29.2 bits (66), Expect = 1.8
 Identities = 9/26 (34%), Positives = 10/26 (38%)

Query: 56  KKKNFRRKKKKKTKKKKKKKKTKKKN 81
           K K  +  K  K KK K K K     
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLNVGG 219



 Score = 27.7 bits (62), Expect = 5.1
 Identities = 11/36 (30%), Positives = 16/36 (44%)

Query: 45 DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
          D       + +K K   + K KK  K K ++K K K
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAK 73



 Score = 27.7 bits (62), Expect = 5.9
 Identities = 12/41 (29%), Positives = 21/41 (51%)

Query: 43 DVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
          D +   + A  A K K  +  K K  +K+K K++ ++K  R
Sbjct: 42 DEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLR 82



 Score = 27.7 bits (62), Expect = 6.1
 Identities = 14/44 (31%), Positives = 19/44 (43%)

Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMITLE 90
            K    AK K     K K + K+K K++K +K  R     T E
Sbjct: 47 EEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPE 90


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 29.5 bits (67), Expect = 1.8
 Identities = 11/43 (25%), Positives = 23/43 (53%)

Query: 42  QDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
           ++ +N +    + KK +  +RK +KK +K++ +K   KK    
Sbjct: 399 EEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEA 441


>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain.  This region has been
          identified as the binding site of the SRP72 protein to
          SRP RNA.
          Length = 57

 Score = 26.6 bits (59), Expect = 2.2
 Identities = 16/52 (30%), Positives = 23/52 (44%), Gaps = 7/52 (13%)

Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMITLEIPCRDKSY 98
            K   + KKKK+ ++K+K K  K      T    R        +P RD+SY
Sbjct: 10 KKKPGTEQKKKKSAKKKRKPKLPKNFNPVVTPDPERW-------LPMRDRSY 54



 Score = 26.3 bits (58), Expect = 3.1
 Identities = 14/33 (42%), Positives = 19/33 (57%)

Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKK 79
           +K   +  KKK    +KKKK+ KKK+K K  K
Sbjct: 1  EAKKPKKIAKKKPGTEQKKKKSAKKKRKPKLPK 33


>gnl|CDD|201461 pfam00829, Ribosomal_L21p, Ribosomal prokaryotic L21 protein. 
          Length = 96

 Score = 27.5 bits (62), Expect = 2.3
 Identities = 10/28 (35%), Positives = 14/28 (50%), Gaps = 2/28 (7%)

Query: 62 RKKKKKTKKKKKKKKTKKKN--RRHYMI 87
          R KK    K K++K  +KK   R+ Y  
Sbjct: 68 RGKKVIVFKYKRRKNYRKKQGHRQPYTR 95


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 28.5 bits (64), Expect = 2.3
 Identities = 13/40 (32%), Positives = 23/40 (57%)

Query: 42  QDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
           ++++   K  ++ +K K  ++K KKK  K K KK  KK +
Sbjct: 69  EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDD 108



 Score = 28.5 bits (64), Expect = 2.4
 Identities = 11/29 (37%), Positives = 14/29 (48%)

Query: 54  AKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
             KKK  + K KK  KK  K +K  +K  
Sbjct: 90  KSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118



 Score = 27.3 bits (61), Expect = 6.2
 Identities = 12/30 (40%), Positives = 13/30 (43%)

Query: 51  ADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
             Q  K K  + KKKK   K KK  K   K
Sbjct: 80  EKQKWKWKKKKSKKKKDKDKDKKDDKKDDK 109



 Score = 27.3 bits (61), Expect = 6.5
 Identities = 13/30 (43%), Positives = 15/30 (50%)

Query: 52  DQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
            + K KK   +KKK K K KK  KK  K  
Sbjct: 82  QKWKWKKKKSKKKKDKDKDKKDDKKDDKSE 111


>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain.  This family
          contains a C-terminal presumed domain in Sas10 which
          hash been identified as a regulator of chromatin
          silencing.
          Length = 76

 Score = 27.2 bits (61), Expect = 2.4
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYM 86
          K ++N R KK+KK +K  K+ K++++  R   
Sbjct: 25 KDRRNPRVKKRKKYEKAVKRLKSQRRVVREED 56


>gnl|CDD|222959 PHA02999, PHA02999, Hypothetical protein; Provisional.
          Length = 382

 Score = 29.0 bits (65), Expect = 2.5
 Identities = 17/61 (27%), Positives = 24/61 (39%), Gaps = 4/61 (6%)

Query: 21  LKEYIGFCSRHQLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
           L EYI +            F       +    Q   ++N  +  KKK K KKK KK ++ 
Sbjct: 63  LYEYIAYKQNAS----NPNFQNSDSVVNLTKKQISSEENTMKMDKKKYKLKKKIKKVREI 118

Query: 81  N 81
           N
Sbjct: 119 N 119


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
          (DUF1764).  This is a family of eukaryotic proteins of
          unknown function. This family contains many
          hypothetical proteins.
          Length = 98

 Score = 27.4 bits (61), Expect = 2.8
 Identities = 13/47 (27%), Positives = 18/47 (38%), Gaps = 6/47 (12%)

Query: 45 DNASKFADQAKKKKNFRRKKKKKTKKKKKKKK------TKKKNRRHY 85
           N  K   + K+     R K  K  +KK KKK       ++  RR  
Sbjct: 19 SNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRRRT 65



 Score = 26.7 bits (59), Expect = 5.7
 Identities = 11/39 (28%), Positives = 16/39 (41%)

Query: 45 DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
           +      + +KKK  R  K  + K  KK +K  KK   
Sbjct: 14 IDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDE 52


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 29.1 bits (66), Expect = 2.9
 Identities = 9/34 (26%), Positives = 19/34 (55%)

Query: 47  ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
                   KK++  +++KKKK    KKK++ +++
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEE 446


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 28.9 bits (65), Expect = 3.0
 Identities = 17/40 (42%), Positives = 22/40 (55%)

Query: 42 QDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
          +DVD+   F  +   +      K+   KKKKKKKK KKKN
Sbjct: 31 KDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKN 70


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 28.0 bits (63), Expect = 3.0
 Identities = 10/30 (33%), Positives = 22/30 (73%)

Query: 55  KKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
           ++KK   ++KK++ K++K++ K +KK  +H
Sbjct: 100 QEKKKRSKEKKEEEKERKRQLKQQKKKAKH 129


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 28.9 bits (65), Expect = 3.1
 Identities = 11/33 (33%), Positives = 21/33 (63%)

Query: 52  DQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
           D  KK+K  R++K++   K +  KK K+K+++ 
Sbjct: 624 DGIKKRKKKRKRKRRFLTKIEGVKKEKRKDKKL 656


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 29.0 bits (65), Expect = 3.1
 Identities = 16/49 (32%), Positives = 21/49 (42%), Gaps = 5/49 (10%)

Query: 37  YKRFYQDVDNASKFA-----DQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
           Y  F+   +N+ +FA        +K K    K KKK    K   K KKK
Sbjct: 93  YDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKK 141



 Score = 27.9 bits (62), Expect = 6.6
 Identities = 13/52 (25%), Positives = 17/52 (32%)

Query: 46  NASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMITLEIPCRDKS 97
                +   K K    + KKK T  K   KK KK       +      + KS
Sbjct: 108 GPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNKPKS 159


>gnl|CDD|178450 PLN02859, PLN02859, glutamine-tRNA ligase.
          Length = 788

 Score = 29.0 bits (65), Expect = 3.4
 Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 81  NRRHYMITLEIPCRDKSYHWLLH 103
           N  H + TLE   R  SY+WLL 
Sbjct: 456 NITHSLCTLEFETRRASYYWLLD 478


>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
          Length = 801

 Score = 29.0 bits (65), Expect = 3.4
 Identities = 14/36 (38%), Positives = 16/36 (44%)

Query: 44  VDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKK 79
              A K A Q KKK      KKK+ KK     K+K 
Sbjct: 597 AAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKA 632


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 28.8 bits (64), Expect = 3.5
 Identities = 11/29 (37%), Positives = 17/29 (58%)

Query: 52  DQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
           D+  K +  R+KK+K+   KK   K KK+
Sbjct: 555 DRTDKNRERRKKKRKRRAAKKAVTKAKKE 583



 Score = 28.4 bits (63), Expect = 4.1
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 58  KNFRRKKKKKTKKKKKKKKTKKKNRR 83
           KN  R+KKK+ ++  KK  TK K  R
Sbjct: 559 KNRERRKKKRKRRAAKKAVTKAKKER 584


>gnl|CDD|225186 COG2303, BetA, Choline dehydrogenase and related flavoproteins
           [Amino acid transport and metabolism].
          Length = 542

 Score = 28.6 bits (64), Expect = 3.6
 Identities = 13/46 (28%), Positives = 18/46 (39%), Gaps = 1/46 (2%)

Query: 213 SEWRPFGHPKRRRPLNSVVLDSGVADRILADIRQFVADPAWYLDRG 258
           S  R +  P  +RP N  +L      RIL +  + V       D G
Sbjct: 203 SAARAYLKPALKRP-NLTLLTGARVRRILLEGDRAVGVEVEIGDGG 247


>gnl|CDD|187679 cd09621, CBM9_like_5, DOMON-like type 9 carbohydrate binding
           module.  Family 9 carbohydrate-binding modules (CBM9)
           play a role in the microbial degradation of cellulose
           and hemicellulose (materials found in plants). The
           domain has previously been called cellulose-binding
           domain. The polysaccharide binding sites of CBMs with
           available 3D structure have been found to be either flat
           surfaces with interactions formed by predominantly
           aromatic residues (tryptophan and tyrosine), or extended
           shallow grooves. CBM9 domains found in this
           uncharacterized heterogeneous subfamily are often
           located at the C-terminus of longer proteins and may
           co-occur with various other functional domains such as
           glycosyl hydrolases. The CBM9 module in these
           architectures may be involved in binding to
           carbohydrates.
          Length = 188

 Score = 28.1 bits (63), Expect = 3.7
 Identities = 14/40 (35%), Positives = 19/40 (47%), Gaps = 1/40 (2%)

Query: 152 IRVERSR-ESTTLDIQQGIPWETVTLTAFGRDRSLFFNIL 190
           IR+E SR E+     +  IPW  + LT         FN+L
Sbjct: 116 IRLETSRDEAKVTVYEAAIPWAELGLTGPAAGDGFRFNLL 155


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 28.3 bits (63), Expect = 3.9
 Identities = 14/35 (40%), Positives = 18/35 (51%)

Query: 47  ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
           A K A +AKKK     K K + K K K ++ K K 
Sbjct: 164 AKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKA 198



 Score = 27.5 bits (61), Expect = 6.8
 Identities = 13/36 (36%), Positives = 19/36 (52%)

Query: 45  DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
           +  +K A +AKKK    +KK +   K K + K K K
Sbjct: 155 EAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAK 190


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 28.5 bits (63), Expect = 4.0
 Identities = 12/47 (25%), Positives = 21/47 (44%), Gaps = 10/47 (21%)

Query: 44  VDNASKFADQAKKKKNFRRKK----------KKKTKKKKKKKKTKKK 80
           V+  +    ++KK     R+K          +K  KK +K++K KK 
Sbjct: 100 VNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKM 146


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 28.5 bits (64), Expect = 4.2
 Identities = 18/66 (27%), Positives = 24/66 (36%), Gaps = 18/66 (27%)

Query: 34 EQMYKRFYQDV-----DNASKFADQAKKKKNFRRKKKK-------------KTKKKKKKK 75
           Q     Y +V     +  S F+++  K      KK K             K KKKKK+K
Sbjct: 21 LQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEK 80

Query: 76 KTKKKN 81
          K  K  
Sbjct: 81 KEPKSE 86



 Score = 28.1 bits (63), Expect = 5.4
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query: 52 DQAKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
          +   KKK+ ++KKKKK KK+ K +   K   
Sbjct: 63 NNESKKKSEKKKKKKKEKKEPKSEGETKLGF 93


>gnl|CDD|222928 PHA02778, PHA02778, major capsid L1 protein; Provisional.
          Length = 503

 Score = 28.2 bits (64), Expect = 4.4
 Identities = 18/68 (26%), Positives = 28/68 (41%), Gaps = 18/68 (26%)

Query: 19  VDLKEYIGFCSRHQLEQMYKRFYQDVDNAS---KFADQAKKKKNFRRKKKKKTKKKKKKK 75
           VDL E               RF  D+D      KF  QA  ++  RR+K+K T     + 
Sbjct: 451 VDLTE---------------RFSLDLDQFPLGRKFLFQAGLRRRTRRRKRKSTSSSSSRS 495

Query: 76  KTKKKNRR 83
             +++ +R
Sbjct: 496 TKRRRRKR 503


>gnl|CDD|237625 PRK14147, PRK14147, heat shock protein GrpE; Provisional.
          Length = 172

 Score = 27.6 bits (61), Expect = 4.4
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 30 RHQLEQMYKRFYQDVDNASKFADQ 53
          R  LE   KR  +DV+ A KFA++
Sbjct: 45 RADLENQRKRIARDVEQARKFANE 68


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 27.9 bits (62), Expect = 4.8
 Identities = 14/32 (43%), Positives = 17/32 (53%), Gaps = 7/32 (21%)

Query: 59  NFRRK-------KKKKTKKKKKKKKTKKKNRR 83
           ++RRK       KKKK K KK K K KK   +
Sbjct: 79  SWRRKPATTAVEKKKKGKSKKNKLKGKKDEDK 110



 Score = 27.5 bits (61), Expect = 6.9
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 55  KKKKNFRRKKKKKTKKKKKKKKTK 78
           KKKK   +K K K KK + KKK +
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAR 114


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 27.7 bits (62), Expect = 5.2
 Identities = 10/44 (22%), Positives = 21/44 (47%), Gaps = 4/44 (9%)

Query: 29  SRHQLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKK 72
           +  +  ++ + F  +       A  A K++     +KK++KKKK
Sbjct: 157 AEWKDAKLLEEFAAE----EAEAAAAAKEEESAEGEKKESKKKK 196


>gnl|CDD|188874 cd09768, Luminal_EIF2AK3, The Luminal domain, a dimerization
           domain, of the Serine/Threonine protein kinase,
           eukaryotic translation Initiation Factor 2-Alpha Kinase
           3.  The Luminal domain is a dimerization domain present
           in eukaryotic translation Initiation Factor 2-Alpha
           Kinase 3 (EIF2AK3), also called PKR-like Endoplasmic
           Reticulum Kinase (PERK). EIF2AK3 is a serine/threonine
           protein kinase (STK) and a type I transmembrane protein
           that is localized in the endoplasmic reticulum (ER). As
           a EIF2AK, it phosphorylates the alpha subunit of eIF-2,
           resulting in the downregulation of protein synthesis.
           eIF-2 phosphorylation is induced in response to cellular
           stresses including virus infection, heat shock, nutrient
           deficiency, and the accummulation of unfolded proteins,
           among others. There are four distinct kinases that
           phosphorylate eIF-2 and control protein synthesis:
           General Control Non-derepressible-2 (GCN2), protein
           kinase regulated by RNA (PKR), heme-regulated inhibitor
           kinase (HRI), and PERK. PERK contains a luminal domain
           bound with the chaperone BiP under unstressed conditions
           and a cytoplasmic catalytic kinase domain. In response
           to the accumulation of misfolded or unfolded proteins in
           the ER, PERK is activated through the release of BiP,
           allowing it to dimerize through its luminal domain and
           autophosphorylate. It functions as the central regulator
           of translational control during the Unfolded Protein
           Response (UPR) pathway. In addition to the eIF-2 alpha
           subunit, PERK also phosphorylates Nrf2, a leucine zipper
           transcription factor which regulates cellular redox
           status and promotes cell survival during the UPR.
          Length = 301

 Score = 28.1 bits (63), Expect = 5.2
 Identities = 8/35 (22%), Positives = 13/35 (37%)

Query: 118 VETSFEKFDTGYVKTKYDFIPSIGTHLFSYQGNWI 152
           V +S    +          IPS+   L+ + G  I
Sbjct: 35  VSSSLSTLELINNGKSVRLIPSLDGSLYQFDGESI 69


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
          region.  This highly conserved region is found towards
          the C-terminus of the transmembrane domain. The
          function is unclear.
          Length = 151

 Score = 27.3 bits (60), Expect = 5.4
 Identities = 11/29 (37%), Positives = 21/29 (72%)

Query: 54 AKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
          AK++K   + + +K+ KKKK+K ++KK +
Sbjct: 11 AKQRKESGKTQSQKSDKKKKEKVSEKKGK 39



 Score = 26.9 bits (59), Expect = 7.5
 Identities = 12/25 (48%), Positives = 17/25 (68%)

Query: 56 KKKNFRRKKKKKTKKKKKKKKTKKK 80
          K ++ +  KKKK K  +KK K+KKK
Sbjct: 19 KTQSQKSDKKKKEKVSEKKGKSKKK 43


>gnl|CDD|130083 TIGR01010, BexC_CtrB_KpsE, polysaccharide export inner-membrane
           protein, BexC/CtrB/KpsE family.  This family contains
           gamma proteobacterial proteins involved in capsule
           polysaccharide export [Transport and binding proteins,
           Carbohydrates, organic alcohols, and acids].
          Length = 362

 Score = 28.1 bits (63), Expect = 5.5
 Identities = 16/58 (27%), Positives = 25/58 (43%), Gaps = 4/58 (6%)

Query: 141 GTHLFSYQGNWIRVERSRESTTLDIQQGIPWETVTLTAFGRDRSLFFN--ILEEARTL 196
             H    Q  + R  + R S  +D   GI   T+ +TAF  + +   N  +L+E   L
Sbjct: 104 RLHKNGTQEAFYRYYKKRLSVDVDSVSGI--LTLNVTAFDAEEAQKINQRLLKEGERL 159


>gnl|CDD|153356 cd07672, F-BAR_PSTPIP2, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
           Phosphatase-Interacting Protein 2.  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Proline-Serine-Threonine
           Phosphatase-Interacting Protein 2 (PSTPIP2), also known
           as Macrophage Actin-associated tYrosine Phosphorylated
           protein (MAYP), is mostly expressed in hematopoietic
           cells but is also expressed in the brain. It is involved
           in regulating cell adhesion and motility. Mutations in
           the gene encoding murine PSTPIP2 can cause
           autoinflammatory disorders such as chronic multifocal
           osteomyelitis and macrophage autoinflammatory disease.
           PSTPIP2 contains an N-terminal F-BAR domain and lacks
           the PEST motifs and SH3 domain that are found in
           PSTPIP1. F-BAR domains form banana-shaped dimers with a
           positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 240

 Score = 27.6 bits (61), Expect = 5.7
 Identities = 22/72 (30%), Positives = 31/72 (43%), Gaps = 15/72 (20%)

Query: 40  FYQDVDN--------ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR-HYMITLE 90
           F Q +DN        A    D+AKK ++FR ++K   KK +       K R   +  T+E
Sbjct: 66  FKQQIDNVGQSHIQLAQTLRDEAKKMEDFRERQKLARKKIELIMDAIHKQRAMQFKKTME 125

Query: 91  IP------CRDK 96
                   CRDK
Sbjct: 126 SKKNYEQKCRDK 137


>gnl|CDD|177142 MTH00070, ND2, NADH dehydrogenase subunit 2; Provisional.
          Length = 346

 Score = 28.0 bits (63), Expect = 5.9
 Identities = 9/20 (45%), Positives = 10/20 (50%), Gaps = 6/20 (30%)

Query: 86  MITLEIPCRDKSYHWLLHWI 105
           MITL       S HW L W+
Sbjct: 18  MITL------SSNHWFLAWM 31


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 27.7 bits (62), Expect = 6.2
 Identities = 13/29 (44%), Positives = 16/29 (55%)

Query: 52  DQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
              KKK+   +  K+ TKKKKKK  T  K
Sbjct: 83  RLKKKKRVKTKAYKEPTKKKKKKDPTAAK 111


>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein.  NinG or Rap
          is involved in recombination. Rap (recombination adept
          with plasmid) increases lambda-by-plasmid recombination
          catalyzed by Escherichia coli's RecBCD pathway.
          Length = 188

 Score = 27.3 bits (61), Expect = 6.9
 Identities = 7/34 (20%), Positives = 19/34 (55%)

Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYM 86
          + K ++  R+ + +  +++ K +K K K R  ++
Sbjct: 36 REKAQEKKRKAEAQAERRELKARKEKLKTRSDWL 69


>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
          Length = 177

 Score = 27.2 bits (61), Expect = 6.9
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query: 62 RKKKKKTKKKKKKKKTKKKNR 82
          +KKK   K+ K   K+KKK R
Sbjct: 1  KKKKSSPKRSKGMAKSKKKTR 21


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 27.9 bits (63), Expect = 7.2
 Identities = 12/23 (52%), Positives = 13/23 (56%)

Query: 62  RKKKKKTKKKKKKKKTKKKNRRH 84
            K +K  KKKKKK KT K   R 
Sbjct: 680 EKIQKPKKKKKKKPKTVKPKPRT 702


>gnl|CDD|214619 smart00325, RhoGEF, Guanine nucleotide exchange factor for
           Rho/Rac/Cdc42-like GTPases.  Guanine nucleotide exchange
           factor for Rho/Rac/Cdc42-like GTPases Also called
           Dbl-homologous (DH) domain. It appears that PH domains
           invariably occur C-terminal to RhoGEF/DH domains.
           Improved coverage.
          Length = 180

 Score = 27.3 bits (61), Expect = 7.3
 Identities = 12/60 (20%), Positives = 25/60 (41%), Gaps = 2/60 (3%)

Query: 20  DLKEYIGFCSRHQLE--QMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKT 77
           D  E IG       E  ++Y  +  +  +A +   + KK   F++  K+     + ++ T
Sbjct: 69  DSVERIGDVFLKLEEFFKIYSEYCSNHPDALELLKKLKKNPRFQKFLKEIESSPQCRRLT 128


>gnl|CDD|214386 CHL00176, ftsH, cell division protein; Validated.
          Length = 638

 Score = 27.7 bits (62), Expect = 7.3
 Identities = 10/13 (76%), Positives = 11/13 (84%)

Query: 8   ILDPALVRPGRVD 20
           ILD AL+RPGR D
Sbjct: 333 ILDAALLRPGRFD 345


>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 158 and 187 amino
           acids in length. This family includes the PcfB protein.
          Length = 137

 Score = 26.8 bits (60), Expect = 7.6
 Identities = 12/26 (46%), Positives = 17/26 (65%)

Query: 55  KKKKNFRRKKKKKTKKKKKKKKTKKK 80
           KK K F+ ++  K K++K K K KKK
Sbjct: 112 KKLKKFKEEQAAKKKERKDKVKNKKK 137


>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
           factor dissociates the posttermination complex, composed
           of the ribosome, deacylated tRNA, and mRNA, after
           termination of translation.  Thus ribosomes are
           "recycled" and ready for another round of protein
           synthesis.  RRF is believed to bind the ribosome at the
           A-site in a manner that mimics tRNA, but the specific
           mechanisms remain unclear.  RRF is essential for
           bacterial growth.  It is not necessary for cell growth
           in archaea or eukaryotes, but is found in mitochondria
           or chloroplasts of some eukaryotic species.
          Length = 179

 Score = 26.8 bits (60), Expect = 8.4
 Identities = 14/37 (37%), Positives = 22/37 (59%), Gaps = 1/37 (2%)

Query: 44  VDNASKFADQAKKK-KNFRRKKKKKTKKKKKKKKTKK 79
           V +A K A++AK   +N RR    K KK +K+K+  +
Sbjct: 110 VKDAKKIAEEAKVAIRNIRRDANDKIKKLEKEKEISE 146


>gnl|CDD|216581 pfam01577, Peptidase_S30, Potyvirus P1 protease.  The potyviridae
           family positive stand RNA viruses with genome encoding a
           polyprotein. members include zucchini yellow mosaic
           virus, and turnip mosaic viruses which cause
           considerable losses of crops worldwide. This family
           consists of a C terminus region from various plant
           potyvirus P1 proteins (found at the N terminus of the
           polyprotein). The C terminus of P1 is a serine-type
           protease responsible for autocatalytic cleavage between
           P1 and the helper component protease pfam00851. The
           entire P1 protein may be involved in virus-host
           interactions.
          Length = 245

 Score = 27.3 bits (61), Expect = 8.6
 Identities = 13/24 (54%), Positives = 14/24 (58%)

Query: 60  FRRKKKKKTKKKKKKKKTKKKNRR 83
           FR    K+T KK KKKK KKK   
Sbjct: 82  FRTPYYKRTTKKMKKKKKKKKVVM 105


>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16.  Nop16 is a
          protein involved in ribosome biogenesis.
          Length = 173

 Score = 27.0 bits (60), Expect = 8.7
 Identities = 8/24 (33%), Positives = 13/24 (54%)

Query: 61 RRKKKKKTKKKKKKKKTKKKNRRH 84
          R+KKK ++   K  +K  K+  R 
Sbjct: 2  RKKKKNRSSNYKVNRKRLKRKDRK 25


>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
           Members of the NSCC2 family have been sequenced from
           various yeast, fungal and animals species including
           Saccharomyces cerevisiae, Drosophila melanogaster and
           Homo sapiens. These proteins are the Sec62 proteins,
           believed to be associated with the Sec61 and Sec63
           constituents of the general protein secretary systems of
           yeast microsomes. They are also the non-selective cation
           (NS) channels of the mammalian cytoplasmic membrane. The
           yeast Sec62 protein has been shown to be essential for
           cell growth. The mammalian NS channel proteins has been
           implicated in platelet derived growth factor(PGDF)
           dependent single channel current in fibroblasts. These
           channels are essentially closed in serum deprived
           tissue-culture cells and are specifically opened by
           exposure to PDGF. These channels are reported to exhibit
           equal selectivity for Na+, K+ and Cs+ with low
           permeability to Ca2+, and no permeability to anions
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 232

 Score = 27.2 bits (60), Expect = 8.8
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query: 57  KKNFRRKKKKKTKKKKKKKKTKKKN 81
           K  +  KKK K+KK KKK+  ++ N
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKREAN 232


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 27.7 bits (62), Expect = 9.0
 Identities = 13/56 (23%), Positives = 24/56 (42%), Gaps = 8/56 (14%)

Query: 34   EQMYKR-------FYQDVDN-ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
            + M+           ++ +    K   + ++ K+  + K  K +K K KKK KKK 
Sbjct: 1127 KDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKK 1182


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 27.5 bits (61), Expect = 9.4
 Identities = 9/31 (29%), Positives = 21/31 (67%)

Query: 53  QAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
           +  K ++ + ++K+K + K++KKK K+K + 
Sbjct: 96  KEPKNESGKEEEKEKEQVKEEKKKKKEKPKE 126


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
          entry is characterized by proteins with alternating
          conserved and low-complexity regions. Bud13 together
          with Snu17p and a newly identified factor,
          Pml1p/Ylr016c, form a novel trimeric complex. called
          The RES complex, pre-mRNA retention and splicing
          complex. Subunits of this complex are not essential for
          viability of yeasts but they are required for efficient
          splicing in vitro and in vivo. Furthermore,
          inactivation of this complex causes pre-mRNA leakage
          from the nucleus. Bud13 contains a unique,
          phylogenetically conserved C-terminal region of unknown
          function.
          Length = 141

 Score = 26.5 bits (59), Expect = 9.5
 Identities = 9/28 (32%), Positives = 20/28 (71%)

Query: 52 DQAKKKKNFRRKKKKKTKKKKKKKKTKK 79
          D  +K++   R+K++K +K++K+K+  K
Sbjct: 13 DIEEKREEKEREKEEKERKEEKEKEWGK 40


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 27.1 bits (60), Expect = 9.6
 Identities = 12/24 (50%), Positives = 14/24 (58%)

Query: 61  RRKKKKKTKKKKKKKKTKKKNRRH 84
              K KK K KKK KK+KKK  + 
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKE 165


>gnl|CDD|220628 pfam10198, Ada3, Histone acetyltransferases subunit 3.  Ada3 is a
           family of proteins conserved from yeasts to humans. It
           is an essential component of the Ada transcriptional
           coactivator (alteration/deficiency in activation)
           complex. Ada3 plays a key role in linking histone
           acetyltransferase-containing complexes to p53 (tumour
           suppressor protein) thereby regulating p53 acetylation,
           stability and transcriptional activation following DNA
           damage.
          Length = 127

 Score = 26.5 bits (59), Expect = 9.8
 Identities = 17/52 (32%), Positives = 22/52 (42%), Gaps = 3/52 (5%)

Query: 21  LKEYIGFCSRHQLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKK 72
            K  +         Q YK    D+D   K  DQA  K+   R KKKK +K+ 
Sbjct: 59  KKRLLPLAKEEMARQEYKTILDDLD---KQVDQAYLKRIRARGKKKKKEKRP 107


>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
           SecD/SecF; Reviewed.
          Length = 755

 Score = 27.5 bits (62), Expect = 9.8
 Identities = 11/21 (52%), Positives = 13/21 (61%)

Query: 58  KNFRRKKKKKTKKKKKKKKTK 78
           +  R KKKKK KK KK +  K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 27.1 bits (60), Expect = 9.9
 Identities = 14/46 (30%), Positives = 21/46 (45%), Gaps = 3/46 (6%)

Query: 61  RRKKKKKTKKKKKKKKTKKKNRRHYMITLEIPCRDKSYHWLLHWIT 106
           R + ++K K+KK    T ++NR   +IT   P R    H     I 
Sbjct: 291 RAQYREKQKEKKAFLWTLRRNRLRMIIT---PWRAPELHAENAEIK 333


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.135    0.409 

Gapped
Lambda     K      H
   0.267   0.0771    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,602,352
Number of extensions: 1288958
Number of successful extensions: 6991
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5501
Number of HSP's successfully gapped: 524
Length of query: 263
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 168
Effective length of database: 6,723,972
Effective search space: 1129627296
Effective search space used: 1129627296
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 58 (26.0 bits)