RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5528
         (182 letters)



>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 39.1 bits (92), Expect = 6e-04
 Identities = 18/60 (30%), Positives = 34/60 (56%)

Query: 123 MVRLFNQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            +      K  T  +++ + K +K  +EE++EKKKK     K+    +++K ++EEE+EE
Sbjct: 398 EIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457



 Score = 34.9 bits (81), Expect = 0.015
 Identities = 15/46 (32%), Positives = 28/46 (60%), Gaps = 1/46 (2%)

Query: 138 RRRMRKKKKVEKEEE-EEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
             + R+++K EK+++    KKK + + +     +++K EEEEE EE
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464



 Score = 34.9 bits (81), Expect = 0.016
 Identities = 18/47 (38%), Positives = 30/47 (63%)

Query: 136 SLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            +  +  KK++ EK+E+++K    K+K +     K+KK EE+EEEEE
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEE 460



 Score = 32.2 bits (74), Expect = 0.14
 Identities = 14/45 (31%), Positives = 30/45 (66%)

Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            ++ +KKK    +++EE++++ K K +     ++++ EEE+EEEE
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEE 470



 Score = 31.8 bits (73), Expect = 0.18
 Identities = 14/45 (31%), Positives = 30/45 (66%)

Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           ++  +KK    K++EEE++++ ++K +     +++  EE+EEEEE
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471



 Score = 28.7 bits (65), Expect = 2.0
 Identities = 12/46 (26%), Positives = 28/46 (60%)

Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            R   +K+KK +    ++K+++ + + ++    K+++ EE EEE+E
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKE 467



 Score = 28.3 bits (64), Expect = 2.5
 Identities = 15/54 (27%), Positives = 30/54 (55%)

Query: 129 QHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           + K       ++ +     +KEEEEE++K+ K + K     + ++ +EEEEE++
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 35.3 bits (82), Expect = 0.012
 Identities = 12/48 (25%), Positives = 26/48 (54%), Gaps = 3/48 (6%)

Query: 138 RRRMRKKKKVEK---EEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           R+++RKK++  +   E+EE +K   K+K +      K  + E ++ + 
Sbjct: 411 RKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDP 458



 Score = 34.5 bits (80), Expect = 0.020
 Identities = 12/46 (26%), Positives = 22/46 (47%)

Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           LR++ RK +K  ++EE EK    K+        K    E ++ + +
Sbjct: 414 LRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 35.3 bits (82), Expect = 0.013
 Identities = 14/38 (36%), Positives = 20/38 (52%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
            K+  +KE+E+EK K  KR     N  K++K     EE
Sbjct: 397 AKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 32.6 bits (75), Expect = 0.10
 Identities = 15/48 (31%), Positives = 22/48 (45%)

Query: 131 KIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE 178
           K    S +   ++ +K EKE+E+ K KK  R  K     +K     EE
Sbjct: 387 KTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 32.2 bits (74), Expect = 0.11
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
             K+ EK+E+E++K K K++ +   N  K++      EE
Sbjct: 396 LAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 31.1 bits (71), Expect = 0.28
 Identities = 11/39 (28%), Positives = 20/39 (51%)

Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
             K  +++E+EK+K   +K  R+  N  K+ +     EE
Sbjct: 396 LAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 34.3 bits (79), Expect = 0.018
 Identities = 13/41 (31%), Positives = 28/41 (68%)

Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           ++K K +K++ ++KK K+K K     ++K +K +E+E E++
Sbjct: 81  KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDK 121



 Score = 32.4 bits (74), Expect = 0.072
 Identities = 12/41 (29%), Positives = 28/41 (68%)

Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            + +KV+KE EE++K K K+K  +   +K K  ++++++++
Sbjct: 69  EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDK 109



 Score = 31.6 bits (72), Expect = 0.14
 Identities = 13/47 (27%), Positives = 31/47 (65%)

Query: 136 SLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            +++   +K+K + ++++ KKKK+K K K+++    K  +++E+E E
Sbjct: 73  KVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAE 119



 Score = 30.0 bits (68), Expect = 0.49
 Identities = 12/51 (23%), Positives = 30/51 (58%)

Query: 132 IYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           IY        +KKK++ +E E+ KK+  +++  +    K KK ++++++++
Sbjct: 52  IYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKK 102


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 33.5 bits (77), Expect = 0.022
 Identities = 15/43 (34%), Positives = 29/43 (67%)

Query: 140 RMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           + ++++K  K+EE+  KK+ KR+ K+    KKKK ++  ++EE
Sbjct: 69  QQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEE 111



 Score = 32.3 bits (74), Expect = 0.063
 Identities = 14/53 (26%), Positives = 27/53 (50%), Gaps = 1/53 (1%)

Query: 129 QHKIYTTSLRRRMRK-KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
           + + +      + RK ++K  K+  + +KKK K+K K+      KK E+E  +
Sbjct: 64  EDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSK 116



 Score = 28.9 bits (65), Expect = 0.83
 Identities = 16/48 (33%), Positives = 33/48 (68%), Gaps = 1/48 (2%)

Query: 135 TSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           T+ +R  R+KKK +K+++++K KK  +K ++  +   +++ +EEEE E
Sbjct: 83  TAKKRAKRQKKK-QKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGE 129



 Score = 26.9 bits (60), Expect = 4.0
 Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 5/50 (10%)

Query: 138 RRRMRKKKKVE-----KEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           ++R  KK+K E     K  + +KKK+ K+K K+     KK+ +E  +  E
Sbjct: 70  QKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSE 119


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 33.5 bits (76), Expect = 0.046
 Identities = 9/49 (18%), Positives = 23/49 (46%)

Query: 134 TTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
             S +   R ++K      + +++K ++K ++    KK   + E++  E
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSE 156


>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
           component YidC; Provisional.
          Length = 304

 Score = 33.1 bits (76), Expect = 0.051
 Identities = 14/55 (25%), Positives = 24/55 (43%), Gaps = 18/55 (32%)

Query: 126 LFNQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRK-------GKRNNNNKKKK 173
              QH ++            K++KEEE +K+ K +R+       G +   +KKK 
Sbjct: 260 FGQQHYVF-----------GKIDKEEEAKKQAKAERRAANAPKPGAKPKRSKKKA 303


>gnl|CDD|182885 PRK10991, fucI, L-fucose isomerase; Provisional.
          Length = 588

 Score = 33.1 bits (76), Expect = 0.063
 Identities = 17/51 (33%), Positives = 28/51 (54%), Gaps = 6/51 (11%)

Query: 135 TSLRRRMRKKKKVEKEEEEEKK----KKNKRKGKRNNNNKKKKNEEEEEEE 181
           T LRRR+   +K+  EEE E      KKN ++G+  N  + ++N E++   
Sbjct: 209 TELRRRI--DQKIYDEEELEMALAWAKKNCKEGEDENAEQYQRNAEQKRAV 257


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 33.0 bits (75), Expect = 0.075
 Identities = 9/49 (18%), Positives = 19/49 (38%)

Query: 134 TTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           T     R ++K K +   ++E K ++            K+ EE+  +  
Sbjct: 790 TIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAP 838


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 32.2 bits (74), Expect = 0.089
 Identities = 14/45 (31%), Positives = 24/45 (53%)

Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           +R  ++ KK E   E+EK  + +RK K     KK+K   ++ E +
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQ 157



 Score = 26.4 bits (59), Expect = 8.6
 Identities = 9/39 (23%), Positives = 21/39 (53%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
           KK++   E+E+  +++ K K K     +K + ++ E + 
Sbjct: 120 KKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQH 158


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 31.9 bits (73), Expect = 0.16
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKK 173
           KKK+ EK+ ++ KKKK ++K  +    K +K
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 31.1 bits (71), Expect = 0.32
 Identities = 12/31 (38%), Positives = 22/31 (70%)

Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKK 172
           +KK++ +K ++ +KKKK K+KGK+     +K
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 31.1 bits (71), Expect = 0.32
 Identities = 11/30 (36%), Positives = 19/30 (63%)

Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKK 173
            KK  +E++ +K+KK K++ K+    KKK 
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRKKKG 412



 Score = 30.3 bits (69), Expect = 0.49
 Identities = 13/31 (41%), Positives = 20/31 (64%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKK 173
            KKK E+++ +++KKK KRK K     KK +
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 28.4 bits (64), Expect = 2.1
 Identities = 12/36 (33%), Positives = 23/36 (63%)

Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKK 173
           + +  K  K ++EE++ +K+K K+K K+    +KKK
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKK 411



 Score = 28.4 bits (64), Expect = 2.3
 Identities = 11/28 (39%), Positives = 18/28 (64%)

Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKR 165
           ++R  KK +  K++++ KKK  KRK K 
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKKKG 412



 Score = 26.9 bits (60), Expect = 7.5
 Identities = 15/47 (31%), Positives = 27/47 (57%), Gaps = 5/47 (10%)

Query: 137 LRRRMRKKKKV-----EKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE 178
           L +R+ + K+      +K+ EE+K +K K+K KR    KK+K +  +
Sbjct: 368 LNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 31.5 bits (71), Expect = 0.19
 Identities = 12/46 (26%), Positives = 26/46 (56%)

Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           L+++  +++KV +EEE+ +K++   +  R    K++  EE E    
Sbjct: 219 LKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRA 264



 Score = 31.2 bits (70), Expect = 0.29
 Identities = 12/45 (26%), Positives = 25/45 (55%)

Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           ++R  ++K +E+EE+  K+++  RK +     ++ K E E    E
Sbjct: 221 KKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAE 265



 Score = 28.1 bits (62), Expect = 3.1
 Identities = 13/38 (34%), Positives = 19/38 (50%)

Query: 131 KIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNN 168
           K    +  R   + +K EKE E E+++K KR     NN
Sbjct: 134 KSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENN 171



 Score = 26.5 bits (58), Expect = 9.6
 Identities = 15/44 (34%), Positives = 25/44 (56%), Gaps = 4/44 (9%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE----EEEE 182
           K+++   E EE KKK+ +R+       +++K EE +    EEEE
Sbjct: 208 KQQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEE 251


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 31.9 bits (73), Expect = 0.19
 Identities = 8/42 (19%), Positives = 21/42 (50%), Gaps = 2/42 (4%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNN--NNKKKKNEEEEEEEE 182
           + K  E+   + + K  +++ +R    NN++ +NE  +  + 
Sbjct: 585 ETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDN 626



 Score = 28.9 bits (65), Expect = 1.7
 Identities = 8/41 (19%), Positives = 22/41 (53%), Gaps = 3/41 (7%)

Query: 145 KKVEKEEEEEKKKKNKRKGKRNNN---NKKKKNEEEEEEEE 182
           +   K E + ++++++RK ++NN    N+++   +     E
Sbjct: 591 QPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRRE 631


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 30.8 bits (70), Expect = 0.19
 Identities = 12/41 (29%), Positives = 27/41 (65%)

Query: 126 LFNQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRN 166
           L  + KI    L+ + ++KK+ +K+++++KKK +K+  K+ 
Sbjct: 94  LVEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 25.8 bits (57), Expect = 9.0
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 145 KKVEKEEEEEKKKKNKRKGKRNNNNKKKK 173
           K+V KE+ ++KK+K K+K K+     KK 
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKA 130


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 31.9 bits (73), Expect = 0.19
 Identities = 12/35 (34%), Positives = 21/35 (60%)

Query: 146 KVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
           + EKEE+E  K++ + +  +    KKKK  E+ E+
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 27.7 bits (62), Expect = 4.7
 Identities = 15/39 (38%), Positives = 25/39 (64%), Gaps = 1/39 (2%)

Query: 144 KKKVEKEEEE-EKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
           K+++++E+EE E  K+ KR  K     +KKK E E+ E+
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588


>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
           Toc86/159, G and M domains.  The long precursor of the
           86K protein originally described is proposed to have
           three domains. The N-terminal A-domain is acidic,
           repetitive, weakly conserved, readily removed by
           proteolysis during chloroplast isolation, and not
           required for protein translocation. The other domains
           are designated G (GTPase) and M (membrane anchor); this
           family includes most of the G domain and all of M
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 763

 Score = 31.8 bits (72), Expect = 0.21
 Identities = 15/52 (28%), Positives = 24/52 (46%)

Query: 129 QHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
           Q K Y      R++  +K +  EE ++ K  K+ GK         +EE +EE
Sbjct: 428 QRKAYLEEYDYRVKLLQKKQWREELKRMKMMKKFGKEIGELPDGYSEEVDEE 479


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 31.2 bits (71), Expect = 0.21
 Identities = 15/39 (38%), Positives = 27/39 (69%)

Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEE 176
           ++  +KKK+V+KE++E+K KK K    + +  KKKK ++
Sbjct: 154 KKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192



 Score = 29.3 bits (66), Expect = 0.90
 Identities = 16/41 (39%), Positives = 26/41 (63%)

Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           + +K+ E EEEE+K+KK K++ K+    KK K E+  E + 
Sbjct: 142 KVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKG 182



 Score = 28.5 bits (64), Expect = 1.5
 Identities = 13/39 (33%), Positives = 26/39 (66%)

Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
           +K+KK +KE ++EKK+K  +K K       KK ++++++
Sbjct: 154 KKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192



 Score = 27.8 bits (62), Expect = 2.6
 Identities = 13/42 (30%), Positives = 24/42 (57%)

Query: 139 RRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
            +    K  ++ E EE++KK K+K K     KK+K +++E+ 
Sbjct: 136 EKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177



 Score = 26.6 bits (59), Expect = 6.2
 Identities = 10/40 (25%), Positives = 26/40 (65%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           +K+   K E+E + ++ ++K K+     KK+ +E+++++E
Sbjct: 136 EKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKE 175


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 30.3 bits (69), Expect = 0.22
 Identities = 15/39 (38%), Positives = 21/39 (53%)

Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           K + E   E+EKKKK KR GK+     + + E  +E  E
Sbjct: 70  KVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAE 108



 Score = 30.3 bits (69), Expect = 0.29
 Identities = 11/38 (28%), Positives = 22/38 (57%), Gaps = 6/38 (15%)

Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNE 175
           +RR+  + + E+ +E  +K+K  RK      N++KK +
Sbjct: 91  KRRIALRLRRERTKERAEKEKRTRK------NREKKFK 122



 Score = 29.2 bits (66), Expect = 0.66
 Identities = 12/37 (32%), Positives = 25/37 (67%), Gaps = 1/37 (2%)

Query: 137 LRRRMRKKKKVEKEEEEEKKKKN-KRKGKRNNNNKKK 172
           +  R+R+++  E+ E+E++ +KN ++K KR    K+K
Sbjct: 94  IALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 28.0 bits (63), Expect = 1.6
 Identities = 10/38 (26%), Positives = 17/38 (44%), Gaps = 7/38 (18%)

Query: 138 RRRMRKKKKVEKEEEE-------EKKKKNKRKGKRNNN 168
           R  +R +++  KE  E        ++KK KR+ K    
Sbjct: 93  RIALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 30.9 bits (70), Expect = 0.23
 Identities = 12/40 (30%), Positives = 24/40 (60%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           ++    +E +EE+K+    + K +  + +K++EE EEE E
Sbjct: 57  QETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENE 96



 Score = 29.7 bits (67), Expect = 0.73
 Identities = 9/40 (22%), Positives = 25/40 (62%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           + K   ++E+EE +++N+ + + +++  +K+ EE+ E   
Sbjct: 79  EDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNV 118



 Score = 28.9 bits (65), Expect = 1.3
 Identities = 8/38 (21%), Positives = 22/38 (57%)

Query: 145 KKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
              +KE++ + +K+++   + N    ++ ++E E+E E
Sbjct: 74  NSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETE 111



 Score = 28.5 bits (64), Expect = 1.7
 Identities = 8/40 (20%), Positives = 23/40 (57%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            + K +K + E++ ++++ + +  +     +NE+E EE+ 
Sbjct: 75  SEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKT 114



 Score = 27.8 bits (62), Expect = 2.7
 Identities = 13/46 (28%), Positives = 27/46 (58%), Gaps = 6/46 (13%)

Query: 143 KKKKVEKEEEEEK------KKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           +  ++E+ +EEEK       K++K   ++ +   +++NEEE+EE  
Sbjct: 58  ETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESS 103


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 31.6 bits (72), Expect = 0.24
 Identities = 10/39 (25%), Positives = 18/39 (46%)

Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           K+    +EE+         G +NN  ++K N+   E E+
Sbjct: 214 KEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELED 252



 Score = 28.5 bits (64), Expect = 2.1
 Identities = 10/39 (25%), Positives = 16/39 (41%)

Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           K K     +EEK   +      N NNK+++   +   E 
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNEL 250


>gnl|CDD|130161 TIGR01089, fucI, L-fucose isomerase.  This enzyme catalyzes the
           first step in fucose metabolism, and has been
           characterized in Escherichia coli and Bacteroides
           thetaiotaomicron [Energy metabolism, Sugars].
          Length = 587

 Score = 31.5 bits (71), Expect = 0.24
 Identities = 15/50 (30%), Positives = 25/50 (50%), Gaps = 6/50 (12%)

Query: 135 TSLRRRMRKKKKVEKEEEEEKK----KKNKRKGKRNNNNKKKKNEEEEEE 180
           T +RRR+   +K+  EEE E       K  + G+  NN + ++N E+   
Sbjct: 208 TEIRRRI--DQKIYDEEELEMALAWADKYCKYGEDENNKEYQRNAEQSRA 255


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 31.4 bits (71), Expect = 0.26
 Identities = 18/41 (43%), Positives = 26/41 (63%)

Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            KKKK EK +EE K +K K + K     K+K+ E+E++ EE
Sbjct: 116 EKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEE 156



 Score = 30.2 bits (68), Expect = 0.61
 Identities = 16/40 (40%), Positives = 26/40 (65%)

Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
           +KKK+  KEE +++K K + K KR    K+K+ E++ EE 
Sbjct: 118 KKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEP 157



 Score = 29.9 bits (67), Expect = 0.71
 Identities = 12/48 (25%), Positives = 27/48 (56%)

Query: 134 TTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
               + + + K++ +K++E+ K++   RK K     K+   E+E+E+E
Sbjct: 104 KEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKE 151



 Score = 29.5 bits (66), Expect = 1.0
 Identities = 13/40 (32%), Positives = 21/40 (52%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           KK +  ++ EEEKK++  R   R     KKK   +++E  
Sbjct: 152 KKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPP 191



 Score = 29.1 bits (65), Expect = 1.6
 Identities = 11/41 (26%), Positives = 19/41 (46%)

Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
              K    +E + +  K + K K     +KKK +E+ +EE 
Sbjct: 88  PAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEP 128



 Score = 28.7 bits (64), Expect = 1.8
 Identities = 15/40 (37%), Positives = 25/40 (62%)

Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
              K+ EKE+E+ K++K K+K K     K +K +EE +E+
Sbjct: 101 ESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEK 140



 Score = 28.7 bits (64), Expect = 1.8
 Identities = 13/40 (32%), Positives = 23/40 (57%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            K K  KE + E  K+ +++ ++    KKKK E+ +EE +
Sbjct: 90  AKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPK 129



 Score = 27.9 bits (62), Expect = 3.2
 Identities = 8/42 (19%), Positives = 20/42 (47%), Gaps = 1/42 (2%)

Query: 142 RKKKKVEKEEEEEKKKK-NKRKGKRNNNNKKKKNEEEEEEEE 182
            K+   EKE+E+EKK +  + + +     + +     ++  +
Sbjct: 139 EKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPK 180



 Score = 27.9 bits (62), Expect = 3.2
 Identities = 12/52 (23%), Positives = 25/52 (48%)

Query: 131 KIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           ++     +    K K  ++ + E  K++ K K +     KKKK + +EE ++
Sbjct: 79  RVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKD 130



 Score = 27.9 bits (62), Expect = 3.3
 Identities = 11/38 (28%), Positives = 21/38 (55%)

Query: 145 KKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           K+VEK   +    K K   +  N + K++ +E+E+ +E
Sbjct: 78  KRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKE 115



 Score = 27.9 bits (62), Expect = 3.6
 Identities = 8/41 (19%), Positives = 23/41 (56%)

Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           R  K+ EKE+E++ ++   R+ ++     + K+  ++  ++
Sbjct: 141 RPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKK 181



 Score = 27.5 bits (61), Expect = 4.5
 Identities = 12/46 (26%), Positives = 25/46 (54%), Gaps = 1/46 (2%)

Query: 138 RRRMRKKKKVEKEEEE-EKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            +   K++K +K+E+  E+ K  K K +       K+ E+E+E++ 
Sbjct: 109 EKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKV 154



 Score = 26.8 bits (59), Expect = 8.1
 Identities = 15/40 (37%), Positives = 25/40 (62%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           K+++ EKE+ +E+KKK K K K    ++K K E +E+   
Sbjct: 104 KEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP 143



 Score = 26.4 bits (58), Expect = 9.8
 Identities = 9/41 (21%), Positives = 22/41 (53%)

Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           + K   E + E  K+++ +++  +    KKK+  +EE ++ 
Sbjct: 91  KTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDR 131


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 30.4 bits (69), Expect = 0.26
 Identities = 15/45 (33%), Positives = 25/45 (55%)

Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           R    ++K+ EKE E+E+K++ + K K       +K E E+  EE
Sbjct: 10  RIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEE 54



 Score = 26.9 bits (60), Expect = 4.0
 Identities = 14/54 (25%), Positives = 30/54 (55%), Gaps = 1/54 (1%)

Query: 129 QHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
             +I     +R  ++++K E++E +E+K+K   KG      ++K+ EE E+ + 
Sbjct: 8   SGRIIDIEEKREEKEREK-EEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKN 60


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 31.0 bits (70), Expect = 0.27
 Identities = 11/44 (25%), Positives = 19/44 (43%)

Query: 135 TSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE 178
            +   +  KK +  K  +  K KK K+KG  +N + K    +  
Sbjct: 143 EAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPR 186


>gnl|CDD|223440 COG0363, NagB, 6-phosphogluconolactonase/Glucosamine-6-phosphate
          isomerase/deaminase [Carbohydrate transport and
          metabolism].
          Length = 238

 Score = 30.7 bits (70), Expect = 0.29
 Identities = 15/67 (22%), Positives = 20/67 (29%), Gaps = 14/67 (20%)

Query: 5  LHYTPSEDDIAHLLKEFTVDFLLKGY------------GRLVSQLYDLLVTDLHSQIDTS 52
          L      +++A    E   D L                G     LY+ LV     Q+D S
Sbjct: 3  LIIFEDAEELAKAAAEIIADKLQAAKAERGRAVLALSGGSTPLALYEALVKLPQGQLDWS 62

Query: 53 --HFFWL 57
              F L
Sbjct: 63 KVTIFNL 69


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 30.8 bits (70), Expect = 0.30
 Identities = 12/53 (22%), Positives = 23/53 (43%)

Query: 128 NQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
           ++ K      +++ +K+ K E E +   K   K K  +    K K NE+ +  
Sbjct: 66  SKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNA 118



 Score = 27.8 bits (62), Expect = 3.9
 Identities = 10/39 (25%), Positives = 22/39 (56%), Gaps = 1/39 (2%)

Query: 145 KKVEKEEEEEKKKKNKR-KGKRNNNNKKKKNEEEEEEEE 182
               +EE +      K+ K +  NN  KKK+E+++++++
Sbjct: 40  STFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKK 78



 Score = 26.6 bits (59), Expect = 7.7
 Identities = 16/53 (30%), Positives = 27/53 (50%), Gaps = 4/53 (7%)

Query: 128 NQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
            ++K+ TTS     +K KK +K  E +KK + K+K K+     K + E +   
Sbjct: 45  EENKVATTS----TKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGF 93


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 31.0 bits (70), Expect = 0.31
 Identities = 10/46 (21%), Positives = 19/46 (41%), Gaps = 2/46 (4%)

Query: 139 RRMRKKKKVEKEEEEEKKK--KNKRKGKRNNNNKKKKNEEEEEEEE 182
             ++K +  EK E + +KK    KR+      N++ + E       
Sbjct: 330 AEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMAR 375


>gnl|CDD|237603 PRK14083, PRK14083, HSP90 family protein; Provisional.
          Length = 601

 Score = 31.1 bits (71), Expect = 0.32
 Identities = 12/48 (25%), Positives = 19/48 (39%), Gaps = 2/48 (4%)

Query: 13  DIAHLLKEFTVDFLLKGYGRLVSQLYDLLVTDLHSQIDTSHFFWLVTY 60
           D    L+  TV+ L K YG L+     + V      ++ +   W   Y
Sbjct: 167 DAEEWLERETVEELAKKYGSLLP--VPIRVEGEKGGVNETPPPWTRDY 212


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 30.7 bits (70), Expect = 0.33
 Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 5/68 (7%)

Query: 120 IHTMVRLFNQ-----HKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKN 174
           +  ++ L N       K+    L   + +K    +EEEEEK  K   + ++    +KK+ 
Sbjct: 231 LEDLLPLLNLVLYLVDKLANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEE 290

Query: 175 EEEEEEEE 182
           +++EE E 
Sbjct: 291 KKKEEREA 298


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 30.4 bits (69), Expect = 0.41
 Identities = 12/35 (34%), Positives = 19/35 (54%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEE 177
            K ++E+E  ++KKKK K+K K+NN          
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNNATGSSAEATV 206


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 30.2 bits (68), Expect = 0.42
 Identities = 12/36 (33%), Positives = 17/36 (47%)

Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEE 177
           R+K      E+++K K  K K K   +  KKK  E 
Sbjct: 81  RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAREG 116


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 30.6 bits (69), Expect = 0.43
 Identities = 18/55 (32%), Positives = 26/55 (47%), Gaps = 14/55 (25%)

Query: 142 RKKKKVEK-EEEEEKKKKNKRK-------------GKRNNNNKKKKNEEEEEEEE 182
           RKKKK EK +E+E KK K  +K             G       +KK+ + + E+E
Sbjct: 20  RKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVEDE 74



 Score = 27.9 bits (62), Expect = 3.2
 Identities = 14/38 (36%), Positives = 22/38 (57%)

Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
           +KK+  EEE E+KKK + K K     K K  ++E + +
Sbjct: 9   EKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46


>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
           unknown].
          Length = 523

 Score = 30.6 bits (69), Expect = 0.44
 Identities = 13/53 (24%), Positives = 24/53 (45%)

Query: 130 HKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           H   T   R+++ + KKV   +      +NK +   N+NNK K+   +   + 
Sbjct: 15  HLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSKLYSDV 67


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 30.4 bits (68), Expect = 0.50
 Identities = 10/41 (24%), Positives = 30/41 (73%)

Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEE 177
           L    R ++++EK++ E+KK++ +++G+R +++   +++E+
Sbjct: 84  LEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDED 124


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 30.5 bits (68), Expect = 0.54
 Identities = 8/41 (19%), Positives = 19/41 (46%)

Query: 142  RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            +K  +  K E E    + +   ++    +KKK E +++ + 
Sbjct: 1342 KKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADA 1382



 Score = 30.5 bits (68), Expect = 0.56
 Identities = 13/41 (31%), Positives = 24/41 (58%)

Query: 142  RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            +K ++++K EEE K K  +   K   + KK +  ++ EE+E
Sbjct: 1647 KKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDE 1687



 Score = 30.1 bits (67), Expect = 0.67
 Identities = 21/57 (36%), Positives = 30/57 (52%), Gaps = 3/57 (5%)

Query: 129  QHKIYTTSLRRRMRKKKKVE---KEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            + KI    L++   +KKKVE   K+E EEKKK  + K     N  K   E ++ EE+
Sbjct: 1617 EAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEED 1673



 Score = 30.1 bits (67), Expect = 0.83
 Identities = 16/51 (31%), Positives = 30/51 (58%), Gaps = 4/51 (7%)

Query: 136  SLRRRMRKKKKVE----KEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            +L++   + KK E    KE EE+KK +  +K +  N  K ++ ++E EE++
Sbjct: 1693 ALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDK 1743



 Score = 28.6 bits (63), Expect = 2.6
 Identities = 17/64 (26%), Positives = 31/64 (48%)

Query: 119  IIHTMVRLFNQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE 178
            I   M     + K+     ++    K K E+ ++ E++KK   + K+    +KKK EE +
Sbjct: 1594 IEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELK 1653

Query: 179  EEEE 182
            + EE
Sbjct: 1654 KAEE 1657



 Score = 28.6 bits (63), Expect = 2.6
 Identities = 12/40 (30%), Positives = 23/40 (57%)

Query: 142  RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
            +K ++ +K EE+EKK     K +     K ++ +++E EE
Sbjct: 1675 KKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEE 1714



 Score = 28.2 bits (62), Expect = 3.0
 Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 4/50 (8%)

Query: 137  LRRRMRKKKKVEKEEEEEKKK----KNKRKGKRNNNNKKKKNEEEEEEEE 182
                  K +  EK++EE KKK    K K + K+  +  KKK EE++++ +
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKAD 1408



 Score = 28.2 bits (62), Expect = 3.0
 Identities = 14/40 (35%), Positives = 21/40 (52%)

Query: 143  KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            KKK  E ++ EE +KK     K+     KK  E +++E E
Sbjct: 1674 KKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAE 1713



 Score = 28.2 bits (62), Expect = 3.3
 Identities = 12/47 (25%), Positives = 28/47 (59%), Gaps = 3/47 (6%)

Query: 139  RRMRKKKKVEKEEEEEKKKKNKRKG---KRNNNNKKKKNEEEEEEEE 182
            ++  ++ K++  EE +K +++K+K    K+   ++KK  E  ++E E
Sbjct: 1653 KKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAE 1699



 Score = 28.2 bits (62), Expect = 3.6
 Identities = 15/47 (31%), Positives = 25/47 (53%), Gaps = 2/47 (4%)

Query: 138  RRRMRKKKKVE--KEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            ++    KKK E  K+ EE++KK  +   K     KK +  +++E EE
Sbjct: 1668 KKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEE 1714



 Score = 27.8 bits (61), Expect = 3.8
 Identities = 13/41 (31%), Positives = 26/41 (63%)

Query: 142  RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            +K ++++K EEE K K  + K +   + KK +  +++EEE+
Sbjct: 1716 KKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEK 1756



 Score = 27.8 bits (61), Expect = 4.4
 Identities = 8/41 (19%), Positives = 15/41 (36%)

Query: 142  RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
             KK     + E E         +      +KK EE +++ +
Sbjct: 1341 AKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKAD 1381



 Score = 27.8 bits (61), Expect = 4.7
 Identities = 13/39 (33%), Positives = 19/39 (48%)

Query: 144  KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            K   E ++ EE KKK +   K   + KK     ++E EE
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEE 1700



 Score = 27.4 bits (60), Expect = 5.6
 Identities = 14/40 (35%), Positives = 21/40 (52%)

Query: 142  RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
            +K  +  K+E EE KK  + K K     KK +  ++ EEE
Sbjct: 1688 KKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEE 1727



 Score = 27.4 bits (60), Expect = 5.8
 Identities = 15/41 (36%), Positives = 24/41 (58%)

Query: 142  RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            +KK +  K+ EEE K K   + K+   +KKK  E ++ EE+
Sbjct: 1646 KKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEED 1686



 Score = 27.4 bits (60), Expect = 6.0
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query: 143  KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
            K  +  K+ EE+KKK  + K    +  K  +  ++E EE
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEE 1700



 Score = 27.4 bits (60), Expect = 6.3
 Identities = 9/42 (21%), Positives = 19/42 (45%)

Query: 141  MRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
             +KK +  K+  E  K + +         ++K    E+++EE
Sbjct: 1334 AKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE 1375



 Score = 27.0 bits (59), Expect = 6.6
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 143  KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
            KKK  E ++ +E KKK +   K+ +  KK    +++ +E
Sbjct: 1476 KKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADE 1514



 Score = 27.0 bits (59), Expect = 6.6
 Identities = 13/40 (32%), Positives = 23/40 (57%)

Query: 143  KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            +KK  E  ++E ++ K   + K+    +KKK EE ++ EE
Sbjct: 1687 EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEE 1726



 Score = 27.0 bits (59), Expect = 7.1
 Identities = 13/39 (33%), Positives = 23/39 (58%)

Query: 143  KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
            KKK  EK++ +E KKK +   K+ +  KK    +++ +E
Sbjct: 1384 KKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADE 1422



 Score = 27.0 bits (59), Expect = 7.2
 Identities = 14/39 (35%), Positives = 22/39 (56%)

Query: 143  KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
            KKK  E ++ +E KKK +   K+ +  KKK  E ++  E
Sbjct: 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAE 1346



 Score = 27.0 bits (59), Expect = 7.7
 Identities = 14/40 (35%), Positives = 22/40 (55%), Gaps = 2/40 (5%)

Query: 143  KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            KKK  E ++ +E KKK +    +     KKK EE ++ +E
Sbjct: 1437 KKKAEEAKKADEAKKKAEEA--KKAEEAKKKAEEAKKADE 1474



 Score = 26.6 bits (58), Expect = 9.1
 Identities = 15/45 (33%), Positives = 26/45 (57%), Gaps = 4/45 (8%)

Query: 142  RKKKKVEKEEEEEKKK----KNKRKGKRNNNNKKKKNEEEEEEEE 182
            +K  + +K+ EE+KKK    K     K+  +  KKK EE+++ +E
Sbjct: 1391 KKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADE 1435


>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN).  This family
           consists of several eukaryotic survival motor neuron
           (SMN) proteins. The Survival of Motor Neurons (SMN)
           protein, the product of the spinal muscular
           atrophy-determining gene, is part of a large
           macromolecular complex (SMN complex) that functions in
           the assembly of spliceosomal small nuclear
           ribonucleoproteins (snRNPs). The SMN complex functions
           as a specificity factor essential for the efficient
           assembly of Sm proteins on U snRNAs and likely protects
           cells from illicit, and potentially deleterious,
           non-specific binding of Sm proteins to RNAs.
          Length = 264

 Score = 30.0 bits (67), Expect = 0.59
 Identities = 10/40 (25%), Positives = 20/40 (50%)

Query: 135 TSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKN 174
            S +  ++ +     +E +E+    KRK  + N ++KK N
Sbjct: 23  ASFKNALKNEDCATPQENDEQNPGKKRKNNKKNRSRKKCN 62


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 30.0 bits (68), Expect = 0.60
 Identities = 14/54 (25%), Positives = 33/54 (61%), Gaps = 8/54 (14%)

Query: 137 LRRRMRKKKKVEKEEEEEK--------KKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           +R+++R  +++E+EEEE++         +  K + +++   +K K  ++EE+EE
Sbjct: 112 VRKQLRFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 30.3 bits (68), Expect = 0.65
 Identities = 10/32 (31%), Positives = 22/32 (68%)

Query: 147 VEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE 178
            +K  E+  ++K+K+K K+   NKKK +++++
Sbjct: 229 SDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKK 260



 Score = 28.8 bits (64), Expect = 1.6
 Identities = 6/48 (12%), Positives = 20/48 (41%)

Query: 133 YTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
           +   + +  +       +E+EE + +    G+  +   K    ++E++
Sbjct: 178 FQLMMMKAAKNGPAAFGDEDEETEGEKGGGGRGKDLKIKDLEGDDEDD 225



 Score = 28.4 bits (63), Expect = 2.6
 Identities = 12/38 (31%), Positives = 23/38 (60%), Gaps = 3/38 (7%)

Query: 148 EKEEEEEKKKKNKRKG---KRNNNNKKKKNEEEEEEEE 182
           E  +EE+ KKK K+     K+ +++KK K   +++ +E
Sbjct: 234 EDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADE 271



 Score = 28.0 bits (62), Expect = 3.1
 Identities = 10/41 (24%), Positives = 20/41 (48%)

Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            +++K E+E    KK K  +K K   N   K + +  ++ +
Sbjct: 323 SEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSD 363



 Score = 28.0 bits (62), Expect = 3.2
 Identities = 12/43 (27%), Positives = 24/43 (55%), Gaps = 5/43 (11%)

Query: 145 KKVEKEEEEEKKKKNK-----RKGKRNNNNKKKKNEEEEEEEE 182
           ++ E  EE E++K  +     +KGK+    K KKN  ++++ +
Sbjct: 315 EQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSD 357


>gnl|CDD|150820 pfam10204, DuoxA, Dual oxidase maturation factor.  DuoxA (Dual
           oxidase maturation factor) is the essential protein
           necessary for the final release of DUOX2 (an NADPH:O2
           oxidoreductase flavoprotein) from the endoplasmic
           reticulum. Dual oxidases (DUOX1 and DUOX2) constitute
           the catalytic core of the hydrogen peroxide generator,
           which generates H2O2 at the apical membrane of thyroid
           follicular cells, essential for iodination of
           thyroglobulin by thyroid peroxidases. DuoxA carries five
           membrane-integral regions including a reverse
           signal-anchor with external N-terminus (type III) and
           two N-glycosylation sites. It is conserved from
           nematodes to humans.
          Length = 281

 Score = 29.8 bits (67), Expect = 0.70
 Identities = 12/62 (19%), Positives = 21/62 (33%), Gaps = 4/62 (6%)

Query: 52  SHFFWLVTYFLKFVVQLELDIDHVSS----VISFDIISYLSFEGINLCEHLQIVKQHKGI 107
           S  FWLV   L   +   +   + +S            Y +F    L   + +     G+
Sbjct: 41  SRLFWLVRVLLSLFIGAVILAVNFTSDWHVARVTTKAPYKAFSRSKLPARVGLHIGLMGV 100

Query: 108 NL 109
           N+
Sbjct: 101 NI 102


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 29.8 bits (67), Expect = 0.78
 Identities = 8/44 (18%), Positives = 15/44 (34%)

Query: 137  LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
              +  R   + E    ++  ++  +K       KK    E  EE
Sbjct: 1179 REKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEE 1222



 Score = 28.3 bits (63), Expect = 3.2
 Identities = 9/44 (20%), Positives = 17/44 (38%)

Query: 138  RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
            R ++++     +    +K  +   K        KK +E E  EE
Sbjct: 1179 REKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEE 1222



 Score = 27.5 bits (61), Expect = 4.4
 Identities = 11/44 (25%), Positives = 14/44 (31%)

Query: 139  RRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            R   ++     E    KK   +   K       KK  E E  EE
Sbjct: 1179 REKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEE 1222


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 29.7 bits (67), Expect = 0.84
 Identities = 15/38 (39%), Positives = 23/38 (60%)

Query: 145 KKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           K+VEKE  +E++++ K + K        K EE +EEEE
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEE 66



 Score = 27.4 bits (61), Expect = 4.4
 Identities = 10/36 (27%), Positives = 21/36 (58%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE 178
           ++K  +KEEE +++++ + K K+    K+   E E 
Sbjct: 51  EEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86



 Score = 27.4 bits (61), Expect = 5.4
 Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 7/40 (17%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           K    E+E +EE++K+ K+K       K KK +E   E E
Sbjct: 53  KTTDKEEEVDEEEEKEEKKK-------KTKKVKETTTEWE 85


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 29.4 bits (66), Expect = 0.85
 Identities = 11/44 (25%), Positives = 15/44 (34%), Gaps = 4/44 (9%)

Query: 143 KKKKVEKEEEEEKKKK----NKRKGKRNNNNKKKKNEEEEEEEE 182
           KK K+ K ++  KK       K    R     +KK  E      
Sbjct: 117 KKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAA 160


>gnl|CDD|129780 TIGR00697, TIGR00697, conserved hypothetical integral membrane
           protein.  All known members of this family are proteins
           or 210-250 amino acids in length. Conserved regions of
           hydrophobicity suggest that all members of the family
           are integral membrane proteins [Hypothetical proteins,
           Conserved].
          Length = 202

 Score = 29.2 bits (66), Expect = 0.86
 Identities = 29/117 (24%), Positives = 41/117 (35%), Gaps = 31/117 (26%)

Query: 1   MKALLHYTPSEDDIA--HLLKEFTVDFL-----LKGYGRLVSQLYDLLVTD--------- 44
            +  L + PS  D +  H    F+         L  Y  +VSQL D+ V           
Sbjct: 76  TQLHLFFIPSPGDESQTHFEALFSSSPRIALASLVAY--IVSQLLDVKVFTFLKKRTPGR 133

Query: 45  ----------LHSQ-IDTSHFFWLVTYF-LKFVVQLELDIDHVSSVISFDIISYLSF 89
                     L SQ IDT  F ++  Y    F  +  L+I   S +I    I  L+ 
Sbjct: 134 FLWLRSNGSTLFSQLIDTFIFSFIAFYGSPPFPAENLLNIMIFSYLIKG-TIGVLAT 189


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 28.0 bits (63), Expect = 0.89
 Identities = 9/13 (69%), Positives = 12/13 (92%)

Query: 170 KKKKNEEEEEEEE 182
           +++K EEEEEEEE
Sbjct: 67  EEEKKEEEEEEEE 79



 Score = 27.2 bits (61), Expect = 1.6
 Identities = 9/13 (69%), Positives = 12/13 (92%)

Query: 170 KKKKNEEEEEEEE 182
           ++KK EEEEEEE+
Sbjct: 68  EEKKEEEEEEEED 80



 Score = 26.8 bits (60), Expect = 2.3
 Identities = 8/13 (61%), Positives = 12/13 (92%)

Query: 170 KKKKNEEEEEEEE 182
           +KK+ EEEEEE++
Sbjct: 69  EKKEEEEEEEEDD 81


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 29.0 bits (65), Expect = 0.98
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNN 168
           +K KK + E+++E+KKK K K K+   
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKR 168



 Score = 28.3 bits (63), Expect = 1.7
 Identities = 13/27 (48%), Positives = 19/27 (70%)

Query: 140 RMRKKKKVEKEEEEEKKKKNKRKGKRN 166
           +  KKKK E ++E +KKKK K+K K+ 
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKR 168


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 29.4 bits (67), Expect = 1.1
 Identities = 10/44 (22%), Positives = 29/44 (65%)

Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
           +R ++K++ ++++ E  +K++ + + K     +K++  E++EEE
Sbjct: 89  KRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEE 132



 Score = 27.0 bits (61), Expect = 6.2
 Identities = 16/41 (39%), Positives = 26/41 (63%)

Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           RK + +EK EEE +KK+ + + K+    KK++  EE  EE+
Sbjct: 100 RKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQ 140


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 28.8 bits (65), Expect = 1.1
 Identities = 14/37 (37%), Positives = 18/37 (48%), Gaps = 5/37 (13%)

Query: 145 KKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
           KK EKEE E +K+  K         + KK EE  E +
Sbjct: 79  KKNEKEERELRKRAEKEA-----LEQAKKEEELREAK 110


>gnl|CDD|232969 TIGR00422, valS, valyl-tRNA synthetase.  The valyl-tRNA synthetase
           (ValS) is a class I amino acyl-tRNA ligase and is
           particularly closely related to the isoleucyl tRNA
           synthetase [Protein synthesis, tRNA aminoacylation].
          Length = 861

 Score = 29.3 bits (66), Expect = 1.2
 Identities = 12/38 (31%), Positives = 23/38 (60%)

Query: 145 KKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            +++K+ ++EKK+  + +GK  N    KK  +E  E+E
Sbjct: 819 ARLQKQLDKEKKEVIRIEGKLENEGFVKKAPKEVIEKE 856


>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain. 
          Length = 225

 Score = 28.9 bits (65), Expect = 1.2
 Identities = 15/54 (27%), Positives = 27/54 (50%), Gaps = 11/54 (20%)

Query: 140 RMRKKKKVEKEEEEEKKKK--NKRKGK-------RNNNNKKKKNEEE--EEEEE 182
           R R+  ++E+ +E  +KKK     K +        N N KK+K + +  +E E+
Sbjct: 118 RERRDLRIEERDEASEKKKEELIEKAQKEIDDFYENYNEKKEKTKAQNRKEAEQ 171


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
           Williams-Beuren syndrome.  This domain family is found
           in eukaryotes, and is typically between 72 and 83 amino
           acids in length. The family is found in association with
           pfam08241. This family is made up of
           S-adenosylmethionine-dependent methyltransferases. The
           proteins are deleted in Williams-Beuren syndrome (WBS),
           a complex developmental disorder with multisystemic
           manifestations including supravalvular aortic stenosis
           (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 27.6 bits (62), Expect = 1.3
 Identities = 11/31 (35%), Positives = 21/31 (67%), Gaps = 2/31 (6%)

Query: 136 SLRRRMRKKKKVEKEEE--EEKKKKNKRKGK 164
           + R++ +KKK  +K +E    KK++ +R+GK
Sbjct: 39  NRRKKKKKKKLKKKSKEWILRKKEQMRRRGK 69


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 28.9 bits (65), Expect = 1.4
 Identities = 11/40 (27%), Positives = 19/40 (47%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           ++K+ EK +   K K  K    +    +K K E+EE+   
Sbjct: 43  EEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLR 82



 Score = 28.5 bits (64), Expect = 1.8
 Identities = 10/40 (25%), Positives = 19/40 (47%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           +K++ + +   + K K   K K     K K+ +EE+   E
Sbjct: 44  EKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRE 83



 Score = 27.7 bits (62), Expect = 3.6
 Identities = 9/40 (22%), Positives = 22/40 (55%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           + ++ E+E+ +   K   +K  +    +K+K + E+EE+ 
Sbjct: 41  EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKG 80



 Score = 26.9 bits (60), Expect = 5.2
 Identities = 9/40 (22%), Positives = 17/40 (42%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           K+++  K   + K KK  +         K++ EE+   E 
Sbjct: 45  KEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84



 Score = 26.5 bits (59), Expect = 8.1
 Identities = 12/33 (36%), Positives = 16/33 (48%)

Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEE 176
           K K EK  +  KKKK K K K N       +++
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDD 226



 Score = 26.2 bits (58), Expect = 9.6
 Identities = 10/40 (25%), Positives = 20/40 (50%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           +++K +   + + KK  K K +     K++K E+   E E
Sbjct: 46  EEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELE 85


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 29.0 bits (65), Expect = 1.4
 Identities = 13/41 (31%), Positives = 22/41 (53%)

Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           + K++ +K+ EEE K K   + K+     KKK E E + + 
Sbjct: 143 KAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKA 183



 Score = 28.3 bits (63), Expect = 2.7
 Identities = 11/46 (23%), Positives = 20/46 (43%)

Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            + +   + K + E E EKK K + K +     K K   E +++  
Sbjct: 124 AKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAA 169



 Score = 26.7 bits (59), Expect = 7.9
 Identities = 12/42 (28%), Positives = 23/42 (54%), Gaps = 1/42 (2%)

Query: 142 RKKKKVEKEEEEEKKKKNK-RKGKRNNNNKKKKNEEEEEEEE 182
           ++ ++ +K+ EE K K+    K K     +KK  EE +++ E
Sbjct: 112 KQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAE 153


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 28.8 bits (65), Expect = 1.4
 Identities = 20/56 (35%), Positives = 38/56 (67%), Gaps = 4/56 (7%)

Query: 124 VRLFNQHKIYTTSLRRRMRKKKKVEKE----EEEEKKKKNKRKGKRNNNNKKKKNE 175
           V++ +  K+   +L+R+ ++KKK +KE    +E+ +KKK +R+ KR  N KK+K++
Sbjct: 135 VKVKDDEKLLKKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDD 190



 Score = 26.1 bits (58), Expect = 9.9
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           +++  K E+ + +KK KRK  +   + +K   EE + EE
Sbjct: 9   EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEE 47


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 28.9 bits (65), Expect = 1.5
 Identities = 12/35 (34%), Positives = 18/35 (51%)

Query: 148 EKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            + E   K  + KRK K   N +K++ E E E +E
Sbjct: 264 SEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKE 298



 Score = 28.5 bits (64), Expect = 1.8
 Identities = 11/32 (34%), Positives = 21/32 (65%)

Query: 131 KIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRK 162
           K  T + R + +++K++E+E +EEK+ K K  
Sbjct: 276 KRKTKAQRNKEKRRKELEREAKEEKQLKKKLA 307


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 28.0 bits (62), Expect = 1.5
 Identities = 15/39 (38%), Positives = 21/39 (53%)

Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           K+K E  EE+E  K     G  N  N  ++ ++E EEEE
Sbjct: 17  KEKKEVVEEKENGKNAPANGNENEENGAQEGDDEMEEEE 55



 Score = 26.8 bits (59), Expect = 3.5
 Identities = 13/42 (30%), Positives = 22/42 (52%)

Query: 141 MRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           +++KK+V +E+E  K           N  ++  +E EEEEE 
Sbjct: 16  LKEKKEVVEEKENGKNAPANGNENEENGAQEGDDEMEEEEEV 57


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 27.7 bits (62), Expect = 1.5
 Identities = 10/35 (28%), Positives = 17/35 (48%), Gaps = 12/35 (34%)

Query: 148 EKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
               EEE++++            +++ EEEEE EE
Sbjct: 72  AAAAEEEEEEE------------EEEEEEEEESEE 94


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 29.2 bits (65), Expect = 1.6
 Identities = 14/35 (40%), Positives = 24/35 (68%), Gaps = 9/35 (25%)

Query: 148 EKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           E+EEEEE++++ +         ++++ EEEEEEEE
Sbjct: 864 EEEEEEEEEEEEE---------EEEEEEEEEEEEE 889


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 28.2 bits (63), Expect = 1.6
 Identities = 11/35 (31%), Positives = 19/35 (54%)

Query: 146 KVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
           + E+E+  E K ++K K KR     K + E+ + E
Sbjct: 108 ETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142


>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal.  This domain
           is specific to the N-terminal part of the prp1 splicing
           factor, which is involved in mRNA splicing (and possibly
           also poly(A)+ RNA nuclear export and cell cycle
           progression). This domain is specific to the N terminus
           of the RNA splicing factor encoded by prp1. It is
           involved in mRNA splicing and possibly also poly(A)and
           RNA nuclear export and cell cycle progression.
          Length = 131

 Score = 28.0 bits (63), Expect = 1.7
 Identities = 10/45 (22%), Positives = 20/45 (44%), Gaps = 8/45 (17%)

Query: 143 KKKKVEKEEEE--------EKKKKNKRKGKRNNNNKKKKNEEEEE 179
              K + E+EE        +++   +RK +R    K++  +  EE
Sbjct: 52  SDGKYDDEDEEADRIYESIDERMDERRKKRREQKEKEEIEKYREE 96



 Score = 26.1 bits (58), Expect = 7.8
 Identities = 14/36 (38%), Positives = 23/36 (63%), Gaps = 3/36 (8%)

Query: 131 KIYTTSLRRRM--RKKKKVEKEEEEEKKKKNKRKGK 164
           +IY  S+  RM  R+KK+ E++E+EE +K  +   K
Sbjct: 65  RIYE-SIDERMDERRKKRREQKEKEEIEKYREENPK 99


>gnl|CDD|178689 PLN03144, PLN03144, Carbon catabolite repressor protein 4 homolog;
           Provisional.
          Length = 606

 Score = 28.9 bits (65), Expect = 1.8
 Identities = 8/13 (61%), Positives = 11/13 (84%)

Query: 7   YTPSEDDIAHLLK 19
           YTP+ DD+ H+LK
Sbjct: 183 YTPTADDVGHVLK 195


>gnl|CDD|237200 PRK12778, PRK12778, putative bifunctional 2-polyprenylphenol
           hydroxylase/glutamate synthase subunit beta;
           Provisional.
          Length = 752

 Score = 28.9 bits (65), Expect = 1.9
 Identities = 15/66 (22%), Positives = 24/66 (36%), Gaps = 12/66 (18%)

Query: 127 FNQHKIYTTSLRRRMRKKKKVEKEE-----------EEEKKKKNKRKGKRNNNNKKKKNE 175
           F+ H +    + +RM   K +E EE             E +K  K K +R    +    E
Sbjct: 243 FDGHLVDFDEMLKRMGAYKTIEGEELLKLEERTAAWRAELRKSMKPK-ERTAIERVPMPE 301

Query: 176 EEEEEE 181
            + E  
Sbjct: 302 LDPEYR 307


>gnl|CDD|218270 pfam04795, PAPA-1, PAPA-1-like conserved region.  Family of
           proteins with a conserved region found in PAPA-1, a
           PAP-1 binding protein.
          Length = 89

 Score = 27.3 bits (61), Expect = 1.9
 Identities = 13/46 (28%), Positives = 25/46 (54%), Gaps = 5/46 (10%)

Query: 142 RKKKKVEKEEEEEKKKKNKR-----KGKRNNNNKKKKNEEEEEEEE 182
           R+K + EK  EEEK +   +       ++   +K KK +E++ +E+
Sbjct: 12  RRKNQSEKRAEEEKMETINKLLKKQARRKKREDKIKKGDEKKAQEK 57


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 28.6 bits (64), Expect = 2.2
 Identities = 15/39 (38%), Positives = 24/39 (61%), Gaps = 7/39 (17%)

Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            ++ + EEE E+K+K KRK       + KK  E+E+E+E
Sbjct: 278 GERSDSEEETEEKEKEKRK-------RLKKMMEDEDEDE 309


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 28.6 bits (64), Expect = 2.2
 Identities = 16/64 (25%), Positives = 29/64 (45%), Gaps = 7/64 (10%)

Query: 125 RLFNQHKIYT------TSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE 178
           RL N   +          L + +  + K +K + ++K KK  +K +   ++ K   EE E
Sbjct: 118 RLLNSKPVKNVLESIMEGLDKILGIETKAKKGKAKKKTKK-SKKKEAKESSDKDDEEESE 176

Query: 179 EEEE 182
            E+E
Sbjct: 177 SEDE 180


>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
          Length = 648

 Score = 28.6 bits (65), Expect = 2.3
 Identities = 8/34 (23%), Positives = 15/34 (44%)

Query: 146 KVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEE 179
           +++ EEE    K+           +K+K  E+ E
Sbjct: 509 RIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542


>gnl|CDD|239618 cd03556, L-fucose_isomerase, L-fucose isomerase (FucIase); FucIase
           converts L-fucose, an aldohexose, to its ketose form,
           which prepares it for aldol cleavage (similar to the
           isomerization of glucose during glycolysis). L-fucose
           (or 6-deoxy-L-galactose) is found in blood group
           determinants as well as in various oligo- and
           polysaccharides, and glycosides in mammals, bacteria and
           plants.
          Length = 584

 Score = 28.6 bits (64), Expect = 2.3
 Identities = 14/51 (27%), Positives = 28/51 (54%), Gaps = 6/51 (11%)

Query: 135 TSLRRRMRKKKKVEKEEEEEKK----KKNKRKGKRNNNNKKKKNEEEEEEE 181
           T + RRM   +++   EE EK     K+N ++G+  N    +K  E+++++
Sbjct: 205 TEIIRRM--DEEIYDPEEYEKALAWTKENCKEGEDENEKNDQKKREQKDKD 253


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 28.5 bits (63), Expect = 2.4
 Identities = 16/47 (34%), Positives = 26/47 (55%)

Query: 136 SLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            + +   KK+KV++EEEE+K K      K+    KK K    ++EE+
Sbjct: 524 DVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQ 570



 Score = 26.6 bits (58), Expect = 9.5
 Identities = 15/47 (31%), Positives = 22/47 (46%), Gaps = 6/47 (12%)

Query: 139 RRMRKKKKVEKEEEEEKKKK------NKRKGKRNNNNKKKKNEEEEE 179
            + + KK+   EEEEEKK K       ++K  +       K EE+ E
Sbjct: 526 NKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAE 572


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 28.1 bits (63), Expect = 2.6
 Identities = 10/46 (21%), Positives = 25/46 (54%)

Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           L++ +   + +   + + K K  +   K +   K+KK E+++EE++
Sbjct: 296 LKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKK 341


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 27.8 bits (62), Expect = 2.6
 Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 7/50 (14%)

Query: 140 RMRKKKKVEKEEEEEKKKKNKRKGKR-------NNNNKKKKNEEEEEEEE 182
           + ++K  +  EEEE+  +K     K+       + + + +K+EEEEEEEE
Sbjct: 134 KFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEE 183


>gnl|CDD|220644 pfam10238, Eapp_C, E2F-associated phosphoprotein.  This entry
           represents the conserved C-terminal portion of an E2F
           binding protein. E2F transcription factors play an
           essential role in cell proliferation and apoptosis and
           their activity is frequently deregulated in human
           cancers. E2F activity is regulated by a variety of
           mechanisms, frequently mediated by proteins binding to
           individual members or a subgroup of the family. EAPP
           interacts with a subset of E2F factors and influences
           E2F-dependent promoter activity. EAPP is present
           throughout the cell cycle but disappears during mitosis.
          Length = 128

 Score = 27.3 bits (61), Expect = 2.6
 Identities = 13/54 (24%), Positives = 23/54 (42%)

Query: 129 QHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           +H+IY    R       KV++E+    K  N++K +RN+   +         E 
Sbjct: 45  RHEIYKNQYRAMFAINCKVDEEKVLYSKPINRKKRRRNSKKNELNPNNTATSEI 98


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 27.7 bits (62), Expect = 2.7
 Identities = 15/45 (33%), Positives = 30/45 (66%)

Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           + R ++++  E+E EE KK + KR+G+R    + ++  E+++EEE
Sbjct: 15  QARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEE 59


>gnl|CDD|219543 pfam07733, DNA_pol3_alpha, Bacterial DNA polymerase III alpha
           subunit. 
          Length = 384

 Score = 28.2 bits (64), Expect = 2.7
 Identities = 11/19 (57%), Positives = 12/19 (63%)

Query: 137 LRRRMRKKKKVEKEEEEEK 155
           LRR M KKK  E E+  EK
Sbjct: 357 LRRAMGKKKPEEMEKLREK 375


>gnl|CDD|221144 pfam11595, DUF3245, Protein of unknown function (DUF3245).  This is
           a family of proteins conserved in fungi. The function is
           not known, and there is no S. cerevisiae member.
          Length = 145

 Score = 27.6 bits (61), Expect = 2.8
 Identities = 14/53 (26%), Positives = 29/53 (54%), Gaps = 7/53 (13%)

Query: 137 LRRRMRKK--KKVEKEEEEEKKKKNKRKGKR-----NNNNKKKKNEEEEEEEE 182
           LRR++  K  KKV  E+E+ +    KRK          ++K   +++++++E+
Sbjct: 76  LRRQLLGKNYKKVMAEKEKAEGGPVKRKAAVVAKEAKQSSKGVGDDDDDDDED 128


>gnl|CDD|222918 PHA02687, PHA02687, ORF061 late transcription factor VLTF-4;
           Provisional.
          Length = 231

 Score = 27.7 bits (61), Expect = 2.9
 Identities = 15/47 (31%), Positives = 25/47 (53%), Gaps = 2/47 (4%)

Query: 138 RRRMRKKKKVEKEEEEEK--KKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           +RR +KK K +K E+  K  +K      + + N +K+  EE +  EE
Sbjct: 87  KRRTKKKAKADKPEKSPKAVEKLCPPDDRDDKNEEKEPTEEAQRNEE 133


>gnl|CDD|236637 PRK09970, PRK09970, xanthine dehydrogenase subunit XdhA;
           Provisional.
          Length = 759

 Score = 28.1 bits (63), Expect = 3.0
 Identities = 12/64 (18%), Positives = 27/64 (42%), Gaps = 5/64 (7%)

Query: 105 KGINLDPY-VRRLHLIIHTMVRLFNQHKIYTTSLRRRMRKKKKV----EKEEEEEKKKKN 159
             + +DP   R  +          +  +IY+  L   + K +K+    ++  E + ++ N
Sbjct: 378 TALGIDPVEFRLRNAAREGDANPLSGKRIYSAGLPECLEKGRKIFEWDKRRAECKNQQGN 437

Query: 160 KRKG 163
            R+G
Sbjct: 438 LRRG 441


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 27.4 bits (61), Expect = 3.1
 Identities = 9/44 (20%), Positives = 22/44 (50%)

Query: 139 RRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           ++ +KK+     +    KK  K+  ++    +KK   ++++ EE
Sbjct: 49  KQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEE 92



 Score = 27.4 bits (61), Expect = 3.6
 Identities = 12/47 (25%), Positives = 24/47 (51%)

Query: 136 SLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           S ++  +K+    ++    KK K K K K     KK ++++++ EE 
Sbjct: 47  SPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEEN 93


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 27.8 bits (62), Expect = 3.3
 Identities = 12/40 (30%), Positives = 18/40 (45%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           K K+  K E++ KK K K K K     K K   + ++   
Sbjct: 83  KPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPS 122


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 27.4 bits (61), Expect = 3.8
 Identities = 12/43 (27%), Positives = 25/43 (58%), Gaps = 2/43 (4%)

Query: 141 MRKKKKVEKEEEEEKKKKN--KRKGKRNNNNKKKKNEEEEEEE 181
           +RKK+K +KE+ + +KK++  ++ G   + +K  K     + E
Sbjct: 161 LRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSE 203



 Score = 26.6 bits (59), Expect = 7.6
 Identities = 10/32 (31%), Positives = 18/32 (56%)

Query: 151 EEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           EE EK ++ ++K K    +     + E+E+EE
Sbjct: 24  EEYEKLEELRKKLKAKKEDDSDDTDYEDEDEE 55


>gnl|CDD|233536 TIGR01695, mviN, integral membrane protein MviN.  This model
           represents MviN, a family of integral membrane proteins
           predicted to have ten or more transmembrane regions.
           Although frequently listed as a virulence protein, it is
           not restricted to pathogens and it is an essential
           protein in Sinorhizobium meliloti. In a number of
           species its gene is adjacent to that of the
           uridylyltransferase GlnD , the signal-transducing enzyme
           that performs the key modification to the nitrogen
           regulatory protein PII [Unknown function, General].
          Length = 502

 Score = 27.7 bits (62), Expect = 4.4
 Identities = 23/75 (30%), Positives = 33/75 (44%), Gaps = 3/75 (4%)

Query: 18  LKEFTVDFLLKGYGRLVSQLYDLLVTDLHSQIDTSHFFWLVTYFLKFVVQLELDIDHVS- 76
           LK F   FL    G   SQ+  L+ T L S ++      L  Y+   + QL L I  +S 
Sbjct: 222 LKRFLKLFLPTTLGSSASQITLLINTALASFLEIGSVSAL--YYANRIYQLPLGIFGISL 279

Query: 77  SVISFDIISYLSFEG 91
           S +    +S  + EG
Sbjct: 280 STVLLPKLSRHASEG 294


>gnl|CDD|223587 COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA
           replication, recombination, and repair / Transcription /
           Translation, ribosomal structure and biogenesis].
          Length = 513

 Score = 27.4 bits (61), Expect = 4.5
 Identities = 27/128 (21%), Positives = 44/128 (34%), Gaps = 21/128 (16%)

Query: 65  VVQLELDIDHVSSVISFDI----ISYLSFEG-----------INLCEHLQIVKQHKGINL 109
           V    LDI  VS VI++D+      Y+   G           I+     + VK+ K I  
Sbjct: 332 VAARGLDIPDVSHVINYDLPLDPEDYVHRIGRTGRAGRKGVAISFVTEEEEVKKLKRI-- 389

Query: 110 DPYVRRLHLIIHTMVRLFNQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNN 169
               +RL   + + V L    +     L +  R   + E +  +E KK    K       
Sbjct: 390 ---EKRLERKLPSAVLLPLD-EPEDAKLLKTTRPGLEEESDISDEIKKLKSSKKALLRGL 445

Query: 170 KKKKNEEE 177
             +    +
Sbjct: 446 GVRFTLSK 453


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 26.3 bits (58), Expect = 4.5
 Identities = 14/35 (40%), Positives = 20/35 (57%), Gaps = 11/35 (31%)

Query: 148 EKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
             EE+ E+K++           +KKK EE+EEEEE
Sbjct: 73  AAEEKAEEKEE-----------EKKKEEEKEEEEE 96


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 27.5 bits (61), Expect = 4.8
 Identities = 9/27 (33%), Positives = 14/27 (51%)

Query: 139 RRMRKKKKVEKEEEEEKKKKNKRKGKR 165
               +  +  KEE E+K+K  K+  KR
Sbjct: 341 DDEDEDDEESKEEVEKKQKVKKKPRKR 367


>gnl|CDD|148072 pfam06244, DUF1014, Protein of unknown function (DUF1014).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 122

 Score = 26.6 bits (59), Expect = 4.9
 Identities = 12/45 (26%), Positives = 20/45 (44%)

Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
             R  ++++K+E  EE+ KK   +      N N  +   EE   E
Sbjct: 6   EERLEKEQRKLEAPEEKSKKNAAREVPLEENLNVNRNRLEEGTVE 50


>gnl|CDD|217838 pfam04004, Leo1, Leo1-like protein.  Members of this family are
           part of the Paf1/RNA polymerase II complex. The Paf1
           complex probably functions during the elongation phase
           of transcription. The Leo1 subunit of the yeast
           Paf1-complex binds RNA and contributes to complex
           recruitment. The subunit acts by co-ordinating
           co-transcriptional chromain modifications and helping
           recruitment of mRNA 3prime-end processing factors.
          Length = 312

 Score = 27.1 bits (60), Expect = 4.9
 Identities = 14/68 (20%), Positives = 24/68 (35%), Gaps = 16/68 (23%)

Query: 128 NQHKIYTTSLRRRMRKKKKV----------------EKEEEEEKKKKNKRKGKRNNNNKK 171
             H+  T +L  R RK   V                EK+EE++ + + +R+ +    NK 
Sbjct: 207 FTHRKLTRNLATRQRKSGGVKITIVGKDPEHEKREREKKEEQKLRARRRRQNREKMKNKP 266

Query: 172 KKNEEEEE 179
                   
Sbjct: 267 PNRPGHGS 274


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 27.3 bits (61), Expect = 5.0
 Identities = 12/35 (34%), Positives = 19/35 (54%), Gaps = 3/35 (8%)

Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEE 176
           R +KK + +EE +KKK  + KG +     KK   +
Sbjct: 247 RAEKKRKSKEEIKKKKPKESKGVK---ALKKVVAK 278


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 27.1 bits (60), Expect = 5.1
 Identities = 13/35 (37%), Positives = 17/35 (48%)

Query: 145 KKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEE 179
           K+V +  E EK K NK KG     NK    ++  E
Sbjct: 10  KRVVRALESEKYKANKDKGNPEIYNKINSQDKAIE 44


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 27.0 bits (60), Expect = 5.1
 Identities = 9/45 (20%), Positives = 23/45 (51%), Gaps = 4/45 (8%)

Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
            +++  ++++VE+  E   +   + + K     K K+ ++ E EE
Sbjct: 120 QKQKEEEERRVERRRELGLEDPEQLRLK----QKAKEEQKAESEE 160


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 27.5 bits (61), Expect = 5.2
 Identities = 15/54 (27%), Positives = 29/54 (53%), Gaps = 3/54 (5%)

Query: 131 KIYTTSLRRRMRKKKKVEKEEEEEKKKKNK---RKGKRNNNNKKKKNEEEEEEE 181
           K Y   L+   +K  +V+ E +  ++ + K    K K+  N KK+K + +E++E
Sbjct: 372 KEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKE 425


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 26.1 bits (58), Expect = 5.3
 Identities = 8/34 (23%), Positives = 17/34 (50%), Gaps = 11/34 (32%)

Query: 148 EKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
               EE+K+++           ++++ +EE EEE
Sbjct: 74  AAAAEEKKEEE-----------EEEEEKEESEEE 96


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 27.5 bits (61), Expect = 5.6
 Identities = 12/52 (23%), Positives = 22/52 (42%)

Query: 128 NQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEE 179
           N  K +   L   + +K K + E+ ++K   NK   K+       K+E  + 
Sbjct: 101 NNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKY 152


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 26.6 bits (59), Expect = 5.6
 Identities = 15/45 (33%), Positives = 27/45 (60%)

Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            +  R K K +KEE E++K+K +R  K     +K++ E E+++ E
Sbjct: 70  EKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAE 114



 Score = 26.6 bits (59), Expect = 5.7
 Identities = 18/50 (36%), Positives = 27/50 (54%), Gaps = 4/50 (8%)

Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKG----KRNNNNKKKKNEEEEEEEE 182
           LRR   K KK E E+E+EK+++  +      K     +KKK E +  +EE
Sbjct: 72  LRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKEE 121


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 27.0 bits (60), Expect = 5.9
 Identities = 12/34 (35%), Positives = 17/34 (50%)

Query: 149 KEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
              E  K KK K+K K+    +K  +E+E E E 
Sbjct: 268 DVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEA 301


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 27.4 bits (61), Expect = 6.0
 Identities = 12/44 (27%), Positives = 19/44 (43%)

Query: 134 TTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEE 177
               RR++ KK   E +     +KK K K  + +  K KK   +
Sbjct: 750 KGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAK 793


>gnl|CDD|233534 TIGR01693, UTase_glnD, [Protein-PII] uridylyltransferase.  This
          model describes GlnD, the
          uridylyltransferase/uridylyl-removing enzyme for the
          nitrogen regulatory protein PII. Not all homologs of
          PII share the property of uridylyltransferase
          modification on the characteristic Tyr residue (see
          Prosite pattern PS00496 and document PDOC00439), but
          the modification site is preserved in the PII homolog
          of all species with a member of this family [Central
          intermediary metabolism, Nitrogen metabolism,
          Regulatory functions, Protein interactions].
          Length = 850

 Score = 27.4 bits (61), Expect = 6.2
 Identities = 14/50 (28%), Positives = 19/50 (38%), Gaps = 7/50 (14%)

Query: 29 GYGR----LVSQLYDLLVTDLHSQIDTSHFFWLVTYFLKFVVQLELDIDH 74
          GYGR      S + DLL   LH           +  FL  +  L  ++ H
Sbjct: 51 GYGRGELAPYSDI-DLLF--LHDGKPAEEVEPKIERFLYPLWDLGFEVGH 97


>gnl|CDD|224652 COG1738, yhhQ, Uncharacterized member of the PurR regulon [General
           function prediction only].
          Length = 233

 Score = 26.9 bits (60), Expect = 6.2
 Identities = 29/103 (28%), Positives = 40/103 (38%), Gaps = 13/103 (12%)

Query: 1   MKALLHYTPSEDDIA--HLLKEFTVDFLL---KGYGRLVSQLYDLLVTDLHSQIDTSHFF 55
            +  LH+ PS  D A   L   F+    +        +VSQL D+ V +   +       
Sbjct: 99  TQIALHFPPSGSDEAQEALAALFSFVPRIALASLLAYIVSQLLDVWVFNRLKRRTGGKSL 158

Query: 56  WLVTYFLKFVVQLELDIDHVSSVISFDIISYLSFEG--INLCE 96
           WL       V QL   ID   +VI F I  Y+S +G      E
Sbjct: 159 WLRPNASTLVSQL---ID---TVIFFFIAFYVSPDGFMAEWIE 195


>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
          Length = 472

 Score = 26.9 bits (60), Expect = 6.2
 Identities = 12/32 (37%), Positives = 18/32 (56%)

Query: 149 KEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
           + EE+EK+KK K++GK       +    EE E
Sbjct: 392 EREEKEKEKKAKKEGKEERRIHFQNKSIEEAE 423


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 27.1 bits (60), Expect = 6.3
 Identities = 13/61 (21%), Positives = 35/61 (57%)

Query: 122 TMVRLFNQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
            +V  +N+ +    S +R   ++KK E+++ EE ++K   + +R    +K++   +E+++
Sbjct: 59  AVVEQYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKK 118

Query: 182 E 182
           +
Sbjct: 119 Q 119


>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal.  The C-terminal domain of
           FCP-1 is required for interaction with the carboxy
           terminal domain of RAP74. Interaction relies extensively
           on van der Waals contacts between hydrophobic residues
           situated within alpha-helices in both domains.
          Length = 263

 Score = 26.9 bits (59), Expect = 6.4
 Identities = 13/64 (20%), Positives = 25/64 (39%), Gaps = 20/64 (31%)

Query: 139 RRMRKKKKVEKEEEEEKKKKNKRKGKRNNN--------------------NKKKKNEEEE 178
            + +     E++E+  + +K +  G R                       +K+K NEE+E
Sbjct: 166 EKKKPPGNEEEQEQAPQPRKQQPPGPRREPDEGTPSSSERSNIGGRGPRGHKRKLNEEDE 225

Query: 179 EEEE 182
           E+ E
Sbjct: 226 EDAE 229


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 27.2 bits (60), Expect = 6.5
 Identities = 9/47 (19%), Positives = 22/47 (46%)

Query: 136 SLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            +     +K K ++EE    +++ K + +     +    +EE+ +EE
Sbjct: 788 KVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEE 834


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 27.1 bits (60), Expect = 6.9
 Identities = 13/26 (50%), Positives = 18/26 (69%), Gaps = 1/26 (3%)

Query: 140 RMRKKKKVEKEEEEEKKKKNKRKGKR 165
             RKKKK  K+ +E+KKKKN+   K+
Sbjct: 682 GSRKKKKK-KKSKEKKKKKNREASKQ 706


>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.  This
           family includes the HABP4 family of hyaluronan-binding
           proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
           HABP4 has been observed to bind hyaluronan (a
           glucosaminoglycan), but it is not known whether this is
           its primary role in vivo. It has also been observed to
           bind RNA, but with a lower affinity than that for
           hyaluronan. PAI-1 mRNA-binding protein specifically
           binds the mRNA of type-1 plasminogen activator inhibitor
           (PAI-1), and is thought to be involved in regulation of
           mRNA stability. However, in both cases, the sequence
           motifs predicted to be important for ligand binding are
           not conserved throughout the family, so it is not known
           whether members of this family share a common function.
          Length = 106

 Score = 25.8 bits (57), Expect = 6.9
 Identities = 13/33 (39%), Positives = 19/33 (57%)

Query: 150 EEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           EEEE+ + + K+  +   N +    EEE EEEE
Sbjct: 43  EEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEE 75


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region covers
           both the Brf homology II and III regions. This region is
           involved in binding TATA binding protein.
          Length = 95

 Score = 25.7 bits (57), Expect = 7.6
 Identities = 11/36 (30%), Positives = 17/36 (47%), Gaps = 1/36 (2%)

Query: 137 LRRRMRKKKKVEKEEEEE-KKKKNKRKGKRNNNNKK 171
           L  +  K+ K + +E     KKK KRK K+  +   
Sbjct: 29  LEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64


>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
           peptidyl-prolyl isomerase.  This family is only found at
           the amino terminus of pfam00254. This domain is of
           unknown function.
          Length = 124

 Score = 26.0 bits (58), Expect = 7.6
 Identities = 10/41 (24%), Positives = 22/41 (53%), Gaps = 2/41 (4%)

Query: 134 TTSLRRRMRKKKKVEKEEEEEKKKKNKRKGK--RNNNNKKK 172
             +L+   +K +  ++ + E+K ++NK  G+     N KK+
Sbjct: 71  QKALQAFQKKLQAKQQAKAEKKAEENKAAGEAFLAENAKKE 111


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 26.8 bits (60), Expect = 7.8
 Identities = 13/45 (28%), Positives = 20/45 (44%)

Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
                K+ + EKE  E +KK  ++K K+     K +    EE  E
Sbjct: 74  YEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLE 118


>gnl|CDD|233221 TIGR00985, 3a0801s04tom, mitochondrial import receptor subunit
           translocase of outer membrane 20 kDa subunit.
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 148

 Score = 26.0 bits (57), Expect = 7.9
 Identities = 9/26 (34%), Positives = 14/26 (53%)

Query: 133 YTTSLRRRMRKKKKVEKEEEEEKKKK 158
           +   LRRR +K+   EK+     K+K
Sbjct: 37  FRKKLRRRRKKQAGAEKQYGGLAKEK 62


>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
          Length = 1176

 Score = 26.8 bits (60), Expect = 8.3
 Identities = 13/50 (26%), Positives = 22/50 (44%)

Query: 133 YTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           Y  S    +  K +     E E +   K + ++  NN  +  + EE+EEE
Sbjct: 829 YKKSKGYVLVIKLENGGGLELEGEFSEKEEAEKFYNNLIEVEKVEEKEEE 878


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
           describes the eukaryotic 60S (cytosolic) ribosomal
           protein L7 and paralogs that may or may not also be L7.
           Human, Drosophila, and Arabidopsis all have both a
           typical L7 and an L7-related paralog. This family is
           designated subfamily rather than equivalog to reflect
           these uncharacterized paralogs. Members of this family
           average ~ 250 residues in length, somewhat longer than
           the archaeal L30P/L7E homolog (~ 155 residues) and much
           longer than the related bacterial/organellar form (~ 60
           residues).
          Length = 235

 Score = 26.6 bits (59), Expect = 8.5
 Identities = 10/31 (32%), Positives = 15/31 (48%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKK 173
           KK+K  +E   +  K+ K K K N   +K  
Sbjct: 5   KKRKARQELAVQVAKQAKAKKKANKKKRKIY 35


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 26.6 bits (59), Expect = 8.7
 Identities = 14/40 (35%), Positives = 26/40 (65%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
            K+K E +EEEE + + + K ++  N   K++E+ ++EEE
Sbjct: 447 FKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEE 486


>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
           splicing MRS1.  Members of this family adopt a secondary
           structure consisting of two beta sheets and one alpha
           helix, arranged as a beta-alpha-beta motif. Each beta
           sheet has five strands, arranged in a 32145 order, with
           the second strand being antiparallel to the rest.
           Mitochondrial resolvase Ydc2 is capable of resolving
           Holliday junctions and cleaves DNA after 5'-CT-3' and
           5'-TT-3' sequences. This family also contains the
           mitochondrial RNA-splicing protein MRS1 which is
           involved in the excision of group I introns.
          Length = 254

 Score = 26.6 bits (59), Expect = 8.9
 Identities = 10/27 (37%), Positives = 16/27 (59%)

Query: 147 VEKEEEEEKKKKNKRKGKRNNNNKKKK 173
            E+ E   +KKK + K K + N+KK +
Sbjct: 146 CERTEILAEKKKPRSKKKSSKNSKKLR 172


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 26.0 bits (58), Expect = 9.2
 Identities = 10/45 (22%), Positives = 23/45 (51%), Gaps = 3/45 (6%)

Query: 139 RRMRKKKKVEKEEEEEKK---KKNKRKGKRNNNNKKKKNEEEEEE 180
           + +++  +    +++E K   K  K+  K     +KKK ++EE +
Sbjct: 52  KDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96



 Score = 26.0 bits (58), Expect = 9.3
 Identities = 9/30 (30%), Positives = 15/30 (50%)

Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKK 173
            K  EK E++ +K K K + K+    + K 
Sbjct: 68  LKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97



 Score = 26.0 bits (58), Expect = 9.7
 Identities = 10/53 (18%), Positives = 23/53 (43%)

Query: 113 VRRLHLIIHTMVRLFNQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKR 165
           +  L+     +        +    L+   + +KK EK + + +KKK K++  +
Sbjct: 44  ITDLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 26.2 bits (57), Expect = 9.6
 Identities = 18/73 (24%), Positives = 32/73 (43%), Gaps = 9/73 (12%)

Query: 110 DPYVRRLHLIIHTMVRLFNQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNN 169
            P+  RLH          N  ++  +SL      + K  K ++E+   K +   + +   
Sbjct: 166 QPWPERLH---------NNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTE 216

Query: 170 KKKKNEEEEEEEE 182
           +K+  EE+EEEE 
Sbjct: 217 EKQPQEEQEEEEV 229


>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
           family.  Members of this protein family occur in
           Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
           related Mycoplasmas in small paralogous families that
           may also include truncated forms and/or pseudogenes.
           Members are predicted lipoproteins with a conserved
           signal peptidase II processing and lipid attachment
           site. Note that the name for certain characterized
           members, p72, reflects an anomalous apparent molecular
           weight, given a theoretical MW of about 61 kDa.
          Length = 541

 Score = 26.3 bits (58), Expect = 9.7
 Identities = 12/40 (30%), Positives = 19/40 (47%)

Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           KK +++  E   K  K     + + NN  K N E ++EE 
Sbjct: 36  KKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEEN 75


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 26.3 bits (59), Expect = 9.8
 Identities = 7/40 (17%), Positives = 19/40 (47%)

Query: 139 RRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE 178
           + M+++ K    +++E K  +K + K+     K    + +
Sbjct: 50  KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89


>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 881

 Score = 26.4 bits (58), Expect = 9.9
 Identities = 10/30 (33%), Positives = 18/30 (60%)

Query: 153 EEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
           +E+ + +  K   N  +KKK+ EE+  EE+
Sbjct: 13  DEQSQISPEKQTPNKKDKKKEEEEQLSEED 42


>gnl|CDD|206307 pfam14138, COX16, Cytochrome c oxidase assembly protein COX16.
           This family represents homologues of COX16 which has
           been shown to be involved in assembly of cytochrome
           oxidase. Protein in this family are typically between
           106 and 134 amino acids in length.
          Length = 79

 Score = 24.8 bits (55), Expect = 10.0
 Identities = 12/28 (42%), Positives = 15/28 (53%)

Query: 135 TSLRRRMRKKKKVEKEEEEEKKKKNKRK 162
           T LR   R +K  + EEEE  K   KR+
Sbjct: 22  TQLRYERRDRKVQQLEEEEALKLLKKRR 49


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 26.2 bits (58), Expect = 10.0
 Identities = 9/21 (42%), Positives = 17/21 (80%)

Query: 138 RRRMRKKKKVEKEEEEEKKKK 158
           R   R+K++ +++EEEE+K+K
Sbjct: 199 REEERRKQRKKQQEEEERKQK 219


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.137    0.386 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,641,424
Number of extensions: 937985
Number of successful extensions: 6644
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5369
Number of HSP's successfully gapped: 759
Length of query: 182
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 91
Effective length of database: 6,901,388
Effective search space: 628026308
Effective search space used: 628026308
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 56 (25.2 bits)