RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6462
         (234 letters)



>gnl|CDD|114751 pfam06047, SynMuv_product, Ras-induced vulval development
           antagonist.  This family is from synthetic multi-vulval
           genes which encode chromatin-associated proteins
           involved in transcriptional repression. This protein has
           a role in antagonising Ras-induced vulval development.
          Length = 105

 Score =  175 bits (445), Expect = 1e-56
 Identities = 79/103 (76%), Positives = 90/103 (87%)

Query: 126 QKDYGKALLPGEGAAMAAYVAEGKRIPRRGEIGLTSDEIERYEAVGYVMSGSRHRRMEAV 185
              YG ALLPGEGAAMA YV +GKRIPRRGEIGLTS+EIE +E VGYVMSGSRH+RM AV
Sbjct: 3   PLSYGGALLPGEGAAMAQYVQKGKRIPRRGEIGLTSEEIENFEKVGYVMSGSRHKRMNAV 62

Query: 186 RIRKENQIYSADEKRALAMFSKEERQKRENRILTQFKEMVSSK 228
           RIRKENQ+YSA+EKRALAMF+ EER KREN+IL+QFKE++  K
Sbjct: 63  RIRKENQVYSAEEKRALAMFNYEERAKRENKILSQFKELIDRK 105


>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755).  This
           family is predominated by ORFs from Circoviridae. The
           function of this family remains to be determined.
          Length = 122

 Score = 37.8 bits (88), Expect = 0.001
 Identities = 23/53 (43%), Positives = 29/53 (54%)

Query: 35  RKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSED 87
           RK    S     KK KK +H   +  K   +KKR++   SSSSSS SSSSS +
Sbjct: 68  RKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSSSSSSSE 120



 Score = 30.5 bits (69), Expect = 0.29
 Identities = 23/62 (37%), Positives = 28/62 (45%), Gaps = 14/62 (22%)

Query: 45  SSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSS 104
           S    KK+K  +    +S K    KK++ SSSSSS SSSSS              SS S 
Sbjct: 75  SQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSSS--------------SSSSE 120

Query: 105 DS 106
            S
Sbjct: 121 SS 122



 Score = 29.3 bits (66), Expect = 0.69
 Identities = 20/50 (40%), Positives = 28/50 (56%), Gaps = 8/50 (16%)

Query: 47  KKSKKKKHKKDKKK--------KSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
           ++   + HKK +KK          K S K++++  SSSSSS SSSSS  E
Sbjct: 71  QQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSSSSSSSE 120


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 38.1 bits (89), Expect = 0.003
 Identities = 17/75 (22%), Positives = 28/75 (37%), Gaps = 8/75 (10%)

Query: 36   KASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAV 95
            K    S  ++++K K K   K           R+ +K  S SSS+    SE +       
Sbjct: 1322 KKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVD------- 1374

Query: 96   KSSDSSHSSDSDNSD 110
              S+     D ++ D
Sbjct: 1375 -DSEDEDDEDDEDDD 1388



 Score = 37.3 bits (87), Expect = 0.006
 Identities = 26/85 (30%), Positives = 36/85 (42%), Gaps = 2/85 (2%)

Query: 35   RKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSA 94
            R  S T G  S  +  K K K+  KKK K S  + KK     +S    S  + +  DK  
Sbjct: 1157 RLKSKTKGKASKLRKPKLKKKE--KKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPD 1214

Query: 95   VKSSDSSHSSDSDNSDRDTVGPAPK 119
             K S+SS S   D+ ++ T      
Sbjct: 1215 NKKSNSSGSDQEDDEEQKTKPKKSS 1239



 Score = 36.2 bits (84), Expect = 0.012
 Identities = 18/67 (26%), Positives = 25/67 (37%)

Query: 46   SKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSD 105
             KK KK + K  +KKKSK   K+     SS           D   +       D S   D
Sbjct: 1321 LKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDED 1380

Query: 106  SDNSDRD 112
             ++ + D
Sbjct: 1381 DEDDEDD 1387



 Score = 30.4 bits (69), Expect = 1.1
 Identities = 16/78 (20%), Positives = 26/78 (33%)

Query: 33   SGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDK 92
               +++  S   S  K K KK  +      KK KK +KK      S      +      +
Sbjct: 1292 PDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSR 1351

Query: 93   SAVKSSDSSHSSDSDNSD 110
               +       S S++ D
Sbjct: 1352 LLRRPRKKKSDSSSEDDD 1369


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 36.5 bits (85), Expect = 0.009
 Identities = 10/32 (31%), Positives = 17/32 (53%)

Query: 44  ESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSS 75
           E  +K K+K   K + + +K   KR+K   +S
Sbjct: 401 EKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTS 432



 Score = 36.1 bits (84), Expect = 0.012
 Identities = 13/42 (30%), Positives = 23/42 (54%)

Query: 36  KASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSS 77
           K    S  ++ K SKK   K+ +KK+ +K K + KK++  + 
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTK 420



 Score = 36.1 bits (84), Expect = 0.013
 Identities = 15/52 (28%), Positives = 28/52 (53%), Gaps = 2/52 (3%)

Query: 32  PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKK--KKYSSSSSSDS 81
           PS +K    S    +K+++KK+ +K+K K  K+ +  K   K+   S +S+ 
Sbjct: 383 PSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 34.1 bits (79), Expect = 0.056
 Identities = 12/34 (35%), Positives = 18/34 (52%)

Query: 39  PTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
           P +   S KK+ K   K   K+  KK K+++K K
Sbjct: 378 PKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPK 411



 Score = 32.2 bits (74), Expect = 0.20
 Identities = 13/57 (22%), Positives = 25/57 (43%)

Query: 32  PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
           P  +  S     + SKK   K+ +K +K+K K   K++ +   +       S + +E
Sbjct: 378 PKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434


>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
           Autographa californica nuclear polyhedrosis virus
           (AcMNPV) pp31 is a nuclear phosphoprotein that
           accumulates in the virogenic stroma, which is the viral
           replication centre in the infected-cell nucleus, binds
           to DNA, and serves as a late expression factor.
          Length = 267

 Score = 35.4 bits (82), Expect = 0.014
 Identities = 20/67 (29%), Positives = 32/67 (47%), Gaps = 2/67 (2%)

Query: 21  LMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD 80
           L+  N+      S     PT   +SS  S+K+K  K   KK   +KKRK+K+      + 
Sbjct: 198 LLKLNDNDKTAASSASVVPTP--QSSSPSRKRKAPKKVAKKVAAAKKRKQKRSPPPPFNM 255

Query: 81  SSSSSED 87
            S +++D
Sbjct: 256 VSDNTDD 262


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 35.5 bits (82), Expect = 0.022
 Identities = 21/90 (23%), Positives = 29/90 (32%), Gaps = 1/90 (1%)

Query: 32  PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSS-SSSDSSSSSEDEWV 90
               +  P S   S ++                S  R     SSS  SS SS+SS  E  
Sbjct: 277 GPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESS 336

Query: 91  DKSAVKSSDSSHSSDSDNSDRDTVGPAPKQ 120
             +AV    S   S S +       P+  +
Sbjct: 337 RGAAVSPGPSPSRSPSPSRPPPPADPSSPR 366



 Score = 27.4 bits (61), Expect = 8.4
 Identities = 12/48 (25%), Positives = 17/48 (35%), Gaps = 1/48 (2%)

Query: 74  SSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVGPAPKQH 121
            S+S+S  + +      D +   SSDSS S  S           P   
Sbjct: 213 ISASASSPAPAPGRSAADDAGASSSDSSSSESSG-CGWGPENECPLPR 259


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 33.9 bits (78), Expect = 0.041
 Identities = 17/49 (34%), Positives = 22/49 (44%)

Query: 35  RKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSS 83
             A      E  ++ KK+K KK + KK KK KK KK+K      S    
Sbjct: 139 TTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKK 187



 Score = 29.7 bits (67), Expect = 1.00
 Identities = 16/41 (39%), Positives = 20/41 (48%)

Query: 32  PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
               K       +  K+ K KK K  + K SKK KK+KKKK
Sbjct: 153 EKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 27.4 bits (61), Expect = 6.2
 Identities = 15/48 (31%), Positives = 23/48 (47%)

Query: 35  RKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSS 82
            K +    +E  +K KKK+ KK+KK+K  K +K  + K S        
Sbjct: 144 EKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191



 Score = 27.0 bits (60), Expect = 7.2
 Identities = 16/60 (26%), Positives = 25/60 (41%)

Query: 38  SPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKS 97
           S TS  E++ K +K+   ++++KK KK KK  KK+                   K   K 
Sbjct: 132 SETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 33.3 bits (76), Expect = 0.049
 Identities = 17/26 (65%), Positives = 22/26 (84%)

Query: 47  KKSKKKKHKKDKKKKSKKSKKRKKKK 72
           KK KKKKH+ DK++K KK +K+KKKK
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKK 167



 Score = 31.0 bits (70), Expect = 0.30
 Identities = 15/34 (44%), Positives = 20/34 (58%)

Query: 39  PTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
            T  D    K  +KKHKK K +  K+ KK+KK+K
Sbjct: 129 ETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEK 162


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 33.7 bits (78), Expect = 0.063
 Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 4/80 (5%)

Query: 45  SSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSS 104
            SKK+ KK  K  +K + K+ +++K+KK  + +        E+E   K   K  +   + 
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463

Query: 105 DSDNSDRDTVGPAPKQHVTL 124
           +      +      K+  TL
Sbjct: 464 EEK----EEEEEKKKKQATL 479


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 33.1 bits (76), Expect = 0.099
 Identities = 21/67 (31%), Positives = 29/67 (43%), Gaps = 5/67 (7%)

Query: 9   EKKEKELDYQEFLMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKD---KKKKSKKS 65
           ++  K L   EF  LD  L       R+       E+++   +KK K     KKKK K+S
Sbjct: 209 KELLKSLLIPEFKPLDKYLKESKKKKRETE--EDVEAAESRAEKKRKSKEEIKKKKPKES 266

Query: 66  KKRKKKK 72
           K  K  K
Sbjct: 267 KGVKALK 273


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 32.6 bits (75), Expect = 0.14
 Identities = 15/35 (42%), Positives = 20/35 (57%)

Query: 37  ASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKK 71
             P       KK +K+K KK +KKK KK KK+ +K
Sbjct: 380 PKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 32.2 bits (74), Expect = 0.23
 Identities = 14/26 (53%), Positives = 19/26 (73%)

Query: 47  KKSKKKKHKKDKKKKSKKSKKRKKKK 72
           K+ +KK  K+ KKKK KK  K++KKK
Sbjct: 386 KREEKKPQKRKKKKKRKKKGKKRKKK 411



 Score = 30.3 bits (69), Expect = 0.77
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 47  KKSKKKKHKKDKKKKSKKSKKRKKKK 72
            K K+++ K  K+KK KK KK+ KK+
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKR 408



 Score = 29.9 bits (68), Expect = 1.1
 Identities = 14/26 (53%), Positives = 20/26 (76%)

Query: 47  KKSKKKKHKKDKKKKSKKSKKRKKKK 72
           KK ++KK +K KKKK +K K +K+KK
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKK 410



 Score = 29.6 bits (67), Expect = 1.6
 Identities = 14/26 (53%), Positives = 19/26 (73%)

Query: 47  KKSKKKKHKKDKKKKSKKSKKRKKKK 72
           ++ KK + +K KKK+ KK KKRKKK 
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKG 412



 Score = 29.6 bits (67), Expect = 1.7
 Identities = 14/25 (56%), Positives = 17/25 (68%)

Query: 48  KSKKKKHKKDKKKKSKKSKKRKKKK 72
           K  KKK ++ K +K KK KKRKKK 
Sbjct: 381 KPPKKKREEKKPQKRKKKKKRKKKG 405



 Score = 29.2 bits (66), Expect = 1.7
 Identities = 12/26 (46%), Positives = 19/26 (73%)

Query: 47  KKSKKKKHKKDKKKKSKKSKKRKKKK 72
           KK +++K  + +KKK K+ KK KK+K
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRK 409



 Score = 28.8 bits (65), Expect = 2.9
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 47  KKSKKKKHKKDKKKKSKKSKKRKKKK 72
           ++ K +K KK KK+K K  K++KK +
Sbjct: 388 EEKKPQKRKKKKKRKKKGKKRKKKGR 413


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 32.1 bits (73), Expect = 0.15
 Identities = 25/76 (32%), Positives = 37/76 (48%), Gaps = 10/76 (13%)

Query: 45  SSKKSKKKKHKKDKKKKSKKSKKRKKKKY----------SSSSSSDSSSSSEDEWVDKSA 94
           S K SKKKK    KKK +KK+KK+KK+K           S S  ++ +    D  + K  
Sbjct: 47  SKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQ 106

Query: 95  VKSSDSSHSSDSDNSD 110
             +++ +H    D SD
Sbjct: 107 NTAAEVNHEDVIDLSD 122


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 32.9 bits (75), Expect = 0.16
 Identities = 23/85 (27%), Positives = 34/85 (40%), Gaps = 3/85 (3%)

Query: 29  SVLPSGRKASPTSGDESS--KKSKKKKHK-KDKKKKSKKSKKRKKKKYSSSSSSDSSSSS 85
           S    G  A     DE    KK KK + K +DK  K     K   KK S  S  D+++  
Sbjct: 427 STSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESVKSQEDNNNIP 486

Query: 86  EDEWVDKSAVKSSDSSHSSDSDNSD 110
            +EWV K  ++         +++  
Sbjct: 487 PEEWVMKKILEWVPDLEEDGTEDPG 511



 Score = 27.9 bits (62), Expect = 6.4
 Identities = 21/82 (25%), Positives = 28/82 (34%), Gaps = 2/82 (2%)

Query: 36  KASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWV--DKS 93
           +AS      SS+KS            S K  K+KK K +S+    + S  +DE     K 
Sbjct: 390 QASSAGLIGSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKG 449

Query: 94  AVKSSDSSHSSDSDNSDRDTVG 115
                     S    SD    G
Sbjct: 450 KKNQKKGRDKSSKVPSDSKAGG 471


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 32.7 bits (75), Expect = 0.16
 Identities = 7/44 (15%), Positives = 22/44 (50%)

Query: 44 ESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSED 87
            +K+++++K  K++ K +  +   K +   S+++ D +     
Sbjct: 17 LKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQ 60



 Score = 28.0 bits (63), Expect = 5.1
 Identities = 13/55 (23%), Positives = 30/55 (54%), Gaps = 3/55 (5%)

Query: 46 SKKSKKKKHKKDKKKKSKKSKKRKKKKY---SSSSSSDSSSSSEDEWVDKSAVKS 97
          SK + KK+ K  + ++ K +K+  K      ++   S  S++++DE +D +  ++
Sbjct: 9  SKNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYRA 63


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
          This domain is found to the N-terminus of bacterial
          signal peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 31.3 bits (72), Expect = 0.19
 Identities = 11/27 (40%), Positives = 14/27 (51%)

Query: 46 SKKSKKKKHKKDKKKKSKKSKKRKKKK 72
           KK  K   K +KK +  K+K  KKK 
Sbjct: 64 DKKELKAWEKAEKKAEKAKAKAEKKKA 90



 Score = 30.6 bits (70), Expect = 0.35
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 43 DESSKKSKKKKHKKDKKKKSKKSKKRKK 70
          D+   K+ +K  KK +K K+K  KK+ K
Sbjct: 64 DKKELKAWEKAEKKAEKAKAKAEKKKAK 91



 Score = 28.6 bits (65), Expect = 1.6
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query: 48 KSKKKKHKKDKKKKSKKSKKRKKKK 72
          K++KK  K   K + KK+KK + K 
Sbjct: 73 KAEKKAEKAKAKAEKKKAKKEEPKP 97


>gnl|CDD|221569 pfam12420, DUF3671, Protein of unknown function.  This domain
          family is found in eukaryotes, and is typically between
          96 and 116 amino acids in length.
          Length = 106

 Score = 30.7 bits (70), Expect = 0.23
 Identities = 11/25 (44%), Positives = 11/25 (44%)

Query: 49 SKKKKHKKDKKKKSKKSKKRKKKKY 73
              K KKD     KK KK   KKY
Sbjct: 21 DYIDKLKKDPNIDKKKFKKIIFKKY 45


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 31.8 bits (72), Expect = 0.34
 Identities = 25/68 (36%), Positives = 41/68 (60%), Gaps = 1/68 (1%)

Query: 34  GRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKS 93
            ++ S T+ ++SS K K+K HK D++    +SKK++ ++ SSS S  SSSS      +K 
Sbjct: 733 KKQTSKTASEKSSSKGKRK-HKNDEEADKIESKKQRLEEKSSSCSPSSSSSHHHSSSNKE 791

Query: 94  AVKSSDSS 101
           + KSS + 
Sbjct: 792 SRKSSRNK 799


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 30.4 bits (69), Expect = 0.41
 Identities = 17/29 (58%), Positives = 19/29 (65%)

Query: 44  ESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
           +   K KK+K KK KKKK K SKK  KKK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 28.1 bits (63), Expect = 2.1
 Identities = 16/29 (55%), Positives = 19/29 (65%)

Query: 44  ESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
           E +K+ K+KK KK KKKK    K  KKKK
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 28.1 bits (63), Expect = 2.2
 Identities = 13/31 (41%), Positives = 18/31 (58%)

Query: 48  KSKKKKHKKDKKKKSKKSKKRKKKKYSSSSS 78
           K   K+  K KK+K KK KK+KKK    ++ 
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAK 132



 Score = 28.1 bits (63), Expect = 2.2
 Identities = 14/31 (45%), Positives = 16/31 (51%)

Query: 47  KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSS 77
            K K K+ K+ KKKK KK KK  KK      
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 27.7 bits (62), Expect = 2.8
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 47  KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD 80
           K+  K+K K+ K+KK KK KK+KK    ++    
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 31.2 bits (71), Expect = 0.47
 Identities = 17/69 (24%), Positives = 26/69 (37%), Gaps = 7/69 (10%)

Query: 36  KASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSS-------EDE 88
           K      +  SKK  +KK KK K+KK  KS+   K  + +   S  +          ED 
Sbjct: 56  KDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDV 115

Query: 89  WVDKSAVKS 97
               + +  
Sbjct: 116 DNAFNKIAE 124



 Score = 29.3 bits (66), Expect = 1.7
 Identities = 17/50 (34%), Positives = 26/50 (52%)

Query: 28 FSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSS 77
            +L +  +        S+KK KK+    + KKKS+K KK+KK+K    S
Sbjct: 36 KEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKS 85



 Score = 28.9 bits (65), Expect = 2.1
 Identities = 16/60 (26%), Positives = 26/60 (43%)

Query: 40  TSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSD 99
           T  D+   K+ + K K +KKKK KK KK  K +  +     +   S+         K ++
Sbjct: 54  TKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNE 113


>gnl|CDD|112562 pfam03753, HHV6-IE, Human herpesvirus 6 immediate early protein.
           The proteins in this family are poorly characterized,
           but an investigation has indicated that the immediate
           early protein is required the down-regulation of MHC
           class I expression in dendritic cells. Human herpesvirus
           6 immediate early protein is also referred to as U90.
          Length = 993

 Score = 31.2 bits (70), Expect = 0.47
 Identities = 27/93 (29%), Positives = 35/93 (37%), Gaps = 14/93 (15%)

Query: 30  VLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD----SSSSS 85
           V PS          + S   K KK    +K    K+ K KK K  S  +       SS S
Sbjct: 638 VTPSKTFYKRDKKKDISTNRKVKKRTAKRKTVGYKTDKSKKIKSDSLPTDTNVIVISSES 697

Query: 86  EDEWVDKSAVKSS----------DSSHSSDSDN 108
           EDE    + +K S           S  SS+SD+
Sbjct: 698 EDEEDGFNIIKKSQLKKKIKSELKSESSSESDD 730


>gnl|CDD|216581 pfam01577, Peptidase_S30, Potyvirus P1 protease.  The potyviridae
           family positive stand RNA viruses with genome encoding a
           polyprotein. members include zucchini yellow mosaic
           virus, and turnip mosaic viruses which cause
           considerable losses of crops worldwide. This family
           consists of a C terminus region from various plant
           potyvirus P1 proteins (found at the N terminus of the
           polyprotein). The C terminus of P1 is a serine-type
           protease responsible for autocatalytic cleavage between
           P1 and the helper component protease pfam00851. The
           entire P1 protein may be involved in virus-host
           interactions.
          Length = 245

 Score = 30.8 bits (70), Expect = 0.51
 Identities = 15/67 (22%), Positives = 26/67 (38%), Gaps = 6/67 (8%)

Query: 6   RRGEKKEKELDYQEFLMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKS 65
           R   K+E+  + Q        + S +        +  +  S ++   K        +KK 
Sbjct: 41  REKLKREEREERQFLQGAYASIVSKITPIGTDKVSKTESVSFRTPYYKR------TTKKM 94

Query: 66  KKRKKKK 72
           KK+KKKK
Sbjct: 95  KKKKKKK 101


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
          Provisional.
          Length = 330

 Score = 31.0 bits (71), Expect = 0.55
 Identities = 9/30 (30%), Positives = 18/30 (60%)

Query: 43 DESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
          D+   K+  K  KK +K+++K +K + K +
Sbjct: 62 DKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 30.1 bits (68), Expect = 0.56
 Identities = 9/53 (16%), Positives = 23/53 (43%)

Query: 27  LFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSS 79
           +F  L S       + +ES+ +++++   + K +  +K K+   K  +     
Sbjct: 87  IFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKP 139


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 30.4 bits (69), Expect = 0.63
 Identities = 17/69 (24%), Positives = 28/69 (40%), Gaps = 4/69 (5%)

Query: 43  DESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSH 102
           +E +KK KKKK KK KK  +  S        +  +   S +    E    +       S 
Sbjct: 178 EERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGE----AKKLKKKRSI 233

Query: 103 SSDSDNSDR 111
           + D++ S+ 
Sbjct: 234 APDNEKSEV 242


>gnl|CDD|216818 pfam01970, TctA, Tripartite tricarboxylate transporter TctA family.
            This family, formerly known as DUF112, is a family of
           bacterial and archaeal tripartite tricarboxylate
           transporters of the extracytoplasmic solute binding
           receptor-dependent transporter group of families,
           distinct from the ABC and TRAP-T families. TctA is part
           of the tripartite TctABC system which, as characterized
           in S. typhimurium, is a secondary carrier that depends
           for activity on the extracytoplasmic
           tricarboxylate-binding receptor TctC as well as two
           integral membrane proteins, TctA and TctB. complete
           three-component systems are found only in bacteria. TctA
           is a large transmembrane protein with up to 12 predicted
           membrane spanning regions in bacteria and up to 11 such
           in archaea, with the N-terminal within the cytoplasm.
           TctA is thought to be a permease, and in most other
           bacteria functions without TctB and TctC molecules.
          Length = 418

 Score = 30.5 bits (70), Expect = 0.66
 Identities = 11/27 (40%), Positives = 16/27 (59%), Gaps = 2/27 (7%)

Query: 132 ALLPGEGAAMAAYVA--EGKRIPRRGE 156
            +LPG GA +AA++A    KR  +  E
Sbjct: 244 GILPGAGATIAAFLAYGLAKRTSKNPE 270


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 30.8 bits (70), Expect = 0.66
 Identities = 20/61 (32%), Positives = 26/61 (42%), Gaps = 8/61 (13%)

Query: 10   KKEKELDYQEFLMLDNE----LFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKS 65
            ++EKE   Q  L  D E    L SVL +  K      +E   +S  KK K  K+ KS   
Sbjct: 1525 QEEKEPAGQGELESDKEKKGNLESVLSNQEKNI----EEDYAESDIKKRKNKKQYKSNTE 1580

Query: 66   K 66
             
Sbjct: 1581 A 1581


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 30.6 bits (69), Expect = 0.69
 Identities = 13/24 (54%), Positives = 19/24 (79%)

Query: 45  SSKKSKKKKHKKDKKKKSKKSKKR 68
           S KK KKKK K+ KKKK++++ K+
Sbjct: 683 SRKKKKKKKSKEKKKKKNREASKQ 706


>gnl|CDD|218662 pfam05623, DUF789, Protein of unknown function (DUF789).  This
           family consists of several plant proteins of unknown
           function.
          Length = 289

 Score = 30.4 bits (69), Expect = 0.75
 Identities = 14/42 (33%), Positives = 19/42 (45%), Gaps = 1/42 (2%)

Query: 74  SSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVG 115
           S  +SS+ SS   + W  +S    SD S SSD        +G
Sbjct: 102 SRDTSSEGSSDCSESWDSRSD-GDSDGSFSSDESELSSQRLG 142


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 30.4 bits (68), Expect = 0.82
 Identities = 19/87 (21%), Positives = 38/87 (43%), Gaps = 2/87 (2%)

Query: 37  ASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVK 96
            SP    +  ++      ++ ++  +KKSK ++  + +S S  +     E E  D  +V 
Sbjct: 74  TSPLKSTKRQREKPASDTEEPERVTAKKSKTQELSRPNSPSEGEGEGEGEGESSDSRSVN 133

Query: 97  SSDSSHSSDSDNSDRDTVG--PAPKQH 121
              SS   D D  +R +    P+P+ +
Sbjct: 134 EEGSSDPKDIDQDNRSSSPSIPSPQDN 160


>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
           unknown].
          Length = 523

 Score = 30.2 bits (68), Expect = 0.85
 Identities = 19/75 (25%), Positives = 31/75 (41%), Gaps = 10/75 (13%)

Query: 45  SSKKSKKK--KHKKDKKKKSKKSKKRKKKKYSSSSSSDS--------SSSSEDEWVDKSA 94
           ++KK  K   K+  D++K+  +SKK    K  +S + +         + + E      S 
Sbjct: 7   ATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSKLYSD 66

Query: 95  VKSSDSSHSSDSDNS 109
           V SS S    D   S
Sbjct: 67  VDSSSSEEEEDGSES 81



 Score = 28.7 bits (64), Expect = 3.3
 Identities = 15/54 (27%), Positives = 23/54 (42%)

Query: 35 RKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
          RK    S      K++    +   +  +  + K K+K     S  DSSSS E+E
Sbjct: 23 RKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSKLYSDVDSSSSEEEE 76


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 30.1 bits (68), Expect = 1.1
 Identities = 24/97 (24%), Positives = 39/97 (40%), Gaps = 7/97 (7%)

Query: 37  ASPTSGDESSKKSKKKKHKKDKKKKSKKSK----KRKKKKYSSSSSSDSSSSSEDEWVDK 92
            S    D  ++ SK+ K    K   S  +K     ++      +  + S+S    +  D 
Sbjct: 154 DSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASD--DT 211

Query: 93  SAVKSSDSSHSSDSDNSDRDTVGPAPKQHVTLTQKDY 129
           +  KSS   + S SD S  D++     +    TQKDY
Sbjct: 212 ANQKSSSKDNQSMSD-SALDSILDQYSEDAKKTQKDY 247



 Score = 28.6 bits (64), Expect = 3.0
 Identities = 15/70 (21%), Positives = 29/70 (41%)

Query: 37  ASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVK 96
            SPT+  +  +K    K      KKS   +  K      +  +D++++S  +  DK    
Sbjct: 21  TSPTAYADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFST 80

Query: 97  SSDSSHSSDS 106
              S+  S++
Sbjct: 81  IDSSTSDSNN 90



 Score = 28.2 bits (63), Expect = 4.1
 Identities = 15/74 (20%), Positives = 26/74 (35%), Gaps = 7/74 (9%)

Query: 62  SKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSD-------NSDRDTV 114
            K +    K   SS  +   + SS+ +  D     SS+++  S S+        +  +  
Sbjct: 143 EKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQS 202

Query: 115 GPAPKQHVTLTQKD 128
              P    T  QK 
Sbjct: 203 NSQPASDDTANQKS 216


>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
           Provisional.
          Length = 1164

 Score = 30.0 bits (67), Expect = 1.2
 Identities = 18/60 (30%), Positives = 29/60 (48%)

Query: 50  KKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNS 109
           + K   K+    +   K  KK+K++  SSS SS SS    +   +  +S  S  S+S +S
Sbjct: 97  RSKTPIKNNDNVTTPIKANKKEKHNLDSSSSSSISSSLTNISFFSSPTSIYSCLSNSLSS 156



 Score = 29.2 bits (65), Expect = 2.5
 Identities = 19/67 (28%), Positives = 29/67 (43%), Gaps = 1/67 (1%)

Query: 46  SKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDS-SHSS 104
           +++SK      D      K+ K++K    SSSSS  SSS  +     S        S+S 
Sbjct: 95  NERSKTPIKNNDNVTTPIKANKKEKHNLDSSSSSSISSSLTNISFFSSPTSIYSCLSNSL 154

Query: 105 DSDNSDR 111
            S +S +
Sbjct: 155 SSKHSPK 161


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 29.9 bits (67), Expect = 1.2
 Identities = 14/52 (26%), Positives = 19/52 (36%)

Query: 38  SPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEW 89
           +P    E  K  K +  K  K  K KK KK+      S         S+ +W
Sbjct: 140 APPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDW 191



 Score = 29.1 bits (65), Expect = 2.1
 Identities = 15/50 (30%), Positives = 17/50 (34%)

Query: 39  PTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
           P S      K  KK  K    K  K  K +K KK  S S+        D 
Sbjct: 136 PISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDP 185



 Score = 27.9 bits (62), Expect = 4.6
 Identities = 14/49 (28%), Positives = 19/49 (38%)

Query: 32  PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD 80
           P  ++       +S K  K  K KK KKK S  ++  K       S  D
Sbjct: 142 PEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPD 190


>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
          Length = 852

 Score = 30.1 bits (68), Expect = 1.2
 Identities = 19/45 (42%), Positives = 24/45 (53%), Gaps = 4/45 (8%)

Query: 74  SSSSSSD---SSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVG 115
           SSS+ S     + S E  WVD S   S D + S D D+ DRD+ G
Sbjct: 11  SSSNGSGGLVGTDSGESRWVDGSEDDSEDPA-SLDDDDDDRDSGG 54


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 29.8 bits (67), Expect = 1.3
 Identities = 12/32 (37%), Positives = 18/32 (56%)

Query: 47  KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSS 78
           KK+K KK     K K+ K+ K+K K  S  ++
Sbjct: 749 KKTKAKKKSASTKGKAAKTVKKKSKAKSKKTT 780



 Score = 27.9 bits (62), Expect = 5.5
 Identities = 11/45 (24%), Positives = 16/45 (35%)

Query: 36  KASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD 80
           K +      +S K K  K  K K K   K   +K+     + S  
Sbjct: 749 KKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSPA 793



 Score = 27.5 bits (61), Expect = 6.7
 Identities = 12/33 (36%), Positives = 16/33 (48%)

Query: 51  KKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSS 83
           +KK K  KK  S K K  K  K  S + S  ++
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTT 780


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 28.7 bits (64), Expect = 1.3
 Identities = 14/31 (45%), Positives = 18/31 (58%)

Query: 42  GDESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
            D   + ++K K KK K KK+KK KK  KK 
Sbjct: 90  DDHPPEPTEKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 26.0 bits (57), Expect = 9.2
 Identities = 12/34 (35%), Positives = 17/34 (50%)

Query: 39  PTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
           P+  D+   +  +K  KK KK K  K  K+  KK
Sbjct: 86  PSHPDDHPPEPTEKPKKKKKKSKKTKKPKKSSKK 119


>gnl|CDD|240380 PTZ00362, PTZ00362, hypothetical protein; Provisional.
          Length = 479

 Score = 29.7 bits (67), Expect = 1.3
 Identities = 17/73 (23%), Positives = 29/73 (39%)

Query: 47  KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDS 106
           KK KKKK K  K+  +     +K     + S      + E  + +    +S  SS  + +
Sbjct: 9   KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRHNEEGTFNNYGTNESQSSSTQNSN 68

Query: 107 DNSDRDTVGPAPK 119
            N +   +   PK
Sbjct: 69  KNEENIVLEDFPK 81


>gnl|CDD|215953 pfam00500, Late_protein_L1, L1 (late) protein. 
          Length = 500

 Score = 29.6 bits (67), Expect = 1.3
 Identities = 17/57 (29%), Positives = 25/57 (43%), Gaps = 3/57 (5%)

Query: 14  ELDYQEFLMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKK 70
           E+D +E   LD + F   P GRK    +G       K +K        + +S KRK+
Sbjct: 447 EVDLKEKFSLDLDQF---PLGRKFLLQAGLRQRPSKKTRKRSATSSSTTSRSAKRKR 500


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 29.8 bits (67), Expect = 1.3
 Identities = 18/73 (24%), Positives = 27/73 (36%)

Query: 21  LMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD 80
           L +D++  +   S   +   +G  +    +K K K +K KK     K   KK     SS 
Sbjct: 87  LHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSK 146

Query: 81  SSSSSEDEWVDKS 93
                 D    KS
Sbjct: 147 DELIKYDNNKPKS 159



 Score = 29.0 bits (65), Expect = 2.3
 Identities = 19/80 (23%), Positives = 32/80 (40%)

Query: 9   EKKEKELDYQEFLMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKR 68
           +KK+ +         DN   S   S + A P +     K   K +  K K   +K + K+
Sbjct: 79  QKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKK 138

Query: 69  KKKKYSSSSSSDSSSSSEDE 88
           KKK  SS        +++ +
Sbjct: 139 KKKVLSSKDELIKYDNNKPK 158



 Score = 29.0 bits (65), Expect = 2.8
 Identities = 17/50 (34%), Positives = 21/50 (42%)

Query: 36 KASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSS 85
          K      D+ +KK  K    KD KK   K KK+ KKK       D+   S
Sbjct: 50 KKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDS 99


>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
          Length = 534

 Score = 29.6 bits (66), Expect = 1.7
 Identities = 19/76 (25%), Positives = 31/76 (40%), Gaps = 11/76 (14%)

Query: 32  PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVD 91
           P+    + +  D+  +  +    + D +  S  S       YS++SS D S S++D W D
Sbjct: 444 PAHADRADSDEDDMDE-QESGDERADGEDDSDSSYS-----YSTTSSEDESDSADDSWGD 497

Query: 92  KSAVKSSDSSHSSDSD 107
                 SDS    D  
Sbjct: 498 -----ESDSGIEHDDG 508


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 29.5 bits (66), Expect = 1.7
 Identities = 16/67 (23%), Positives = 32/67 (47%)

Query: 44  ESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHS 103
             +++  ++K KK++K K K+ KK K  +  + +   +  +S+   V K + K S     
Sbjct: 12  ILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDV 71

Query: 104 SDSDNSD 110
            D +  D
Sbjct: 72  EDENPED 78



 Score = 29.1 bits (65), Expect = 2.6
 Identities = 25/76 (32%), Positives = 35/76 (46%), Gaps = 10/76 (13%)

Query: 43  DESSKKSKKKKHKKDKKKKSKKSK---KRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSS- 98
            E   + KKKK +K K+K+ KK K   K  K K  +  +SD ++  +     KS  KS  
Sbjct: 14  TEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPK-----KSEKKSRK 68

Query: 99  -DSSHSSDSDNSDRDT 113
            D    +  D  D DT
Sbjct: 69  RDVEDENPEDFIDPDT 84


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 29.2 bits (65), Expect = 1.8
 Identities = 16/54 (29%), Positives = 29/54 (53%), Gaps = 3/54 (5%)

Query: 60  KKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSS---HSSDSDNSD 110
           K+SKK+ +R ++K +++SS      +E +   +  VK  D       S+S+ SD
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSD 161



 Score = 28.9 bits (64), Expect = 2.9
 Identities = 21/111 (18%), Positives = 39/111 (35%), Gaps = 3/111 (2%)

Query: 9   EKKEKELDYQEFLMLD---NELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKS 65
           E  E   +  E ++ +    +  S     R     +   S  + +K + K  K++K KK 
Sbjct: 87  EISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKM 146

Query: 66  KKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVGP 116
            +  + + S S  SD   S     ++  + +  D       D S      P
Sbjct: 147 DEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDDGEDISHTYGWPP 197


>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
          (ETRAMP).  These sequences represent a family of
          proteins from the malaria parasite Plasmodium
          falciparum, several of which have been shown to be
          expressed specifically in the ring stage as well as the
          rodent parasite Plasmodium yoelii. A homologue from
          Plasmodium chabaudi was localized to the
          parasitophorous vacuole membrane. Members have an
          initial hydrophobic, Phe/Tyr-rich, stretch long enough
          to span the membrane, a highly charged region rich in
          Lys, a second putative transmembrane region and a
          second highly charged, low complexity sequence region.
          Some members have up to 100 residues of additional
          C-terminal sequence. These genes have been shown to be
          found in the sub-telomeric regions of both Plasmodium
          falciparum and P. yoelii chromosomes.
          Length = 82

 Score = 27.6 bits (62), Expect = 1.8
 Identities = 15/42 (35%), Positives = 19/42 (45%)

Query: 38 SPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSS 79
          +P   +    K KK K  K      +K KK KKK   SS +S
Sbjct: 20 APGLNNNVVAKKKKLKKLKKIDDDLEKKKKNKKKIIISSIAS 61


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 29.4 bits (66), Expect = 1.8
 Identities = 19/82 (23%), Positives = 32/82 (39%), Gaps = 4/82 (4%)

Query: 43  DESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSH 102
           DE  +  K K  KK K+     + +  KK    + SS +SS  +         KS++S+ 
Sbjct: 39  DELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPA----KPRKSAESTR 94

Query: 103 SSDSDNSDRDTVGPAPKQHVTL 124
           SS   +    T     ++    
Sbjct: 95  SSHPKSKAPSTESEEEEEPEET 116


>gnl|CDD|185600 PTZ00410, PTZ00410, NAD-dependent SIR2; Provisional.
          Length = 349

 Score = 29.1 bits (65), Expect = 1.9
 Identities = 26/90 (28%), Positives = 37/90 (41%), Gaps = 11/90 (12%)

Query: 65  SKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVGPAPKQHVTL 124
           +K+ +    SS SSSDSS+SS  +   +         + +D     RD   P   Q    
Sbjct: 263 AKEGRSSSSSSRSSSDSSTSSSSDGYGQFG------DYEADPGGVCRDIFFPGDCQESVR 316

Query: 125 TQKDYGKALLPGEGAAMAAYVAEGKRIPRR 154
                 +AL  G G A+A  V EG+  P  
Sbjct: 317 R---LAEAL--GLGEALAKRVREGEAAPET 341


>gnl|CDD|227651 COG5347, COG5347, GTPase-activating protein that regulates ARFs
           (ADP-ribosylation factors), involved in ARF-mediated
           vesicular transport [Intracellular trafficking and
           secretion].
          Length = 319

 Score = 29.0 bits (65), Expect = 1.9
 Identities = 19/106 (17%), Positives = 34/106 (32%), Gaps = 4/106 (3%)

Query: 25  NELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKK----SKKSKKRKKKKYSSSSSSD 80
           N  +      +   P      S  +KK   KK + KK    S           SS+ + D
Sbjct: 85  NRFYEKNLLDQLLLPIKAKYDSSVAKKYIRKKYELKKFIDDSSSPSDFSSFSASSTRTVD 144

Query: 81  SSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVGPAPKQHVTLTQ 126
           S     D      +  +S  + +   D  + ++      +  +LT 
Sbjct: 145 SVDDRLDSESQSRSSSASLGNSNRPDDELNVESFQSTGSKPRSLTS 190


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 29.2 bits (65), Expect = 2.0
 Identities = 28/109 (25%), Positives = 36/109 (33%), Gaps = 2/109 (1%)

Query: 43  DESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSH 102
           +E   + +    KK KK K  K KK    K  S S  DS  S  D     S V +     
Sbjct: 324 EEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQKE 383

Query: 103 SSDSDNSDRDTVGPAP--KQHVTLTQKDYGKALLPGEGAAMAAYVAEGK 149
               +  D +   P        +   KD GK     E +   A V   K
Sbjct: 384 PKKEEPVDSNPSSPGNSGPARPSPESKDKGKRKAANEVSKSPASVPAKK 432


>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
           topoisomerase IA [General function prediction only].
          Length = 298

 Score = 28.9 bits (65), Expect = 2.0
 Identities = 13/39 (33%), Positives = 19/39 (48%)

Query: 33  SGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKK 71
            G      +   ++ K+KK   KK   KK+ K+KK  KK
Sbjct: 258 KGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKK 296


>gnl|CDD|222911 PHA02616, PHA02616, VP2/VP3; Provisional.
          Length = 259

 Score = 28.9 bits (64), Expect = 2.1
 Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 2/40 (5%)

Query: 39  PTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSS 78
           P S  + +  SKK K      K S+ SKKR+ +    S++
Sbjct: 216 PVSPTKKAALSKKSKWTG--TKSSQSSKKRRGRSTGRSTT 253


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 28.5 bits (64), Expect = 2.1
 Identities = 19/67 (28%), Positives = 27/67 (40%)

Query: 43  DESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSH 102
            E  +K KK+  +K K K  KK  K+KK K            SE +   ++  K  D + 
Sbjct: 68  AEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTK 127

Query: 103 SSDSDNS 109
           S     S
Sbjct: 128 SYSETLS 134


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 29.3 bits (65), Expect = 2.1
 Identities = 22/90 (24%), Positives = 37/90 (41%), Gaps = 3/90 (3%)

Query: 40  TSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSD 99
            + +E+ K  K K ++ +    S   K +   K  S S+S    S+  +    SA  S  
Sbjct: 124 ANPNEAGKPGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSSDE---SATDSGK 180

Query: 100 SSHSSDSDNSDRDTVGPAPKQHVTLTQKDY 129
           +S S        +   PAPK  +T  ++ Y
Sbjct: 181 ASASVAGIVGADEEAPPAPKNTLTPLEELY 210


>gnl|CDD|217363 pfam03094, Mlo, Mlo family.  A family of plant integral membrane
           proteins, first discovered in barley. Mutants lacking
           wild-type Mlo proteins show broad spectrum resistance to
           the powdery mildew fungus, and dysregulated cell death
           control, with spontaneous cell death in response to
           developmental or abiotic stimuli. Thus wild-type Mlo
           proteins are thought to be inhibitors of cell death
           whose deficiency lowers the threshold required to
           trigger the cascade of events that result in plant cell
           death. Mlo proteins are localised in the plasma membrane
           and possess seven transmembrane regions; thus the Mlo
           family is the only major higher plant family to possess
           7 transmembrane domains. It has been suggested that Mlo
           proteins function as G-protein coupled receptors in
           plants; however the molecular and biological functions
           of Mlo proteins remain to be fully determined.
          Length = 481

 Score = 29.3 bits (66), Expect = 2.2
 Identities = 16/52 (30%), Positives = 21/52 (40%)

Query: 34  GRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSS 85
           G        DE  +K+ KK HK  KKKK  K   +       SS  ++ S  
Sbjct: 414 GSSMKKAVFDEQVQKALKKWHKTAKKKKKHKRSVKSGSTTPGSSRDETPSRG 465


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 29.3 bits (65), Expect = 2.2
 Identities = 15/51 (29%), Positives = 25/51 (49%)

Query: 32  PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSS 82
           P   K +     + S K KKKK +K+K++K KK K    + + S   ++  
Sbjct: 263 PDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQP 313



 Score = 28.9 bits (64), Expect = 2.9
 Identities = 23/79 (29%), Positives = 39/79 (49%), Gaps = 4/79 (5%)

Query: 38  SPTSGD----ESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKS 93
           SP  GD    E   K  KKK KK+K+K+  K KK++ + + S   +   S +    V ++
Sbjct: 186 SPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEA 245

Query: 94  AVKSSDSSHSSDSDNSDRD 112
              S  ++ S  + +S+ D
Sbjct: 246 DEASLANTVSGTAPDSEPD 264



 Score = 27.3 bits (60), Expect = 7.4
 Identities = 16/75 (21%), Positives = 38/75 (50%), Gaps = 5/75 (6%)

Query: 43  DESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE-----WVDKSAVKS 97
           D+  K  ++++H++  +K  ++ KKR+K+K          + S+++      VD    + 
Sbjct: 79  DQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTEEM 138

Query: 98  SDSSHSSDSDNSDRD 112
            +++  SD D+ D +
Sbjct: 139 PENALPSDEDDKDPN 153



 Score = 27.0 bits (59), Expect = 9.8
 Identities = 16/57 (28%), Positives = 25/57 (43%)

Query: 32  PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
           P   +A  T      KK K++K K++KKKK K    R       +     + + E+E
Sbjct: 266 PKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQNGAVEEE 322


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 28.9 bits (65), Expect = 2.2
 Identities = 19/78 (24%), Positives = 31/78 (39%)

Query: 35  RKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSA 94
           +K++    +  S    KK  +   K +   S    +KK +  S+S S SS+E        
Sbjct: 244 KKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKKAAPESTSQSPSSAEPTSESPQT 303

Query: 95  VKSSDSSHSSDSDNSDRD 112
             +S  S   D  +SD  
Sbjct: 304 AGNSSLSSLGDYSDSDES 321


>gnl|CDD|220662 pfam10265, DUF2217, Uncharacterized conserved protein (DUF2217).
           This is a family of conserved proteins of from 500 - 600
           residues found from worms to humans. Its function is not
           known.
          Length = 515

 Score = 29.0 bits (65), Expect = 2.4
 Identities = 18/74 (24%), Positives = 30/74 (40%)

Query: 45  SSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSS 104
             ++ + K+  + ++  +++   RK  K SS      SS   D     S+  SS  S SS
Sbjct: 35  FLRRRRGKRKMEGEQLGTRRPLSRKIGKCSSRRVRSPSSKPNDTLSGASSKLSSKHSGSS 94

Query: 105 DSDNSDRDTVGPAP 118
            S  S  D    + 
Sbjct: 95  HSLASVSDRNSSSS 108


>gnl|CDD|153366 cd07682, F-BAR_srGAP2, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
           Activating Protein 2.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. Slit-Robo GTPase Activating Proteins
           (srGAPs) are Rho GAPs that interact with Robo1, the
           transmembrane receptor of Slit proteins. Slit proteins
           are secreted proteins that control axon guidance and the
           migration of neurons and leukocytes. Vertebrates contain
           three isoforms of srGAPs. srGAP2 is expressed in zones
           of neuronal differentiation. It plays a role in the
           regeneration of neurons and axons. srGAP2 contains an
           N-terminal F-BAR domain, a Rho GAP domain, and a
           C-terminal SH3 domain. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
          Length = 263

 Score = 28.5 bits (63), Expect = 2.5
 Identities = 26/89 (29%), Positives = 51/89 (57%), Gaps = 4/89 (4%)

Query: 9   EKKEKELDYQEFLM-LDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKK 67
           + KE  L  QE LM + NEL++V+   +     + D  S +SK K+ +K ++K+  +S +
Sbjct: 116 KSKEVGLQLQEDLMKVLNELYTVM---KTYHMYNADSISAQSKLKEAEKQEEKQMSRSVR 172

Query: 68  RKKKKYSSSSSSDSSSSSEDEWVDKSAVK 96
           ++ ++   S  S ++   E++ V +S+VK
Sbjct: 173 QEDRQTPRSPDSTTNIRIEEKHVRRSSVK 201


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 28.1 bits (63), Expect = 2.7
 Identities = 17/62 (27%), Positives = 27/62 (43%)

Query: 47  KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDS 106
           +K+ KK+ K+ KKK+ KK KK+ KK         S SS E    ++   +          
Sbjct: 81  EKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEIM 140

Query: 107 DN 108
           + 
Sbjct: 141 EK 142


>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
           function prediction only].
          Length = 386

 Score = 28.7 bits (64), Expect = 2.8
 Identities = 11/25 (44%), Positives = 14/25 (56%)

Query: 47  KKSKKKKHKKDKKKKSKKSKKRKKK 71
           K    +K K D+K  S K+KK  KK
Sbjct: 353 KAEAARKRKGDRKGVSHKAKKGGKK 377


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 29.0 bits (66), Expect = 2.8
 Identities = 9/42 (21%), Positives = 15/42 (35%)

Query: 43  DESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSS 84
            E  +       K    KK+     RK K+   +S  D  ++
Sbjct: 818 PEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKAT 859



 Score = 27.8 bits (63), Expect = 6.6
 Identities = 6/30 (20%), Positives = 11/30 (36%)

Query: 43  DESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
            E  +K +       K    KK+  +  +K
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFSRK 844


>gnl|CDD|114659 pfam05950, Orthopox_A36R, Orthopoxvirus A36R protein.  This family
           consists of several Orthopoxvirus A36R proteins. The
           A36R protein is predicted to be a type Ib membrane
           protein.
          Length = 158

 Score = 27.8 bits (61), Expect = 3.1
 Identities = 18/48 (37%), Positives = 25/48 (52%), Gaps = 4/48 (8%)

Query: 67  KRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTV 114
           K KK K S+SS S  S+ SE +W D      S    ++D DN  R+ +
Sbjct: 42  KLKKIKSSNSSKSSKSTDSESDWED----HCSAMEQNNDVDNISRNEI 85


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
          non-specific RNA binding region part 2.  This is a
          region found N terminal to the catalytic domain of
          glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
          but not in Escherichia coli. This region is thought to
          bind RNA in a non-specific manner, enhancing
          interactions between the tRNA and enzyme, but is not
          essential for enzyme function.
          Length = 83

 Score = 26.9 bits (60), Expect = 3.1
 Identities = 13/38 (34%), Positives = 18/38 (47%)

Query: 50 KKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSED 87
           KKK KK KKK    +   K KK ++   S+ +   E 
Sbjct: 23 VKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEG 60



 Score = 25.8 bits (57), Expect = 8.3
 Identities = 10/39 (25%), Positives = 18/39 (46%)

Query: 50 KKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
            KK KK KKKK++ +    K K +++      +   + 
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEG 60


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
          (DUF1764).  This is a family of eukaryotic proteins of
          unknown function. This family contains many
          hypothetical proteins.
          Length = 98

 Score = 27.0 bits (60), Expect = 3.3
 Identities = 15/61 (24%), Positives = 28/61 (45%)

Query: 11 KEKELDYQEFLMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKK 70
          KE++ + +      +++FS +   +K    +   +  K+ KK  KKDKKK        + 
Sbjct: 1  KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEES 60

Query: 71 K 71
          K
Sbjct: 61 K 61


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 28.7 bits (64), Expect = 3.4
 Identities = 9/46 (19%), Positives = 23/46 (50%)

Query: 44   ESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEW 89
            E  +++  +      KK S+++ K+   K ++  +S+S ++ E   
Sbjct: 1180 EKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYG 1225



 Score = 28.7 bits (64), Expect = 3.8
 Identities = 25/91 (27%), Positives = 37/91 (40%), Gaps = 16/91 (17%)

Query: 36   KASPTSGDESSKKSKKKKHKK-DKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSA 94
            K +   G    KK +K +    +KK  S   +    K+  SS +   SSSSE + +D SA
Sbjct: 1372 KPAEAIGISPEKKVRKMRASPFNKKSGSVLGRAATNKETESSENVSGSSSSEKDEIDVSA 1431

Query: 95   ---------------VKSSDSSHSSDSDNSD 110
                           +  S+S  + DSD  D
Sbjct: 1432 KPRPQRANRKQTTYVLSDSESESADDSDFDD 1462



 Score = 28.3 bits (63), Expect = 4.0
 Identities = 11/77 (14%), Positives = 29/77 (37%), Gaps = 8/77 (10%)

Query: 44   ESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHS 103
            E +++  ++   + +   +KK  ++  KK +   ++  +S SE                S
Sbjct: 1176 EEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETT--------EETYGSS 1227

Query: 104  SDSDNSDRDTVGPAPKQ 120
            +    +  + V P  + 
Sbjct: 1228 AMETENVAEVVKPKGRA 1244


>gnl|CDD|233034 TIGR00584, mug, mismatch-specific thymine-DNA glycosylate (mug). 
          All proteins in this family for whcih functions are
          known are G-T or G-U mismatch glycosylases that
          function in base excision repair. This family is based
          on the phylogenomic analysis of JA Eisen (1999, Ph.D.
          Thesis, Stanford University). Used 2pf model [DNA
          metabolism, DNA replication, recombination, and
          repair].
          Length = 328

 Score = 28.5 bits (63), Expect = 3.6
 Identities = 14/41 (34%), Positives = 19/41 (46%)

Query: 31 LPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKK 71
           P  RK  P S +       KK     K  KS KSK++++K
Sbjct: 56 APKFRKRKPRSNEPYRPVEPKKPSDSKKSGKSTKSKEKQEK 96


>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1.  Mating in
           fungi is controlled by the loci that determine the
           mating type of an individual, and only individuals with
           differing mating types can mate. Basidiomycete fungi
           have evolved a unique mating system, termed tetrapolar
           or bifactorial incompatibility, in which mating type is
           determined by two unlinked loci; compatibility at both
           loci is required for mating to occur. The multi-allelic
           tetrapolar mating system is considered to be a novel
           innovation that could have only evolved once, and is
           thus unique to the mushroom fungi. This domain is
           C-terminal to the homeodomain transcription factor
           region.
          Length = 418

 Score = 28.2 bits (63), Expect = 3.8
 Identities = 18/81 (22%), Positives = 32/81 (39%), Gaps = 8/81 (9%)

Query: 47  KKSKKKKHKKDKKKKSKKSKKRKKKKY-----SSSSSSDSSSSSEDEWVDKSAVKSSDSS 101
           ++ K +K ++ +  +  +  KR    Y     S + SS+   S     +D S V +S  +
Sbjct: 45  EQLKDEKRRRRQTPRQSRRSKRAAHAYPSPERSPALSSERLLSPSPSVLDLSPVLASPQT 104

Query: 102 ---HSSDSDNSDRDTVGPAPK 119
                S S + D D      K
Sbjct: 105 GKRRRSSSPSDDEDEAERPSK 125


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 28.3 bits (63), Expect = 3.8
 Identities = 21/86 (24%), Positives = 30/86 (34%), Gaps = 2/86 (2%)

Query: 32  PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVD 91
                ++ +S    S  SK  K  K   +    S  +     + S  S S S +E +   
Sbjct: 310 SKETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVSDNESK--Q 367

Query: 92  KSAVKSSDSSHSSDSDNSDRDTVGPA 117
           K A KSS  +  S  D S     G  
Sbjct: 368 KRASKSSSGARDSKKDASGMSANGTV 393



 Score = 28.0 bits (62), Expect = 5.3
 Identities = 45/206 (21%), Positives = 83/206 (40%), Gaps = 26/206 (12%)

Query: 37  ASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD-----SSSSSEDEWVD 91
                 D   +  +  ++  +K + S  S    K+  S+SSS+      S SS   +  +
Sbjct: 278 HHSKHADSKLQTIEVIENHSNKSRPSSSSTNGSKETTSNSSSAAAGSIGSKSSKSAKHSN 337

Query: 92  KSAVKSSDSSHSSDSDNSDRDTVGPAPKQHVTLTQKDYGKALLPGEGAAMAAYVAE---- 147
           ++   SS  SHSS +        G  P   V+  +    +A     GA  +   A     
Sbjct: 338 RNKSNSSPKSHSSAN--------GSVPSSSVSDNESKQKRASKSSSGARDSKKDASGMSA 389

Query: 148 -GKRIPRRGEIGLTS-DEIERYEAVGYVMSG--SRHRRMEAVRIRKENQIYSADEKRALA 203
            G       E  +++   IER E     +     + R+ E+  +R +  + ++ E+   +
Sbjct: 390 NGTVENCIPENKISTPSAIERLEQDIKKLQAELQQARQNES-ELRNQISLLTSLER---S 445

Query: 204 MFSKEERQKRENRIL-TQFKEMVSSK 228
           + S   + K+EN +L T+   MVS+K
Sbjct: 446 LKSDLGQLKKENDMLQTKLNSMVSAK 471


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 28.6 bits (64), Expect = 3.9
 Identities = 14/44 (31%), Positives = 23/44 (52%)

Query: 29  SVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKK 72
           S     +KA P +  + ++K+KK   K  K   + K+K+  KKK
Sbjct: 767 SAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKK 810



 Score = 27.8 bits (62), Expect = 6.0
 Identities = 9/42 (21%), Positives = 20/42 (47%)

Query: 47  KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
           +K  K K  K   +K+KK   + +K  +++ +  ++     E
Sbjct: 772 EKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region
          covers both the Brf homology II and III regions. This
          region is involved in binding TATA binding protein.
          Length = 95

 Score = 26.8 bits (60), Expect = 4.1
 Identities = 12/40 (30%), Positives = 23/40 (57%)

Query: 47 KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSE 86
          ++ K+ K K D+   S K KK++K K     +  +S+++E
Sbjct: 32 QEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAAE 71


>gnl|CDD|227371 COG5038, COG5038, Ca2+-dependent lipid-binding protein, contains C2
           domain [General function prediction only].
          Length = 1227

 Score = 28.2 bits (63), Expect = 4.3
 Identities = 18/111 (16%), Positives = 35/111 (31%), Gaps = 13/111 (11%)

Query: 32  PSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSS--DSSSSSEDE- 88
            + ++   +S + S  +      K  + +  +  KK ++++            +    + 
Sbjct: 2   STKQQHYRSSDNYSGNRPIPTIPKFFRSRGQRAEKKEEEQEMQPEDEKLFAPIAQRTVQI 61

Query: 89  ----------WVDKSAVKSSDSSHSSDSDNSDRDTVGPAPKQHVTLTQKDY 129
                       D S      S  SS  + SD DT    P     L + DY
Sbjct: 62  ADVNFQGAKGIDDLSFTVPKQSIESSSPEKSDVDTSNTRPSVSRELHKDDY 112


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 28.0 bits (63), Expect = 4.4
 Identities = 13/77 (16%), Positives = 26/77 (33%)

Query: 36  KASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAV 95
                +  +  KK  K   K    K   K K + +   S  +   ++   +D+      +
Sbjct: 65  DTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDI 124

Query: 96  KSSDSSHSSDSDNSDRD 112
              + +   D D+ D D
Sbjct: 125 DVLNQADDDDDDDDDDD 141



 Score = 28.0 bits (63), Expect = 4.4
 Identities = 15/86 (17%), Positives = 25/86 (29%)

Query: 29  SVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
             +           D++++    KK  K   K +      KKK      SS  +      
Sbjct: 53  DQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNAL 112

Query: 89  WVDKSAVKSSDSSHSSDSDNSDRDTV 114
             D       D    + +D+ D D  
Sbjct: 113 DKDDDLNYVKDIDVLNQADDDDDDDD 138



 Score = 28.0 bits (63), Expect = 5.0
 Identities = 16/75 (21%), Positives = 28/75 (37%)

Query: 36  KASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAV 95
              P    +++ K+   K    KK K +    +K +K ++    D  +  +D  V   A 
Sbjct: 72  SDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQAD 131

Query: 96  KSSDSSHSSDSDNSD 110
              D     D D+ D
Sbjct: 132 DDDDDDDDDDLDDDD 146


>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain.  This is a
           short domain found in bacterial type II/III secretory
           system proteins. The architecture of these proteins
           suggest that this family may be functionally analogous
           to pfam03958.
          Length = 95

 Score = 26.5 bits (59), Expect = 4.4
 Identities = 16/41 (39%), Positives = 20/41 (48%)

Query: 74  SSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTV 114
           +S S S S S+S     + S   SS SS S DS +  R T 
Sbjct: 22  TSGSVSSSGSNSSSSSSNSSNGGSSSSSSSGDSSSGTRITT 62


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 27.5 bits (61), Expect = 4.4
 Identities = 15/34 (44%), Positives = 18/34 (52%), Gaps = 1/34 (2%)

Query: 35  RKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKR 68
           RK + T+  E  KK K KK+K   KK   K K R
Sbjct: 82  RKPATTA-VEKKKKGKSKKNKLKGKKDEDKKKAR 114


>gnl|CDD|217667 pfam03666, NPR3, Nitrogen Permease regulator of amino acid
           transport activity 3.  This family, also known in yeasts
           as Rmd11, complexes with NPR2, pfam06218. This complex
           heterodimer is responsible for inactivating TORC1. an
           evolutionarily conserved protein complex that controls
           cell size via nutritional input signals, specifically,
           in response to amino acid starvation.
          Length = 424

 Score = 28.1 bits (63), Expect = 4.5
 Identities = 19/64 (29%), Positives = 28/64 (43%), Gaps = 2/64 (3%)

Query: 63  KKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVGPAPKQHV 122
           +K KK+K+KK   +  +D    S D     S  KSS  S S    NSD  +   +     
Sbjct: 45  RKKKKKKQKKSDRADPNDDREPSVDSEDSSS--KSSSKSESGSLANSDPASDPSSEDSKS 102

Query: 123 TLTQ 126
           ++T 
Sbjct: 103 SMTM 106



 Score = 28.1 bits (63), Expect = 4.6
 Identities = 27/89 (30%), Positives = 38/89 (42%), Gaps = 7/89 (7%)

Query: 21  LMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD 80
           L +D+  F   P          + + +K KKKK K  K  ++  +  R+    S  SSS 
Sbjct: 24  LKIDDLRFVGHPIHVDE-----NGTWRKKKKKKQK--KSDRADPNDDREPSVDSEDSSSK 76

Query: 81  SSSSSEDEWVDKSAVKSSDSSHSSDSDNS 109
           SSS SE   +  S   S  SS  S S  +
Sbjct: 77  SSSKSESGSLANSDPASDPSSEDSKSSMT 105


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 27.5 bits (61), Expect = 5.2
 Identities = 21/93 (22%), Positives = 38/93 (40%), Gaps = 6/93 (6%)

Query: 35  RKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSA 94
           R+A    G ++ KK  +     +++ K KK KK KKKK      S  S       V++S 
Sbjct: 64  RQAERKKGYQAGKKETEDWFSPNQENKQKKKKKDKKKK------SPKSLLLNFVTVEESK 117

Query: 95  VKSSDSSHSSDSDNSDRDTVGPAPKQHVTLTQK 127
                    S++ +   +    +  +++  T K
Sbjct: 118 PDDEVHEQKSETLSLPIELEEQSMGRYLPTTFK 150


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 28.0 bits (63), Expect = 5.2
 Identities = 12/60 (20%), Positives = 27/60 (45%), Gaps = 1/60 (1%)

Query: 34  GRKASPTSGDESSKKSKK-KKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDK 92
           G +    +G+ S  + KK +K ++  +KK++K +  K      + ++   +   D    K
Sbjct: 396 GEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKK 455


>gnl|CDD|221825 pfam12877, DUF3827, Domain of unknown function (DUF3827).  This
           family contains the human KIAA1549 protein which has
           been found to be fused fused to BRAF gene in many cases
           of pilocytic astrocytomas. The fusion is due mainly to a
           tandem duplication of 2 Mb at 7q34. Although nothing is
           known about the function of KIAA1549 protein, the BRAF
           protein is a well characterized oncoprotein. It is a
           serine/threonine protein kinase which is implicated in
           MAP/ERK signalling, a critical pathway for the
           regulation of cell division, differentiation and
           secretion.
          Length = 684

 Score = 28.0 bits (62), Expect = 5.5
 Identities = 20/98 (20%), Positives = 35/98 (35%), Gaps = 4/98 (4%)

Query: 21  LMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD 80
            M+  E   + P  +++ P    E        K K  +   S K ++R +K  S   S  
Sbjct: 338 EMIIQEPAPLPPLKKESLPIEDAEVPTP----KSKSSQDGSSNKKRRRGRKSPSDGDSEG 393

Query: 81  SSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVGPAP 118
           SS  S     +KS   S+  S ++    +  +      
Sbjct: 394 SSVISNRSSREKSGRPSTTPSVTAQQKPTKEEGRKKPA 431


>gnl|CDD|215456 PLN02850, PLN02850, aspartate-tRNA ligase.
          Length = 530

 Score = 27.7 bits (62), Expect = 5.9
 Identities = 12/54 (22%), Positives = 21/54 (38%)

Query: 38 SPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVD 91
          S     E S +   KK  K    K++K ++    K +++S  D        + D
Sbjct: 1  SSQEAVEESGEKISKKAAKKAAAKAEKLRREATAKAAAASLEDEDDPLASNYGD 54


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 27.4 bits (61), Expect = 6.0
 Identities = 20/80 (25%), Positives = 30/80 (37%), Gaps = 10/80 (12%)

Query: 34  GRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKS 93
            +KA+  S   SS   K  K K   KK  KK +K                  +D  +D S
Sbjct: 181 AKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRKW----------DLDGDEDDDAVLDYS 230

Query: 94  AVKSSDSSHSSDSDNSDRDT 113
           A  ++D +  +  D  + D 
Sbjct: 231 APDANDENADAPEDVEEVDQ 250


>gnl|CDD|220232 pfam09421, FRQ, Frequency clock protein.  The frequency clock
           protein, is the central component of the frq-based
           circadian negative feedback loop, regulates various
           aspects of the circadian clock in Neurospora crassa.
           This protein has been shown to interact with itself via
           a coiled-coil.
          Length = 989

 Score = 27.7 bits (61), Expect = 6.3
 Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 1/62 (1%)

Query: 41  SGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDS 100
           S D S  KS++     D +  S K+ +RKK+K  ++ S    +             SSDS
Sbjct: 532 SSDSSGDKSQRSPETDDTEDGSDKNGRRKKRKTQTADSEIGRNGLSKF-GGPQVSSSSDS 590

Query: 101 SH 102
            H
Sbjct: 591 FH 592


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 27.3 bits (61), Expect = 6.9
 Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 1/42 (2%)

Query: 29  SVLPSGRKASPTSGDESSKK-SKKKKHKKDKKKKSKKSKKRK 69
           +  P+    SP +  E     S K+K + DKK+K KKSKK K
Sbjct: 333 TPPPASLSHSPPAAFERPLALSPKRKREGDKKQKKKKSKKLK 374


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 27.3 bits (61), Expect = 7.2
 Identities = 8/37 (21%), Positives = 16/37 (43%)

Query: 52  KKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDE 88
                +KKK+ KK ++++ K+       D     + E
Sbjct: 177 HSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGE 213


>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
          Length = 278

 Score = 27.1 bits (60), Expect = 7.2
 Identities = 20/60 (33%), Positives = 25/60 (41%), Gaps = 17/60 (28%)

Query: 5  WRRGEKKEKELDYQEFLMLDNELFSVLPSGRKASPTSGDESSKKSKKKKHKKDKKKKSKK 64
          ++  EK+E EL  Q+F  L  EL                 S KK K KK K D + K  K
Sbjct: 44 YKVYEKREFELKKQQFEQLKAEL-----------------SKKKKKFKKEKVDVRVKVVK 86


>gnl|CDD|215562 PLN03078, PLN03078, Putative tRNA pseudouridine synthase;
           Provisional.
          Length = 513

 Score = 27.6 bits (61), Expect = 7.5
 Identities = 16/63 (25%), Positives = 25/63 (39%)

Query: 51  KKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSD 110
           K + K   K K +     ++ K S   SS  S  +  E  ++     S SS  S S + +
Sbjct: 246 KYRKKLPGKHKQRNGAVSRRAKSSKEMSSSESEENHGEISEEDEEDLSFSSIPSGSSDEN 305

Query: 111 RDT 113
            D 
Sbjct: 306 EDI 308


>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
           Members of the NSCC2 family have been sequenced from
           various yeast, fungal and animals species including
           Saccharomyces cerevisiae, Drosophila melanogaster and
           Homo sapiens. These proteins are the Sec62 proteins,
           believed to be associated with the Sec61 and Sec63
           constituents of the general protein secretary systems of
           yeast microsomes. They are also the non-selective cation
           (NS) channels of the mammalian cytoplasmic membrane. The
           yeast Sec62 protein has been shown to be essential for
           cell growth. The mammalian NS channel proteins has been
           implicated in platelet derived growth factor(PGDF)
           dependent single channel current in fibroblasts. These
           channels are essentially closed in serum deprived
           tissue-culture cells and are specifically opened by
           exposure to PDGF. These channels are reported to exhibit
           equal selectivity for Na+, K+ and Cs+ with low
           permeability to Ca2+, and no permeability to anions
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 232

 Score = 27.2 bits (60), Expect = 7.5
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 51  KKKHKKDKKKKSKKSKKRKKKKYSS 75
           K K+   KK KSKK KK++ K+ ++
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKREAN 232


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 27.5 bits (61), Expect = 7.6
 Identities = 16/118 (13%), Positives = 41/118 (34%), Gaps = 5/118 (4%)

Query: 6   RRGEKKEKELDYQEFLMLDNELFSVLPSGRKASPTSGDESSKKSKKK-----KHKKDKKK 60
           +   K+EK+   ++      +      +  K  P   ++  +K  ++     + KK ++ 
Sbjct: 110 KEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERV 169

Query: 61  KSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSSHSSDSDNSDRDTVGPAP 118
           ++K   K+  KK   +   +     +     + AVK        + +    +  G   
Sbjct: 170 RAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDR 227


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 27.4 bits (61), Expect = 7.7
 Identities = 15/44 (34%), Positives = 18/44 (40%)

Query: 44  ESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSED 87
           +    S+  K KK KKKK KK K+RK               S D
Sbjct: 265 DYYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSD 308


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 27.3 bits (61), Expect = 8.0
 Identities = 14/41 (34%), Positives = 17/41 (41%)

Query: 47  KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSED 87
           KK  KK + K K KK KK         S +S    SS  + 
Sbjct: 350 KKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKK 390



 Score = 27.3 bits (61), Expect = 8.2
 Identities = 11/40 (27%), Positives = 16/40 (40%)

Query: 47  KKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSE 86
             +KK   K + K K KK KK+        S +   + S 
Sbjct: 347 TATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSS 386


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 26.9 bits (60), Expect = 8.0
 Identities = 14/38 (36%), Positives = 19/38 (50%)

Query: 44  ESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDS 81
           +  K +  ++ KK KKKKS K KK KK       +  S
Sbjct: 171 DKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKAISS 208


>gnl|CDD|236555 PRK09537, pylS, pyrolysyl-tRNA synthetase; Reviewed.
          Length = 417

 Score = 27.1 bits (60), Expect = 8.1
 Identities = 14/69 (20%), Positives = 24/69 (34%), Gaps = 4/69 (5%)

Query: 40  TSGDESSKK----SKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAV 95
           T  D++  K    S   K KK   K   ++ K  +    + + S  S       V    V
Sbjct: 86  TFEDKTQVKVKVVSAPTKKKKAMPKSVVRAPKPLENPVPAQAESSGSKPVPSIPVSTPEV 145

Query: 96  KSSDSSHSS 104
           K+   + + 
Sbjct: 146 KAPAPALTP 154


>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 944

 Score = 27.4 bits (61), Expect = 8.7
 Identities = 16/68 (23%), Positives = 27/68 (39%)

Query: 42  GDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVKSSDSS 101
              S K+   KK   D+K K+  S+         +SS D+S +S    +D     ++  S
Sbjct: 627 DALSPKEGDGKKSSADRKPKTPPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQS 686

Query: 102 HSSDSDNS 109
               +  S
Sbjct: 687 VPEAALAS 694


>gnl|CDD|215299 PLN02543, PLN02543, pfkB-type carbohydrate kinase family protein.
          Length = 496

 Score = 27.2 bits (60), Expect = 8.9
 Identities = 10/47 (21%), Positives = 23/47 (48%)

Query: 34  GRKASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSD 80
           GRK  PTS    +K ++++  K D++   + +++ ++         D
Sbjct: 70  GRKKKPTSSPPKAKTTRRRTKKTDQELDPEGAEEDQEAAEDGEDYDD 116


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 26.7 bits (59), Expect = 9.0
 Identities = 13/22 (59%), Positives = 13/22 (59%)

Query: 48  KSKKKKHKKDKKKKSKKSKKRK 69
           K  K   KKDKK K KK KKR 
Sbjct: 196 KGDKSSAKKDKKSKKKKKKKRS 217


>gnl|CDD|119000 pfam10480, ICAP-1_inte_bdg, Beta-1 integrin binding protein.
           ICAP-1 is a serine/threonine-rich protein that binds to
           the cytoplasmic domains of beta-1 integrins in a highly
           specific manner, binding to a NPXY sequence motif on the
           beta-1 integrin. The cytoplasmic domains of integrins
           are essential for cell adhesion, and the fact that
           phosphorylation of ICAP-1 by interaction with the
           cell-matrix implies an important role of ICAP-1 during
           integrin-dependent cell adhesion. Overexpression of
           ICAP-1 strongly reduces the integrin-mediated cell
           spreading on extracellular matrix and inhibits both
           Cdc42 and Rac1. In addition, ICAP-1 induces release of
           Cdc42 from cellular membranes and prevents the
           dissociation of GDP from this GTPase. An additional
           function of ICAP-1 is to promote differentiation of
           osteoprogenitors by supporting their condensation
           through modulating the integrin high affinity state,.
          Length = 200

 Score = 27.0 bits (59), Expect = 9.0
 Identities = 19/57 (33%), Positives = 30/57 (52%), Gaps = 10/57 (17%)

Query: 67  KRKKKKYSSSSSSDSSSSSEDEWVDKS----------AVKSSDSSHSSDSDNSDRDT 113
           ++ KK++SSSSS  S  S++ + VD S          A   +DS+ SS   N++ D 
Sbjct: 3   RKGKKRHSSSSSQSSEISTKSKSVDSSLGGLSRSSTVASLDTDSTKSSGQSNANSDA 59


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 27.2 bits (60), Expect = 9.5
 Identities = 15/60 (25%), Positives = 26/60 (43%)

Query: 37  ASPTSGDESSKKSKKKKHKKDKKKKSKKSKKRKKKKYSSSSSSDSSSSSEDEWVDKSAVK 96
           A   +   ++K   KK   K K    K SKK+  K   S+  +  +++   +   K+A K
Sbjct: 199 AKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKK 258


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.304    0.120    0.316 

Gapped
Lambda     K      H
   0.267   0.0710    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,202,972
Number of extensions: 1020309
Number of successful extensions: 5055
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3444
Number of HSP's successfully gapped: 640
Length of query: 234
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 140
Effective length of database: 6,768,326
Effective search space: 947565640
Effective search space used: 947565640
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 57 (25.8 bits)