RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8548
         (102 letters)



>gnl|CDD|222120 pfam13426, PAS_9, PAS domain. 
          Length = 101

 Score = 58.2 bits (141), Expect = 1e-12
 Identities = 19/65 (29%), Positives = 37/65 (56%), Gaps = 2/65 (3%)

Query: 16 YPIVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALESKTELKLEVIF 75
            IVY +D    L G+ R +++ K  + + L+GP   EE   ++ +AL +  E+++E+  
Sbjct: 8  GRIVYANDAALRLLGYTREELLGK--SIRDLFGPGDDEEAVARLREALRNGGEVEVELEL 65

Query: 76 YKKNG 80
           +K+G
Sbjct: 66 RRKDG 70


>gnl|CDD|237427 PRK13559, PRK13559, hypothetical protein; Provisional.
          Length = 361

 Score = 58.7 bits (142), Expect = 1e-11
 Identities = 22/75 (29%), Positives = 42/75 (56%), Gaps = 3/75 (4%)

Query: 6   VLGNAQVPSLYPIVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALES 65
            + +   P L PIV  +  F +L+G+A  +++  G  C+FL G  T      +I  A+ +
Sbjct: 57  CITDPHQPDL-PIVLANQAFLDLTGYAAEEVV--GRNCRFLQGAATDPIAVAKIRAAIAA 113

Query: 66  KTELKLEVIFYKKNG 80
           + E+ +E++ Y+K+G
Sbjct: 114 EREIVVELLNYRKDG 128


>gnl|CDD|237425 PRK13557, PRK13557, histidine kinase; Provisional.
          Length = 540

 Score = 58.5 bits (142), Expect = 2e-11
 Identities = 21/64 (32%), Positives = 39/64 (60%), Gaps = 2/64 (3%)

Query: 17  PIVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALESKTELKLEVIFY 76
           PIV+ +  F E++G+A  +I+  G  C+FL GP+T      ++  A+  + E+  E++ Y
Sbjct: 54  PIVFANRAFLEMTGYAAEEII--GNNCRFLQGPETDRATVAEVRDAIAERREIATEILNY 111

Query: 77  KKNG 80
           +K+G
Sbjct: 112 RKDG 115


>gnl|CDD|237426 PRK13558, PRK13558, bacterio-opsin activator; Provisional.
          Length = 665

 Score = 47.9 bits (114), Expect = 9e-08
 Identities = 19/64 (29%), Positives = 41/64 (64%), Gaps = 2/64 (3%)

Query: 17  PIVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALESKTELKLEVIFY 76
           P++Y +D F  ++G++  +++  G  C+FL G DT+EE   ++ +A++ +    +E+  Y
Sbjct: 172 PLIYINDAFERITGYSPDEVL--GRNCRFLQGEDTNEERVAELREAIDEERPTSVELRNY 229

Query: 77  KKNG 80
           +K+G
Sbjct: 230 RKDG 233


>gnl|CDD|238075 cd00130, PAS, PAS domain; PAS motifs appear in archaea,
          eubacteria and eukarya. Probably the most surprising
          identification of a PAS domain was that in EAG-like
          K+-channels. PAS domains have been found to bind
          ligands, and to act as sensors for light and oxygen in
          signal transduction.
          Length = 103

 Score = 41.1 bits (96), Expect = 5e-06
 Identities = 16/63 (25%), Positives = 31/63 (49%), Gaps = 2/63 (3%)

Query: 18 IVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALESKTELKLEVIFYK 77
          I+Y +    +L G++  +++ K      L  P+  EE + ++E  L     + LEV   +
Sbjct: 14 ILYANPAAEQLLGYSPEELIGKSLL--DLIHPEDREELRERLENLLSGGEPVTLEVRLRR 71

Query: 78 KNG 80
          K+G
Sbjct: 72 KDG 74


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 34.2 bits (78), Expect = 0.007
 Identities = 14/17 (82%), Positives = 17/17 (100%)

Query: 83  EEEEEEEEEEEKEEDKN 99
           EEEEEEEEEEE+EE++N
Sbjct: 874 EEEEEEEEEEEEEEEEN 890



 Score = 32.3 bits (73), Expect = 0.033
 Identities = 14/19 (73%), Positives = 17/19 (89%)

Query: 83  EEEEEEEEEEEKEEDKNLL 101
           EEEEEEEEEEE+EE++  L
Sbjct: 876 EEEEEEEEEEEEEENEEPL 894



 Score = 31.9 bits (72), Expect = 0.040
 Identities = 13/16 (81%), Positives = 16/16 (100%)

Query: 83  EEEEEEEEEEEKEEDK 98
           EEEEEEEEEEE+EE++
Sbjct: 870 EEEEEEEEEEEEEEEE 885



 Score = 31.9 bits (72), Expect = 0.041
 Identities = 13/16 (81%), Positives = 16/16 (100%)

Query: 83  EEEEEEEEEEEKEEDK 98
           EEEEEEEEEEE+EE++
Sbjct: 872 EEEEEEEEEEEEEEEE 887



 Score = 31.9 bits (72), Expect = 0.044
 Identities = 14/19 (73%), Positives = 17/19 (89%)

Query: 80  GYHEEEEEEEEEEEKEEDK 98
           G  EEEEEEEEEEE+EE++
Sbjct: 861 GDSEEEEEEEEEEEEEEEE 879



 Score = 31.9 bits (72), Expect = 0.045
 Identities = 13/16 (81%), Positives = 16/16 (100%)

Query: 83  EEEEEEEEEEEKEEDK 98
           EEEEEEEEEEE+EE++
Sbjct: 873 EEEEEEEEEEEEEEEE 888



 Score = 31.5 bits (71), Expect = 0.048
 Identities = 13/16 (81%), Positives = 16/16 (100%)

Query: 83  EEEEEEEEEEEKEEDK 98
           EEEEEEEEEEE+EE++
Sbjct: 871 EEEEEEEEEEEEEEEE 886



 Score = 31.5 bits (71), Expect = 0.049
 Identities = 13/16 (81%), Positives = 16/16 (100%)

Query: 83  EEEEEEEEEEEKEEDK 98
           EEEEEEEEEEE+EE++
Sbjct: 867 EEEEEEEEEEEEEEEE 882



 Score = 31.5 bits (71), Expect = 0.053
 Identities = 13/16 (81%), Positives = 16/16 (100%)

Query: 83  EEEEEEEEEEEKEEDK 98
           EEEEEEEEEEE+EE++
Sbjct: 866 EEEEEEEEEEEEEEEE 881



 Score = 31.5 bits (71), Expect = 0.053
 Identities = 13/16 (81%), Positives = 16/16 (100%)

Query: 83  EEEEEEEEEEEKEEDK 98
           EEEEEEEEEEE+EE++
Sbjct: 869 EEEEEEEEEEEEEEEE 884



 Score = 31.1 bits (70), Expect = 0.066
 Identities = 13/16 (81%), Positives = 16/16 (100%)

Query: 83  EEEEEEEEEEEKEEDK 98
           EEEEEEEEEEE+EE++
Sbjct: 868 EEEEEEEEEEEEEEEE 883



 Score = 31.1 bits (70), Expect = 0.084
 Identities = 13/16 (81%), Positives = 16/16 (100%)

Query: 83  EEEEEEEEEEEKEEDK 98
           EEEEEEEEEEE+EE++
Sbjct: 865 EEEEEEEEEEEEEEEE 880



 Score = 29.6 bits (66), Expect = 0.23
 Identities = 13/20 (65%), Positives = 17/20 (85%)

Query: 79  NGYHEEEEEEEEEEEKEEDK 98
           +G   EEEEEEEEEE+EE++
Sbjct: 859 DGGDSEEEEEEEEEEEEEEE 878


>gnl|CDD|232930 TIGR00342, TIGR00342, tRNA sulfurtransferase ThiI.  Members of this
           protein family are "ThiI", a sulfurtransferase involved
           in 4-thiouridine modification of tRNA. This protein
           often is bifunctional, with genetically separable
           activities, where the C-terminal rhodanese-like domain
           (residues 385 to 482 in E. coli ThiI), a domain not
           included in this model, is sufficient to synthesize the
           thiazole moiety of thiamine (see TIGR04271). Note that
           ThiI, because of its role in tRNA modification, may
           occur in species (such as Mycoplasma genitalium) that
           lack de novo thiamine biosynthesis [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Thiamine,
           Protein synthesis, tRNA and rRNA base modification].
          Length = 371

 Score = 31.2 bits (71), Expect = 0.055
 Identities = 17/61 (27%), Positives = 26/61 (42%), Gaps = 5/61 (8%)

Query: 22  SDGFCELSGFARAQIMQKGC---ACKFLYGPDTSEEHKTQIEKALESKTELKLEVIFYKK 78
           S G    S  A   +M++GC   A  F   P  SE+ + ++E+   S  E    V  Y  
Sbjct: 180 SGGID--SPVAAFMMMKRGCRVVAVHFFNEPAASEKAREKVERLANSLNETGGSVKLYVF 237

Query: 79  N 79
           +
Sbjct: 238 D 238


>gnl|CDD|225112 COG2202, AtoS, FOG: PAS/PAC domain [Signal transduction
           mechanisms].
          Length = 232

 Score = 30.6 bits (67), Expect = 0.11
 Identities = 12/67 (17%), Positives = 26/67 (38%)

Query: 18  IVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALESKTELKLEVIFYK 77
           I+Y +    EL G++  + + +G +       +   E +     A      L++E    +
Sbjct: 134 ILYANPAAEELLGYSPEEELGRGLSDLIHPEDEERRELELARALAEGRGGPLEIEYRVRR 193

Query: 78  KNGYHEE 84
           K+G    
Sbjct: 194 KDGERVR 200


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 29.8 bits (68), Expect = 0.17
 Identities = 12/15 (80%), Positives = 13/15 (86%)

Query: 83  EEEEEEEEEEEKEED 97
           EEEEEEEEEE  EE+
Sbjct: 306 EEEEEEEEEEPSEEE 320



 Score = 29.1 bits (66), Expect = 0.30
 Identities = 13/16 (81%), Positives = 14/16 (87%)

Query: 83  EEEEEEEEEEEKEEDK 98
           EEEEEEEEEEE+EE  
Sbjct: 302 EEEEEEEEEEEEEEPS 317



 Score = 29.1 bits (66), Expect = 0.30
 Identities = 12/16 (75%), Positives = 14/16 (87%)

Query: 83  EEEEEEEEEEEKEEDK 98
            EEEEEEEEEE+EE+ 
Sbjct: 301 AEEEEEEEEEEEEEEP 316



 Score = 29.1 bits (66), Expect = 0.33
 Identities = 12/15 (80%), Positives = 13/15 (86%)

Query: 83  EEEEEEEEEEEKEED 97
           EEEEEEEEEEE  E+
Sbjct: 305 EEEEEEEEEEEPSEE 319



 Score = 27.9 bits (63), Expect = 0.95
 Identities = 11/15 (73%), Positives = 13/15 (86%)

Query: 83  EEEEEEEEEEEKEED 97
           EEEEEEEEEEE+  +
Sbjct: 304 EEEEEEEEEEEEPSE 318



 Score = 26.8 bits (60), Expect = 2.0
 Identities = 11/16 (68%), Positives = 14/16 (87%)

Query: 83  EEEEEEEEEEEKEEDK 98
             EEEEEEEEE+EE++
Sbjct: 300 AAEEEEEEEEEEEEEE 315


>gnl|CDD|214512 smart00091, PAS, PAS domain.  PAS motifs appear in archaea,
          eubacteria and eukarya. Probably the most surprising
          identification of a PAS domain was that in EAG-like
          K+-channels.
          Length = 67

 Score = 27.7 bits (62), Expect = 0.32
 Identities = 10/47 (21%), Positives = 23/47 (48%), Gaps = 2/47 (4%)

Query: 18 IVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALE 64
          I+Y +    EL G++  +++ K  +   L  P+  E  +  +++ L 
Sbjct: 23 ILYANPAAEELLGYSPEELIGK--SLLELIHPEDRERVQEALQRLLS 67


>gnl|CDD|232884 TIGR00229, sensory_box, PAS domain S-box.  The PAS domain was
          previously described. This sensory box, or S-box domain
          occupies the central portion of the PAS domain but is
          more widely distributed. It is often tandemly repeated.
          Known prosthetic groups bound in the S-box domain
          include heme in the oxygen sensor FixL, FAD in the
          redox potential sensor NifL, and a 4-hydroxycinnamyl
          chromophore in photoactive yellow protein. Proteins
          containing the domain often contain other regulatory
          domains such as response regulator or sensor histidine
          kinase domains. Other S-box proteins include
          phytochromes and the aryl hydrocarbon receptor nuclear
          translocator [Regulatory functions, Small molecule
          interactions].
          Length = 124

 Score = 28.8 bits (65), Expect = 0.32
 Identities = 16/64 (25%), Positives = 32/64 (50%), Gaps = 3/64 (4%)

Query: 18 IVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALESKTELK-LEVIFY 76
          I+Y +  F E+ G++  +++ +      L   +  EE + +IE+ LE + E    E    
Sbjct: 25 ILYVNPAFEEIFGYSAEELIGRNVL--ELIPEEDREEVRERIERRLEGEREPVSEERRVR 82

Query: 77 KKNG 80
          +K+G
Sbjct: 83 RKDG 86


>gnl|CDD|233035 TIGR00585, mutl, DNA mismatch repair protein MutL.  All proteins in
           this family for which the functions are known are
           involved in the process of generalized mismatch repair.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 312

 Score = 28.4 bits (64), Expect = 0.54
 Identities = 9/26 (34%), Positives = 12/26 (46%)

Query: 6   VLGNAQVPSLYPIVYCSDGFCELSGF 31
           V G A +  L P+    D   +L GF
Sbjct: 215 VFGTAVLRKLIPLDEWEDLDLQLEGF 240


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
          family includes archaebacterial L12, eukaryotic P0, P1
          and P2.
          Length = 88

 Score = 27.2 bits (61), Expect = 0.70
 Identities = 10/15 (66%), Positives = 15/15 (100%)

Query: 83 EEEEEEEEEEEKEED 97
          EE++EEEEEEE+++D
Sbjct: 68 EEKKEEEEEEEEDDD 82



 Score = 26.8 bits (60), Expect = 0.85
 Identities = 11/15 (73%), Positives = 14/15 (93%)

Query: 83 EEEEEEEEEEEKEED 97
           EEE++EEEEE+EED
Sbjct: 66 AEEEKKEEEEEEEED 80



 Score = 26.8 bits (60), Expect = 1.2
 Identities = 11/15 (73%), Positives = 15/15 (100%)

Query: 83 EEEEEEEEEEEKEED 97
          EEE++EEEEEE+E+D
Sbjct: 67 EEEKKEEEEEEEEDD 81



 Score = 25.3 bits (56), Expect = 3.7
 Identities = 9/15 (60%), Positives = 13/15 (86%)

Query: 83 EEEEEEEEEEEKEED 97
            EEE++EEEE+EE+
Sbjct: 65 AAEEEKKEEEEEEEE 79


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
           domain family is found in eukaryotes, and is typically
           between 125 and 136 amino acids in length.
          Length = 133

 Score = 27.9 bits (62), Expect = 0.72
 Identities = 10/14 (71%), Positives = 11/14 (78%)

Query: 82  HEEEEEEEEEEEKE 95
           H  +EEEEEEEE E
Sbjct: 93  HTRQEEEEEEEENE 106



 Score = 27.5 bits (61), Expect = 0.89
 Identities = 10/14 (71%), Positives = 13/14 (92%)

Query: 83  EEEEEEEEEEEKEE 96
           +EEEEEEEE EK++
Sbjct: 96  QEEEEEEEENEKQQ 109



 Score = 25.2 bits (55), Expect = 5.2
 Identities = 9/15 (60%), Positives = 11/15 (73%)

Query: 79  NGYHEEEEEEEEEEE 93
            G+  +EEEEEEEE 
Sbjct: 91  TGHTRQEEEEEEEEN 105



 Score = 24.8 bits (54), Expect = 8.3
 Identities = 9/14 (64%), Positives = 13/14 (92%)

Query: 83  EEEEEEEEEEEKEE 96
           EEEEEEEE E++++
Sbjct: 97  EEEEEEEENEKQQQ 110



 Score = 24.8 bits (54), Expect = 8.5
 Identities = 9/17 (52%), Positives = 14/17 (82%)

Query: 83  EEEEEEEEEEEKEEDKN 99
           EEEEEEE E++++ D+ 
Sbjct: 98  EEEEEEENEKQQQSDEA 114


>gnl|CDD|235206 PRK04031, PRK04031, DNA primase; Provisional.
          Length = 408

 Score = 27.9 bits (63), Expect = 0.76
 Identities = 17/46 (36%), Positives = 26/46 (56%), Gaps = 2/46 (4%)

Query: 56  KTQIEKALESKTELK--LEVIFYKKNGYHEEEEEEEEEEEKEEDKN 99
           K +I KAL +K  ++  LE +  K     E+ +EEEE+ EKE  + 
Sbjct: 253 KKEIAKALRNKVPVEQYLEELGKKAQKAAEKVKEEEEKPEKEPAEQ 298


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 27.8 bits (62), Expect = 0.92
 Identities = 12/15 (80%), Positives = 15/15 (100%)

Query: 83  EEEEEEEEEEEKEED 97
           EEEE+EEEEEE++ED
Sbjct: 391 EEEEDEEEEEEEDED 405



 Score = 27.4 bits (61), Expect = 1.4
 Identities = 11/15 (73%), Positives = 14/15 (93%)

Query: 83  EEEEEEEEEEEKEED 97
           EEE+EEEEEEE E++
Sbjct: 392 EEEDEEEEEEEDEDE 406



 Score = 26.6 bits (59), Expect = 2.3
 Identities = 15/48 (31%), Positives = 26/48 (54%), Gaps = 3/48 (6%)

Query: 50  DTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEED 97
              E  +  I  A  +   L++E    +++   EE+EEEEEEE+++E 
Sbjct: 363 KAKETEQDYILDAFSA---LEIEDANTERDDEEEEDEEEEEEEDEDEG 407


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 27.2 bits (61), Expect = 0.95
 Identities = 12/15 (80%), Positives = 14/15 (93%)

Query: 83 EEEEEEEEEEEKEED 97
          EEEEEEEE+EE EE+
Sbjct: 82 EEEEEEEEKEESEEE 96



 Score = 26.8 bits (60), Expect = 1.1
 Identities = 10/16 (62%), Positives = 15/16 (93%)

Query: 83 EEEEEEEEEEEKEEDK 98
          EE++EEEEEEE++E+ 
Sbjct: 78 EEKKEEEEEEEEKEES 93



 Score = 26.1 bits (58), Expect = 2.6
 Identities = 9/16 (56%), Positives = 15/16 (93%)

Query: 83 EEEEEEEEEEEKEEDK 98
           EE++EEEEEE+E+++
Sbjct: 77 AEEKKEEEEEEEEKEE 92



 Score = 25.7 bits (57), Expect = 3.3
 Identities = 10/15 (66%), Positives = 14/15 (93%)

Query: 83 EEEEEEEEEEEKEED 97
          +EEEEEEEE+E+ E+
Sbjct: 81 KEEEEEEEEKEESEE 95



 Score = 24.9 bits (55), Expect = 5.2
 Identities = 9/15 (60%), Positives = 14/15 (93%)

Query: 83 EEEEEEEEEEEKEED 97
          ++EEEEEEEE++E +
Sbjct: 80 KKEEEEEEEEKEESE 94



 Score = 24.9 bits (55), Expect = 5.9
 Identities = 9/16 (56%), Positives = 13/16 (81%)

Query: 83 EEEEEEEEEEEKEEDK 98
             EE++EEEE+EE+K
Sbjct: 75 AAAEEKKEEEEEEEEK 90



 Score = 24.5 bits (54), Expect = 8.3
 Identities = 11/15 (73%), Positives = 13/15 (86%)

Query: 83 EEEEEEEEEEEKEED 97
          EEEEEEE+EE +EE 
Sbjct: 83 EEEEEEEKEESEEEA 97



 Score = 24.5 bits (54), Expect = 9.5
 Identities = 9/16 (56%), Positives = 13/16 (81%)

Query: 83 EEEEEEEEEEEKEEDK 98
            EE++EEEEE+EE +
Sbjct: 76 AAEEKKEEEEEEEEKE 91


>gnl|CDD|205914 pfam13740, ACT_6, ACT domain.  ACT domains bind to amino acids
          and regulate associated enzyme domains.
          Length = 76

 Score = 26.3 bits (59), Expect = 1.2
 Identities = 9/26 (34%), Positives = 13/26 (50%), Gaps = 1/26 (3%)

Query: 52 SEEHKTQIEKALESK-TELKLEVIFY 76
            +   ++EKAL     EL L+V F 
Sbjct: 51 PWDAIARLEKALLFLAQELGLQVRFK 76


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 27.3 bits (60), Expect = 1.3
 Identities = 13/45 (28%), Positives = 29/45 (64%)

Query: 56  KTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDKNL 100
           + QIEKA + + EL+  + + ++    +E E++EE+E + E++ +
Sbjct: 125 QEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTI 169


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 27.6 bits (61), Expect = 1.3
 Identities = 12/14 (85%), Positives = 14/14 (100%)

Query: 83  EEEEEEEEEEEKEE 96
           E+EEEEEEEEE+EE
Sbjct: 230 EDEEEEEEEEEEEE 243



 Score = 26.8 bits (59), Expect = 1.9
 Identities = 12/15 (80%), Positives = 14/15 (93%)

Query: 83  EEEEEEEEEEEKEED 97
           +EEEEEEEEEE+E D
Sbjct: 231 DEEEEEEEEEEEEID 245



 Score = 26.8 bits (59), Expect = 2.0
 Identities = 12/15 (80%), Positives = 15/15 (100%)

Query: 83  EEEEEEEEEEEKEED 97
           EE+EEEEEEEE+EE+
Sbjct: 229 EEDEEEEEEEEEEEE 243



 Score = 25.3 bits (55), Expect = 7.1
 Identities = 11/15 (73%), Positives = 14/15 (93%)

Query: 83  EEEEEEEEEEEKEED 97
           E EE+EEEEEE+EE+
Sbjct: 227 ESEEDEEEEEEEEEE 241



 Score = 24.9 bits (54), Expect = 8.3
 Identities = 11/15 (73%), Positives = 14/15 (93%)

Query: 83  EEEEEEEEEEEKEED 97
            EE+EEEEEEE+EE+
Sbjct: 228 SEEDEEEEEEEEEEE 242


>gnl|CDD|224221 COG1302, COG1302, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 131

 Score = 26.9 bits (60), Expect = 1.3
 Identities = 14/54 (25%), Positives = 32/54 (59%), Gaps = 3/54 (5%)

Query: 47  YG---PDTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEED 97
           YG   P+ +E  + ++++ +E+ T LK+  +     G   ++E++EEEE  +++
Sbjct: 77  YGVKIPEVAENIQERVKEEVENMTGLKVVEVNVHVVGVKVKKEQKEEEESFDDE 130


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 27.3 bits (61), Expect = 1.4
 Identities = 12/19 (63%), Positives = 15/19 (78%)

Query: 79  NGYHEEEEEEEEEEEKEED 97
             +  EEEEEEEEEE+EE+
Sbjct: 400 ERWAAEEEEEEEEEEEEEE 418



 Score = 26.2 bits (58), Expect = 3.3
 Identities = 13/14 (92%), Positives = 14/14 (100%)

Query: 83  EEEEEEEEEEEKEE 96
           EEEEEEEEEEE+EE
Sbjct: 406 EEEEEEEEEEEEEE 419


>gnl|CDD|153040 pfam12606, RELT, Tumour necrosis factor receptor superfamily
          member 19.  This family of proteins is found in
          bacteria and eukaryotes. Proteins in this family are
          typically between 49 and 288 amino acids in length.
          There are two completely conserved residues (K and Y)
          that may be functionally important. The members of
          tumor necrosis factor receptor (TNFR) superfamily have
          been designated as the "guardians of the immune system"
          due to their roles in immune cell proliferation,
          differentiation, activation, and death (apoptosis). The
          messenger RNA of RELT is especially abundant in
          hematologic tissues such as spleen, lymph node, and
          peripheral blood leukocytes as well as in leukemias and
          lymphomas. RELT is able to activate the NF-kappaB
          pathway and selectively binds tumor necrosis factor
          receptor-associated factor 1.
          Length = 50

 Score = 25.7 bits (57), Expect = 1.4
 Identities = 10/26 (38%), Positives = 16/26 (61%), Gaps = 3/26 (11%)

Query: 77 KKNGYH---EEEEEEEEEEEKEEDKN 99
          KK GY    +  E+E EE+++EE + 
Sbjct: 25 KKKGYRCTVDPLEDEPEEKKEEEKEE 50


>gnl|CDD|234575 PRK00016, PRK00016, metal-binding heat shock protein; Provisional.
          Length = 159

 Score = 27.1 bits (61), Expect = 1.4
 Identities = 10/20 (50%), Positives = 12/20 (60%), Gaps = 1/20 (5%)

Query: 79  NGY-HEEEEEEEEEEEKEED 97
            GY H E+EE EE    EE+
Sbjct: 125 LGYDHIEDEEAEEMFGLEEE 144


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 27.1 bits (60), Expect = 1.5
 Identities = 14/50 (28%), Positives = 24/50 (48%), Gaps = 3/50 (6%)

Query: 50  DTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDKN 99
           D  EE     EK+L   +E    +    ++   E++EE +EE EK++   
Sbjct: 315 DIHEEIPKLDEKSL---SEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVK 361


>gnl|CDD|219822 pfam08401, DUF1738, Domain of unknown function (DUF1738).  This
           region is found in a number of bacterial hypothetical
           proteins. Some members are annotated as being similar to
           replication primases, and in fact this region is often
           found together with the Toprim domain (pfam01751).
          Length = 123

 Score = 26.8 bits (60), Expect = 1.5
 Identities = 9/26 (34%), Positives = 19/26 (73%), Gaps = 2/26 (7%)

Query: 73  VIFYKKNGYHEEEEEEEEEEEKEEDK 98
           V+++K   + +EEE+E+ EE + E++
Sbjct: 89  VVYWKP--FEKEEEDEDGEEGEPEER 112


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 27.3 bits (61), Expect = 1.6
 Identities = 13/20 (65%), Positives = 13/20 (65%)

Query: 83  EEEEEEEEEEEKEEDKNLLK 102
           EEEEEE EE E EE   LL 
Sbjct: 359 EEEEEEPEEPEPEEGPPLLT 378



 Score = 26.2 bits (58), Expect = 3.5
 Identities = 11/15 (73%), Positives = 15/15 (100%)

Query: 83  EEEEEEEEEEEKEED 97
           EEE+E+EEEEE+EE+
Sbjct: 350 EEEQEDEEEEEEEEE 364



 Score = 25.0 bits (55), Expect = 7.8
 Identities = 11/16 (68%), Positives = 15/16 (93%)

Query: 83  EEEEEEEEEEEKEEDK 98
           EE+E+EEEEEE+EE +
Sbjct: 351 EEQEDEEEEEEEEEPE 366



 Score = 25.0 bits (55), Expect = 8.5
 Identities = 11/23 (47%), Positives = 18/23 (78%)

Query: 77  KKNGYHEEEEEEEEEEEKEEDKN 99
           +K+   E+E+EEEEEEE+E ++ 
Sbjct: 346 QKDEEEEQEDEEEEEEEEEPEEP 368


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 26.9 bits (60), Expect = 1.7
 Identities = 11/15 (73%), Positives = 13/15 (86%)

Query: 83  EEEEEEEEEEEKEED 97
           EEEE  EEEEE+EE+
Sbjct: 311 EEEEVPEEEEEEEEE 325



 Score = 26.1 bits (58), Expect = 3.7
 Identities = 11/15 (73%), Positives = 13/15 (86%)

Query: 83  EEEEEEEEEEEKEED 97
           EEEEE  EEEE+EE+
Sbjct: 310 EEEEEVPEEEEEEEE 324



 Score = 25.8 bits (57), Expect = 4.4
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 83  EEEEEEEEEEEKEEDKNLL 101
           EE  EEEEEEE+EE++   
Sbjct: 313 EEVPEEEEEEEEEEERTFE 331



 Score = 25.4 bits (56), Expect = 7.6
 Identities = 11/14 (78%), Positives = 12/14 (85%)

Query: 83  EEEEEEEEEEEKEE 96
           EEEEEE  EEE+EE
Sbjct: 309 EEEEEEVPEEEEEE 322



 Score = 25.0 bits (55), Expect = 9.6
 Identities = 11/15 (73%), Positives = 13/15 (86%)

Query: 83  EEEEEEEEEEEKEED 97
           EEEEEEEEE  +EE+
Sbjct: 306 EEEEEEEEEVPEEEE 320


>gnl|CDD|235359 PRK05182, PRK05182, DNA-directed RNA polymerase subunit alpha;
           Provisional.
          Length = 310

 Score = 27.0 bits (61), Expect = 1.7
 Identities = 13/20 (65%), Positives = 14/20 (70%)

Query: 83  EEEEEEEEEEEKEEDKNLLK 102
           EE EEE EEEE E D  LL+
Sbjct: 230 EEAEEEVEEEEPEFDPILLR 249


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 26.9 bits (59), Expect = 2.0
 Identities = 10/19 (52%), Positives = 12/19 (63%)

Query: 81  YHEEEEEEEEEEEKEEDKN 99
            + EE E EEE E  ED+N
Sbjct: 378 RNPEESEHEEEVEDYEDEN 396


>gnl|CDD|165423 PHA03152, PHA03152, hypothetical protein; Provisional.
          Length = 138

 Score = 26.1 bits (57), Expect = 2.6
 Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 3/40 (7%)

Query: 5  FVLGNAQVPSLYPI--VYCSDGFCELSGFARAQIMQKGCA 42
          F L    +PS+Y I  +Y +D      GF+ AQ+ Q+  A
Sbjct: 30 FALNETLIPSIYDINWIYITDPH-TCGGFSIAQVFQRRSA 68


>gnl|CDD|182533 PRK10542, PRK10542, glutathionine S-transferase; Provisional.
          Length = 201

 Score = 26.2 bits (58), Expect = 2.8
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query: 46  LYGPDTSEEHKTQIEKALESK 66
           L+ PDT EE+K  +   LE K
Sbjct: 112 LFRPDTPEEYKPTVRAQLEKK 132


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 26.4 bits (59), Expect = 2.8
 Identities = 14/51 (27%), Positives = 21/51 (41%), Gaps = 4/51 (7%)

Query: 49  PDTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDKN 99
           P  +E  + + E    S  E K      +K     E++ E+EE EK   K 
Sbjct: 391 PLLAEGEEEEGENGNLSPAERKK----LRKKQRKAEKKAEKEEAEKAAAKK 437


>gnl|CDD|223396 COG0319, COG0319, Predicted metal-dependent hydrolase [General
           function prediction only].
          Length = 153

 Score = 26.1 bits (58), Expect = 2.9
 Identities = 14/19 (73%), Positives = 16/19 (84%), Gaps = 1/19 (5%)

Query: 80  GY-HEEEEEEEEEEEKEED 97
           GY HEE+EEEEE EE EE+
Sbjct: 121 GYDHEEDEEEEEMEELEEE 139


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 26.3 bits (58), Expect = 2.9
 Identities = 21/53 (39%), Positives = 28/53 (52%), Gaps = 2/53 (3%)

Query: 46  LYGPDTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDK 98
           LY    +EE  T I++  E   E    V+  ++    EEEEEEEEEEE  E +
Sbjct: 370 LYDKIATEEDATTIDELREFLKEKGHPVV--ERWAEEEEEEEEEEEEEAAEAE 420


>gnl|CDD|216228 pfam00989, PAS, PAS fold.  The PAS fold corresponds to the
          structural domain that has previously been defined as
          PAS and PAC motifs. The PAS fold appears in archaea,
          eubacteria and eukarya.
          Length = 113

 Score = 25.8 bits (57), Expect = 3.3
 Identities = 13/53 (24%), Positives = 21/53 (39%), Gaps = 6/53 (11%)

Query: 18 IVYCSDGFCELSGFARAQIMQKGCACKF--LYGPDTSEEHKTQIEKALESKTE 68
          I+Y +    EL G +R +++ K        L   D   E    + +AL    E
Sbjct: 23 ILYVNAAAEELLGLSREEVIGK----SLLDLIPEDDDAEVAELLRQALLQGEE 71


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
          represents the eukaryotic large ribosomal protein P1.
          Eukaryotic P1 and P2 are functionally equivalent to the
          bacterial protein L7/L12, but are not homologous to
          L7/L12. P1 is located in the L12 stalk, with proteins
          P2, P0, L11, and 28S rRNA. P1 and P2 are the only
          proteins in the ribosome to occur as multimers, always
          appearing as sets of heterodimers. Recent data indicate
          that eukaryotes have four copies (two heterodimers),
          while most archaeal species contain six copies of L12p
          (three homodimers) and bacteria may have four or six
          copies (two or three homodimers), depending on the
          species. Experiments using S. cerevisiae P1 and P2
          indicate that P1 proteins are positioned more
          internally with limited reactivity in the C-terminal
          domains, while P2 proteins seem to be more externally
          located and are more likely to interact with other
          cellular components. In lower eukaryotes, P1 and P2 are
          further subdivided into P1A, P1B, P2A, and P2B, which
          form P1A/P2B and P1B/P2A heterodimers. Some plant
          species have a third P-protein, called P3, which is not
          homologous to P1 and P2. In humans, P1 and P2 are
          strongly autoimmunogenic. They play a significant role
          in the etiology and pathogenesis of systemic lupus
          erythema (SLE). In addition, the ribosome-inactivating
          protein trichosanthin (TCS) interacts with human P0,
          P1, and P2, with its primary binding site located in
          the C-terminal region of P2. TCS inactivates the
          ribosome by depurinating a specific adenine in the
          sarcin-ricin loop of 28S rRNA.
          Length = 103

 Score = 25.7 bits (57), Expect = 3.4
 Identities = 9/15 (60%), Positives = 14/15 (93%)

Query: 83 EEEEEEEEEEEKEED 97
          +EE++EEEEEE ++D
Sbjct: 83 KEEKKEEEEEESDDD 97


>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
          Length = 1176

 Score = 26.0 bits (58), Expect = 3.6
 Identities = 17/54 (31%), Positives = 26/54 (48%), Gaps = 7/54 (12%)

Query: 53  EEHKTQIEKA----LESKTELKLEVIFYKKNG---YHEEEEEEEEEEEKEEDKN 99
           +E+K          LE+   L+LE  F +K     ++    E E+ EEKEE+ N
Sbjct: 827 KEYKKSKGYVLVIKLENGGGLELEGEFSEKEEAEKFYNNLIEVEKVEEKEEELN 880


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
          [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 25.4 bits (56), Expect = 3.9
 Identities = 10/15 (66%), Positives = 12/15 (80%)

Query: 83 EEEEEEEEEEEKEED 97
          E EEEE+EEE +EE 
Sbjct: 85 EAEEEEKEEEAEEES 99


>gnl|CDD|239210 cd02844, PAZ_CAF_like, PAZ domain, CAF_like subfamily. CAF (for
           carpel factory) is a plant homolog of Dicer. CAF has
           been implicated in flower morphogenesis and in early
           Arabidopsis development and might function through
           posttranscriptional regulation of specific mRNA
           molecules. PAZ domains are named after the proteins
           Piwi, Argonaut, and Zwille. PAZ is found in two families
           of proteins that are essential components of
           RNA-mediated gene-silencing pathways, including RNA
           interference, the Piwi and Dicer families. PAZ functions
           as a nucleic-acid binding domain, with a strong
           preference for single-stranded nucleic acids (RNA or
           DNA) or RNA duplexes with single-stranded 3' overhangs.
           It has been suggested that the PAZ domain provides a
           unique mode for the recognition of the two 3'-terminal
           nucleotides in single-stranded nucleic acids and buries
           the 3' OH group, and that it might recognize
           characteristic 3' overhangs in siRNAs within RISC
           (RNA-induced silencing) and other complexes.
          Length = 135

 Score = 25.8 bits (57), Expect = 3.9
 Identities = 20/82 (24%), Positives = 32/82 (39%), Gaps = 4/82 (4%)

Query: 18  IVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALESKTELKLEVIFYK 77
             Y   G  +L+  +            +    +  +E K  I     ++  LK + IF  
Sbjct: 43  RFYVISGILDLNANSS---FPGKEGLGYATYAEYFKE-KYGIVLNHPNQPLLKGKQIFNL 98

Query: 78  KNGYHEEEEEEEEEEEKEEDKN 99
            N  H   EE+ E EEKE+D+ 
Sbjct: 99  HNLLHNRFEEKGESEEKEKDRY 120


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
          represents the L12P protein of the large (50S) subunit
          of the archaeal ribosome.
          Length = 105

 Score = 25.4 bits (56), Expect = 4.0
 Identities = 13/15 (86%), Positives = 14/15 (93%)

Query: 83 EEEEEEEEEEEKEED 97
          EEEEEEEEEEE EE+
Sbjct: 81 EEEEEEEEEEESEEE 95



 Score = 25.4 bits (56), Expect = 4.3
 Identities = 13/15 (86%), Positives = 14/15 (93%)

Query: 83 EEEEEEEEEEEKEED 97
          EEEEEEEEEEE+EE 
Sbjct: 78 EEEEEEEEEEEEEES 92



 Score = 24.6 bits (54), Expect = 6.8
 Identities = 13/16 (81%), Positives = 16/16 (100%)

Query: 83 EEEEEEEEEEEKEEDK 98
          EEEEEEEEEEE+EE++
Sbjct: 76 EEEEEEEEEEEEEEEE 91



 Score = 24.6 bits (54), Expect = 7.0
 Identities = 13/16 (81%), Positives = 15/16 (93%)

Query: 83 EEEEEEEEEEEKEEDK 98
          EEEEEEEEEEE+EE+ 
Sbjct: 77 EEEEEEEEEEEEEEES 92



 Score = 24.6 bits (54), Expect = 8.0
 Identities = 11/16 (68%), Positives = 14/16 (87%)

Query: 83 EEEEEEEEEEEKEEDK 98
            EEEEEEEEE+EE++
Sbjct: 74 AAEEEEEEEEEEEEEE 89


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 25.6 bits (57), Expect = 4.1
 Identities = 16/57 (28%), Positives = 26/57 (45%), Gaps = 3/57 (5%)

Query: 44  KFLYGPDTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDKNL 100
           K L       E + +I+K  E K EL+  V   +      E  E+ EEEE++ ++  
Sbjct: 114 KALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAK---LEAIEKREEEERQIEEKR 167


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 25.8 bits (56), Expect = 4.5
 Identities = 10/50 (20%), Positives = 28/50 (56%), Gaps = 2/50 (4%)

Query: 52  SEEHKTQ-IEKALESKTELKLEV-IFYKKNGYHEEEEEEEEEEEKEEDKN 99
           SE + T+  ++ +  ++E++          G   + EE++++ E+E++K+
Sbjct: 81  SEPNITRSFQEPVSQESEVQDNTEQNQDTKGSKTDSEEDDDDSEEEDNKS 130


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 26.0 bits (57), Expect = 4.5
 Identities = 10/28 (35%), Positives = 17/28 (60%), Gaps = 4/28 (14%)

Query: 79  NGYHEEEEEEEEEEE----KEEDKNLLK 102
           +    +E+E + E+E    +EEDKN L+
Sbjct: 374 DEDDSDEDEVDYEKERKRRREEDKNFLR 401


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 25.6 bits (57), Expect = 4.9
 Identities = 13/48 (27%), Positives = 22/48 (45%), Gaps = 9/48 (18%)

Query: 53  EEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDKNL 100
           ++ K  +E AL  K ELK          + + E++ E+ + K E K  
Sbjct: 52  KDLKESLEAALLDKKELKA---------WEKAEKKAEKAKAKAEKKKA 90


>gnl|CDD|182840 PRK10921, PRK10921, twin-arginine protein translocation system
           subunit TatC; Provisional.
          Length = 258

 Score = 25.5 bits (56), Expect = 5.0
 Identities = 8/24 (33%), Positives = 13/24 (54%)

Query: 75  FYKKNGYHEEEEEEEEEEEKEEDK 98
           FY   G   EEE + E E ++ ++
Sbjct: 235 FYVGKGRRREEENDAEAESEKTEE 258


>gnl|CDD|119297 pfam10777, YlaC, sigma70 family sigma factor YlaC.  Members of the
           sigma70 family of sigma factors are components of the
           RNA polymerase holoenzyme that direct bacterial or
           plastid core RNA polymerase to specific promoter
           elements. This domain is an inner membrane protein of
           unknown function.
          Length = 156

 Score = 25.4 bits (56), Expect = 5.4
 Identities = 8/32 (25%), Positives = 18/32 (56%)

Query: 44  KFLYGPDTSEEHKTQIEKALESKTELKLEVIF 75
           + L  P+  +E K Q++K + +K E+    ++
Sbjct: 117 EILNSPNVPDEQKAQLKKIIATKGEIDFYDVY 148


>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
          Length = 310

 Score = 25.4 bits (56), Expect = 5.7
 Identities = 7/15 (46%), Positives = 8/15 (53%)

Query: 83  EEEEEEEEEEEKEED 97
                  EEEE+EED
Sbjct: 288 AAAAAPAEEEEEEED 302



 Score = 25.4 bits (56), Expect = 6.9
 Identities = 6/15 (40%), Positives = 9/15 (60%)

Query: 83  EEEEEEEEEEEKEED 97
                EEEEEE+++ 
Sbjct: 290 AAAPAEEEEEEEDDM 304



 Score = 25.0 bits (55), Expect = 9.1
 Identities = 7/15 (46%), Positives = 9/15 (60%)

Query: 83  EEEEEEEEEEEKEED 97
                 EEEEE+E+D
Sbjct: 289 AAAAPAEEEEEEEDD 303


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 25.5 bits (56), Expect = 5.8
 Identities = 13/50 (26%), Positives = 24/50 (48%)

Query: 50  DTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDKN 99
               E   + EK   +  + + +    K++   EEE EEE+EE  +E++ 
Sbjct: 59  TAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEK 108


>gnl|CDD|219844 pfam08447, PAS_3, PAS fold.  The PAS fold corresponds to the
          structural domain that has previously been defined as
          PAS and PAC motifs. The PAS fold appears in archaea,
          eubacteria and eukarya.
          Length = 90

 Score = 24.7 bits (54), Expect = 5.9
 Identities = 13/64 (20%), Positives = 27/64 (42%), Gaps = 1/64 (1%)

Query: 18 IVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEK-ALESKTELKLEVIFY 76
          I+Y S  F E+ G+   ++         L  P+  E  +  +++ +L+       E    
Sbjct: 1  IIYWSPRFEEILGYTPEELKSSYEGWLDLVHPEDRERVRRALQEFSLKKGEPYSGEYRIR 60

Query: 77 KKNG 80
          +K+G
Sbjct: 61 RKDG 64


>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
          Length = 613

 Score = 25.5 bits (57), Expect = 6.9
 Identities = 12/20 (60%), Positives = 17/20 (85%)

Query: 83  EEEEEEEEEEEKEEDKNLLK 102
           +EEE+EE+EE +EE K LL+
Sbjct: 480 DEEEKEEKEEAEEEFKPLLE 499


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 25.4 bits (56), Expect = 7.1
 Identities = 11/47 (23%), Positives = 21/47 (44%)

Query: 52  SEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDK 98
           S+ H+  +EK  E    LK ++   KK     E   + + + K + +
Sbjct: 272 SKTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFE 318


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This
          is a family of uncharacterized proteins.
          Length = 449

 Score = 25.3 bits (55), Expect = 7.1
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 77 KKNGYHEEEEEEEEEEEKEEDKN 99
          +K G   EEE  EEE++ EED +
Sbjct: 39 RKLGKEAEEEAMEEEDDDEEDDD 61


>gnl|CDD|221096 pfam11368, DUF3169, Protein of unknown function (DUF3169).  Some
           members in this family of proteins are annotated as
           membrane proteins however this cannot be confirmed.
           Currently there is no known function.
          Length = 248

 Score = 25.4 bits (56), Expect = 7.1
 Identities = 8/26 (30%), Positives = 16/26 (61%), Gaps = 1/26 (3%)

Query: 78  KNGYHEEEEEEEEEE-EKEEDKNLLK 102
           K  Y +EE+E+  +E  ++ ++NL  
Sbjct: 76  KALYDKEEDEDAADELYRQANRNLEY 101


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 25.0 bits (55), Expect = 7.2
 Identities = 7/17 (41%), Positives = 16/17 (94%)

Query: 83  EEEEEEEEEEEKEEDKN 99
           E++EE+++E+E EE+++
Sbjct: 124 EDDEEDDDEDESEEEES 140



 Score = 24.6 bits (54), Expect = 8.5
 Identities = 9/15 (60%), Positives = 14/15 (93%)

Query: 83  EEEEEEEEEEEKEED 97
           E+EEEE++EE+ +ED
Sbjct: 119 EDEEEEDDEEDDDED 133



 Score = 24.6 bits (54), Expect = 9.6
 Identities = 9/22 (40%), Positives = 15/22 (68%)

Query: 76  YKKNGYHEEEEEEEEEEEKEED 97
             +    +++EE+EEEE+ EED
Sbjct: 108 ASEEDESDDDEEDEEEEDDEED 129



 Score = 24.6 bits (54), Expect = 9.8
 Identities = 8/15 (53%), Positives = 14/15 (93%)

Query: 83  EEEEEEEEEEEKEED 97
           EE+EEEE++EE +++
Sbjct: 118 EEDEEEEDDEEDDDE 132


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 25.4 bits (55), Expect = 7.9
 Identities = 20/58 (34%), Positives = 29/58 (50%), Gaps = 8/58 (13%)

Query: 50  DTSEEHKTQIEKALESKTELKLEVIFYKKNGYH--------EEEEEEEEEEEKEEDKN 99
           D  ++   + E + ES  E  L+ IFY  +G            EE+E  EEEKEE++N
Sbjct: 729 DAIDDEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDSFAESSEEDESSEEEKEEEEN 786


>gnl|CDD|220001 pfam08748, DUF1789, Domain of unknown function (DUF1789).
          Putative uncharacterized domain found in phage-related
          conserved hypothetical protein from Bordetella.
          Length = 125

 Score = 24.7 bits (54), Expect = 7.9
 Identities = 8/55 (14%), Positives = 21/55 (38%)

Query: 44 KFLYGPDTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDK 98
          KF+ G   + +   +I +      +++       +    E  ++  E + K  +K
Sbjct: 5  KFVLGQLPTFKLPVKIPRLGGEDAKVEFTFKHMDRKELAELYDKWGESDAKLGEK 59


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 25.1 bits (55), Expect = 8.4
 Identities = 13/55 (23%), Positives = 23/55 (41%), Gaps = 1/55 (1%)

Query: 45  FLYGPDTSEEHKTQIEKALESKTELK-LEVIFYKKNGYHEEEEEEEEEEEKEEDK 98
           F +  D S   K           E K  + +  K     ++E+++EEE ++EE  
Sbjct: 711 FFFSFDISGLMKKIFRNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKA 765


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The
          CAF-1 or chromatin assembly factor-1 consists of three
          subunits, and this is the first, or A. The A domain is
          uniquely required for the progression of S phase in
          mouse cells, independent of its ability to promote
          histone deposition but dependent on its ability to
          interact with HP1 - heterochromatin protein 1-rich
          heterochromatin domains next to centromeres that are
          crucial for chromosome segregation during mitosis. This
          HP1-CAF-1 interaction module functions as a built-in
          replication control for heterochromatin, which, like a
          control barrier, has an impact on S-phase progression
          in addition to DNA-based checkpoints.
          Length = 76

 Score = 24.1 bits (53), Expect = 8.5
 Identities = 7/15 (46%), Positives = 13/15 (86%)

Query: 83 EEEEEEEEEEEKEED 97
          E+EE+EEE+++ + D
Sbjct: 60 EDEEDEEEDDDDDMD 74


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 25.1 bits (55), Expect = 8.6
 Identities = 13/48 (27%), Positives = 25/48 (52%), Gaps = 11/48 (22%)

Query: 51  TSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDK 98
           +++E K +       K   +L+ I      + E +E+E+EEEE+  D+
Sbjct: 348 STKESKMR------DKRRARLDPI-----DFEEVDEDEDEEEEQRSDE 384


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 25.0 bits (55), Expect = 8.7
 Identities = 11/53 (20%), Positives = 26/53 (49%)

Query: 50  DTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDKNLLK 102
           +  EE   +IE   E+K   + +    +++     EE+E+ ++ ++ DK  + 
Sbjct: 234 EREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEIL 286


>gnl|CDD|218778 pfam05852, DUF848, Gammaherpesvirus protein of unknown function
           (DUF848).  This family consists of several
           uncharacterized proteins from the Gammaherpesvirinae.
          Length = 146

 Score = 24.6 bits (54), Expect = 8.8
 Identities = 12/37 (32%), Positives = 20/37 (54%)

Query: 59  IEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKE 95
            ++  E K EL+ E+   +  G  +EE   EEEE ++
Sbjct: 95  TDRVEELKEELEFELERLEAAGGEQEELSGEEEEPED 131


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 25.0 bits (54), Expect = 9.2
 Identities = 13/17 (76%), Positives = 15/17 (88%)

Query: 83  EEEEEEEEEEEKEEDKN 99
           EEEEEEEEEEE E ++N
Sbjct: 139 EEEEEEEEEEEAEVEEN 155


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 24.8 bits (54), Expect = 9.2
 Identities = 12/17 (70%), Positives = 16/17 (94%)

Query: 83  EEEEEEEEEEEKEEDKN 99
           +EEEEE EEEE+EED++
Sbjct: 268 DEEEEETEEEEEEEDED 284


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 24.9 bits (55), Expect = 9.3
 Identities = 16/58 (27%), Positives = 28/58 (48%), Gaps = 2/58 (3%)

Query: 44  KFLYGPDTSEE--HKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDKN 99
           +FL G   + +   K   +   + + E K +          EEEEEEE+E+++EE + 
Sbjct: 400 EFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457


>gnl|CDD|204616 pfam11240, DUF3042, Protein of unknown function (DUF3042).  This
          family of proteins with unknown function appears to be
          restricted to Firmicutes.
          Length = 54

 Score = 23.5 bits (51), Expect = 9.8
 Identities = 9/22 (40%), Positives = 11/22 (50%)

Query: 73 VIFYKKNGYHEEEEEEEEEEEK 94
          V  +KK      EE+EE  EE 
Sbjct: 23 VFAFKKKVIEPIEEKEEFIEEN 44


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 25.0 bits (54), Expect = 9.9
 Identities = 5/15 (33%), Positives = 15/15 (100%)

Query: 83  EEEEEEEEEEEKEED 97
           ++E+E+E+++++E+D
Sbjct: 156 DDEDEDEDDDDEEDD 170


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.131    0.371 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,066,640
Number of extensions: 425579
Number of successful extensions: 3250
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2542
Number of HSP's successfully gapped: 480
Length of query: 102
Length of database: 10,937,602
Length adjustment: 68
Effective length of query: 34
Effective length of database: 7,921,530
Effective search space: 269332020
Effective search space used: 269332020
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 53 (24.0 bits)