RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4652
         (291 letters)



>gnl|CDD|224953 COG2042, COG2042, Uncharacterized conserved protein [Function
           unknown].
          Length = 179

 Score =  179 bits (456), Expect = 2e-56
 Identities = 86/251 (34%), Positives = 113/251 (45%), Gaps = 87/251 (34%)

Query: 1   MWDMKHCDPKKCSGRKLARHNLIKILKLGERFRGFPKEAKFYLDKFKWGRTFLTLNEELL 60
           ++    CDPKKC+G+KL R  L ++L++G +F G                          
Sbjct: 16  IYHAGQCDPKKCTGKKLERFGLARLLRVGGKFPG-------------------------- 49

Query: 61  DLYKACDTVLFIFLMRICLTPVGVQCVSKADHDIIQENGAAVVDCSWAKIEETPFNKMKT 120
                           + LTP   + +S AD DI++  G  VVDCSW ++E   F K++ 
Sbjct: 50  ----------------VVLTPFAEKALSPADRDIVERFGITVVDCSWNRVERV-FKKLRG 92

Query: 121 NHPRLLPFLIAANPINYGKPCKLSCVEALAATFIIAGFPKEAKFYLDKFKWGRTFLTLNE 180
              R LPFL+AANP+NYGKP KLS  EALAA   I GF  EA                  
Sbjct: 93  REHRRLPFLVAANPVNYGKPFKLSSAEALAAALYIVGFKDEA-----------------S 135

Query: 181 ELLDLYKECDTIFYEDLDGERFPKEAKFYLDKFKWGRTFLTLNEELLDLYKECDTSTDVV 240
           EL                           L KFKWG TFL LN+ELL+ Y   + S +VV
Sbjct: 136 EL---------------------------LSKFKWGHTFLELNKELLEEYSNAEDSAEVV 168

Query: 241 KVQNEYLKKIE 251
           ++Q EYL+K E
Sbjct: 169 EIQEEYLEKGE 179


>gnl|CDD|112833 pfam04034, DUF367, Domain of unknown function (DUF367). 
          Length = 128

 Score =  172 bits (438), Expect = 2e-54
 Identities = 69/114 (60%), Positives = 83/114 (72%)

Query: 77  ICLTPVGVQCVSKADHDIIQENGAAVVDCSWAKIEETPFNKMKTNHPRLLPFLIAANPIN 136
           + LTP     +S  D DI++ +G AVVDCSWA++EE PF K++  H RLLPFL+AANP+N
Sbjct: 1   LVLTPFAEIALSPEDKDIVERSGIAVVDCSWARLEEVPFKKIRGRHERLLPFLVAANPVN 60

Query: 137 YGKPCKLSCVEALAATFIIAGFPKEAKFYLDKFKWGRTFLTLNEELLDLYKECD 190
           YGKPCKLSCVEALAA   I GF  EA   L KFKWG TFL LN+ELL+ Y  C+
Sbjct: 61  YGKPCKLSCVEALAAALYIVGFKDEASELLSKFKWGHTFLELNKELLEAYSNCE 114



 Score = 50.0 bits (120), Expect = 7e-08
 Identities = 20/34 (58%), Positives = 23/34 (67%)

Query: 34  GFPKEAKFYLDKFKWGRTFLTLNEELLDLYKACD 67
           GF  EA   L KFKWG TFL LN+ELL+ Y  C+
Sbjct: 81  GFKDEASELLSKFKWGHTFLELNKELLEAYSNCE 114


>gnl|CDD|235024 PRK02287, PRK02287, hypothetical protein; Provisional.
          Length = 171

 Score =  168 bits (427), Expect = 3e-52
 Identities = 80/247 (32%), Positives = 106/247 (42%), Gaps = 85/247 (34%)

Query: 5   KHCDPKKCSGRKLARHNLIKILKLGERFRGFPKEAKFYLDKFKWGRTFLTLNEELLDLYK 64
              DPKKC+ RKL R  L ++++     R  P+ +                         
Sbjct: 10  NQDDPKKCTARKLVRFGLARLVRSI---RKIPRGS------------------------- 41

Query: 65  ACDTVLFIFLMRICLTPVGVQCVSKADHDIIQENGAAVVDCSWAKIEETPFNKMKTNHPR 124
                       I L P   + +S AD DI+++ G   +DCSW + E   F K+   H R
Sbjct: 42  ------------IVLNPFAEKALSPADRDIVEKRGIVALDCSWNEAERV-FFKLLRGHHR 88

Query: 125 LLPFLIAANPINYGKPCKLSCVEALAATFIIAGFPKEAKFYLDKFKWGRTFLTLNEELLD 184
            LPFL+AANP+NYGKP KLS VEALAA   I GF +EA                 E++  
Sbjct: 89  RLPFLVAANPVNYGKPFKLSSVEALAAALYILGFKEEA-----------------EKI-- 129

Query: 185 LYKECDTIFYEDLDGERFPKEAKFYLDKFKWGRTFLTLNEELLDLYKECDTSTDVVKVQN 244
                                    L KFKWG TFL LN+E L+ Y     S ++V++Q 
Sbjct: 130 -------------------------LSKFKWGHTFLELNKEPLEAYARAKDSEEIVEIQK 164

Query: 245 EYLKKIE 251
           EYL KI 
Sbjct: 165 EYLGKIL 171


>gnl|CDD|217870 pfam04068, RLI, Possible Fer4-like domain in RNase L inhibitor,
          RLI.  Possible metal-binding domain in endoribonuclease
          RNase L inhibitor. Found at the N-terminal end of RNase
          L inhibitor proteins, adjacent to the 4Fe-4S binding
          domain, fer4, pfam00037. Also often found adjacent to
          the DUF367 domain pfam04034 in uncharacterized
          proteins. The RNase L system plays a major role in the
          anti-viral and anti-proliferative activities of
          interferons, and could possibly play a more general
          role in the regulation of RNA stability in mammalian
          cells. Inhibitory activity requires
          concentration-dependent association of RLI with RNase
          L.
          Length = 35

 Score = 49.4 bits (119), Expect = 1e-08
 Identities = 13/31 (41%), Positives = 22/31 (70%)

Query: 1  MWDMKHCDPKKCSGRKLARHNLIKILKLGER 31
          + D   CDPKKC+GRK AR   +++++ G++
Sbjct: 5  IVDFDQCDPKKCTGRKCARFCPVRLVRTGKK 35


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 40.0 bits (94), Expect = 0.001
 Identities = 15/31 (48%), Positives = 19/31 (61%)

Query: 257 PNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
           P   +    E E  EEEEEEEEEE++  E+E
Sbjct: 303 PEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 36.9 bits (86), Expect = 0.010
 Identities = 17/37 (45%), Positives = 19/37 (51%)

Query: 254 RNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGR 290
           R  P  P  P  E EEEEE  EEEEEE++  E     
Sbjct: 296 RRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEE 332



 Score = 31.5 bits (72), Expect = 0.50
 Identities = 17/34 (50%), Positives = 20/34 (58%)

Query: 254 RNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
           R     P   P E EEEEEEEEE  EE++  E+E
Sbjct: 292 RRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEE 325



 Score = 27.7 bits (62), Expect = 7.9
 Identities = 18/46 (39%), Positives = 21/46 (45%), Gaps = 8/46 (17%)

Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAI--------EDEL 288
           E E       + P  E EEEEEEE   EEE +A         E+EL
Sbjct: 304 EPEEEEEEEEEVPEEEEEEEEEEERTFEEEVRATVAEAIRLLEEEL 349


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 38.8 bits (90), Expect = 0.003
 Identities = 17/23 (73%), Positives = 20/23 (86%)

Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
           SE EEEEEEEEEEEEE++  E+E
Sbjct: 863 SEEEEEEEEEEEEEEEEEEEEEE 885



 Score = 37.7 bits (87), Expect = 0.006
 Identities = 16/22 (72%), Positives = 20/22 (90%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
           +SEEEEEEEEEEEEE++  E+E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEE 883



 Score = 37.3 bits (86), Expect = 0.010
 Identities = 16/22 (72%), Positives = 19/22 (86%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
           E EEEEEEEEEEEEE++  E+E
Sbjct: 870 EEEEEEEEEEEEEEEEEEEENE 891



 Score = 36.9 bits (85), Expect = 0.011
 Identities = 16/22 (72%), Positives = 19/22 (86%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
           E EEEEEEEEEEEEE++  E+E
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEE 888



 Score = 36.9 bits (85), Expect = 0.011
 Identities = 16/23 (69%), Positives = 19/23 (82%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDEL 288
           E EEEEEEEEEEEEE++  E+ L
Sbjct: 872 EEEEEEEEEEEEEEEEEENEEPL 894



 Score = 36.9 bits (85), Expect = 0.012
 Identities = 16/22 (72%), Positives = 19/22 (86%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
           E EEEEEEEEEEEEE++  E+E
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEE 887



 Score = 36.9 bits (85), Expect = 0.012
 Identities = 16/22 (72%), Positives = 19/22 (86%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
           E EEEEEEEEEEEEE++  E+E
Sbjct: 868 EEEEEEEEEEEEEEEEEEEEEE 889



 Score = 36.5 bits (84), Expect = 0.014
 Identities = 16/22 (72%), Positives = 19/22 (86%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
           E EEEEEEEEEEEEE++  E+E
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEE 886



 Score = 36.1 bits (83), Expect = 0.023
 Identities = 15/22 (68%), Positives = 18/22 (81%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
           E EEEEEEEEEEEEE++  E+ 
Sbjct: 869 EEEEEEEEEEEEEEEEEEEEEN 890



 Score = 36.1 bits (83), Expect = 0.023
 Identities = 15/22 (68%), Positives = 18/22 (81%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
           E EEEEEEEEEEEEE++   +E
Sbjct: 871 EEEEEEEEEEEEEEEEEEENEE 892



 Score = 33.8 bits (77), Expect = 0.12
 Identities = 14/22 (63%), Positives = 17/22 (77%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
           E EEEEEEEEEEEE ++ +  E
Sbjct: 876 EEEEEEEEEEEEEENEEPLSLE 897



 Score = 30.7 bits (69), Expect = 1.0
 Identities = 15/29 (51%), Positives = 16/29 (55%)

Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEE 279
           EEE       +    E EEEEEEEEE EE
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 29.6 bits (66), Expect = 2.4
 Identities = 13/23 (56%), Positives = 18/23 (78%)

Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
           S+  + EEEEEEEEEE++  E+E
Sbjct: 858 SDGGDSEEEEEEEEEEEEEEEEE 880


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 35.8 bits (83), Expect = 0.020
 Identities = 23/58 (39%), Positives = 31/58 (53%), Gaps = 2/58 (3%)

Query: 228 DLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIE 285
           +L  +  T  DV  ++   LKK  +E+  P V  +   E EEEEEEEEEEEE    + 
Sbjct: 370 ELRDKIATEEDVKDIEE--LKKFLKEKEHPVVERWAAEEEEEEEEEEEEEEEPVAEVM 425


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 33.4 bits (77), Expect = 0.023
 Identities = 11/19 (57%), Positives = 16/19 (84%)

Query: 263 PPSESEEEEEEEEEEEEED 281
             +E E++EEEEEEEE++D
Sbjct: 64  AAAEEEKKEEEEEEEEDDD 82



 Score = 32.6 bits (75), Expect = 0.044
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 263 PPSESEEEEEEEEEEEEED 281
             +  EE++EEEEEEEE+D
Sbjct: 63  AAAAEEEKKEEEEEEEEDD 81



 Score = 31.4 bits (72), Expect = 0.094
 Identities = 12/19 (63%), Positives = 16/19 (84%)

Query: 263 PPSESEEEEEEEEEEEEED 281
             + +EEE++EEEEEEEED
Sbjct: 62  AAAAAEEEKKEEEEEEEED 80



 Score = 30.3 bits (69), Expect = 0.26
 Identities = 9/27 (33%), Positives = 18/27 (66%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
             + +    EEE++EEEE++  +D++G
Sbjct: 58  AAAAAAAAAEEEKKEEEEEEEEDDDMG 84



 Score = 26.0 bits (58), Expect = 8.0
 Identities = 8/25 (32%), Positives = 15/25 (60%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDE 287
             + +     EEE++EEE++  ED+
Sbjct: 57  AAAAAAAAAAEEEKKEEEEEEEEDD 81


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 33.4 bits (77), Expect = 0.027
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
             + + EE++EEEEEEEE +  E+E  
Sbjct: 72  AAAAAAEEKKEEEEEEEEKEESEEEAA 98



 Score = 33.4 bits (77), Expect = 0.029
 Identities = 11/29 (37%), Positives = 17/29 (58%)

Query: 259 VPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
                 + +  EE++EEEEEEE+K   +E
Sbjct: 67  AAAAAAAAAAAEEKKEEEEEEEEKEESEE 95



 Score = 32.2 bits (74), Expect = 0.068
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
              E +EEEEEEEE+EE ++     LG
Sbjct: 76  AAEEKKEEEEEEEEKEESEEEAAAGLG 102



 Score = 31.8 bits (73), Expect = 0.094
 Identities = 11/27 (40%), Positives = 20/27 (74%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
             + +EE++EEEEEEEE++++ E+   
Sbjct: 73  AAAAAEEKKEEEEEEEEKEESEEEAAA 99


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 35.3 bits (81), Expect = 0.030
 Identities = 14/18 (77%), Positives = 15/18 (83%)

Query: 263 PPSESEEEEEEEEEEEEE 280
             SE +EEEEEEEEEEEE
Sbjct: 226 SESEEDEEEEEEEEEEEE 243



 Score = 34.9 bits (80), Expect = 0.039
 Identities = 15/21 (71%), Positives = 15/21 (71%)

Query: 261 DFPPSESEEEEEEEEEEEEED 281
           D    E EEEEEEEEEEEE D
Sbjct: 225 DSESEEDEEEEEEEEEEEEID 245



 Score = 30.3 bits (68), Expect = 1.1
 Identities = 13/17 (76%), Positives = 16/17 (94%)

Query: 265 SESEEEEEEEEEEEEED 281
           S+SE EE+EEEEEEEE+
Sbjct: 224 SDSESEEDEEEEEEEEE 240



 Score = 28.4 bits (63), Expect = 4.7
 Identities = 10/17 (58%), Positives = 10/17 (58%)

Query: 260 PDFPPSESEEEEEEEEE 276
                 E EEEEEEEEE
Sbjct: 227 ESEEDEEEEEEEEEEEE 243


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 35.2 bits (81), Expect = 0.034
 Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 2/58 (3%)

Query: 228 DLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIE 285
           DLY +  T  D   +  + L++  +E+  P V  +   E EEEEEEEEE  E +  +E
Sbjct: 369 DLYDKIATEEDATTI--DELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPME 424


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 34.7 bits (80), Expect = 0.047
 Identities = 12/26 (46%), Positives = 19/26 (73%)

Query: 262 FPPSESEEEEEEEEEEEEEDKAIEDE 287
             P + +EEEE+E+EEEEE++   +E
Sbjct: 342 VNPEQKDEEEEQEDEEEEEEEEEPEE 367



 Score = 32.0 bits (73), Expect = 0.37
 Identities = 15/27 (55%), Positives = 18/27 (66%)

Query: 264 PSESEEEEEEEEEEEEEDKAIEDELGR 290
             E E+E+EEEEEEEEE +  E E G 
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPEEGP 374



 Score = 27.7 bits (62), Expect = 8.5
 Identities = 11/20 (55%), Positives = 12/20 (60%)

Query: 261 DFPPSESEEEEEEEEEEEEE 280
           +    E EEEEEE EE E E
Sbjct: 352 EQEDEEEEEEEEEPEEPEPE 371


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 34.5 bits (80), Expect = 0.048
 Identities = 14/27 (51%), Positives = 19/27 (70%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
             + +EEEEEEEEEEEEE+ + E+   
Sbjct: 297 QAAAAEEEEEEEEEEEEEEPSEEEAAA 323



 Score = 34.5 bits (80), Expect = 0.051
 Identities = 14/27 (51%), Positives = 19/27 (70%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
             + + EEEEEEEEEEEE++  E+E  
Sbjct: 296 AQAAAAEEEEEEEEEEEEEEPSEEEAA 322



 Score = 32.1 bits (74), Expect = 0.25
 Identities = 14/27 (51%), Positives = 16/27 (59%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
              E EEEEEEEEEEE  ++     LG
Sbjct: 300 AAEEEEEEEEEEEEEEPSEEEAAAGLG 326



 Score = 28.7 bits (65), Expect = 3.7
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDE 287
             +++   EEEEEEEEEE++    E
Sbjct: 294 AQAQAAAAEEEEEEEEEEEEEEPSE 318



 Score = 27.9 bits (63), Expect = 7.2
 Identities = 14/35 (40%), Positives = 18/35 (51%)

Query: 245 EYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEE 279
           E LK++   +      +    E EEEEEEE  EEE
Sbjct: 286 EELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 35.0 bits (80), Expect = 0.051
 Identities = 18/68 (26%), Positives = 31/68 (45%)

Query: 221  TLNEELLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEE 280
            +LNE+L D   E D      K   +     E +       +    + + +E+E+EEE  +
Sbjct: 3902 SLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSD 3961

Query: 281  DKAIEDEL 288
            D  I+DE+
Sbjct: 3962 DVGIDDEI 3969



 Score = 28.8 bits (64), Expect = 5.1
 Identities = 17/75 (22%), Positives = 37/75 (49%), Gaps = 6/75 (8%)

Query: 220  LTLNEELLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEE 279
            L L+E+  D+ K+ D     ++  +E  ++ + E++ P + D  P E     +E+ ++++
Sbjct: 3993 LKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEP-MQDEDPLEENNTLDEDIQQDD 4051

Query: 280  -----EDKAIEDELG 289
                 ED    +E G
Sbjct: 4052 FSDLAEDDEKMNEDG 4066


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 34.3 bits (78), Expect = 0.055
 Identities = 14/22 (63%), Positives = 19/22 (86%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
           ES+E+EEEEEEEEEE+  +E+ 
Sbjct: 134 ESDEDEEEEEEEEEEEAEVEEN 155



 Score = 32.0 bits (72), Expect = 0.31
 Identities = 15/25 (60%), Positives = 18/25 (72%)

Query: 265 SESEEEEEEEEEEEEEDKAIEDELG 289
              E+EEEEEEEEEEE +  E+E G
Sbjct: 134 ESDEDEEEEEEEEEEEAEVEENEQG 158



 Score = 30.0 bits (67), Expect = 1.4
 Identities = 12/22 (54%), Positives = 18/22 (81%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
           +  +E+EEEEEEEEE++A  +E
Sbjct: 133 DESDEDEEEEEEEEEEEAEVEE 154



 Score = 27.7 bits (61), Expect = 8.0
 Identities = 13/22 (59%), Positives = 17/22 (77%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
           E E +E+EEEEEEEE++  E E
Sbjct: 132 EDESDEDEEEEEEEEEEEAEVE 153



 Score = 27.4 bits (60), Expect = 8.8
 Identities = 11/19 (57%), Positives = 13/19 (68%)

Query: 266 ESEEEEEEEEEEEEEDKAI 284
           E EEEEEEE E EE ++  
Sbjct: 141 EEEEEEEEEAEVEENEQGT 159


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 32.3 bits (74), Expect = 0.075
 Identities = 13/25 (52%), Positives = 18/25 (72%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDE 287
             + + EEEEEEEEEEEE++   +E
Sbjct: 70  AAAAAAEEEEEEEEEEEEEEEESEE 94



 Score = 31.6 bits (72), Expect = 0.11
 Identities = 14/26 (53%), Positives = 20/26 (76%)

Query: 264 PSESEEEEEEEEEEEEEDKAIEDELG 289
            +  EEEEEEEEEEEEE+++ E+ + 
Sbjct: 73  AAAEEEEEEEEEEEEEEEESEEEAMA 98



 Score = 31.6 bits (72), Expect = 0.12
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDE 287
             + +  EEEEEEEEEEE++  E E
Sbjct: 69  AAAAAAAEEEEEEEEEEEEEEEESE 93



 Score = 30.8 bits (70), Expect = 0.24
 Identities = 15/25 (60%), Positives = 20/25 (80%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDE 287
             + +EEEEEEEEEEEEE++  E+E
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESEEE 95



 Score = 30.4 bits (69), Expect = 0.32
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDE 287
             + +   EEEEEEEEEE++  E+ 
Sbjct: 68  AAAAAAAAEEEEEEEEEEEEEEEES 92



 Score = 29.6 bits (67), Expect = 0.63
 Identities = 15/24 (62%), Positives = 17/24 (70%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDELG 289
           E EEEEEEEEEEEE ++     LG
Sbjct: 78  EEEEEEEEEEEEEESEEEAMAGLG 101



 Score = 28.1 bits (63), Expect = 2.3
 Identities = 13/38 (34%), Positives = 20/38 (52%)

Query: 250 IEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
           I++    P       + +    EEEEEEEEE++  E+E
Sbjct: 54  IKKAAAAPVAAAAAAAAAAAAAEEEEEEEEEEEEEEEE 91


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
           domain family is found in eukaryotes, and is typically
           between 125 and 136 amino acids in length.
          Length = 133

 Score = 32.5 bits (74), Expect = 0.089
 Identities = 11/23 (47%), Positives = 14/23 (60%)

Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
           +    +EEEEEEEE E +   DE
Sbjct: 91  TGHTRQEEEEEEEENEKQQQSDE 113


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 32.7 bits (75), Expect = 0.095
 Identities = 12/23 (52%), Positives = 18/23 (78%)

Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
           S+ +EE+EEEE++EE+D   E E
Sbjct: 114 SDDDEEDEEEEDDEEDDDEDESE 136



 Score = 31.9 bits (73), Expect = 0.17
 Identities = 11/22 (50%), Positives = 20/22 (90%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
           E +EEEE++EE+++ED++ E+E
Sbjct: 118 EEDEEEEDDEEDDDEDESEEEE 139



 Score = 30.8 bits (70), Expect = 0.41
 Identities = 12/23 (52%), Positives = 19/23 (82%)

Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
            ES+++EE+EEEE++E+   EDE
Sbjct: 112 DESDDDEEDEEEEDDEEDDDEDE 134



 Score = 28.4 bits (64), Expect = 2.2
 Identities = 9/23 (39%), Positives = 18/23 (78%)

Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
            + E+EEEE++EE++++   E+E
Sbjct: 116 DDEEDEEEEDDEEDDDEDESEEE 138



 Score = 27.7 bits (62), Expect = 4.5
 Identities = 11/22 (50%), Positives = 18/22 (81%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
           E E +++EE+EEEE+D+  +DE
Sbjct: 111 EDESDDDEEDEEEEDDEEDDDE 132



 Score = 27.3 bits (61), Expect = 6.1
 Identities = 9/23 (39%), Positives = 16/23 (69%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDEL 288
             ++EE+EEEE++EED   ++  
Sbjct: 114 SDDDEEDEEEEDDEEDDDEDESE 136



 Score = 26.9 bits (60), Expect = 7.8
 Identities = 11/23 (47%), Positives = 18/23 (78%)

Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
           SE +E +++EE+EEEED   +D+
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDD 131


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 33.5 bits (76), Expect = 0.13
 Identities = 18/50 (36%), Positives = 24/50 (48%), Gaps = 2/50 (4%)

Query: 244 NEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEE--EEEDKAIEDELGRI 291
           +    +I+ E   P VPD    E EE E EE+EE  E+E   + DE    
Sbjct: 120 SIAPPEIDPEPRKPIVPDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEE 169


>gnl|CDD|152814 pfam12379, DUF3655, Protein of unknown function (DUF3655).  This
           domain family is found in viruses, and is approximately
           70 amino acids in length. The family is found in
           association with pfam08716, pfam01661, pfam05409,
           pfam06471, pfam08717, pfam06478, pfam09401, pfam06460,
           pfam08715, pfam08710.
          Length = 70

 Score = 29.6 bits (66), Expect = 0.29
 Identities = 14/28 (50%), Positives = 18/28 (64%)

Query: 262 FPPSESEEEEEEEEEEEEEDKAIEDELG 289
           +PP E E+ EE E EEEE ++  E E G
Sbjct: 6   YPPDEEEDCEEYECEEEEIEETCEHEYG 33


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 30.5 bits (69), Expect = 0.29
 Identities = 12/32 (37%), Positives = 20/32 (62%)

Query: 256 MPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
            P       +E + EE+EEE+++EE+K  E+E
Sbjct: 65  APAAAAAAAAEEKAEEKEEEKKKEEEKEEEEE 96



 Score = 29.8 bits (67), Expect = 0.49
 Identities = 10/26 (38%), Positives = 19/26 (73%)

Query: 260 PDFPPSESEEEEEEEEEEEEEDKAIE 285
            +    + EE+++EEE+EEEE++A+ 
Sbjct: 75  EEKAEEKEEEKKKEEEKEEEEEEALA 100



 Score = 29.4 bits (66), Expect = 0.89
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDE 287
              E  EE+EEE+++EEE +  E+E
Sbjct: 73  AAEEKAEEKEEEKKKEEEKEEEEEE 97



 Score = 28.2 bits (63), Expect = 1.9
 Identities = 10/26 (38%), Positives = 20/26 (76%)

Query: 264 PSESEEEEEEEEEEEEEDKAIEDELG 289
             ++EE+EEE+++EEE+++  E+ L 
Sbjct: 75  EEKAEEKEEEKKKEEEKEEEEEEALA 100



 Score = 27.8 bits (62), Expect = 2.5
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
              E EEE+++EEE+EEE++     LG
Sbjct: 77  KAEEKEEEKKKEEEKEEEEEEALAGLG 103


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 30.4 bits (69), Expect = 0.32
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
             +E++E EEEE+EEE E+++ +D L 
Sbjct: 79  AAAEADEAEEEEKEEEAEEESDDDMLF 105



 Score = 28.9 bits (65), Expect = 1.2
 Identities = 10/24 (41%), Positives = 16/24 (66%)

Query: 264 PSESEEEEEEEEEEEEEDKAIEDE 287
            + +E +E EEEE+EEE +   D+
Sbjct: 78  EAAAEADEAEEEEKEEEAEEESDD 101



 Score = 28.5 bits (64), Expect = 1.8
 Identities = 10/24 (41%), Positives = 17/24 (70%)

Query: 264 PSESEEEEEEEEEEEEEDKAIEDE 287
            +E+  E +E EEEE+E++A E+ 
Sbjct: 76  GAEAAAEADEAEEEEKEEEAEEES 99



 Score = 27.8 bits (62), Expect = 3.1
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
             +   +E EEEE+EEE ++  +D++ 
Sbjct: 78  EAAAEADEAEEEEKEEEAEEESDDDML 104


>gnl|CDD|240630 cd05305, L-AlaDH, Alanine dehydrogenase NAD-binding and catalytic
           domains.  Alanine dehydrogenase (L-AlaDH) catalyzes the
           NAD-dependent conversion of pyruvate to L-alanine via
           reductive amination. Like formate dehydrogenase and
           related enzymes, L-AlaDH is comprised of 2 domains
           connected by a long alpha helical stretch, each
           resembling a Rossmann fold NAD-binding domain. The
           NAD-binding domain is inserted within the linear
           sequence of the more divergent catalytic domain. Ligand
           binding and active site residues are found in the cleft
           between the subdomains. L-AlaDH is typically hexameric
           and is critical in carbon and nitrogen metabolism in
           micro-organisms.
          Length = 359

 Score = 32.0 bits (74), Expect = 0.33
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query: 76  RICLTPVGVQCVSKADHDIIQENGA 100
           R+ LTP GV  +  A H+++ E GA
Sbjct: 15  RVALTPAGVAELVAAGHEVLVEKGA 39


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 32.2 bits (74), Expect = 0.36
 Identities = 24/136 (17%), Positives = 44/136 (32%), Gaps = 11/136 (8%)

Query: 160 KEAKFYLDKFKWGRTFLTLNEELLDLYKECDTIFYEDLDGERFPKEAKFYLDKFKWGRTF 219
              K  L   +     L  NE+ +      + +     D E  P E    +         
Sbjct: 747 APLKVVLVGSEELEDRLEANEDDIKGLANLEELEILSPDPEEPPVEVTAVVGGA----EL 802

Query: 220 LTLNEELLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDF----PPSESEEEEEEEE 275
                 L+DL  E      + K   +  K+I+      +   F    P    E+E+E+  
Sbjct: 803 FLPLAGLIDLAAELA---RLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLA 859

Query: 276 EEEEEDKAIEDELGRI 291
           E + +   +E+ L  +
Sbjct: 860 EYQVKLAKLEERLAVL 875


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 32.1 bits (73), Expect = 0.37
 Identities = 11/37 (29%), Positives = 19/37 (51%)

Query: 252 EERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDEL 288
           E +  P  P   P E EEE +E ++ + +D    D++
Sbjct: 270 EPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQI 306



 Score = 30.2 bits (68), Expect = 1.3
 Identities = 11/38 (28%), Positives = 16/38 (42%)

Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDEL 288
           E E   P  P  P     EEE +E ++ + D   E + 
Sbjct: 268 EPEPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQ 305


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 31.9 bits (72), Expect = 0.42
 Identities = 11/52 (21%), Positives = 29/52 (55%)

Query: 236 STDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
             D++ ++    + + EE   P        + E+E+E++++EE++++  E+E
Sbjct: 126 DLDMIIIKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEE 177



 Score = 29.6 bits (66), Expect = 2.7
 Identities = 14/22 (63%), Positives = 18/22 (81%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
           + EE++EEEEEEEEE K  +DE
Sbjct: 165 DDEEDDEEEEEEEEEIKGFDDE 186



 Score = 28.1 bits (62), Expect = 7.0
 Identities = 9/23 (39%), Positives = 20/23 (86%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDEL 288
           E E+E++++EE++EE++  E+E+
Sbjct: 158 EDEDEDDDDEEDDEEEEEEEEEI 180


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 32.0 bits (73), Expect = 0.43
 Identities = 13/33 (39%), Positives = 17/33 (51%)

Query: 255 NMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
           +     D    E EEEEEEE+E+E   K   D+
Sbjct: 383 DANTERDDEEEEDEEEEEEEDEDEGPSKEHSDD 415


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
           domain.  The centromere protein B (CENP-B) dimerisation
           domain is composed of two alpha-helices, which are
           folded into an antiparallel configuration. Dimerisation
           of CENP-B is mediated by this domain, in which monomers
           dimerise to form a symmetrical, antiparallel, four-helix
           bundle structure with a large hydrophobic patch in which
           23 residues of one monomer form van der Waals contacts
           with the other monomer. This CENP-B dimer configuration
           may be suitable for capturing two distant CENP-B boxes
           during centromeric heterochromatin formation.
          Length = 101

 Score = 30.1 bits (67), Expect = 0.47
 Identities = 11/25 (44%), Positives = 20/25 (80%)

Query: 265 SESEEEEEEEEEEEEEDKAIEDELG 289
           S+S+EEE++++E+EE+D   +DE  
Sbjct: 14  SDSDEEEDDDDEDEEDDDEDDDEDD 38



 Score = 29.4 bits (65), Expect = 0.79
 Identities = 8/22 (36%), Positives = 18/22 (81%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
           +S+ + +EEE++++ED+  +DE
Sbjct: 11  DSDSDSDEEEDDDDEDEEDDDE 32



 Score = 28.2 bits (62), Expect = 2.1
 Identities = 9/23 (39%), Positives = 19/23 (82%)

Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
           S+S+ +EEE++++E+E+   ED+
Sbjct: 12  SDSDSDEEEDDDDEDEEDDDEDD 34



 Score = 27.4 bits (60), Expect = 4.0
 Identities = 8/23 (34%), Positives = 19/23 (82%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDEL 288
           E ++++E+EE+++E+D   +DE+
Sbjct: 19  EEDDDDEDEEDDDEDDDEDDDEV 41


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 31.8 bits (72), Expect = 0.51
 Identities = 19/45 (42%), Positives = 26/45 (57%), Gaps = 3/45 (6%)

Query: 248 KKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKA---IEDELG 289
           + +EEE       +    ESEEEE E+EEEEEE +A    E+E+ 
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487



 Score = 30.3 bits (68), Expect = 1.5
 Identities = 16/28 (57%), Positives = 19/28 (67%)

Query: 260 PDFPPSESEEEEEEEEEEEEEDKAIEDE 287
           P     ESEEEE  EEEEEEE++  E+E
Sbjct: 432 PSMASQESEEEESVEEEEEEEEEEEEEE 459



 Score = 29.5 bits (66), Expect = 2.4
 Identities = 16/59 (27%), Positives = 29/59 (49%)

Query: 231 KECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELG 289
           K   TS +   + ++  ++ E         +    E +E EEEE E+EEE++ +E + G
Sbjct: 423 KASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNG 481


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 31.5 bits (71), Expect = 0.59
 Identities = 14/40 (35%), Positives = 19/40 (47%)

Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGR 290
            EER     P+ P     E++E+ EE EEE    E  L +
Sbjct: 297 PEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSK 336


>gnl|CDD|222440 pfam13897, GOLD_2, Golgi-dynamics membrane-trafficking.  Sec14-like
           Golgi-trafficking domain The GOLD domain is always found
           combined with lipid- or membrane-association domains.
          Length = 136

 Score = 30.0 bits (68), Expect = 0.60
 Identities = 15/26 (57%), Positives = 18/26 (69%)

Query: 265 SESEEEEEEEEEEEEEDKAIEDELGR 290
           SES +EEEEEE EEEE +  + E G 
Sbjct: 52  SESSDEEEEEEAEEEEAETGDVEAGS 77


>gnl|CDD|183731 PRK12766, PRK12766, 50S ribosomal protein L32e; Provisional.
          Length = 232

 Score = 31.0 bits (70), Expect = 0.64
 Identities = 14/24 (58%), Positives = 18/24 (75%)

Query: 265 SESEEEEEEEEEEEEEDKAIEDEL 288
           +E+E E+E  EEEEEED  +E EL
Sbjct: 65  TEAEVEDEGGEEEEEEDADVETEL 88


>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.  This
           family includes the HABP4 family of hyaluronan-binding
           proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
           HABP4 has been observed to bind hyaluronan (a
           glucosaminoglycan), but it is not known whether this is
           its primary role in vivo. It has also been observed to
           bind RNA, but with a lower affinity than that for
           hyaluronan. PAI-1 mRNA-binding protein specifically
           binds the mRNA of type-1 plasminogen activator inhibitor
           (PAI-1), and is thought to be involved in regulation of
           mRNA stability. However, in both cases, the sequence
           motifs predicted to be important for ligand binding are
           not conserved throughout the family, so it is not known
           whether members of this family share a common function.
          Length = 106

 Score = 29.7 bits (67), Expect = 0.66
 Identities = 17/49 (34%), Positives = 23/49 (46%), Gaps = 3/49 (6%)

Query: 234 DTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDK 282
           D   ++ + Q E  +K E E       +   +  E   EEEE EEEEDK
Sbjct: 32  DEIAELTEEQGEEEEKNEVEEKQAVEEE---ANKEGVVEEEEVEEEEDK 77


>gnl|CDD|235562 PRK05687, fliH, flagellar assembly protein H; Validated.
          Length = 246

 Score = 30.7 bits (70), Expect = 0.77
 Identities = 13/39 (33%), Positives = 15/39 (38%)

Query: 253 ERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGRI 291
               P  P+      E  E EE  EEE     E+EL  I
Sbjct: 26  PPPEPPPPEPAAPPPEPPEPEEVPEEEAAPLTEEELEAI 64


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 30.6 bits (69), Expect = 0.80
 Identities = 14/24 (58%), Positives = 18/24 (75%)

Query: 264 PSESEEEEEEEEEEEEEDKAIEDE 287
             E +E+EEEEE EEEE++  EDE
Sbjct: 262 EKELKEDEEEEETEEEEEEEDEDE 285


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 30.4 bits (69), Expect = 0.90
 Identities = 15/40 (37%), Positives = 22/40 (55%)

Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGR 290
           EEE       DF  SE +E E ++EEE E++   E+ L +
Sbjct: 47  EEEEEEEVDSDFDDSEDDEPESDDEEEGEKELQREERLKK 86


>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and L12p.
           Ribosomal proteins P1 and P2 are the eukaryotic proteins
           that are functionally equivalent to bacterial L7/L12.
           L12p is the archaeal homolog. Unlike other ribosomal
           proteins, the archaeal L12p and eukaryotic P1 and P2 do
           not share sequence similarity with their bacterial
           counterparts. They are part of the ribosomal stalk
           (called the L7/L12 stalk in bacteria), along with 28S
           rRNA and the proteins L11 and P0 in eukaryotes (23S
           rRNA, L11, and L10e in archaea). In bacterial ribosomes,
           L7/L12 homodimers bind the extended C-terminal helix of
           L10 to anchor the L7/L12 molecules to the ribosome.
           Eukaryotic P1/P2 heterodimers and archaeal L12p
           homodimers are believed to bind the L10 equivalent
           proteins, eukaryotic P0 and archaeal L10e, in a similar
           fashion. P1 and P2 (L12p, L7/L12) are the only proteins
           in the ribosome to occur as multimers, always appearing
           as sets of dimers. Recent data indicate that most
           archaeal species contain six copies of L12p (three
           homodimers), while eukaryotes have two copies each of P1
           and P2 (two heterodimers). Bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. As in bacteria, the stalk is crucial for
           binding of initiation, elongation, and release factors
           in eukaryotes and archaea.
          Length = 105

 Score = 29.2 bits (65), Expect = 0.97
 Identities = 12/19 (63%), Positives = 16/19 (84%)

Query: 263 PPSESEEEEEEEEEEEEED 281
           P  ++EE +EEEEEEE+ED
Sbjct: 81  PAEKAEEAKEEEEEEEDED 99


>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
          Length = 310

 Score = 30.4 bits (69), Expect = 0.99
 Identities = 12/56 (21%), Positives = 19/56 (33%)

Query: 226 LLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEED 281
           L  +  E   +  + +   E L                 + +     EEEEEEE+D
Sbjct: 248 LAAIGLESGFTFPLAEKIKEALANPSAAAAAAAAAAAAAAAAAAAPAEEEEEEEDD 303



 Score = 27.3 bits (61), Expect = 9.3
 Identities = 7/18 (38%), Positives = 11/18 (61%)

Query: 264 PSESEEEEEEEEEEEEED 281
            + +    EEEEEEE++ 
Sbjct: 287 AAAAAAPAEEEEEEEDDM 304


>gnl|CDD|219956 pfam08658, Rad54_N, Rad54 N terminal.  This is the N terminal of
           the DNA repair protein Rad54.
          Length = 191

 Score = 30.0 bits (68), Expect = 0.99
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 257 PNVPDFPPSESEEEEEEEEEEEEEDK 282
           P V D P  E E+ E+++E EE E K
Sbjct: 139 PTVDDKPKIEEEKAEKDQEPEESETK 164


>gnl|CDD|151958 pfam11521, TFIIE-A_C-term, C-terminal general transcription factor
           TFIIE alpha.  TFIIE is compiled of two subunits, alpha
           and beta. This family of proteins are the C terminal
           domain of the alpha subunit of the protein which is the
           largest subunit and contains several functional domains
           which are important for basal transcription and cell
           growth. The C terminal end of the protein binds directly
           to the amino-terminal PH domain of p62/Tfb1 (of IIH)
           which is involved in the recruitment of the general
           transcription factor IIH to the transcription
           preinitiation complex. P53 competes for the same binding
           site as TFIIE alpha which shows their structural
           similarity. Like p53, TFIIE alpha 336-439 can activate
           transcription in vivo.
          Length = 86

 Score = 28.6 bits (64), Expect = 0.99
 Identities = 12/36 (33%), Positives = 23/36 (63%), Gaps = 1/36 (2%)

Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIED 286
           E + + P+ P+ P  +S +EEE++E++E E+   E 
Sbjct: 7   ESDDDSPS-PEPPAGQSVDEEEDDEDDEFEEVEDEP 41


>gnl|CDD|225411 COG2855, COG2855, Predicted membrane protein [Function unknown].
          Length = 334

 Score = 30.3 bits (69), Expect = 1.1
 Identities = 15/77 (19%), Positives = 25/77 (32%), Gaps = 9/77 (11%)

Query: 22  LIKILKLGERFRGFPKEAKFYLDKFKWG-------RTFLTLNEELLDLYKACDTVLFIFL 74
           L  IL+         K  K  +  F  G        +   +  E++       T L    
Sbjct: 232 LSLILRRQSGAGSGSKGLKITIPWFILGFLLVALLNSLGLIPAEVVSALVTLSTFLLAMA 291

Query: 75  M-RICLTPVGVQCVSKA 90
           M  + LT   ++ + KA
Sbjct: 292 MAALGLT-THIKALKKA 307


>gnl|CDD|219735 pfam08170, POPLD, POPLD (NUC188) domain.  This domain is found in
           POP1-like nucleolar proteins.
          Length = 92

 Score = 28.8 bits (65), Expect = 1.1
 Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 1/37 (2%)

Query: 247 LKKIEEERNMPNVP-DFPPSESEEEEEEEEEEEEEDK 282
           L++I  E   P  P DFP +E+   EE EE EE E K
Sbjct: 31  LRQIHFESGQPFFPGDFPDTEAGWLEELEEREELEKK 67


>gnl|CDD|217634 pfam03601, Cons_hypoth698, Conserved hypothetical protein 698. 
          Length = 305

 Score = 30.3 bits (69), Expect = 1.2
 Identities = 12/50 (24%), Positives = 18/50 (36%), Gaps = 8/50 (16%)

Query: 39  AKFYLDKFKWG-------RTFLTLNEELLDLYKACDTVLFIFLMR-ICLT 80
            +  +  F  G        T + L  E++   K   T L +  M  I LT
Sbjct: 239 IRITIPWFVLGFLLVALLNTLVLLPPEVVSALKTLSTFLLLMAMAAIGLT 288


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 28.7 bits (64), Expect = 1.2
 Identities = 11/17 (64%), Positives = 17/17 (100%)

Query: 265 SESEEEEEEEEEEEEED 281
           +E+++EE++EEEEEEED
Sbjct: 88  AEAKKEEKKEEEEEEED 104



 Score = 28.3 bits (63), Expect = 2.0
 Identities = 10/17 (58%), Positives = 14/17 (82%)

Query: 265 SESEEEEEEEEEEEEED 281
              +EE++EEEEEEE+D
Sbjct: 89  EAKKEEKKEEEEEEEDD 105



 Score = 27.2 bits (60), Expect = 5.0
 Identities = 12/23 (52%), Positives = 19/23 (82%), Gaps = 3/23 (13%)

Query: 267 SEEEEEEEEEEEEEDKAIEDELG 289
           +E ++EE++EEEEE+   ED+LG
Sbjct: 88  AEAKKEEKKEEEEEE---EDDLG 107


>gnl|CDD|132829 cd07296, PX_PLD1, The phosphoinositide binding Phox Homology domain
           of Phospholipase D1.  The PX domain is a
           phosphoinositide binding module present in many proteins
           with diverse functions such as cell signaling, vesicular
           trafficking, protein sorting, and lipid modification,
           among others. Phospholipase D (PLD) catalyzes the
           hydrolysis of the phosphodiester bond of
           phosphatidylcholine to generate membrane-bound
           phosphatidic acid and choline. PLDs are implicated in
           many cellular functions like signaling, cytoskeletal
           reorganization, vesicular transport, stress responses,
           and the control of differentiation, proliferation, and
           survival. PLD1 contains PX and Pleckstrin Homology (PH)
           domains in addition to the catalytic domain. It acts as
           an effector of Rheb in the signaling of the mammalian
           target of rapamycin (mTOR), a serine/threonine protein
           kinase that transduces nutrients and other stimuli to
           regulate many cellular processes. PLD1 also regulates
           the secretion of the procoagulant von Willebrand factor
           (VWF) in endothelial cells. The PX domain is involved in
           targeting of proteins to PI-enriched membranes, and may
           also be involved in protein-protein interaction. The PX
           domain of PLD1 specifically binds to
           phosphatidylinositol-3,4,5-trisphosphate [PI(3,4,5)P3],
           which enables PLD1 to mediate signals via the ERK1/2
           pathway.
          Length = 135

 Score = 29.1 bits (65), Expect = 1.3
 Identities = 15/43 (34%), Positives = 22/43 (51%), Gaps = 5/43 (11%)

Query: 249 KIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGRI 291
           K  E R+MP++P        EEE  EE+     K +ED L ++
Sbjct: 79  KRGEPRHMPSLP-----RGAEEEAREEQFSSRRKQLEDYLSKL 116


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
           represents the eukaryotic large ribosomal protein P1.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P1 is located in the L12 stalk, with proteins
           P2, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers) and bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plant species have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site located in the C-terminal
           region of P2. TCS inactivates the ribosome by
           depurinating a specific adenine in the sarcin-ricin loop
           of 28S rRNA.
          Length = 103

 Score = 28.4 bits (64), Expect = 1.4
 Identities = 9/19 (47%), Positives = 15/19 (78%)

Query: 263 PPSESEEEEEEEEEEEEED 281
             ++ EE++EEEEEE ++D
Sbjct: 79  AEAKKEEKKEEEEEESDDD 97



 Score = 27.3 bits (61), Expect = 3.5
 Identities = 9/19 (47%), Positives = 13/19 (68%)

Query: 263 PPSESEEEEEEEEEEEEED 281
                +EE++EEEEEE +D
Sbjct: 78  AAEAKKEEKKEEEEEESDD 96



 Score = 27.3 bits (61), Expect = 3.8
 Identities = 8/27 (29%), Positives = 19/27 (70%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDELG 289
             + +  E ++EE++EEE++  +D++G
Sbjct: 73  AAAAAAAEAKKEEKKEEEEEESDDDMG 99


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 29.1 bits (65), Expect = 1.6
 Identities = 18/60 (30%), Positives = 31/60 (51%)

Query: 228 DLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
           D+ KE +   D  +  +E  ++ +EE            E  E+EEEEEE+EE++  ++D 
Sbjct: 35  DIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDI 94


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 29.3 bits (66), Expect = 1.6
 Identities = 14/22 (63%), Positives = 18/22 (81%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDE 287
           E +EEEEEEEEEE+ED   +D+
Sbjct: 173 EKDEEEEEEEEEEDEDFDDDDD 194



 Score = 28.6 bits (64), Expect = 3.3
 Identities = 16/54 (29%), Positives = 29/54 (53%)

Query: 234 DTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
              T+  +  +E L  +E++       D    + ++EEEEEEEEEE++   +D+
Sbjct: 140 GLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDD 193


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 29.8 bits (67), Expect = 1.8
 Identities = 19/65 (29%), Positives = 31/65 (47%), Gaps = 15/65 (23%)

Query: 224 EELLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKA 283
           EE  D  +E D   D    + E ++  E E           SES+ E+ EE+E+E++ +A
Sbjct: 649 EEDEDDLEETDDDED----ECEAIEDSESE-----------SESDGEDGEEDEQEDDAEA 693

Query: 284 IEDEL 288
            E  +
Sbjct: 694 NEGVV 698


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 29.8 bits (68), Expect = 1.9
 Identities = 17/46 (36%), Positives = 22/46 (47%), Gaps = 6/46 (13%)

Query: 249 KIEEERNMPNVPDFPPSESEEEEEEEEEEEEED------KAIEDEL 288
               + N    P    SE EE +++E+EEEEED       A E EL
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESEL 216



 Score = 28.2 bits (64), Expect = 6.8
 Identities = 10/41 (24%), Positives = 19/41 (46%)

Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGRI 291
                 P        E +++E+EEEEE+E D ++  +   +
Sbjct: 176 PNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESEL 216


>gnl|CDD|216284 pfam01074, Glyco_hydro_38, Glycosyl hydrolases family 38 N-terminal
           domain.  Glycosyl hydrolases are key enzymes of
           carbohydrate metabolism.
          Length = 269

 Score = 29.5 bits (67), Expect = 2.1
 Identities = 12/46 (26%), Positives = 19/46 (41%), Gaps = 4/46 (8%)

Query: 223 NEELLDLYKECDTSTDVVKVQ----NEYLKKIEEERNMPNVPDFPP 264
            +ELL+     +    + KVQ    ++Y   +EE        DFP 
Sbjct: 222 TDELLEYINRWNELPGLPKVQFGTPSDYFDALEEATWPTKSGDFPT 267


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 29.2 bits (66), Expect = 2.1
 Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 8/46 (17%)

Query: 245 EYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGR 290
           E L+K EEE            +  E E+E+EE+E+E++  +D+ G 
Sbjct: 281 ELLEKAEEEEE--------EDDYSESEDEDEEDEDEEEEEDDDEGD 318



 Score = 28.1 bits (63), Expect = 6.0
 Identities = 12/37 (32%), Positives = 20/37 (54%)

Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
           +EE     +      E E++  E E+E+EED+  E+E
Sbjct: 275 DEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEE 311


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 29.6 bits (67), Expect = 2.1
 Identities = 7/36 (19%), Positives = 16/36 (44%)

Query: 256 MPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGRI 291
            P  P+  P       E++++  E+++  +D   R 
Sbjct: 261 PPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRA 296



 Score = 29.6 bits (67), Expect = 2.4
 Identities = 12/36 (33%), Positives = 21/36 (58%)

Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIED 286
             E   P  P  P  + +  E+EEE+++ ED+A+E+
Sbjct: 264 PPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEE 299



 Score = 28.5 bits (64), Expect = 5.2
 Identities = 12/41 (29%), Positives = 20/41 (48%)

Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGRI 291
              +     P  PP   E++++  E+EEE+D A +  L  I
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEI 300


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 29.2 bits (65), Expect = 2.4
 Identities = 12/31 (38%), Positives = 16/31 (51%)

Query: 257 PNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
           P  PD    E E  EE +  EEEED+ +  +
Sbjct: 169 PAQPDVDNEEEERLEESDGREEEEDEEVGSD 199


>gnl|CDD|240334 PTZ00258, PTZ00258, GTP-binding protein; Provisional.
          Length = 390

 Score = 29.1 bits (66), Expect = 2.5
 Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 5/55 (9%)

Query: 238 DVVKVQNEYLKKIEE---ERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELG 289
           D ++ +N++L KI+E   E+     P  P S   EEE  E   EEE K   +E G
Sbjct: 233 DFIRQKNKWLAKIKEWVGEKGG--GPIIPYSAEFEEELAELGSEEERKEYLEEYG 285


>gnl|CDD|217192 pfam02703, Adeno_E1A, Early E1A protein.  This is a family of
           adenovirus early E1A proteins. The E1A protein is 32 kDa
           it can however be cleaved to yield the 28 kDa protein.
           The E1A protein is responsible for the transcriptional
           activation of the early genes with in the viral genome
           at the start of the infection process as well as some
           cellular genes.
          Length = 264

 Score = 28.9 bits (65), Expect = 2.5
 Identities = 10/22 (45%), Positives = 13/22 (59%)

Query: 262 FPPSESEEEEEEEEEEEEEDKA 283
           FPPS+SE+EE E+    E    
Sbjct: 117 FPPSDSEDEEREQGTSFESAAN 138


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 29.3 bits (66), Expect = 2.5
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query: 263 PPSESEEEEEEEEEEEEEDKAIEDE 287
           P    E +E+E+EEEE+     E+E
Sbjct: 364 PIDFEEVDEDEDEEEEQRSDEHEEE 388



 Score = 28.2 bits (63), Expect = 5.9
 Identities = 11/27 (40%), Positives = 17/27 (62%)

Query: 261 DFPPSESEEEEEEEEEEEEEDKAIEDE 287
           D    E  +E+E+EEEE+  D+  E+E
Sbjct: 363 DPIDFEEVDEDEDEEEEQRSDEHEEEE 389



 Score = 27.8 bits (62), Expect = 8.4
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query: 249 KIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
           K+ ++R     P       E+E+EEEE+  +E +  E E
Sbjct: 353 KMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGE 391



 Score = 27.8 bits (62), Expect = 8.5
 Identities = 8/28 (28%), Positives = 15/28 (53%)

Query: 262 FPPSESEEEEEEEEEEEEEDKAIEDELG 289
               E +E+E+EEEE+  ++   E+   
Sbjct: 365 IDFEEVDEDEDEEEEQRSDEHEEEEGED 392



 Score = 27.4 bits (61), Expect = 9.2
 Identities = 10/24 (41%), Positives = 15/24 (62%)

Query: 267 SEEEEEEEEEEEEEDKAIEDELGR 290
             EE +E+E+EEEE ++ E E   
Sbjct: 366 DFEEVDEDEDEEEEQRSDEHEEEE 389


>gnl|CDD|217714 pfam03755, YicC_N, YicC-like family, N-terminal region.  Family of
           bacterial proteins. Although poorly characterized, the
           members of this protein family have been demonstrated to
           play a role in stationary phase survival. These proteins
           are not essential during stationary phase.
          Length = 159

 Score = 28.3 bits (64), Expect = 2.8
 Identities = 21/75 (28%), Positives = 27/75 (36%), Gaps = 19/75 (25%)

Query: 220 LTLNEELLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPD------FPPSESEEEEEE 273
           L +NE LL  Y                LK++ EE  +            P     EEEE+
Sbjct: 77  LKVNEALLKAYLAA-------------LKELAEELGLAAPISLDDLLRLPGVLEIEEEED 123

Query: 274 EEEEEEEDKAIEDEL 288
           EEE E    A+   L
Sbjct: 124 EEELEALWAALLAAL 138


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 28.6 bits (64), Expect = 2.8
 Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 4/38 (10%)

Query: 254 RNMPNV----PDFPPSESEEEEEEEEEEEEEDKAIEDE 287
           RN+P +     +   S+ EEEE EEE  +E ++    E
Sbjct: 79  RNLPAMWKIFRELSSSDDEEEETEEESTDETEQEDPPE 116


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 29.2 bits (65), Expect = 2.9
 Identities = 21/91 (23%), Positives = 32/91 (35%), Gaps = 13/91 (14%)

Query: 201 RFPKEAKFYLDKFKWGRT-FLTLNEELLDLYKECDTSTDVVKVQNEYLKK---IEEERNM 256
           RF     F     K+    +  LN E            +    + +Y+     +  E+  
Sbjct: 314 RFLDNLDFLQFLSKYIGCLWRNLNNE---------KWGNEEAERKDYILDSSSVPLEKQF 364

Query: 257 PNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
            ++  F   E E    EE E EEE +  EDE
Sbjct: 365 DDILYFEKMEIENRNPEESEHEEEVEDYEDE 395


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 29.0 bits (65), Expect = 2.9
 Identities = 11/48 (22%), Positives = 23/48 (47%)

Query: 240 VKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
           +K   EY ++ E+ +    +      E+ + E EE  E E+ +  E++
Sbjct: 370 IKDSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEED 417


>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
           represents the eukaryotic large ribosomal protein P2.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P2 is located in the L12 stalk, with proteins
           P1, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers). Bacteria may have four or six copies
           of L7/L12 (two or three homodimers) depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plants have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site in the C-terminal region
           of P2. TCS inactivates the ribosome by depurinating a
           specific adenine in the sarcin-ricin loop of 28S rRNA.
          Length = 109

 Score = 27.6 bits (62), Expect = 3.0
 Identities = 8/19 (42%), Positives = 13/19 (68%)

Query: 263 PPSESEEEEEEEEEEEEED 281
             +  +EE++EE EEE +D
Sbjct: 83  AAAAKKEEKKEESEEESDD 101



 Score = 27.2 bits (61), Expect = 4.0
 Identities = 8/22 (36%), Positives = 14/22 (63%)

Query: 260 PDFPPSESEEEEEEEEEEEEED 281
                ++ EE++EE EEE ++D
Sbjct: 81  AAAAAAKKEEKKEESEEESDDD 102


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 29.0 bits (65), Expect = 3.0
 Identities = 12/43 (27%), Positives = 27/43 (62%)

Query: 245 EYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
           +Y KK+ E ++  +  +   ++++EE+ ++EE E++ K   DE
Sbjct: 380 DYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADE 422


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 28.3 bits (64), Expect = 3.2
 Identities = 14/61 (22%), Positives = 28/61 (45%), Gaps = 8/61 (13%)

Query: 235 TST-------DVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
           TST        V K+ + Y K+ ++++           E +++    EE +E+ KA++  
Sbjct: 62  TSTKVDISGVKVPKLNDAYFKRKKKKKK-KKKEGEFFEEKKKKYVVSEERKEDQKAVDAA 120

Query: 288 L 288
           L
Sbjct: 121 L 121


>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH.  This
           bacillus clade of FliH proteins is not found by the Pfam
           FliH model pfam02108, but is closely related to the
           sequences identified by that model. Sequences identified
           by this model are observed in flagellar operons in an
           analogous position relative to other flagellar operon
           genes.
          Length = 255

 Score = 28.6 bits (64), Expect = 3.2
 Identities = 14/34 (41%), Positives = 20/34 (58%), Gaps = 1/34 (2%)

Query: 258 NVPDFPPSESEEEEEEEEEEEEEDKAIEDELGRI 291
            V D P +E E  +EE+E E+  +KA E E  +I
Sbjct: 24  QVTDVPETEQELADEEQEFEQILEKA-EAEAAQI 56


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 29.0 bits (65), Expect = 3.3
 Identities = 22/67 (32%), Positives = 35/67 (52%), Gaps = 15/67 (22%)

Query: 230 YKECDTSTDVVKVQNEYL---------KKIEEERNMPNVPDFPPSESEEEEEEEEEEEEE 280
           Y E     ++VK  +E++         K++E+E     VPD    E E+EE++EEEE+  
Sbjct: 2   YLEERRIKELVKKHSEFINYPIYLWVEKEVEKE-----VPD-EEEEEEKEEKKEEEEKTT 55

Query: 281 DKAIEDE 287
           DK  E +
Sbjct: 56  DKEEEVD 62


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 28.3 bits (64), Expect = 3.5
 Identities = 9/24 (37%), Positives = 18/24 (75%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDELG 289
           E  E++++++E+E+ED   E +LG
Sbjct: 51  EDLEDDDDDDEDEDEDDEEEADLG 74


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 28.6 bits (64), Expect = 3.7
 Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 9/60 (15%)

Query: 241 KVQNEYLKKIEEERN----MPNVPDFPPSESEEEE-----EEEEEEEEEDKAIEDELGRI 291
           K + + LKK+ E+ +    M  VP+ P  E E EE       ++EEE+E+  +  + GR 
Sbjct: 292 KEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRR 351



 Score = 28.3 bits (63), Expect = 5.9
 Identities = 17/45 (37%), Positives = 24/45 (53%), Gaps = 2/45 (4%)

Query: 245 EYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEED--KAIEDE 287
           E     E+E      P    S+SEEE EE+E+E+ +   K +EDE
Sbjct: 261 EPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDE 305


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The CAF-1
           or chromatin assembly factor-1 consists of three
           subunits, and this is the first, or A. The A domain is
           uniquely required for the progression of S phase in
           mouse cells, independent of its ability to promote
           histone deposition but dependent on its ability to
           interact with HP1 - heterochromatin protein 1-rich
           heterochromatin domains next to centromeres that are
           crucial for chromosome segregation during mitosis. This
           HP1-CAF-1 interaction module functions as a built-in
           replication control for heterochromatin, which, like a
           control barrier, has an impact on S-phase progression in
           addition to DNA-based checkpoints.
          Length = 76

 Score = 26.8 bits (60), Expect = 3.7
 Identities = 8/16 (50%), Positives = 15/16 (93%)

Query: 266 ESEEEEEEEEEEEEED 281
           ESE+EE+EEE+++++ 
Sbjct: 58  ESEDEEDEEEDDDDDM 73


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 29.0 bits (64), Expect = 3.7
 Identities = 16/50 (32%), Positives = 27/50 (54%)

Query: 241  KVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGR 290
            K++ E LKK EEE+          +E +++ EE ++ EEE+K    E  +
Sbjct: 1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAK 1668



 Score = 27.8 bits (61), Expect = 8.2
 Identities = 19/68 (27%), Positives = 34/68 (50%)

Query: 224  EELLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKA 283
            E L    +E   + ++ K + E  KK EE +           E+++E EE++++ EE K 
Sbjct: 1692 EALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKK 1751

Query: 284  IEDELGRI 291
             E+E  +I
Sbjct: 1752 DEEEKKKI 1759


>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
           kDa subunit.  This family contains several DNA-directed
           RNA polymerase 19 kDa polypeptides. The Poxvirus
           DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
           DNA-template-directed extension of the 3'-end of an RNA
           strand by one nucleotide at a time.
          Length = 167

 Score = 28.1 bits (63), Expect = 3.9
 Identities = 12/26 (46%), Positives = 17/26 (65%)

Query: 256 MPNVPDFPPSESEEEEEEEEEEEEED 281
           M +  D    ES++++ EE EEEEED
Sbjct: 1   MEDSDDIIDYESDDDDSEEYEEEEED 26


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 28.8 bits (65), Expect = 3.9
 Identities = 9/23 (39%), Positives = 16/23 (69%)

Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
              E+EEEEE+  ++ED+  +D+
Sbjct: 318 QGEEDEEEEEDGVDDEDEEDDDD 340



 Score = 28.0 bits (63), Expect = 7.9
 Identities = 18/47 (38%), Positives = 23/47 (48%), Gaps = 14/47 (29%)

Query: 245 EYLKKIEEER--NMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELG 289
           E LKK+E ER   M              EEE++EEEE+ K   D+L 
Sbjct: 269 ERLKKLEAERLRRM------------RGEEEDDEEEEDSKESADDLD 303


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 28.5 bits (64), Expect = 4.0
 Identities = 14/20 (70%), Positives = 14/20 (70%)

Query: 264 PSESEEEEEEEEEEEEEDKA 283
           P E EEE EE EEE EED A
Sbjct: 150 PEEEEEEPEEMEEELEEDAA 169


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 27.6 bits (62), Expect = 4.5
 Identities = 14/41 (34%), Positives = 21/41 (51%)

Query: 248 KKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDEL 288
           KK+ +   +         E E E EE +EEE+ D+ +E EL
Sbjct: 88  KKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKEL 128


>gnl|CDD|146371 pfam03700, Sorting_nexin, Sorting nexin, N-terminal domain.  These
           proteins bins to the cytoplasmic domain of plasma
           membrane receptors. and are involved in endocytic
           protein trafficking. The N-terminal domain appears to be
           specific to sorting nexins 1 and 2.
          Length = 127

 Score = 27.6 bits (61), Expect = 4.5
 Identities = 12/24 (50%), Positives = 13/24 (54%)

Query: 252 EERNMPNVPDFPPSESEEEEEEEE 275
            ER  P  PD  P E EE EE E+
Sbjct: 2   AEREPPPFPDGEPEEPEELEEGED 25


>gnl|CDD|222049 pfam13330, Mucin2_WxxW, Mucin-2 protein WxxW repeating region.
           This family is repeating region found on mucins 2 and 5.
           The function is not known, but the repeat can be present
           in up to 32 copies, as in a member from Branchiostoma
           floridae. The region carries a highly conserved WxxW
           sequence motif and also has at least six well conserved
           cysteine residues.
          Length = 88

 Score = 26.9 bits (60), Expect = 4.7
 Identities = 7/32 (21%), Positives = 9/32 (28%), Gaps = 1/32 (3%)

Query: 76  RICLTPVGVQC-VSKADHDIIQENGAAVVDCS 106
             C  P  ++C V           G   V C 
Sbjct: 29  SFCEMPTDIECRVVGTPQVPASMTGQNFVTCD 60


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 28.5 bits (63), Expect = 4.9
 Identities = 15/84 (17%), Positives = 34/84 (40%)

Query: 200 ERFPKEAKFYLDKFKWGRTFLTLNEELLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNV 259
           E F +E    L +    +      EE  +   E D      +++N+ + + ++E+    V
Sbjct: 242 ENFKEEPGEPLSRPALKKEKQGAEEEGEEGMSEEDLDVGAAEIENKEVSEGDKEQQQEEV 301

Query: 260 PDFPPSESEEEEEEEEEEEEEDKA 283
            +    + E + +  +E  EE + 
Sbjct: 302 ENAEAHKEEVQSDRPDEIGEEKEE 325


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 28.2 bits (63), Expect = 4.9
 Identities = 10/43 (23%), Positives = 21/43 (48%)

Query: 245 EYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
           E   + E   +  N  +  P E E++++ EEEE     ++ ++
Sbjct: 211 ELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSED 253



 Score = 27.9 bits (62), Expect = 6.0
 Identities = 19/82 (23%), Positives = 38/82 (46%), Gaps = 16/82 (19%)

Query: 224 EELLDLYKE---------CDTSTDVVKVQ-------NEYLKKIEEERNMPNVPDFPPSES 267
           E+L+DL++           D     V  Q        + L  ++    + + P+   SE 
Sbjct: 164 EKLVDLWRPWIEDKAGKDLDRLAACVDDQQAFARVVRDMLSSMDMAEELGDEPESADSED 223

Query: 268 EEEEEEEEEEEEEDKAIEDELG 289
            E+E++ +E+E++D+  E+E G
Sbjct: 224 NEDEDDPKEDEDDDQGEEEESG 245



 Score = 27.5 bits (61), Expect = 8.1
 Identities = 9/27 (33%), Positives = 14/27 (51%)

Query: 261 DFPPSESEEEEEEEEEEEEEDKAIEDE 287
           D P  + ++++ EEEE    D   ED 
Sbjct: 228 DDPKEDEDDDQGEEEESGSSDSLSEDS 254


>gnl|CDD|227277 COG4941, COG4941, Predicted RNA polymerase sigma factor containing
           a TPR repeat domain [Transcription].
          Length = 415

 Score = 28.6 bits (64), Expect = 4.9
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDELGRI 291
            S+E+EE EE E  +D+ I D+  R+
Sbjct: 87  LSDEDEEMEEAEALDDEHIRDDRLRL 112


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 28.7 bits (64), Expect = 5.0
 Identities = 10/24 (41%), Positives = 14/24 (58%)

Query: 264 PSESEEEEEEEEEEEEEDKAIEDE 287
           P  +E+ E   EE E  D++ EDE
Sbjct: 241 PDNNEDSEAGREESEGSDESEEDE 264


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 27.8 bits (62), Expect = 5.0
 Identities = 11/47 (23%), Positives = 22/47 (46%)

Query: 242 VQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDEL 288
           ++ +  K+ E +       +    E  E+EE  E+ EE ++   +EL
Sbjct: 13  MEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEEL 59


>gnl|CDD|237061 PRK12327, nusA, transcription elongation factor NusA; Provisional.
          Length = 362

 Score = 28.3 bits (64), Expect = 5.1
 Identities = 13/23 (56%), Positives = 16/23 (69%)

Query: 265 SESEEEEEEEEEEEEEDKAIEDE 287
           SESE EEE   EEE E+  +E+E
Sbjct: 340 SESEAEEEGAIEEEVEEDVVEEE 362


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts
           harbouring premature signals for translation termination
           are recognised and rapidly degraded by eukaryotic cells
           through a pathway known as nonsense-mediated mRNA decay.
           In Saccharomyces cerevisiae, three trans-acting factors
           (Upf1 to Upf3) are required for nonsense-mediated mRNA
           decay.
          Length = 171

 Score = 27.7 bits (62), Expect = 5.2
 Identities = 9/24 (37%), Positives = 14/24 (58%)

Query: 264 PSESEEEEEEEEEEEEEDKAIEDE 287
           P E E++E  +EEE +     +DE
Sbjct: 17  PEEDEDDESSDEEEVDLPDDEQDE 40


>gnl|CDD|220438 pfam09845, DUF2072, Zn-ribbon containing protein (DUF2072).  This
           archaeal protein has no known function.
          Length = 121

 Score = 27.0 bits (60), Expect = 5.3
 Identities = 16/42 (38%), Positives = 24/42 (57%), Gaps = 1/42 (2%)

Query: 251 EEERNMPNVP-DFPPSESEEEEEEEEEEEEEDKAIEDELGRI 291
           E +R  P+VP + P  E E E +EE  E ++D+   +E  RI
Sbjct: 35  ERKRPKPDVPEEKPVEEIERETDEELLEVKQDREQGEEGERI 76


>gnl|CDD|129772 TIGR00689, rpiB_lacA_lacB, sugar-phosphate isomerase,
           RpiB/LacA/LacB family.  Proteins of known function in
           this family act as sugar (pentose and/or
           hexose)-phosphate isomerases, including the LacA and
           LacB subunits of galactose-6-phosphate isomerases from
           Gram-positive bacteria and RpiB. RpiB is the second
           ribose phosphate isomerase of E. coli. It lacks homology
           to RpiA, its inducer is unknown (but is not ribose), and
           it can be replaced by the homologous
           galactose-6-phosphate isomerase of Streptococcus mutans,
           all of which suggests that the ribose phosphate
           isomerase activity of RpiB is a secondary function. On
           the other hand, there appear to be a significant number
           of species which contain rpiB, lack rpiA and seem to
           require rpi activity in order to complete the pentose
           phosphate pathway.
          Length = 144

 Score = 27.3 bits (61), Expect = 5.4
 Identities = 13/32 (40%), Positives = 16/32 (50%), Gaps = 2/32 (6%)

Query: 16  KLAR-HNLIKILKLGERFRGFPKEAKFYLDKF 46
            LAR HN   +L LG R  G  + A   +D F
Sbjct: 91  ALARQHNDANVLCLGSRVVG-VELALSIVDAF 121


>gnl|CDD|203922 pfam08377, MAP2_projctn, MAP2/Tau projection domain.  This domain
           is found in the MAP2/Tau family of proteins which
           includes MAP2, MAP4, Tau, and their homologs. All
           isoforms contain a conserved C-terminal domain
           containing tubulin-binding repeats (pfam00418), and a
           N-terminal projection domain of varying size. This
           domain has a net negative charge and exerts a long-range
           repulsive force. This provides a mechanism that can
           regulate microtubule spacing which might facilitate
           efficient organelle transport.
          Length = 1134

 Score = 28.6 bits (63), Expect = 5.4
 Identities = 23/67 (34%), Positives = 31/67 (46%), Gaps = 16/67 (23%)

Query: 232 ECDTSTD-VVKVQNEYLK-KIEEERNMPNVPDFP--------------PSESEEEEEEEE 275
           E D+S D   +VQ E+++   EE    P++P  P              P+E   EEEE E
Sbjct: 811 EADSSIDEKAEVQMEFIQLPKEESTETPDIPAIPSDVTQPQPEAIVSEPAEVRGEEEEIE 870

Query: 276 EEEEEDK 282
            E E DK
Sbjct: 871 AEGEYDK 877


>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
          Length = 112

 Score = 27.3 bits (59), Expect = 5.4
 Identities = 16/53 (30%), Positives = 29/53 (54%)

Query: 235 TSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
           ++ D +K  +E++KK  E+   P  P+    +  E+ E E+E++ E    EDE
Sbjct: 47  SNEDEMKTTHEHIKKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENEDE 99


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
           secreted from the skin of amphibians, including the
           opiate-like dermorphins and deltorphins, and the
           antimicrobial dermoseptins and temporins. The alignment
           for this family includes the signal peptide.
          Length = 46

 Score = 25.4 bits (56), Expect = 5.9
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDELGR 290
           E E+ E+EEE E+EE+   + E+ R
Sbjct: 22  EEEKREDEEENEDEEEGEEQSEVKR 46


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 26.8 bits (59), Expect = 5.9
 Identities = 15/37 (40%), Positives = 19/37 (51%)

Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
           E E N     D    E EE +E++EEEE E +  E E
Sbjct: 38  ENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGE 74


>gnl|CDD|215313 PLN02575, PLN02575, haloacid dehalogenase-like hydrolase.
          Length = 381

 Score = 27.9 bits (62), Expect = 6.1
 Identities = 12/29 (41%), Positives = 17/29 (58%)

Query: 260 PDFPPSESEEEEEEEEEEEEEDKAIEDEL 288
           P+F P E E E E+EE+ E    A  D++
Sbjct: 351 PEFGPPEPELEMEKEEDRELPSSAGVDDI 379


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 28.0 bits (63), Expect = 6.2
 Identities = 16/48 (33%), Positives = 26/48 (54%)

Query: 240 VKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
             V+    K+ EE++           + EEEEEE+E++EEE +  E+E
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
           N-terminal.  This is the very acidic N-terminal region
           of the early transcription elongation factor Spt5. The
           Spt5-Spt4 complex regulates early transcription
           elongation by RNA polymerase II and has an imputed role
           in pre-mRNA processing via its physical association with
           mRNA capping enzymes. The actual function of this
           N-terminal domain is not known although it is
           dispensable for binding to Spt4.
          Length = 92

 Score = 26.6 bits (59), Expect = 6.3
 Identities = 13/26 (50%), Positives = 19/26 (73%)

Query: 266 ESEEEEEEEEEEEEEDKAIEDELGRI 291
           E ++EEEEEEEEE++ + + DE   I
Sbjct: 7   EVDDEEEEEEEEEDDLEDLSDEDEFI 32



 Score = 26.3 bits (58), Expect = 7.7
 Identities = 14/24 (58%), Positives = 18/24 (75%)

Query: 265 SESEEEEEEEEEEEEEDKAIEDEL 288
            + EEEEEEEEE++ ED + EDE 
Sbjct: 8   VDDEEEEEEEEEDDLEDLSDEDEF 31


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 27.8 bits (62), Expect = 7.2
 Identities = 17/42 (40%), Positives = 25/42 (59%), Gaps = 11/42 (26%)

Query: 247 LKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDEL 288
           L+KI++ER             E+E EEEE+  EE+KA E+E+
Sbjct: 151 LEKIKKER-----------AEEKEREEEEKAAEEEKAREEEI 181



 Score = 27.4 bits (61), Expect = 8.4
 Identities = 8/27 (29%), Positives = 19/27 (70%)

Query: 265 SESEEEEEEEEEEEEEDKAIEDELGRI 291
           S+S +++ ++++ E+E  A+  EL +I
Sbjct: 128 SDSSDDDSDDDDSEDETAALLRELEKI 154


>gnl|CDD|222891 PHA02585, 16, small terminase protein; Provisional.
          Length = 161

 Score = 27.1 bits (60), Expect = 7.2
 Identities = 18/69 (26%), Positives = 31/69 (44%), Gaps = 5/69 (7%)

Query: 221 TLNEELLDLYKECD--TSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEE 278
             N+E+L ++KE    T       +    +   ++ N+ N   F  S +E  +E  +  E
Sbjct: 94  NTNKEILKIHKEMKDITEEQTGTKKG---EPQGDQMNIENATVFMGSPTELMDEVGDAYE 150

Query: 279 EEDKAIEDE 287
           EE+K I  E
Sbjct: 151 EEEKVINGE 159


>gnl|CDD|224250 COG1331, COG1331, Highly conserved protein containing a thioredoxin
           domain [Posttranslational modification, protein
           turnover, chaperones].
          Length = 667

 Score = 28.1 bits (63), Expect = 7.3
 Identities = 28/99 (28%), Positives = 40/99 (40%), Gaps = 14/99 (14%)

Query: 193 FYEDLDGERFPKEAKFYLDKFKWGRTFLTLNEELLDLYKECDTSTDVVKVQNEYLKKIEE 252
           FY  LD +   +E K+Y     W  +   L E L +  +      DV +  N       E
Sbjct: 316 FYSSLDADSDGEEGKYYT----W--SVEELKEVLGEDAELACKYFDVSEEGN------FE 363

Query: 253 ERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDELGRI 291
            RN+ +VP   P E   EE EE+ E   +K +     R 
Sbjct: 364 GRNVLHVP--GPLEEAIEEAEEKLERAREKLLAAREKRK 400


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 28.1 bits (63), Expect = 7.4
 Identities = 6/57 (10%), Positives = 21/57 (36%), Gaps = 2/57 (3%)

Query: 231  KECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDE 287
            K    +    K +    +    + +       P  +  +   E++++ E D + +++
Sbjct: 1326 KSEKKTARKKKSKTRVKQASASQSSRLLRR--PRKKKSDSSSEDDDDSEVDDSEDED 1380


>gnl|CDD|220112 pfam09110, HAND, HAND.  The HAND domain adopts a secondary
           structure consisting of four alpha helices, three of
           which (H2, H3, H4) form an L-like configuration. Helix
           H2 runs antiparallel to helices H3 and H4, packing
           closely against helix H4, whilst helix H1 reposes in the
           concave surface formed by these three helices and runs
           perpendicular to them. The domain confers DNA and
           nucleosome binding properties to the protein.
          Length = 109

 Score = 26.5 bits (59), Expect = 7.5
 Identities = 8/30 (26%), Positives = 16/30 (53%)

Query: 257 PNVPDFPPSESEEEEEEEEEEEEEDKAIED 286
             + D    + E+EEE E + + E + I++
Sbjct: 61  VPLDDVGDGDEEDEEEREAKRKLEQEKIDN 90


>gnl|CDD|218562 pfam05342, Peptidase_M26_N, M26 IgA1-specific Metallo-endopeptidase
           N-terminal region.  These peptidases, which cleave
           mammalian IgA, are found in Gram-positive bacteria.
           Often found associated with pfam00746, they may be
           attached to the cell wall.
          Length = 252

 Score = 27.3 bits (61), Expect = 8.7
 Identities = 17/61 (27%), Positives = 23/61 (37%), Gaps = 20/61 (32%)

Query: 225 ELLDLYKECDTSTDVVKVQNEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAI 284
           E LD YKE   ST +V  + E                     SE E  EE+  + + K +
Sbjct: 1   ENLDYYKEYTLSTTMVYDRGE--------------------GSETETLEEKPVQLDLKKV 40

Query: 285 E 285
           E
Sbjct: 41  E 41


>gnl|CDD|107059 PHA01746, PHA01746, hypothetical protein.
          Length = 131

 Score = 26.8 bits (59), Expect = 8.7
 Identities = 14/45 (31%), Positives = 22/45 (48%), Gaps = 1/45 (2%)

Query: 244 NEYLKKIEEERNMPNVPDFPPSESEEEEEEEEEEEEEDKAIEDEL 288
           NEY+K      N     +    E +EE+EE+  E+   K +E+E 
Sbjct: 88  NEYVKFTPRNNNAIQEEEDD-MEQQEEKEEKGREKGRKKNVEEEF 131


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 26.8 bits (59), Expect = 9.7
 Identities = 10/24 (41%), Positives = 19/24 (79%)

Query: 265 SESEEEEEEEEEEEEEDKAIEDEL 288
           S++EE EE+EEE E+ + ++++E 
Sbjct: 26  SQAEEVEEDEEEMEDWEDSLDEED 49



 Score = 26.8 bits (59), Expect = 10.0
 Identities = 14/33 (42%), Positives = 16/33 (48%)

Query: 251 EEERNMPNVPDFPPSESEEEEEEEEEEEEEDKA 283
           E+E  M +  D    E EE EE EEE     KA
Sbjct: 33  EDEEEMEDWEDSLDEEDEEAEEVEEETAASSKA 65


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.139    0.429 

Gapped
Lambda     K      H
   0.267   0.0685    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,608,325
Number of extensions: 1559347
Number of successful extensions: 7162
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5727
Number of HSP's successfully gapped: 430
Length of query: 291
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 195
Effective length of database: 6,679,618
Effective search space: 1302525510
Effective search space used: 1302525510
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (26.6 bits)