RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10021
         (716 letters)



>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score =  259 bits (664), Expect = 8e-80
 Identities = 130/368 (35%), Positives = 181/368 (49%), Gaps = 63/368 (17%)

Query: 342 SLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLNHVGLAHSLSRYKL 401
           SL   +      D  E      E    S  +   + + +E E                  
Sbjct: 2   SLVDFEPFPSDADALEEELLISEGGVSSALEVNLELELIEGE-----------------K 44

Query: 402 KFSPDKVDTMIVQAVSLLDDLDKELNNYMMRCREWYGWHFPELGKIVTDNVAFVKTIKTI 461
           K      DT +  A S  +   +   +      E       EL    +D   + + +   
Sbjct: 45  KLELVVCDTKLGNADSAFEIPSEVGEDLRENAEE----ALGELSS--SDEDDYYRMLL-- 96

Query: 462 GLAHSLSRYKLKFSPDKVDTMIVQAVSLLDDLDKELNNYMMRCREWYGWHFPELGKIVTD 521
           GL H LSR K++   DK D +I+QA+  LDD+DKE+N   MR REWYGWHFPEL  +V D
Sbjct: 97  GLGHELSRIKVREEVDKEDKLIIQAIEALDDIDKEINLLAMRLREWYGWHFPELSSLVPD 156

Query: 522 NLK----------RNDNSTRDKTSASDLSEILPEDVEEKVKEAAEISMGTEISDDDIENI 571
           N +          R + +   K S  DL   LP D+  K+ EAA+ SMG ++S++DI+NI
Sbjct: 157 NEQYAKLVSALGNRENIN---KESLKDLGFALP-DIAIKIAEAAKDSMGADLSEEDIDNI 212

Query: 572 LLLCDQVLEISSYRGQLYDYLKSRMMAVAPNLTILMGELVGARLVAQAGSLLNLAKHPAS 631
             L + +LE+   R QL +Y++S+M  +APNLT L+G ++GARL++ AG L  LAK PAS
Sbjct: 213 RELAEIILELYELREQLEEYIESKMSEIAPNLTALVGPVLGARLISHAGGLTRLAKMPAS 272

Query: 632 TVQILGAEKALFRSLLNLAKHPASTVQILGAEKALFRALKTKRDTPKYGLIYHSQLIGQS 691
           T+Q+LGAE                        KALFRALKT   TPKYG+IY S LI +S
Sbjct: 273 TIQVLGAE------------------------KALFRALKTGAKTPKYGVIYQSPLIQKS 308

Query: 692 STKNKGKM 699
               +GK+
Sbjct: 309 PPWQRGKI 316



 Score = 58.6 bits (142), Expect = 6e-09
 Identities = 30/93 (32%), Positives = 47/93 (50%), Gaps = 6/93 (6%)

Query: 211 LIGQSSTKNKGKMARMLAAKAALATRVDALGEDSSIELGTDHRAKLEIKLRLLEEGNLRR 270
           LI +S    +GK+AR LAAK A+A R+DA   +     G   R +LE ++  L+E   + 
Sbjct: 304 LIQKSPPWQRGKIARALAAKLAIAARIDAFSGEPD---GISLREELEKRIEKLKEKPPKP 360

Query: 271 LSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEE 303
               TKAK + +K       ++K K +   +E 
Sbjct: 361 ---PTKAKPERDKKERPGRYRRKKKEKKAKSER 390


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score =  183 bits (466), Expect = 3e-51
 Identities = 98/257 (38%), Positives = 140/257 (54%), Gaps = 49/257 (19%)

Query: 466 SLSRYKLKFSPDKVDTMIVQAVSLLDDLDKELNNYMMRCREWYGWHFPELGKIVTD---- 521
            L+R KL+ +  K D + +QA+  +DD+DK +N +  R REWY  HFPEL ++V      
Sbjct: 124 ELTRRKLRSAAQKRDKLAIQAIRAIDDIDKTINLFSERLREWYSLHFPELDELVKKHEEY 183

Query: 522 -----NLKRNDNSTRDKTSASDLSEI-LPEDVEEKVKEAAEISMGTEISDDDIENILLLC 575
                 L   +N TR+K     L ++ LPE+   K+ EAA+ SMG ++S+ D+E I  L 
Sbjct: 184 VKLVSELGDRENYTREK-----LKKLGLPENKARKIAEAAKKSMGADLSEFDLEAIKKLA 238

Query: 576 DQVLEISSYRGQLYDYLKSRMMAVAPNLTILMGELVGARLVAQAGSLLNLAKHPASTVQI 635
           +++L++   R +L DYL++ M  VAPNLT L+G  +GARL++ AG L  LAK PAST+Q+
Sbjct: 239 NEILDLYKLREELEDYLETVMKEVAPNLTALVGPSLGARLISLAGGLEELAKMPASTIQV 298

Query: 636 LGAEKALFRSLLNLAKHPASTVQILGAEKALFRALKTKRDTPKYGLIYHSQLIGQSSTKN 695
           LGAE                        KALFR L+T    PK+G+I+    I       
Sbjct: 299 LGAE------------------------KALFRHLRTGAKPPKHGVIFQYPAI------- 327

Query: 696 KGKMGSYGWQPGSASRA 712
               GS  WQ G  +RA
Sbjct: 328 ---HGSPWWQRGKIARA 341



 Score = 79.6 bits (197), Expect = 1e-15
 Identities = 44/151 (29%), Positives = 76/151 (50%), Gaps = 9/151 (5%)

Query: 328 VKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEE-----EEVSKKKKKKKKKDVED 382
           +K    DE    +   S+K ++   +   EP  + GE       E+  +    +  D E+
Sbjct: 56  LKELGPDEVVVENEEESRKLQELGYRVTVEPPNKIGEFLRENLPELGVEYGFFE--DEEE 113

Query: 383 ETDPLNHVGLAHSLSRYKLKFSPDKVDTMIVQAVSLLDDLDKELNNYMMRCREWYGWHFP 442
             + L+   +   L+R KL+ +  K D + +QA+  +DD+DK +N +  R REWY  HFP
Sbjct: 114 FYEKLHEWSVE--LTRRKLRSAAQKRDKLAIQAIRAIDDIDKTINLFSERLREWYSLHFP 171

Query: 443 ELGKIVTDNVAFVKTIKTIGLAHSLSRYKLK 473
           EL ++V  +  +VK +  +G   + +R KLK
Sbjct: 172 ELDELVKKHEEYVKLVSELGDRENYTREKLK 202



 Score = 34.2 bits (79), Expect = 0.27
 Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 3/78 (3%)

Query: 220 KGKMARMLAAKAALATRVDAL-GEDSSIELGTDHRAKL-EIKLRLLEEGNLRRLSGTTKA 277
           +GK+AR LAAK A+A RVD   G     EL  +   ++ EIK     +   ++       
Sbjct: 335 RGKIARALAAKLAIAARVDYFSGRYIGDELKEELNKRIEEIK-EKYPKPPKKKREEKKPQ 393

Query: 278 KAKLEKYHGKRLEKKKFK 295
           K K +K   K+ +K+K K
Sbjct: 394 KRKKKKKRKKKGKKRKKK 411


>gnl|CDD|216709 pfam01798, Nop, Putative snoRNA binding domain.  This family
           consists of various Pre RNA processing
           ribonucleoproteins. The function of the aligned region
           is unknown however it may be a common RNA or snoRNA or
           Nop1p binding domain. Nop5p (Nop58p) from yeast is the
           protein component of a ribonucleoprotein protein
           required for pre-18s rRNA processing and is suggested to
           function with Nop1p in a snoRNA complex. Nop56p and
           Nop5p interact with Nop1p and are required for ribosome
           biogenesis. Prp31p is required for pre-mRNA splicing in
           S. cerevisiae.
          Length = 149

 Score =  161 bits (409), Expect = 2e-46
 Identities = 68/133 (51%), Positives = 87/133 (65%), Gaps = 24/133 (18%)

Query: 570 NILLLCDQVLEISSYRGQLYDYLKSRMMAVAPNLTILMGELVGARLVAQAGSLLNLAKHP 629
           NIL  C+++L ++ YR QL +Y++S+M  +APNLT L+GELVGARL++ AGSL NLAK P
Sbjct: 1   NILEFCERILSLAEYRKQLEEYIESKMSKIAPNLTALVGELVGARLISHAGSLTNLAKLP 60

Query: 630 ASTVQILGAEKALFRSLLNLAKHPASTVQILGAEKALFRALKTKRDTPKYGLIYHSQLIG 689
           AST+Q+LGAE                        KALFRALKTK  TPKYGLIYHS LI 
Sbjct: 61  ASTIQVLGAE------------------------KALFRALKTKAKTPKYGLIYHSPLIQ 96

Query: 690 QSSTKNKGKMGSY 702
           ++  K +GK+   
Sbjct: 97  KAPPKVRGKIARM 109



 Score = 68.8 bits (169), Expect = 8e-14
 Identities = 27/57 (47%), Positives = 36/57 (63%)

Query: 210 QLIGQSSTKNKGKMARMLAAKAALATRVDALGEDSSIELGTDHRAKLEIKLRLLEEG 266
            LI ++  K +GK+ARMLAAK ALA R+DA  E+     G + R +LE +L  LEE 
Sbjct: 93  PLIQKAPPKVRGKIARMLAAKLALAARIDAFSEEPDGSFGIELREELEKRLEKLEEK 149


>gnl|CDD|203845 pfam08060, NOSIC, NOSIC (NUC001) domain.  This is the central
           domain in Nop56/SIK1-like proteins.
          Length = 52

 Score = 99.5 bits (249), Expect = 1e-25
 Identities = 30/51 (58%), Positives = 39/51 (76%)

Query: 412 IVQAVSLLDDLDKELNNYMMRCREWYGWHFPELGKIVTDNVAFVKTIKTIG 462
           I+QA+ LLDD+DKE+N + MR REWY WHFPEL K+V D + + K +K IG
Sbjct: 1   IIQAIELLDDIDKEINLFHMRLREWYSWHFPELEKLVPDPIDYAKVVKLIG 51



 Score = 88.3 bits (220), Expect = 9e-22
 Identities = 26/41 (63%), Positives = 33/41 (80%)

Query: 483 IVQAVSLLDDLDKELNNYMMRCREWYGWHFPELGKIVTDNL 523
           I+QA+ LLDD+DKE+N + MR REWY WHFPEL K+V D +
Sbjct: 1   IIQAIELLDDIDKEINLFHMRLREWYSWHFPELEKLVPDPI 41


>gnl|CDD|197999 smart00931, NOSIC, NOSIC (NUC001) domain.  This is the central
           domain in Nop56/SIK1-like proteins.
          Length = 52

 Score = 96.1 bits (240), Expect = 2e-24
 Identities = 29/51 (56%), Positives = 40/51 (78%)

Query: 412 IVQAVSLLDDLDKELNNYMMRCREWYGWHFPELGKIVTDNVAFVKTIKTIG 462
           I+QA++LLDD+DKE+N + MR REWY WHFPEL K+V D + + K +K +G
Sbjct: 1   IIQAINLLDDIDKEINLFHMRVREWYSWHFPELEKLVPDPLDYAKVVKLLG 51



 Score = 85.3 bits (212), Expect = 1e-20
 Identities = 27/42 (64%), Positives = 34/42 (80%)

Query: 483 IVQAVSLLDDLDKELNNYMMRCREWYGWHFPELGKIVTDNLK 524
           I+QA++LLDD+DKE+N + MR REWY WHFPEL K+V D L 
Sbjct: 1   IIQAINLLDDIDKEINLFHMRVREWYSWHFPELEKLVPDPLD 42


>gnl|CDD|219731 pfam08156, NOP5NT, NOP5NT (NUC127) domain.  This N terminal
          domain is found in RNA-binding proteins of the NOP5
          family.
          Length = 67

 Score = 86.9 bits (216), Expect = 5e-21
 Identities = 31/67 (46%), Positives = 41/67 (61%), Gaps = 1/67 (1%)

Query: 1  MLVLFETPAGYAFFKLLDEKKLQ-EADNLYESFQTPQGANKVLKLKHFEKFIDTTEALAA 59
          MLVLFET AGYA FK+ DEK L  +   + +SFQ  +  +K++KLK F+ F    +AL  
Sbjct: 1  MLVLFETAAGYALFKVKDEKDLIGDDKEVQKSFQDLEKFSKMVKLKAFQPFKSAADALEN 60

Query: 60 TTAAVEG 66
            A  EG
Sbjct: 61 ANAISEG 67


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 42.3 bits (99), Expect = 8e-04
 Identities = 26/105 (24%), Positives = 40/105 (38%), Gaps = 10/105 (9%)

Query: 291 KKKFKTEFDAAEETPATPDTSSSGKKKKKNKNL--DTSIVKAEPEDEPAA----ADVSLS 344
            +K      A +E P      S  K  +  K     T +  +EP +E +      D +L 
Sbjct: 48  SRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALD 107

Query: 345 KKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKK----KKDVEDETD 385
           K+ KK   +   +      + EE   +KK +K    KK  ED  D
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVED 152



 Score = 33.9 bits (77), Expect = 0.34
 Identities = 31/145 (21%), Positives = 56/145 (38%), Gaps = 13/145 (8%)

Query: 245 SIELGTDHRAKLEIKLRLLEEGN--LRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAE 302
              L    R K+E  L  ++E       +S     ++       KR  KK      +  E
Sbjct: 41  KFLLAAGSRKKIESAL-AVDEEPDENGAVSKKKPTRSV------KRATKKTVVEISEPLE 93

Query: 303 ETPA--TPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVE 360
           E       + ++  K+ KK          A   D       +  K +K++K +  +E VE
Sbjct: 94  EGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEK--TEKKVRKRRKVKKMDEDVE 151

Query: 361 EGGEEEEVSKKKKKKKKKDVEDETD 385
           + G E EVS  ++ +    +E+E++
Sbjct: 152 DQGSESEVSDVEESEFVTSLENESE 176


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 39.8 bits (93), Expect = 0.004
 Identities = 25/60 (41%), Positives = 36/60 (60%), Gaps = 4/60 (6%)

Query: 329 KAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLN 388
           K  P++E         KK++++KT D+EE V+E  E+EE  KKKK KK K+   E + LN
Sbjct: 33  KEVPDEEEEEEKEE--KKEEEEKTTDKEEEVDEEEEKEE--KKKKTKKVKETTTEWELLN 88



 Score = 30.1 bits (68), Expect = 5.3
 Identities = 11/35 (31%), Positives = 20/35 (57%)

Query: 351 KTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETD 385
           K  ++E P EE  EE+E  K++++K     E+  +
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDE 63


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 39.5 bits (92), Expect = 0.006
 Identities = 21/106 (19%), Positives = 38/106 (35%), Gaps = 3/106 (2%)

Query: 273 GTTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEP 332
               +K    K    +  K +   E +  +E           KK+K  +       K E 
Sbjct: 81  EKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKK---KKEKPKEEPKDRKPKEEA 137

Query: 333 EDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKK 378
           +++    +    K+KK ++ +D EE  +      +   KK  KKK 
Sbjct: 138 KEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKP 183



 Score = 34.1 bits (78), Expect = 0.24
 Identities = 21/105 (20%), Positives = 38/105 (36%), Gaps = 7/105 (6%)

Query: 288 RLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKK 347
              K K   E           +     ++KKK K         E   +    + +  K+ 
Sbjct: 88  PAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKK-----EKPKEEPKDRKPKEEAKEKRP 142

Query: 348 KKKKTQDEEEPVEEGGEEEEVSK--KKKKKKKKDVEDETDPLNHV 390
            K+K +++E+ VEE  + EE  K  + + K +     +  P N  
Sbjct: 143 PKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKK 187



 Score = 34.1 bits (78), Expect = 0.27
 Identities = 20/95 (21%), Positives = 38/95 (40%), Gaps = 3/95 (3%)

Query: 295 KTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQD 354
           K   D A +      +     K K  K       K E +++    +    KK+K K+   
Sbjct: 70  KLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPK 129

Query: 355 E---EEPVEEGGEEEEVSKKKKKKKKKDVEDETDP 386
           +   +E  +E    +E  K+K+KK ++  + E + 
Sbjct: 130 DRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEK 164



 Score = 32.9 bits (75), Expect = 0.65
 Identities = 25/128 (19%), Positives = 45/128 (35%), Gaps = 13/128 (10%)

Query: 272 SGTTKAKAKLEKYHGKRLEKKKFKTEFDA-----AEETPATPDTSSSGKKKKKNKNLDTS 326
           SG  + K K +    K+ +K+K K E          +    P      K+KK  +  D  
Sbjct: 102 SGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDRE 161

Query: 327 IVKAEPEDEPAAADVSLSKKKKKKKTQDEEEP--------VEEGGEEEEVSKKKKKKKKK 378
             K        +      KKK   K ++  E             G+ EE    ++++K++
Sbjct: 162 EEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEE 221

Query: 379 DVEDETDP 386
           D   + + 
Sbjct: 222 DDGKDRET 229


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 38.6 bits (89), Expect = 0.013
 Identities = 26/104 (25%), Positives = 46/104 (44%), Gaps = 2/104 (1%)

Query: 276  KAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDE 335
            +A     +      E  + K E    ++  A     ++ KK ++ K  D +  KAE ED+
Sbjct: 1346 EAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAE-EDK 1404

Query: 336  PAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKD 379
              A ++       KKK  + ++  EE  + +E  KK ++ KK D
Sbjct: 1405 KKADELK-KAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKAD 1447



 Score = 36.3 bits (83), Expect = 0.071
 Identities = 25/114 (21%), Positives = 47/114 (41%), Gaps = 4/114 (3%)

Query: 268  LRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSI 327
             ++ +   K KA   K   K+ E+ K   E   AE   A  D + + ++K +        
Sbjct: 1320 AKKKAEEAKKKADAAK---KKAEEAKKAAEAAKAEAE-AAADEAEAAEEKAEAAEKKKEE 1375

Query: 328  VKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVE 381
             K + +     A+      + KKK +++++  +E  +     KK  + KKK  E
Sbjct: 1376 AKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEE 1429



 Score = 35.5 bits (81), Expect = 0.12
 Identities = 29/107 (27%), Positives = 46/107 (42%), Gaps = 4/107 (3%)

Query: 278  KAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPA 337
            K K ++   K  E KK       AEE     D +   KKK +         KAE E    
Sbjct: 1301 KKKADEAKKKAEEAKKADEAKKKAEEAKKKADAA---KKKAEEAKKAAEAAKAEAEAAAD 1357

Query: 338  AADVSLSKKKK-KKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
             A+ +  K +  +KK ++ ++  +   ++ E  KK  + KKK  ED+
Sbjct: 1358 EAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDK 1404



 Score = 35.1 bits (80), Expect = 0.14
 Identities = 24/109 (22%), Positives = 42/109 (38%), Gaps = 5/109 (4%)

Query: 278  KAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEP----- 332
              K ++   K  E KK         E       ++  + +      + +  KAE      
Sbjct: 1314 AKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373

Query: 333  EDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVE 381
            E+    AD +  K ++KKK  + ++  EE  ++ +  KK    KKK  E
Sbjct: 1374 EEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADE 1422



 Score = 35.1 bits (80), Expect = 0.15
 Identities = 32/118 (27%), Positives = 51/118 (43%), Gaps = 7/118 (5%)

Query: 264  EEGNLRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNL 323
            EE   +  +   KA+   +     + E +    E +AAEE     +      KKK     
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKA---- 1380

Query: 324  DTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVE 381
            D +  KAE   E   AD +  K ++ KK  DE +      ++ + +KKK ++KKK  E
Sbjct: 1381 DAAKKKAE---EKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADE 1435



 Score = 34.7 bits (79), Expect = 0.19
 Identities = 33/127 (25%), Positives = 62/127 (48%), Gaps = 1/127 (0%)

Query: 253  RAKLEIKLRLLEEGNLRRLSGTTKAK-AKLEKYHGKRLEKKKFKTEFDAAEETPATPDTS 311
             A++E  ++L EE    +     KA+ AK++    K+ E++K K E    +E        
Sbjct: 1591 EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAE 1650

Query: 312  SSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKK 371
               K +++NK       K   ED+  A +   +++ +KK  +  ++  EE  + EE+ KK
Sbjct: 1651 ELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKK 1710

Query: 372  KKKKKKK 378
            + ++KKK
Sbjct: 1711 EAEEKKK 1717



 Score = 34.7 bits (79), Expect = 0.24
 Identities = 39/167 (23%), Positives = 72/167 (43%), Gaps = 11/167 (6%)

Query: 226  MLAAKAALATRVDALGEDSSIELGTDHRAKLEIKLRLLEEGNLR--RLSGTTKAKAKLEK 283
            M   KA  A + +    +  ++L  + +     + +  EE  ++   L    + K K+E+
Sbjct: 1578 MALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQ 1637

Query: 284  YHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSL 343
               K  E+KK   E   AEE       ++   KK +         K   EDE  AA+   
Sbjct: 1638 LKKKEAEEKKKAEELKKAEEENKI--KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695

Query: 344  SKKKKKKKTQ-------DEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
             + ++ KK +       +E++  EE  + EE +K K ++ KK+ E++
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEED 1742



 Score = 34.3 bits (78), Expect = 0.29
 Identities = 30/102 (29%), Positives = 47/102 (46%), Gaps = 5/102 (4%)

Query: 278  KAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPA 337
            K K ++   K  E KK   E   A       D +   KK ++ K  D +  KAE   E  
Sbjct: 1390 KKKADEAKKKAEEDKKKADELKKAAAAKKKADEAK--KKAEEKKKADEAKKKAE---EAK 1444

Query: 338  AADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKD 379
             AD +  K ++ KK ++ ++  EE  + +E  KK ++ KK D
Sbjct: 1445 KADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKAD 1486



 Score = 34.0 bits (77), Expect = 0.36
 Identities = 19/94 (20%), Positives = 39/94 (41%), Gaps = 6/94 (6%)

Query: 288  RLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKK 347
            + E+ +   E   AEE     +   + +KKK ++       K + E+   A +     ++
Sbjct: 1273 KAEEARKADELKKAEEKKKADEAKKAEEKKKADE------AKKKAEEAKKADEAKKKAEE 1326

Query: 348  KKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVE 381
             KKK    ++  EE  +  E +K + +    + E
Sbjct: 1327 AKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAE 1360



 Score = 33.6 bits (76), Expect = 0.48
 Identities = 29/117 (24%), Positives = 52/117 (44%), Gaps = 10/117 (8%)

Query: 276  KAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAE---- 331
            KA+   +    K+ E+KK   E   AEE     +   + + KK  ++ + ++ KAE    
Sbjct: 1529 KAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKK 1588

Query: 332  ------PEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVED 382
                   E      +    K ++ KK ++ +   EE  + EE  KK ++ KKK+ E+
Sbjct: 1589 AEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEE 1645



 Score = 32.4 bits (73), Expect = 1.0
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 1/107 (0%)

Query: 276  KAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDE 335
            +AK K E+   K  E KK       A+E     +      + KK         +A+ + E
Sbjct: 1395 EAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAE 1454

Query: 336  PAAADVSLSKKKKKKKTQDE-EEPVEEGGEEEEVSKKKKKKKKKDVE 381
             A       KK ++ K  DE ++  EE  + +E  KK ++ KKK  E
Sbjct: 1455 EAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADE 1501



 Score = 32.0 bits (72), Expect = 1.5
 Identities = 29/142 (20%), Positives = 60/142 (42%), Gaps = 1/142 (0%)

Query: 242  EDSSIELGTDHRAKLEIKLRLLEEGNLRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAA 301
            ED ++ L     AK   + R+ E   L       KA+ + +K    +++ ++ K   +  
Sbjct: 1574 EDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAE-EAKKAEEAKIKAEELKKAEEEK 1632

Query: 302  EETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEE 361
            ++        +  KKK +         K +  +E   A+    K ++ KK +++E+   E
Sbjct: 1633 KKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAE 1692

Query: 362  GGEEEEVSKKKKKKKKKDVEDE 383
              ++E    KK ++ KK   +E
Sbjct: 1693 ALKKEAEEAKKAEELKKKEAEE 1714



 Score = 31.3 bits (70), Expect = 2.2
 Identities = 26/105 (24%), Positives = 45/105 (42%), Gaps = 12/105 (11%)

Query: 275  TKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPED 334
             KA+AK      + L+      +FDA E+  A   T  +  K ++ K  +T   KAE   
Sbjct: 1058 GKAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETG--KAEEA- 1114

Query: 335  EPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKD 379
                      K ++ KK  ++    EE  + E+  K ++ +K +D
Sbjct: 1115 ---------RKAEEAKKKAEDARKAEEARKAEDARKAEEARKAED 1150



 Score = 31.3 bits (70), Expect = 2.8
 Identities = 24/106 (22%), Positives = 41/106 (38%)

Query: 278  KAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPA 337
            K K ++       KKK       AEE     +     ++ KK         +A+  +E  
Sbjct: 1404 KKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAK 1463

Query: 338  AADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
                   K  + KK  +E +  +E  ++ E +KKK  + KK  E +
Sbjct: 1464 KKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAK 1509



 Score = 30.9 bits (69), Expect = 3.0
 Identities = 28/123 (22%), Positives = 47/123 (38%), Gaps = 11/123 (8%)

Query: 270  RLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVK 329
            R     KA+   +    K+ E+KK   E   AEE     +     ++ KK         K
Sbjct: 1267 RRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEA-----K 1321

Query: 330  AEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKK-----KKKKDVEDET 384
             + E+    AD +  KK ++ K   E    E     +E    ++K     KKK++ + + 
Sbjct: 1322 KKAEEAKKKAD-AAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKA 1380

Query: 385  DPL 387
            D  
Sbjct: 1381 DAA 1383



 Score = 30.9 bits (69), Expect = 3.3
 Identities = 26/105 (24%), Positives = 42/105 (40%)

Query: 277  AKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEP 336
            AK K ++   K  EKKK       AEE     +     ++ KK +       +A+  DE 
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEA 1475

Query: 337  AAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVE 381
                    K  + KK  +E +   +  ++   +KKK  + KK  E
Sbjct: 1476 KKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEE 1520



 Score = 30.5 bits (68), Expect = 3.8
 Identities = 27/104 (25%), Positives = 47/104 (45%), Gaps = 4/104 (3%)

Query: 276  KAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDE 335
             A+ K E    K+ E KK   + DAA++       +   KKK +        +K +    
Sbjct: 1361 AAEEKAEAAEKKKEEAKK---KADAAKKKAEEKKKADEAKKKAEEDKKKADELK-KAAAA 1416

Query: 336  PAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKD 379
               AD +  K ++KKK  + ++  EE  + +E  KK ++ KK +
Sbjct: 1417 KKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAE 1460



 Score = 30.5 bits (68), Expect = 3.8
 Identities = 29/131 (22%), Positives = 53/131 (40%), Gaps = 6/131 (4%)

Query: 257  EIKLRLLEEGNLRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKK 316
            E K +  E+          +   K ++   K  E KK +     AEE     +     ++
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481

Query: 317  KKK----NKNLDTSIVKAEP--EDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSK 370
             KK     K  + +  KA+   +   A      +KK ++ K  DE +  EE  + +E  K
Sbjct: 1482 AKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541

Query: 371  KKKKKKKKDVE 381
             ++KKK  +++
Sbjct: 1542 AEEKKKADELK 1552



 Score = 29.7 bits (66), Expect = 7.5
 Identities = 24/102 (23%), Positives = 42/102 (41%), Gaps = 9/102 (8%)

Query: 278  KAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPA 337
              K ++   K  E KK   E   A E     D +   ++ KK      +    + ++   
Sbjct: 1482 AKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADE--- 1538

Query: 338  AADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKD 379
                  +KK ++KK  DE +  EE  + EE  K ++ KK ++
Sbjct: 1539 ------AKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEE 1574


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 38.5 bits (90), Expect = 0.014
 Identities = 27/125 (21%), Positives = 50/125 (40%), Gaps = 5/125 (4%)

Query: 261 RLLEEGNLRRLSGTTKAKAKLEKYHGKRLEKKKFK-TEFDAAEETPATPDTSSSGKKKKK 319
            L  E      S   + +   +K  G+R    KF     +   E+      + +  K+KK
Sbjct: 396 ELRRELEGEEESDEEENEEPSKKNVGRR----KFGPENGEKEAESKKLKKENKNEFKEKK 451

Query: 320 NKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKD 379
             + +  +   E       A+  L + +K +K ++EEE  EE    +  S   K  KK+D
Sbjct: 452 ESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQD 511

Query: 380 VEDET 384
            + ++
Sbjct: 512 SKKKS 516



 Score = 33.9 bits (78), Expect = 0.32
 Identities = 34/180 (18%), Positives = 65/180 (36%), Gaps = 15/180 (8%)

Query: 222 KMARMLAAKAAL--ATRVDALGEDSSIELGTDHRAKLEIKLRLLEEGNLRRLSGTTKAKA 279
            M  M  A+A        +       +E   +   +   +      G  RR  G    + 
Sbjct: 375 SMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVG--RRKFGPENGEK 432

Query: 280 KLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAA 339
           + E    K+  K +FK + ++ EE     +  +  +K        +   + E E+E    
Sbjct: 433 EAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDE 492

Query: 340 D-----------VSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLN 388
           +            S  K+  KKK+  + +       +  V  KKKKKK+K ++ + D ++
Sbjct: 493 ENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLID 552


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 38.1 bits (89), Expect = 0.018
 Identities = 23/186 (12%), Positives = 51/186 (27%), Gaps = 13/186 (6%)

Query: 214 QSSTKNKGKMARMLAAKAALATRVDALGEDSSIELGTDHRAKLEIKLRLLEEGNLRRLSG 273
           ++  K K   A+  +        +    E          +  + +   + +  +      
Sbjct: 15  EAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDI 74

Query: 274 TTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPE 333
             K      K      +    K   D  + +      ++  K    N   D  ++    +
Sbjct: 75  PKKKTKTAAK--AAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADD 132

Query: 334 DEPAAADVSLSKK-----------KKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVED 382
           D+    D  L               +     D ++  EE  E +E+ K          ED
Sbjct: 133 DDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDED 192

Query: 383 ETDPLN 388
           +++ L 
Sbjct: 193 DSEALR 198



 Score = 36.5 bits (85), Expect = 0.050
 Identities = 22/136 (16%), Positives = 47/136 (34%)

Query: 274 TTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPE 333
              A  +  K   K+L  K     F   EE     ++     ++     +  S +  + +
Sbjct: 8   AELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTD 67

Query: 334 DEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLNHVGLA 393
           D   +       K   K    +    ++  +E + SKK +KK   D +D+ + +  + + 
Sbjct: 68  DATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVL 127

Query: 394 HSLSRYKLKFSPDKVD 409
           +           D +D
Sbjct: 128 NQADDDDDDDDDDDLD 143



 Score = 36.1 bits (84), Expect = 0.068
 Identities = 25/145 (17%), Positives = 49/145 (33%), Gaps = 4/145 (2%)

Query: 244 SSIELGTDHRAKLEIKLRLLEEGNLRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEE 303
           +  EL  +  AK ++K +L  +   +      + K  LE       +  +         +
Sbjct: 6   TKAELAAEEEAKKKLK-KLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVK 64

Query: 304 TPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGG 363
                  S   KKK K          A+   +    D   S KK +KK   +++      
Sbjct: 65  DTDDATESDIPKKKTKTAAKAA---AAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYV 121

Query: 364 EEEEVSKKKKKKKKKDVEDETDPLN 388
           ++ +V  +       D +D+ D  +
Sbjct: 122 KDIDVLNQADDDDDDDDDDDLDDDD 146



 Score = 29.6 bits (67), Expect = 6.5
 Identities = 17/137 (12%), Positives = 34/137 (24%), Gaps = 3/137 (2%)

Query: 290 EKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKK 349
                     AAEE         + K K K       I +A    +     +        
Sbjct: 1   MTTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLS 60

Query: 350 KKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLNHVGLAHSLSRYKLKFSPDKVD 409
              +D ++  E    +++     K    K    +           +  +  L    D   
Sbjct: 61  GMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNY 120

Query: 410 TMIVQAVSLLDDLDKEL 426
              +    +L+  D + 
Sbjct: 121 VKDID---VLNQADDDD 134


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
           procyclic acidic repetitive protein (PARP) like
           sequences. The procyclic acidic repetitive protein
           (parp) genes of Trypanosoma brucei encode a small family
           of abundant surface proteins whose expression is
           restricted to the procyclic form of the parasite. They
           are found at two unlinked loci, parpA and parpB;
           transcription of both loci is developmentally regulated.
          Length = 145

 Score = 36.1 bits (82), Expect = 0.018
 Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 2/91 (2%)

Query: 298 FDAAEETPATPDTSSSGKKK--KKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDE 355
           F AA + PA    +  GK K  K  K         +P+DEP   +    +++ +++ + E
Sbjct: 24  FAAAADEPADKGITKGGKGKGEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPE 83

Query: 356 EEPVEEGGEEEEVSKKKKKKKKKDVEDETDP 386
           EE  EE   EE   ++ + + + + E E +P
Sbjct: 84  EEGEEEPEPEETGEEEPEPEPEPEPEPEPEP 114


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 37.1 bits (86), Expect = 0.027
 Identities = 31/114 (27%), Positives = 47/114 (41%), Gaps = 18/114 (15%)

Query: 290 EKKKFKTEFDAAEETPATPD----TSSSGKKKKKNKNLDTSIVKAEPEDEPAAA------ 339
           EK + KT   AA      P      SS  KKK K K       ++  ++E          
Sbjct: 194 EKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRDV 253

Query: 340 --DVSLSKKKKKKKTQDEEEPVEEG------GEEEEVSKKKKKKKKKDVEDETD 385
             +   ++     + +DE+EP   G       E EE  K+K+K+ KK +EDE +
Sbjct: 254 ILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDE 307



 Score = 30.6 bits (69), Expect = 3.5
 Identities = 15/81 (18%), Positives = 21/81 (25%)

Query: 301 AEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVE 360
                 T                 T+  K      P +      K K  KKTQD  +   
Sbjct: 133 PNVKRRTGVGLPPVAPAASPALKPTANGKRPSSKPPKSIMSPEVKVKSAKKTQDTSKETT 192

Query: 361 EGGEEEEVSKKKKKKKKKDVE 381
               E + S K    K+   +
Sbjct: 193 TEKTEGKTSVKAASLKRNPPK 213



 Score = 29.8 bits (67), Expect = 5.4
 Identities = 16/85 (18%), Positives = 30/85 (35%), Gaps = 5/85 (5%)

Query: 300 AAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADV-SLSKKKKKKKTQDEEEP 358
             +    T DTS     +K            +      +  + S  KKK K+K + +E  
Sbjct: 177 KVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEAS 236

Query: 359 VEEGGEEEEVSKKKKKKKKKDVEDE 383
                EE E    ++  K+  + ++
Sbjct: 237 ESTVKEESE----EESGKRDVILED 257


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 36.6 bits (85), Expect = 0.037
 Identities = 28/115 (24%), Positives = 46/115 (40%), Gaps = 4/115 (3%)

Query: 290 EKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKK 349
            K +  +   +     +    S+  +++ K     TS  K + ED+   +     KKKKK
Sbjct: 19  SKLQPISYIYSNVLVLSKEILSTFSEEENKVAT--TSTKKDKKEDKNNESKKKSEKKKKK 76

Query: 350 KKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLNHVGLAHSLSRYKLKFS 404
           KK  +++EP  EG  +      KK KK K    +  P   V  A +      + S
Sbjct: 77  KK--EKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKS 129



 Score = 32.4 bits (74), Expect = 0.82
 Identities = 15/76 (19%), Positives = 33/76 (43%), Gaps = 8/76 (10%)

Query: 310 TSSSGKKKKK--------NKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEE 361
           + ++ K K +           L   I+    E+E   A  S  K KK+ K  + ++  E+
Sbjct: 13  SGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEK 72

Query: 362 GGEEEEVSKKKKKKKK 377
             ++++  K+ K + +
Sbjct: 73  KKKKKKEKKEPKSEGE 88


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 36.9 bits (86), Expect = 0.049
 Identities = 19/146 (13%), Positives = 45/146 (30%), Gaps = 10/146 (6%)

Query: 241  GEDSSIELGTDHRAKLEIKLRLLEEGNLRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDA 300
             + SS++     +          +E +   LS   K K   ++    +          D 
Sbjct: 1235 PKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDG 1294

Query: 301  AEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVE 360
                 + P + +  K KK+ +    ++ K +  ++  A       + K+           
Sbjct: 1295 ESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRL-- 1352

Query: 361  EGGEEEEVSKKKKKKKKKDVEDETDP 386
                     ++ +KKK     ++ D 
Sbjct: 1353 --------LRRPRKKKSDSSSEDDDD 1370



 Score = 31.6 bits (72), Expect = 1.9
 Identities = 16/107 (14%), Positives = 33/107 (30%), Gaps = 7/107 (6%)

Query: 269  RRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIV 328
            +        +   E   G +      K      E + A        KKK + K       
Sbjct: 1282 QYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAAL----KKKKKSEKKTARKKKS 1337

Query: 329  KAEPED---EPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKK 372
            K   +      ++  +   +KKK   + ++++  E    E+E  +  
Sbjct: 1338 KTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDD 1384



 Score = 30.0 bits (68), Expect = 5.2
 Identities = 22/170 (12%), Positives = 46/170 (27%), Gaps = 13/170 (7%)

Query: 214  QSSTKNKGKMARMLAAKAALATRVDALGEDSSIELGTDHRAKLEIKLRLLEEGNLRRLSG 273
            ++S     K       +       +     S  +   D     E +    ++ +++RL  
Sbjct: 1191 KASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDD-----EEQKTKPKKSSVKRLKS 1245

Query: 274  TTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPE 333
                 +K  + + +       K           +    S       +K  D         
Sbjct: 1246 KKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYS---PPPPSKRPDGESNGGSKP 1302

Query: 334  DEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
              P        K KK+ +        ++  E++   KKK K + K     
Sbjct: 1303 SSPTKK-----KVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASAS 1347


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 36.8 bits (85), Expect = 0.055
 Identities = 35/139 (25%), Positives = 51/139 (36%), Gaps = 13/139 (9%)

Query: 268  LRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSI 327
            L+ L    K   KL+K   K  E ++      A  E+ A    S    KK   K      
Sbjct: 1155 LKDLDALEKELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTT--- 1211

Query: 328  VKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPL 387
             K   E E        S      +T++  E V+  G     +KKK     K+ E+E + L
Sbjct: 1212 -KKASESETTEETYGSS----AMETENVAEVVKPKGRAG--AKKKAPAAAKEKEEEDEIL 1264

Query: 388  NHVGLAHSLSRYKLKFSPD 406
            +   L   L+ Y L  +P 
Sbjct: 1265 D---LKDRLAAYNLDSAPA 1280


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 35.1 bits (81), Expect = 0.066
 Identities = 24/99 (24%), Positives = 45/99 (45%), Gaps = 6/99 (6%)

Query: 304 TPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGG 363
           TP      +  KKKKK    +   VK E E++          KKKK K +D+++      
Sbjct: 50  TPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKD------ 103

Query: 364 EEEEVSKKKKKKKKKDVEDETDPLNHVGLAHSLSRYKLK 402
           ++++   +KK +K+ + + E    ++     +LS  K +
Sbjct: 104 DKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPR 142


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 35.7 bits (83), Expect = 0.076
 Identities = 21/79 (26%), Positives = 36/79 (45%), Gaps = 3/79 (3%)

Query: 314 GKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKK 373
           G KK   K     + KAE + E         KK    K ++EEE  E+  +EEE  ++++
Sbjct: 404 GSKKATKKIKKI-VEKAEKKREEEK--KEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEE 460

Query: 374 KKKKKDVEDETDPLNHVGL 392
           + +++  E+E        L
Sbjct: 461 EAEEEKEEEEEKKKKQATL 479



 Score = 35.3 bits (82), Expect = 0.11
 Identities = 19/82 (23%), Positives = 40/82 (48%)

Query: 297 EFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEE 356
           E    E    T    ++ K KK  +  +    + + E +  A      +++++++ + +E
Sbjct: 393 ELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452

Query: 357 EPVEEGGEEEEVSKKKKKKKKK 378
           E  EE  EE E  K+++++KKK
Sbjct: 453 EEKEEEEEEAEEEKEEEEEKKK 474



 Score = 34.5 bits (80), Expect = 0.19
 Identities = 18/81 (22%), Positives = 35/81 (43%), Gaps = 3/81 (3%)

Query: 263 LEEGNLRRLSGTTKAKAKLEKYHG---KRLEKKKFKTEFDAAEETPATPDTSSSGKKKKK 319
           L E  +  L+G+ KA  K++K      K+ E++K + +  A        +     +KK++
Sbjct: 394 LTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEE 453

Query: 320 NKNLDTSIVKAEPEDEPAAAD 340
            K  +    + E E+E     
Sbjct: 454 EKEEEEEEAEEEKEEEEEKKK 474



 Score = 33.7 bits (78), Expect = 0.32
 Identities = 19/72 (26%), Positives = 40/72 (55%)

Query: 312 SSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKK 371
            S K  KK K +     K   E++      + + KKK+++ ++E+E  EE  EEEE   +
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463

Query: 372 KKKKKKKDVEDE 383
           ++K+++++ + +
Sbjct: 464 EEKEEEEEKKKK 475



 Score = 32.6 bits (75), Expect = 0.76
 Identities = 19/70 (27%), Positives = 40/70 (57%)

Query: 310 TSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVS 369
           T S    KK  K ++ +  K E E +         KKK++++ +++E+  EE  EEEE +
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462

Query: 370 KKKKKKKKKD 379
           +++K+++++ 
Sbjct: 463 EEEKEEEEEK 472



 Score = 31.8 bits (73), Expect = 1.4
 Identities = 16/44 (36%), Positives = 21/44 (47%)

Query: 343 LSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDP 386
              KK  KK +   E  E+  EEE+  KKKK    K  E+E + 
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEE 446


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 34.3 bits (79), Expect = 0.11
 Identities = 25/86 (29%), Positives = 40/86 (46%), Gaps = 6/86 (6%)

Query: 294 FKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQ 353
           F T + A +  P+   + S   +K+    ++      E E +       + K+KK+KK +
Sbjct: 114 FPTGYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDK 173

Query: 354 DEEEPVEEGGEEEEVSKKKKKKKKKD 379
            E+    +G      SKKKKKKKKK 
Sbjct: 174 KEKMVEPKG------SKKKKKKKKKK 193


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 33.3 bits (77), Expect = 0.20
 Identities = 15/39 (38%), Positives = 24/39 (61%)

Query: 340 DVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKK 378
           D    +KKKKKK + E E  EE  ++  VS+++K+ +K 
Sbjct: 78  DAYFKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 34.3 bits (79), Expect = 0.23
 Identities = 19/64 (29%), Positives = 33/64 (51%)

Query: 316 KKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKK 375
           KK K   ++     +E ++  A  + + +  +K +  +  EE  EE  EE  +  KKKK+
Sbjct: 373 KKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKR 432

Query: 376 KKKD 379
           KKK+
Sbjct: 433 KKKE 436


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 34.1 bits (79), Expect = 0.24
 Identities = 21/99 (21%), Positives = 41/99 (41%), Gaps = 17/99 (17%)

Query: 329 KAEPEDEPAAADVSLSKKKK----KKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDET 384
               E+E    ++S +++KK    ++K + + E  E      +   +   KK K  + ET
Sbjct: 394 AEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGET 453

Query: 385 DPLNHVGLAHSLSRYK-------------LKFSPDKVDT 410
             ++   L   L+R +             L+ SPD ++T
Sbjct: 454 KKVDPDPLGEKLARTEDPLEEAMKFLKPLLQLSPDNIET 492


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 33.6 bits (77), Expect = 0.25
 Identities = 19/63 (30%), Positives = 29/63 (46%), Gaps = 4/63 (6%)

Query: 315 KKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKK 374
           +K K NK         E  ++  + D ++ K K   K Q  E   +   +E++  KKK K
Sbjct: 19  EKYKANK----DKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPK 74

Query: 375 KKK 377
           KKK
Sbjct: 75  KKK 77


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 34.4 bits (79), Expect = 0.25
 Identities = 25/80 (31%), Positives = 36/80 (45%), Gaps = 7/80 (8%)

Query: 302 EETPATPDTSSSGKKKKKNKNLDTSIVKAE--PEDEPAAADVSLSKKKKKKKTQDEEEPV 359
            E+      S  G KKKK K+  T    AE  P+DE    D     KK +KK +D+   V
Sbjct: 407 NESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDE---EDAPKKGKKNQKKGRDKSSKV 463

Query: 360 EEGGEEEEVSKKKKKKKKKD 379
               + +   KK+  K ++D
Sbjct: 464 P--SDSKAGGKKESVKSQED 481


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 32.9 bits (75), Expect = 0.30
 Identities = 23/74 (31%), Positives = 33/74 (44%), Gaps = 16/74 (21%)

Query: 317 KKKNKNLDTSIVKAEPEDEPAAADVSLS----KKKKKKKTQDEEEPVEEGGEEEEVSKKK 372
           KKK+K+        +P  E   +D        KK KKKK +D++E            +KK
Sbjct: 110 KKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKE------------RKK 157

Query: 373 KKKKKKDVEDETDP 386
           KKK+KK  +    P
Sbjct: 158 KKKEKKKKKKRHSP 171



 Score = 29.8 bits (67), Expect = 3.6
 Identities = 14/47 (29%), Positives = 19/47 (40%)

Query: 335 EPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVE 381
            P        KK + +    EE P +  G +    K KKKK + D E
Sbjct: 108 PPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKE 154


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 32.8 bits (75), Expect = 0.47
 Identities = 13/72 (18%), Positives = 29/72 (40%), Gaps = 5/72 (6%)

Query: 303 ETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKK----TQDEEEP 358
           E   + D  ++  ++ K +  + +    + ED+  A       +++ ++    + DE E 
Sbjct: 50  EAKKSDDQETAEIEEVKEEEKEAA-NSEDKEDKGDAEKEDEESEEENEEEDEESSDENEK 108

Query: 359 VEEGGEEEEVSK 370
             E   E  V K
Sbjct: 109 ETEEKTESNVEK 120



 Score = 30.9 bits (70), Expect = 1.9
 Identities = 15/54 (27%), Positives = 21/54 (38%)

Query: 334 DEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPL 387
               AA      KK   +   E E V+E  +E   S+ K+ K   + EDE    
Sbjct: 40  PSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEE 93



 Score = 28.9 bits (65), Expect = 8.4
 Identities = 18/86 (20%), Positives = 34/86 (39%), Gaps = 15/86 (17%)

Query: 315 KKKKKNKNLDTSI---------------VKAEPEDEPAAADVSLSKKKKKKKTQDEEEPV 359
           K++K N+ L+ +I                 + P D+ AA +    K   ++  + EE   
Sbjct: 8   KRRKINRVLNIAIGIVVVLIIIVAYQLFFPSSPSDQAAADEQEAKKSDDQETAEIEEVKE 67

Query: 360 EEGGEEEEVSKKKKKKKKKDVEDETD 385
           EE        K+ K   +K+ E+  +
Sbjct: 68  EEKEAANSEDKEDKGDAEKEDEESEE 93


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 32.5 bits (74), Expect = 0.59
 Identities = 20/98 (20%), Positives = 37/98 (37%), Gaps = 15/98 (15%)

Query: 304 TPATPDTSSSGKKKKKNKNL----------DTSIVKAEPEDEPAAADVS-----LSKKKK 348
            P  P+     K    N +               ++   E E + A  S          K
Sbjct: 184 GPGEPEPEDDPKDSLGNGSSTNGLPDSSQDKNKSLEEYYEKESSDAAASQDDGPKGSDVK 243

Query: 349 KKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDP 386
            KK+ DEE+  ++G   EE   K+ +++++  E+E + 
Sbjct: 244 NKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEE 281


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 33.0 bits (75), Expect = 0.66
 Identities = 19/63 (30%), Positives = 28/63 (44%)

Query: 324 DTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
           D+S    +PE+        +  K + ++ +D EE  EE  EEE    KK KK KK    +
Sbjct: 289 DSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKK 348

Query: 384 TDP 386
              
Sbjct: 349 NGL 351



 Score = 30.7 bits (69), Expect = 3.1
 Identities = 17/57 (29%), Positives = 27/57 (47%), Gaps = 4/57 (7%)

Query: 329 KAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETD 385
           K  PE           ++ +  +  +EE+  EEGG  ++  K KK K KK+  D+ D
Sbjct: 303 KLSPEIPAKPEI----EQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDD 355


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 32.7 bits (75), Expect = 0.70
 Identities = 19/63 (30%), Positives = 26/63 (41%), Gaps = 3/63 (4%)

Query: 315 KKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKK 374
           K KK    LD+     +   + +    S +K KKKKK        +    +E  S  KKK
Sbjct: 334 KTKKTQTRLDSFFTATKKPIKKSN---SKAKLKKKKKKAGASAVPKSETSQEAKSSGKKK 390

Query: 375 KKK 377
            KK
Sbjct: 391 VKK 393


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 32.7 bits (74), Expect = 0.79
 Identities = 30/122 (24%), Positives = 52/122 (42%), Gaps = 14/122 (11%)

Query: 264 EEGNLRRLSGTTKAKAKLEKYH-------GKRLEKKKFKTEFDAAEETPATPDTSSSGKK 316
           E+G++  +   +K   K EK          K+ E + FK+   A +++PA+   S +   
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASA-ASVAEAD 246

Query: 317 KKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKK 376
           +    N  +        DEP  A+   +KK  K K + + +      E+EE  KKKK   
Sbjct: 247 EASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRK------EKEEKKKKKKHHH 300

Query: 377 KK 378
            +
Sbjct: 301 HR 302


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 32.6 bits (74), Expect = 0.92
 Identities = 16/79 (20%), Positives = 30/79 (37%), Gaps = 1/79 (1%)

Query: 92  DTKRAAYPSLGTT-VLLYPYSWSGSSSNAAAKHFHNSFVINIKKPKKSSSSSKHHHHATS 150
           +   A+   L +T  L+   +  G+ S + + +     +   +K K    S  H HH+  
Sbjct: 215 EKSEASSKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIG 274

Query: 151 KKNASVTHATPSVETGHTK 169
             N    HA   ++T    
Sbjct: 275 INNHHSKHADSKLQTIEVI 293


>gnl|CDD|215038 PLN00040, PLN00040, Protein MAK16 homolog; Provisional.
          Length = 233

 Score = 31.7 bits (72), Expect = 0.92
 Identities = 14/49 (28%), Positives = 20/49 (40%)

Query: 317 KKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEE 365
           K  NK L+   V+   E+ P +     SKKK +   + E E   E    
Sbjct: 183 KSYNKVLEMEEVEEAEEELPKSDKNPNSKKKSRVHVEIEYEDEIEYKSL 231


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 32.0 bits (73), Expect = 0.94
 Identities = 16/44 (36%), Positives = 24/44 (54%), Gaps = 3/44 (6%)

Query: 331 EPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKK 374
           +PEDE   +  SL  ++ KKK   EE   ++  E EE+   +KK
Sbjct: 73  KPEDEKELSASSLEAEQAKKK---EEAEAKKKKEMEELKAVQKK 113


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 32.2 bits (73), Expect = 1.1
 Identities = 29/150 (19%), Positives = 55/150 (36%), Gaps = 13/150 (8%)

Query: 231 AALATRVDALGEDSSIEL--GTDHRAKLEIKLRLLEEGNLRRLSGTTKAKAKLEKYHGKR 288
            A  +R     ++   +L   T++ A+L I L  L+             + KL++   K 
Sbjct: 162 EAAGSREKRKKKERLKKLIEETENLAELIIDLEELK-----------LQELKLKEQAKKA 210

Query: 289 LEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKK 348
           LE  + K + +  EE     D     +++           + E E      +       +
Sbjct: 211 LEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQ 270

Query: 349 KKKTQDEEEPVEEGGEEEEVSKKKKKKKKK 378
             K   EEE  ++  EEE     K++++ K
Sbjct: 271 VLKENKEEEKEKKLQEEELKLLAKEEEELK 300


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 32.3 bits (73), Expect = 1.3
 Identities = 19/71 (26%), Positives = 34/71 (47%), Gaps = 1/71 (1%)

Query: 317 KKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKK-KTQDEEEPVEEGGEEEEVSKKKKKK 375
           K +  +LD  I+K       A  D+S           +DE+E  ++  ++EE  +++++ 
Sbjct: 121 KAEIGDLDMIIIKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEI 180

Query: 376 KKKDVEDETDP 386
           K  D EDE D 
Sbjct: 181 KGFDDEDEEDE 191


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 32.0 bits (73), Expect = 1.3
 Identities = 29/125 (23%), Positives = 39/125 (31%), Gaps = 11/125 (8%)

Query: 267 NLRRLSGTTKAKAKLEKYHGKRLEKKKFKT-EFDAAEETPATPDTSSSG-----KKKKKN 320
           NL  L        + EK   +  +K K      D         D          KK KK 
Sbjct: 222 NLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKK 281

Query: 321 KNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDV 380
           K       K   EDE       L             + VEE     E S KK+K++++D 
Sbjct: 282 KKKKKKRRKDLDEDELEPEAEGLGSSDSGS-----RKDVEEENARLEDSPKKRKEEQEDD 336

Query: 381 EDETD 385
           +   D
Sbjct: 337 DFVED 341



 Score = 30.1 bits (68), Expect = 4.3
 Identities = 25/104 (24%), Positives = 37/104 (35%), Gaps = 13/104 (12%)

Query: 287 KRLEKKKFKTEFDAAEETPATP-------DTSSSGKKKKKNKNLDTSIVK-----AEPED 334
           K LE KK K ++D  ++            D    GKKKK +              AE + 
Sbjct: 181 KNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGSTDDEAEKKR 240

Query: 335 EPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKK 378
           +     + ++       +  E    +     E V  KK KKKKK
Sbjct: 241 QEVKKKLKINNVSLDDDS-TETPASDYYDVSEMVKFKKPKKKKK 283


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 32.0 bits (73), Expect = 1.5
 Identities = 24/80 (30%), Positives = 39/80 (48%), Gaps = 11/80 (13%)

Query: 311  SSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKK----KKTQDEEEPVEEGGEEE 366
             S+ KK  + +N +      + E EPA      S K+KK        ++E+ +EE   E 
Sbjct: 1511 PSNEKKSIEIENRN------QEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAES 1564

Query: 367  EVSKKKKKKK-KKDVEDETD 385
            ++ K+K KK+ K + E E D
Sbjct: 1565 DIKKRKNKKQYKSNTEAELD 1584


>gnl|CDD|149180 pfam07960, CBP4, CBP4.  The CBP4 in S. cerevisiae is essential for
           the expression and activity of ubiquinol-cytochrome c
           reductase. This family appears to be fungal specific.
          Length = 128

 Score = 30.1 bits (68), Expect = 1.7
 Identities = 15/65 (23%), Positives = 30/65 (46%), Gaps = 1/65 (1%)

Query: 315 KKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEE-EEVSKKKK 373
              K+    D  I K  P + P     ++ + +K K  + ++E +E   EE EE   + +
Sbjct: 64  TILKETSKSDKPIWKTGPIESPWEKKRNVREFQKTKAEEAQKEELERIREELEEARAQSE 123

Query: 374 KKKKK 378
           + +K+
Sbjct: 124 EMRKE 128


>gnl|CDD|166134 PLN02493, PLN02493, probable peroxisomal (S)-2-hydroxy-acid
           oxidase.
          Length = 367

 Score = 31.2 bits (70), Expect = 1.8
 Identities = 22/75 (29%), Positives = 39/75 (52%), Gaps = 2/75 (2%)

Query: 608 GELVGARLVAQAGSLLNLAKHPASTVQILGAEKALFRSL-LNLAKHPASTVQILG-AEKA 665
           GE   AR  + AG+++ L+    S+V+ + +     R   L + K+     Q++  AE+A
Sbjct: 88  GEYATARAASAAGTIMTLSSWATSSVEEVASTGPGIRFFQLYVYKNRNVVEQLVRRAERA 147

Query: 666 LFRALKTKRDTPKYG 680
            F+A+    DTP+ G
Sbjct: 148 GFKAIALTVDTPRLG 162


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 31.4 bits (71), Expect = 1.8
 Identities = 18/67 (26%), Positives = 29/67 (43%), Gaps = 7/67 (10%)

Query: 113 SGSSSNAAAKHFHNSFVINIKKPKK-----SSSSSKHHHHATSKKNASVTHATPSVETGH 167
           SG  S ++ K    S   +  K +K     SS+S +H   +    N + +   P +  G 
Sbjct: 836 SGPFSASSTKSSSKS--SSTSKHRKTEGKGSSTSKEHKGSSGDTPNKASSFPVPPLSNGS 893

Query: 168 TKPLRAK 174
           +KP R K
Sbjct: 894 SKPRRPK 900


>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
          Length = 1463

 Score = 31.7 bits (71), Expect = 1.9
 Identities = 21/66 (31%), Positives = 30/66 (45%), Gaps = 3/66 (4%)

Query: 297 EFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEE 356
           E  AA+E+PA P          + + L  S    EP D P   D    K  K+K ++D  
Sbjct: 750 EPSAAQESPANP--WPRAPPCDEQEPLSVSPYGPEP-DRPPDDDFETRKGLKRKSSEDHA 806

Query: 357 EPVEEG 362
           +P+ EG
Sbjct: 807 DPIPEG 812


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 31.6 bits (72), Expect = 1.9
 Identities = 24/166 (14%), Positives = 59/166 (35%), Gaps = 9/166 (5%)

Query: 223 MARMLAAKAALATRVDALGEDSSIELGTDHRAKLEIKLRLLE---EGNLRRLSGTTKAKA 279
           +  +      L  ++  LGE+  + +      +LE ++  LE       R L     A+ 
Sbjct: 267 LEEIEQLLEELNKKIKDLGEEEQLRVKEK-IGELEAEIASLERSIAEKERELE---DAEE 322

Query: 280 KLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAA 339
           +L K   +  +      E +   E         + +  +  + L+    + E  D+  A 
Sbjct: 323 RLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAE 382

Query: 340 --DVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
             D     ++K +K + E   ++   +  +   ++  ++  D+   
Sbjct: 383 TRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAA 428


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
           glycoprotein C-terminal domain.  The trypanosome
           parasite expresses these proteins to evade the immune
           response.
          Length = 98

 Score = 29.3 bits (66), Expect = 2.1
 Identities = 12/39 (30%), Positives = 17/39 (43%)

Query: 345 KKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
           K K+ K  ++     +  G E    K K KK KKD +  
Sbjct: 28  KPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKG 66


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 31.4 bits (71), Expect = 2.3
 Identities = 32/117 (27%), Positives = 45/117 (38%), Gaps = 24/117 (20%)

Query: 300 AAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPV 359
            AE+   T +     KKK                 E  A +  L K K  +K    +   
Sbjct: 7   EAEKKILTEEELERKKKK-----------------EEKAKEKELKKLKAAQKEAKAKLQA 49

Query: 360 EEGGEEEEVSKKKKKK-KKKDVEDET-----DPLNHVGLAHSLSRYKLK-FSPDKVD 409
           ++  +   V KK +KK +K+DVEDE      DP    G    LS    K +SP  V+
Sbjct: 50  QQASDGTNVPKKSEKKSRKRDVEDENPEDFIDPDTPFGQKKRLSSQMAKQYSPSAVE 106


>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN.  FtsN is a poorly
           conserved protein active in cell division in a number of
           Proteobacteria. The N-terminal 30 residue region tends
           to by Lys/Arg-rich, and is followed by a
           membrane-spanning region. This is followed by an acidic
           low-complexity region of variable length and a
           well-conserved C-terminal domain of two tandem regions
           matched by pfam05036 (Sporulation related repeat), found
           in several cell division and sporulation proteins. The
           role of FtsN as a suppressor for other cell division
           mutations is poorly understood; it may involve cell wall
           hydrolysis [Cellular processes, Cell division].
          Length = 298

 Score = 30.8 bits (69), Expect = 2.3
 Identities = 24/117 (20%), Positives = 35/117 (29%), Gaps = 2/117 (1%)

Query: 274 TTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPE 333
           T + +  LE+       +   K    A  E     +      +KK  K       K   E
Sbjct: 115 TAEQRQLLEQMQAD--MRAAEKVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVE 172

Query: 334 DEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLNHV 390
            E  A+ V  +K+K+K   +   E        E   K  K  K K    E       
Sbjct: 173 TEKIASKVKEAKQKQKALPKQTAETQSNSKPIETAPKADKADKTKPKPKEKAERAAA 229


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 31.2 bits (70), Expect = 2.3
 Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 10/74 (13%)

Query: 321 KNLDTSIVKAEPEDE---PAAADVSLSKKKKKKKTQDEEEP-------VEEGGEEEEVSK 370
           +NL +SI   E + E   P   D+ L ++++ +  +DEE         V+E  EEEE  +
Sbjct: 115 ENLPSSIAPPEIDPEPRKPIVPDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEE 174

Query: 371 KKKKKKKKDVEDET 384
            K  K   D+E +T
Sbjct: 175 LKTMKDFIDLESQT 188


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
           homolog A (Rrp7A) and similar proteins.  The family
           corresponds to the RRP7 domain of Rrp7A, also termed
           gastric cancer antigen Zg14, and similar proteins which
           are yeast ribosomal RNA-processing protein 7 (Rrp7p)
           homologs mainly found in Metazoans. The cellular
           function of Rrp7A remains unclear currently. Rrp7A
           harbors an N-terminal RNA recognition motif (RRM), also
           termed RBD (RNA binding domain) or RNP
           (ribonucleoprotein domain), and a C-terminal RRP7
           domain.
          Length = 129

 Score = 29.6 bits (67), Expect = 2.7
 Identities = 17/58 (29%), Positives = 33/58 (56%), Gaps = 9/58 (15%)

Query: 329 KAEPEDEPAAAD----VSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVED 382
           + E +++ A  D    V+++KK ++ KT  +E         +   K+KKKKKKK++++
Sbjct: 37  EKEEKEKEAEPDEDGWVTVTKKGRRPKTARKE-----SVAAKAAEKEKKKKKKKELKN 89


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 30.7 bits (69), Expect = 2.7
 Identities = 19/93 (20%), Positives = 39/93 (41%), Gaps = 6/93 (6%)

Query: 290 EKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKK 349
           ++ + + +   A+  P   DTSS  + K+  +N    I KA+ E +    +        K
Sbjct: 246 QRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEA------LK 299

Query: 350 KKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVED 382
            K     +  +E    E+ ++ K+ + +K  E 
Sbjct: 300 AKDHKAFDLKQESKASEKEAEDKELEAQKKREP 332


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 29.8 bits (67), Expect = 2.9
 Identities = 18/60 (30%), Positives = 30/60 (50%), Gaps = 3/60 (5%)

Query: 328 VKAEP-EDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDP 386
           VK E  ED+    +    + ++ +  +DEEE VE+  EEEE   ++     KD+E +   
Sbjct: 43  VKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEE--DEEDNVDLKDIEKKNIN 100


>gnl|CDD|184732 PRK14539, PRK14539, 50S ribosomal protein L11/unknown domain fusion
           protein; Provisional.
          Length = 196

 Score = 29.9 bits (67), Expect = 3.0
 Identities = 14/52 (26%), Positives = 26/52 (50%)

Query: 327 IVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKK 378
           +VKA+ E + AA   +L+K K+      E E     G+  EV+   ++ + +
Sbjct: 142 VVKAKEEAKAAAKAAALAKAKEASLKSAEAELKASKGKSIEVNVIGEEDEGE 193


>gnl|CDD|223928 COG0859, RfaF, ADP-heptose:LPS heptosyltransferase [Cell envelope
           biogenesis, outer membrane].
          Length = 334

 Score = 30.4 bits (69), Expect = 3.6
 Identities = 10/59 (16%), Positives = 20/59 (33%), Gaps = 2/59 (3%)

Query: 149 TSKKNASVTHATPSVETGHTKPLRAKRKHQLKLKRFKQTLCRRDF--CSNTQILMKIIL 205
             K  A +    P ++       + K     +     +TL +  +    + Q L+K  L
Sbjct: 38  VPKGFAPILKLNPEIDKVIIIDKKKKGLGLKERLALLRTLRKERYDAVIDLQGLLKSAL 96


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 29.2 bits (66), Expect = 3.9
 Identities = 15/64 (23%), Positives = 26/64 (40%), Gaps = 13/64 (20%)

Query: 314 GKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKK 373
           G  KK+ +  +   V+ E  DE    D  L K+  K K              E+  + ++
Sbjct: 95  GLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLK-------------REKRRENER 141

Query: 374 KKKK 377
           K+K+
Sbjct: 142 KQKE 145


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 30.0 bits (68), Expect = 4.2
 Identities = 22/90 (24%), Positives = 29/90 (32%), Gaps = 20/90 (22%)

Query: 290 EKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKK 349
             K+        E  P   D      KKKK +           ED  AA       + +K
Sbjct: 207 LYKELLKSLLIPEFKPL--DKYLKESKKKKRET---------EEDVEAA-----ESRAEK 250

Query: 350 KKTQDEEEPVEEGGEEEEVSKKKKKKKKKD 379
           K+   EE   ++  E    SK  K  KK  
Sbjct: 251 KRKSKEEIKKKKPKE----SKGVKALKKVV 276


>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e. 
          Length = 108

 Score = 28.4 bits (64), Expect = 4.4
 Identities = 14/34 (41%), Positives = 23/34 (67%)

Query: 345 KKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKK 378
           ++KKKKK + E E   E  E++EVS+++K  +K 
Sbjct: 39  REKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72


>gnl|CDD|166620 PLN02979, PLN02979, glycolate oxidase.
          Length = 366

 Score = 30.1 bits (67), Expect = 4.4
 Identities = 22/75 (29%), Positives = 39/75 (52%), Gaps = 2/75 (2%)

Query: 608 GELVGARLVAQAGSLLNLAKHPASTVQILGAEKALFRSL-LNLAKHPASTVQILG-AEKA 665
           GE   AR  + AG+++ L+    S+V+ + +     R   L + K+     Q++  AE+A
Sbjct: 87  GEYATARAASAAGTIMTLSSWATSSVEEVASTGPGIRFFQLYVYKNRNVVEQLVRRAERA 146

Query: 666 LFRALKTKRDTPKYG 680
            F+A+    DTP+ G
Sbjct: 147 GFKAIALTVDTPRLG 161


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 30.1 bits (68), Expect = 4.6
 Identities = 18/108 (16%), Positives = 27/108 (25%), Gaps = 19/108 (17%)

Query: 298 FDAAEETPATPDTSSSGKKKKKNKNLDTSI-------------VKAEPEDEPAAADVSLS 344
            D+  +  A       G   K +   D  I                +   E +      +
Sbjct: 5   LDSLPDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRN 64

Query: 345 KKKKKKKTQDEEEPVEEGG------EEEEVSKKKKKKKKKDVEDETDP 386
            KK  K T+      EE         E   S   K K      +E + 
Sbjct: 65  PKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEEEE 112


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 29.6 bits (67), Expect = 4.9
 Identities = 21/82 (25%), Positives = 34/82 (41%), Gaps = 8/82 (9%)

Query: 278 KAKLEKYHGKRLEKKKFKTE-FDAAEETPATPDTSSSGKKKKKNKNLDTSIVKA------ 330
           KA+LE+   K+ +KKK K    + A  + A    SS+   +  +        K       
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRS 232

Query: 331 -EPEDEPAAADVSLSKKKKKKK 351
             P++E +    SL    KK+K
Sbjct: 233 IAPDNEKSEVYKSLFTSHKKEK 254


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 29.9 bits (68), Expect = 5.5
 Identities = 25/134 (18%), Positives = 48/134 (35%), Gaps = 9/134 (6%)

Query: 278 KAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPA 337
           +A+LE+    R  + K   E  AA++  A     +  K KK        I      D  A
Sbjct: 459 QARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSA 518

Query: 338 AADVSLSKKKKKKKTQDEEEPVEEGGEEEEV---------SKKKKKKKKKDVEDETDPLN 388
                 ++K + +  Q E++       ++           +KK  ++      +E     
Sbjct: 519 VIAAREARKAQARARQAEKQAAAAADPKKAAVAAAIARAKAKKAAQQAANAEAEEEVDPK 578

Query: 389 HVGLAHSLSRYKLK 402
              +A +++R K K
Sbjct: 579 KAAVAAAIARAKAK 592


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 30.1 bits (68), Expect = 5.5
 Identities = 20/88 (22%), Positives = 30/88 (34%), Gaps = 11/88 (12%)

Query: 295 KTEFDAAEE-TPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKKKTQ 353
           KT  + A++              KK N   D++    +     AA   +   KK   KTQ
Sbjct: 736 KTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQ 795

Query: 354 DEEEPVEEGGEEEEVSKKKKKKKKKDVE 381
                     +    +K K+  KKK  E
Sbjct: 796 ----------KIAAATKAKRAAKKKVAE 813


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 29.8 bits (67), Expect = 5.7
 Identities = 12/41 (29%), Positives = 22/41 (53%)

Query: 343 LSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
             K  + K   D  E  E   +EE++ +++ +KK+K+  DE
Sbjct: 382 TKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADE 422


>gnl|CDD|178608 PLN03046, PLN03046, D-glycerate 3-kinase; Provisional.
          Length = 460

 Score = 29.9 bits (67), Expect = 5.7
 Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 3/64 (4%)

Query: 107 LYPYSWSGSSSNAAAKHFHNSFVINIKKPKKSSSSSKHHHHATSKKNASVTHATPSVETG 166
            YP      +S AA      S   N+K+   S SS K +    +  ++  +    S +T 
Sbjct: 26  DYPIP---CASAAAFNFNPISNPFNLKRRPYSPSSHKFNDRVAASCSSYPSSKLSSRKTH 82

Query: 167 HTKP 170
            + P
Sbjct: 83  LSCP 86


>gnl|CDD|178937 PRK00227, glnD, PII uridylyl-transferase; Provisional.
          Length = 693

 Score = 29.7 bits (67), Expect = 5.7
 Identities = 13/31 (41%), Positives = 17/31 (54%)

Query: 609 ELVGARLVAQAGSLLNLAKHPASTVQILGAE 639
           E VGA +VA+A   + L     + VQ L AE
Sbjct: 423 EQVGAEMVARAARRMGLNLRDRAVVQTLVAE 453


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 29.4 bits (66), Expect = 6.1
 Identities = 9/63 (14%), Positives = 18/63 (28%), Gaps = 6/63 (9%)

Query: 324 DTSIVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
           D   ++  P      A   L          +E +P  E  EE+     + +   +    +
Sbjct: 30  DFVGIELVPLAVFLLAAKVLEAP------TEEPQPEPEPPEEQPKPPTEPETPPEPTPPK 83

Query: 384 TDP 386
              
Sbjct: 84  PKE 86


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 29.2 bits (66), Expect = 6.7
 Identities = 22/103 (21%), Positives = 40/103 (38%), Gaps = 10/103 (9%)

Query: 282 EKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPED------- 334
           EK    +  K+    E    +E  A  +  +  +K+K ++   T I+   P D       
Sbjct: 137 EKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILT--PADFAKIQEL 194

Query: 335 -EPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKK 376
                 D +L  K K++     E   +E  + +++    KKKK
Sbjct: 195 RLEKGVDKALGGKLKRRDKDAPERHSDELVDADDIEGPAKKKK 237


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 29.0 bits (65), Expect = 7.0
 Identities = 20/93 (21%), Positives = 42/93 (45%)

Query: 291 KKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSKKKKKK 350
           +K F  +    +E            K+++ K       K +   EP   D++  ++K++ 
Sbjct: 28  QKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEI 87

Query: 351 KTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
             ++E+E   E  +++EV   K+KK  K++  E
Sbjct: 88  AQEEEKEEEAEDVKQQEVFSFKRKKPFKEMNLE 120


>gnl|CDD|235205 PRK04028, PRK04028, glutamyl-tRNA(Gln) amidotransferase subunit E;
           Validated.
          Length = 630

 Score = 29.8 bits (68), Expect = 7.0
 Identities = 21/82 (25%), Positives = 42/82 (51%), Gaps = 10/82 (12%)

Query: 532 DKTSASDLSEILPEDVEEKVKEAAEISMGTEISDDDIENILLLCDQVLEISSYRGQLYDY 591
            K +  ++ + L E+  +  +EAAE      +S++++E I    D+++E      +  D+
Sbjct: 537 AKEAIEEILKELAENPGKSAEEAAEELGLKGLSEEEVEKI---IDEIVE------ENIDF 587

Query: 592 LKSR-MMAVAPNLTILMGELVG 612
           +K + M A  P +  +M EL G
Sbjct: 588 VKEKGMGAFGPLMGEVMAELRG 609


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 28.8 bits (65), Expect = 7.2
 Identities = 24/80 (30%), Positives = 34/80 (42%), Gaps = 4/80 (5%)

Query: 276 KAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTS-SSGKKKKKNKNLDTSIV---KAE 331
           + K +  K   K+  K+  K E     E     +    S KK    +N + +IV      
Sbjct: 12  RRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNIVFSKVEF 71

Query: 332 PEDEPAAADVSLSKKKKKKK 351
            + E A  D+ L KKKKKKK
Sbjct: 72  ADGEQAKKDLKLKKKKKKKK 91


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score = 29.6 bits (66), Expect = 7.2
 Identities = 20/57 (35%), Positives = 28/57 (49%), Gaps = 4/57 (7%)

Query: 327 IVKAEPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
           +V+   E E    D   +KK  +   + + E V+EG E     KKKK KK K+V  E
Sbjct: 206 MVEKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEG----KKKKTKKVKEVTKE 258


>gnl|CDD|240059 cd04708, BAH_plantDCM_II, BAH, or Bromo Adjacent Homology domain,
           second copy present in DNA
           (Cytosine-5)-methyltransferases (DCM) from plants. DNA
           methylation, or the covalent addition of a methyl group
           to cytosine within the context of the CpG dinucleotide,
           has profound effects on the genome. These effects
           include transcriptional repression via inhibition of
           transcription factor binding, the recruitment of
           methyl-binding proteins and their associated chromatin
           remodeling factors, X chromosome inactivation,
           imprinting, and the suppression of parasitic DNA
           sequences. DNA methylation is also essential for proper
           embryonic development and is an important player in both
           DNA repair and genome stability. BAH domains are found
           in a variety of proteins playing roles in
           transcriptional silencing and the remodeling of
           chromatin. It is assumed that in most or all of these
           instances the BAH domain mediates protein-protein
           interactions.
          Length = 202

 Score = 29.0 bits (65), Expect = 7.3
 Identities = 9/37 (24%), Positives = 14/37 (37%)

Query: 331 EPEDEPAAADVSLSKKKKKKKTQDEEEPVEEGGEEEE 367
           + E     A  S  +K+K K   D E   E   +  +
Sbjct: 150 KEEAYSTGASDSALRKRKGKGKGDSESDSEAPVKAPK 186


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 29.2 bits (65), Expect = 7.7
 Identities = 25/111 (22%), Positives = 38/111 (34%), Gaps = 8/111 (7%)

Query: 276 KAKAKLE-KYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPED 334
           KAKA    K     L K+K +   +  EE  A     ++   K K   L     +   E 
Sbjct: 94  KAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEV 153

Query: 335 EPAAADVSLSKKKKKKKTQDE-------EEPVEEGGEEEEVSKKKKKKKKK 378
                +    K K K     +       ++   E GE  E   +++K K K
Sbjct: 154 TEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKAK 204


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 28.9 bits (65), Expect = 7.9
 Identities = 21/92 (22%), Positives = 39/92 (42%), Gaps = 9/92 (9%)

Query: 299 DAAEETPATPDTSSSGKKKKKNKNLDTSIV---KAEPEDEPAAADVSLSKKKKKKKTQDE 355
           D  E  P  P   +  K K  +++ D  +      E ++E       ++ K K KK    
Sbjct: 5   DDEEFEPPAPPAKAVVKDKWDDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKA--- 61

Query: 356 EEPVEEGGEEEEVSKKKKKKKKKDVEDETDPL 387
              ++   EE+E +K++K++K     +E  P 
Sbjct: 62  ---LKAKIEEKEKAKREKEEKGLRELEEDTPE 90


>gnl|CDD|214617 smart00323, RasGAP, GTPase-activator protein for Ras-like GTPases. 
           All alpha-helical domain that accelerates the GTPase
           activity of Ras, thereby "switching" it into an "off"
           position. Improved domain limits from structure.
          Length = 344

 Score = 29.2 bits (66), Expect = 7.9
 Identities = 12/51 (23%), Positives = 27/51 (52%), Gaps = 7/51 (13%)

Query: 539 LSEILPEDVEEKVKEAAEIS---MGTEISDDDIENILLLCDQVLE--ISSY 584
           L  +L + VE K  ++ E+    +  E  + ++EN+L   +++ +  I+S 
Sbjct: 119 LKPVLKKIVESK--KSCEVDPAKLEGEDLETNLENLLQYVERLFDAIINSS 167


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 29.3 bits (66), Expect = 7.9
 Identities = 18/71 (25%), Positives = 27/71 (38%), Gaps = 6/71 (8%)

Query: 331 EPEDEPAAADVSLSKKKKKKKTQDEE------EPVEEGGEEEEVSKKKKKKKKKDVEDET 384
             + +         KK KK K   ++      E + E  E E     KKKKKKK  + + 
Sbjct: 10  VKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKK 69

Query: 385 DPLNHVGLAHS 395
           +      LA+ 
Sbjct: 70  NLGEAYDLAYD 80


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 28.9 bits (65), Expect = 8.0
 Identities = 29/118 (24%), Positives = 42/118 (35%), Gaps = 8/118 (6%)

Query: 268 LRRLSGTTKAKAKLEKYHGKRLEKKKFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTS- 326
           LR L   +K +AK  K      E KK K   D+  E     +     K++KK +   TS 
Sbjct: 94  LRELEKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKP-KEPGLKRKQRKKAQESATSP 152

Query: 327 ---IVKAEPEDEPAAADVSLSK---KKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKK 378
                       P+   +  +K    ++ KK          G E+   SK   KK  K
Sbjct: 153 ESSPSSTPNSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGK 210


>gnl|CDD|239124 cd02659, peptidase_C19C, A subfamily of Peptidase C19. Peptidase
           C19 contains ubiquitinyl hydrolases. They are
           intracellular peptidases that remove ubiquitin molecules
           from polyubiquinated peptides by cleavage of isopeptide
           bonds. They hydrolyze bonds involving the carboxyl group
           of the C-terminal Gly residue of ubiquitin. The purpose
           of the de-ubiquitination is thought to be editing of the
           ubiquitin conjugates, which could rescue them from
           degradation, as well as recycling of the ubiquitin. The
           ubiquitin/proteasome system is responsible for most
           protein turnover in the mammalian cell, and with over 50
           members, family C19 is one of the largest families of
           peptidases in the human genome.
          Length = 334

 Score = 29.1 bits (66), Expect = 8.3
 Identities = 12/65 (18%), Positives = 21/65 (32%), Gaps = 13/65 (20%)

Query: 655 STVQIL----GAEKALFRALKTKRDTPKYGLIYHSQLI---------GQSSTKNKGKMGS 701
           S +Q L        A++    T+ D     +    Q +            +T+   K  S
Sbjct: 16  SLLQQLYMTPEFRNAVYSIPPTEDDDDNKSVPLALQRLFLFLQLSESPVKTTELTDKTRS 75

Query: 702 YGWQP 706
           +GW  
Sbjct: 76  FGWDS 80


>gnl|CDD|227753 COG5466, COG5466, Predicted small metal-binding protein [Function
           unknown].
          Length = 59

 Score = 26.7 bits (59), Expect = 8.4
 Identities = 11/38 (28%), Positives = 16/38 (42%), Gaps = 4/38 (10%)

Query: 534 TSASDLSEILPEDVEEKVKEAAEISMGTEISDDDIENI 571
             A   +E++   VE   +   E    TEI +D I  I
Sbjct: 19  ARADSEAEVMRRIVEHAKEAHGE----TEIREDMINKI 52


>gnl|CDD|187818 cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily
           protein Cas7.  CRISPR (Clustered Regularly Interspaced
           Short Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Cas7 is a RAMP superfamily protein; Subunit of the
           Cascade complex; also known as Cst2/DevR family.
          Length = 302

 Score = 28.9 bits (65), Expect = 9.3
 Identities = 11/60 (18%), Positives = 20/60 (33%)

Query: 349 KKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLNHVGLAHSLSRYKLKFSPDKV 408
            K     EE         E    K++K+K+   +    + ++      +R     SP  V
Sbjct: 155 GKFEVGIEEKKCNPRGRLEYRLPKEEKEKRRANNLLKAIRNLSGGAKQARRLEDLSPKFV 214


>gnl|CDD|199835 cd10277, PQQ_ADH_I, Ethanol dehydrogenase, a bacterial quinoprotein
           (PQQ-dependent type I alcohol dehydrogenase).  This
           bacterial family of homodimeric ethanol dehydrogenases
           utilize pyrroloquinoline quinone (PQQ) as a cofactor. It
           represents proteins whose expression may be induced by
           ethanol, and which are similar to quinoprotein methanol
           dehydrogenases, but have higher specificities for
           ethanol and other primary and secondary alcohols.
           Dehydrogenases with PQQ cofactors, such as ethanol,
           methanol, and membrane-bound glucose dehydrogenases,
           form an 8-bladed beta-propeller.
          Length = 529

 Score = 29.2 bits (66), Expect = 9.4
 Identities = 14/36 (38%), Positives = 20/36 (55%), Gaps = 1/36 (2%)

Query: 418 LLDDLDKELNNYMMRCREWYGWHFPELGKIVTDNVA 453
           LL+D  KE  N++   R + G  +  L +I TDNV 
Sbjct: 1   LLND-AKETGNWLTYGRGYNGQRYSPLKQINTDNVK 35


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 29.0 bits (65), Expect = 9.5
 Identities = 17/117 (14%), Positives = 38/117 (32%), Gaps = 25/117 (21%)

Query: 287 KRLEKK-KFKTEFDAAEETPATPDTSSSGKKKKKNKNLDTSIVKAEPEDEPAAADVSLSK 345
           KR  ++ K +       + P+ P+            ++   I K + +            
Sbjct: 290 KRTTRRVKMRPVRAKPSDEPSLPE-----------SDIHEEIPKLDEKSLSEFLGYMGGI 338

Query: 346 KKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDPLNHVGLAHSLSRYKLK 402
            +  +   DEE   E   +++   K +K+K                 +++  R KL+
Sbjct: 339 DEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPV-------------SNNFRRLKLR 382


>gnl|CDD|206307 pfam14138, COX16, Cytochrome c oxidase assembly protein COX16.
           This family represents homologues of COX16 which has
           been shown to be involved in assembly of cytochrome
           oxidase. Protein in this family are typically between
           106 and 134 amino acids in length.
          Length = 79

 Score = 26.7 bits (60), Expect = 9.5
 Identities = 14/43 (32%), Positives = 22/43 (51%), Gaps = 8/43 (18%)

Query: 341 VSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDE 383
             L  +++ +K Q  EE        EE  K  KK++K D+E+E
Sbjct: 22  TQLRYERRDRKVQQLEE--------EEALKLLKKRRKVDLEEE 56


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
           (DUF1754).  This is a eukaryotic protein family of
           unknown function.
          Length = 90

 Score = 27.0 bits (60), Expect = 9.6
 Identities = 14/46 (30%), Positives = 24/46 (52%)

Query: 341 VSLSKKKKKKKTQDEEEPVEEGGEEEEVSKKKKKKKKKDVEDETDP 386
           + + KKKKKKK +++ +      +EEE     +   K+  EDE + 
Sbjct: 17  IDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNE 62


>gnl|CDD|218242 pfam04751, DUF615, Protein of unknown function (DUF615).  This
           family of bacterial proteins has no known function.
          Length = 153

 Score = 28.3 bits (64), Expect = 9.8
 Identities = 16/60 (26%), Positives = 27/60 (45%), Gaps = 9/60 (15%)

Query: 614 RLVAQAGSLLN--LAKHPASTVQILGAEKALFRSLLNLAKHPASTVQILGAEKALFRALK 671
           RL+A   S L   L ++P +  Q L       R L+  A+   +  +   A + LF+ L+
Sbjct: 100 RLIADGDSALEEFLEEYPDADRQQL-------RQLIRNARKEKAANKPPKAYRELFQYLR 152


>gnl|CDD|204610 pfam11207, DUF2989, Protein of unknown function (DUF2989).  Some
           members in this bacterial family of proteins are
           annotated as lipoproteins however this cannot be
           confirmed.
          Length = 203

 Score = 28.4 bits (64), Expect = 10.0
 Identities = 30/99 (30%), Positives = 42/99 (42%), Gaps = 11/99 (11%)

Query: 27  NLYESFQTPQGANKVLKLKHFEKFIDTTEALAATTAAVEGKLCKKLK-KVLKSLVSSDVQ 85
           + YE  + P   NK   L   EK+    E LAA    +E    K+ K   +++L+ S   
Sbjct: 41  HRYELKKNPTDKNKYKLLIALEKYSKCIE-LAAG---IEHIKLKERKTDRVRALLHSIQN 96

Query: 86  ENLLVADTKRAAYPSLGTTVLLYPYSWSGSSSNAAAKHF 124
              L  +TK +  P      LLY Y WS     AA + F
Sbjct: 97  LERLQEETKDSNDPY-----LLY-YHWSRFGDQAALRQF 129


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.313    0.128    0.357 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 35,327,010
Number of extensions: 3484050
Number of successful extensions: 6242
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5145
Number of HSP's successfully gapped: 351
Length of query: 716
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 612
Effective length of database: 6,324,786
Effective search space: 3870769032
Effective search space used: 3870769032
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 63 (28.2 bits)