RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11719
         (657 letters)



>gnl|CDD|211393 cd11381, NSA2, pre-ribosomal protein NSA2 (Nop seven-associated 2).
            NSA2 appears to be a protein required for the
           maturation of 27S pre-rRNA in yeast; it has been
           characterized in mammalian cells as a nucleolar protein
           that might play a role in the regulation of the cell
           cycle and in cell proliferation.
          Length = 257

 Score =  559 bits (1444), Expect = 0.0
 Identities = 200/256 (78%), Positives = 227/256 (88%), Gaps = 1/256 (0%)

Query: 335 NEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKK 394
           NEYIE HRK +GRRLD+EERKRK+EAREPHKR++KA+ +R +KAKLYNK+R  EKIQMKK
Sbjct: 1   NEYIELHRKRHGRRLDHEERKRKKEAREPHKRSKKAQKLRGLKAKLYNKKRYKEKIQMKK 60

Query: 395 KIKAHEEKLTKQKHEKKE-EGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPK 453
            IK HEE+  KQK + K  EGA+P YLLDR+ QSRAKVLSNMIKQKRKEKAGKW VP+PK
Sbjct: 61  TIKMHEERNVKQKVDDKVPEGAVPAYLLDREGQSRAKVLSNMIKQKRKEKAGKWSVPLPK 120

Query: 454 VRTQADAEVFKIVRSGKTKRKAWKRMITKVCFVGENFTRKPPKFERFIRPMALRFNKAHV 513
           VR  ++ E+FK+V++GK K+K+WKRM+TKV FVGE FTRKPPK+ERFIRPM LRF KAHV
Sbjct: 121 VRGISEDEMFKVVKTGKRKKKSWKRMVTKVTFVGEGFTRKPPKYERFIRPMGLRFKKAHV 180

Query: 514 THPELKATFYLPIIGVKKNPNSPMYTNLGVITKGTVIEVNISELGLVTQAGKVVWGKYAQ 573
           THPELKATF LPIIGVKKNP SP+YT LGVITKGT+IEVN+SELGLVTQ GKVVWGKYAQ
Sbjct: 181 THPELKATFCLPIIGVKKNPQSPLYTQLGVITKGTIIEVNVSELGLVTQGGKVVWGKYAQ 240

Query: 574 VTNNPENDGCINAVLL 589
           VTNNPENDGCINAVLL
Sbjct: 241 VTNNPENDGCINAVLL 256


>gnl|CDD|240396 PTZ00388, PTZ00388, 40S ribosomal protein S8-like; Provisional.
          Length = 223

 Score =  429 bits (1104), Expect = e-148
 Identities = 165/222 (74%), Positives = 192/222 (86%), Gaps = 1/222 (0%)

Query: 369 KARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQK-HEKKEEGALPVYLLDRDVQS 427
           KA+ +R IKAKL+NK+R  EK +MKK IKAHEEK  K+K  EK  +GA+P YLLDR+  +
Sbjct: 1   KAQKLRGIKAKLFNKKRYAEKAEMKKTIKAHEEKDVKEKVPEKVPDGAVPSYLLDREQVN 60

Query: 428 RAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKTKRKAWKRMITKVCFVG 487
           RAKVLSNM+KQKRKEKAGKW+VPIPKV+   + E+FK+V+SGK K+K+WKR++ KV FVG
Sbjct: 61  RAKVLSNMLKQKRKEKAGKWDVPIPKVKAMNEDEMFKVVKSGKRKKKSWKRLVNKVTFVG 120

Query: 488 ENFTRKPPKFERFIRPMALRFNKAHVTHPELKATFYLPIIGVKKNPNSPMYTNLGVITKG 547
           E+FTRKPPK+ERFIRP  LRF KAHVTHPELK TFYL IIGVKKNP S +YT+LGVITKG
Sbjct: 121 EDFTRKPPKYERFIRPTGLRFKKAHVTHPELKTTFYLDIIGVKKNPQSNLYTSLGVITKG 180

Query: 548 TVIEVNISELGLVTQAGKVVWGKYAQVTNNPENDGCINAVLL 589
           T+IEVN+SELGLVTQ GKVVWGKYAQVTNNPENDGCINAVLL
Sbjct: 181 TIIEVNVSELGLVTQTGKVVWGKYAQVTNNPENDGCINAVLL 222


>gnl|CDD|211392 cd11380, Ribosomal_S8e_like, Eukaryotic/archaeal ribosomal protein
           S8e and similar proteins.  This family contains the
           eukaryotic/archaeal ribosomal protein S8, a component of
           the small ribosomal subunits, as well as the NSA2 gene
           product.
          Length = 138

 Score =  230 bits (589), Expect = 8e-73
 Identities = 73/140 (52%), Positives = 91/140 (65%), Gaps = 2/140 (1%)

Query: 450 PIPKVRTQADAEVFKIVRSGKTKRKAWKRMITKVCFVGENFTRKPPKFERFIRPMALRFN 509
           P  K + +A    FK+VR  +      K   TK+   GE FTRK       I+P ALR +
Sbjct: 1   PQGKHKRKATGGKFKVVRKKRKYELGRKPANTKLG--GERFTRKVRVRGGNIKPRALRLD 58

Query: 510 KAHVTHPELKATFYLPIIGVKKNPNSPMYTNLGVITKGTVIEVNISELGLVTQAGKVVWG 569
           KA+V+HPEL  T  L I+GVKKNP++  Y   GVITKGT+IE ++S LGLVT+ GKVVWG
Sbjct: 59  KANVSHPELGKTKKLKILGVKKNPSNNEYVRRGVITKGTIIETDVSPLGLVTEEGKVVWG 118

Query: 570 KYAQVTNNPENDGCINAVLL 589
            YAQVT+ P  DGC+NAVLL
Sbjct: 119 GYAQVTSRPGQDGCVNAVLL 138


>gnl|CDD|224918 COG2007, RPS8A, Ribosomal protein S8E [Translation, ribosomal
           structure and biogenesis].
          Length = 127

 Score =  125 bits (317), Expect = 5e-34
 Identities = 42/113 (37%), Positives = 51/113 (45%), Gaps = 26/113 (23%)

Query: 477 KRMITKVCFVGENFTRKPPKFERFIRPMALRFNKAHVTHPELKATFYLPIIGVKKNPNSP 536
           KR   KV   G NF  +           ALR   A+V+ PE   T    IIGV +NP +P
Sbjct: 40  KRKRKKVRTRGGNFKVR-----------ALRAKTANVSDPETGKTKKAKIIGVVENPANP 88

Query: 537 MYTNLGVITKGTVIEVNISELGLVTQAGKVVWGKYAQVTNNPENDGCINAVLL 589
            Y    +ITKG +IE  I                YA VT+ P  DG +NAVLL
Sbjct: 89  EYVRRNIITKGAIIETEIG---------------YAVVTSRPGQDGVVNAVLL 126



 Score = 38.8 bits (91), Expect = 0.001
 Identities = 17/55 (30%), Positives = 24/55 (43%), Gaps = 4/55 (7%)

Query: 335 NEYIERHRKLYGRRLDYEERKRKREA-REP--HKRAEKARSVRKIKAKLYNKQRR 386
                  RK  G +     +KRK E  REP   K  EK R  +K++ +  N + R
Sbjct: 3   IWQGRSKRKPTGGKRHQARKKRKFELGREPTETKIGEK-RKRKKVRTRGGNFKVR 56


>gnl|CDD|201659 pfam01201, Ribosomal_S8e, Ribosomal protein S8e. 
          Length = 129

 Score =  121 bits (306), Expect = 1e-32
 Identities = 57/255 (22%), Positives = 76/255 (29%), Gaps = 129/255 (50%)

Query: 335 NEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKK 394
           N+ I   RK  G RLD+  +KR                                      
Sbjct: 4   NQGIRSKRKRTGGRLDHHRKKR-------------------------------------- 25

Query: 395 KIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKV 454
                          K E G  P            K+  N +KQ R              
Sbjct: 26  ---------------KFELGRPPAP---------TKLGKNRVKQVR-------------- 47

Query: 455 RTQADAEVFKIVRSGKTKRKAWKRMITKVCFVGENFTRKPPKFERFIRPMALRFNKAHVT 514
                      VR G  K +A                              LR +KA+V+
Sbjct: 48  -----------VRGGNRKVRA------------------------------LRLDKANVS 66

Query: 515 HPELKATFYLPIIGVKKNPNSPMYTNLGVITKGTVIEVNISELGLVTQAGKVVWGKYAQV 574
            P    T  + I+GV  NP +  Y    +ITKG +IE ++SELG             A+V
Sbjct: 67  WPSEGTTRKVRILGVVYNPANNEYVRRNIITKGAIIETDVSELGK------------ARV 114

Query: 575 TNNPENDGCINAVLL 589
           T+ P  DG +N VLL
Sbjct: 115 TSRPGQDGRVNGVLL 129


>gnl|CDD|235214 PRK04049, PRK04049, 30S ribosomal protein S8e; Validated.
          Length = 127

 Score = 61.4 bits (150), Expect = 2e-11
 Identities = 32/85 (37%), Positives = 45/85 (52%), Gaps = 15/85 (17%)

Query: 505 ALRFNKAHVTHPELKATFYLPIIGVKKNPNSPMYTNLGVITKGTVIEVNISELGLVTQAG 564
            LR N A+V  P+   T  + I+ V +NP +P Y    +ITKG +IE         T+ G
Sbjct: 56  LLRANYANVADPKTGKTKKVKILTVVENPANPHYARRNIITKGAIIE---------TELG 106

Query: 565 KVVWGKYAQVTNNPENDGCINAVLL 589
           K      A+VT+ P  DG +NAVL+
Sbjct: 107 K------AKVTSRPGQDGVVNAVLI 125


>gnl|CDD|211394 cd11382, Ribosomal_S8e, Eukaryotic/archaeal ribosomal protein S8e
           (RPS8).  The eukaryotic/archaeal ribosomal protein S8 is
           a component of the small (40S in eukaryotes, 30S in
           archaea) ribosomal subunits and interacts tightly with
           18S rRNA (16S rRNA in archaea, presumably).
          Length = 122

 Score = 53.3 bits (129), Expect = 1e-08
 Identities = 27/85 (31%), Positives = 39/85 (45%), Gaps = 15/85 (17%)

Query: 505 ALRFNKAHVTHPELKATFYLPIIGVKKNPNSPMYTNLGVITKGTVIEVNISELGLVTQAG 564
           ALR +  +V+ P    T  + I+ V  NP +        +TKG +IE    + GL     
Sbjct: 53  ALRLDTGNVSWPSEGKTKKVKILDVVYNPANNELVRRNTLTKGAIIET---DRGL----- 104

Query: 565 KVVWGKYAQVTNNPENDGCINAVLL 589
                  A+VT+ P  DG +N VLL
Sbjct: 105 -------ARVTSRPGQDGVVNGVLL 122


>gnl|CDD|129407 TIGR00307, S8e, ribosomal protein S8.e.  Archaeal and eukaryotic
           ribosomal protein S8. This model could easily have been
           split into two models, one for eukaryotic S8 and one for
           archaeal S8; eukaryotic forms invariably have in insert
           of about 80 residues that archaeal forms of S8 do not
           [Protein synthesis, Ribosomal proteins: synthesis and
           modification].
          Length = 127

 Score = 50.2 bits (120), Expect = 1e-07
 Identities = 29/85 (34%), Positives = 41/85 (48%), Gaps = 15/85 (17%)

Query: 505 ALRFNKAHVTHPELKATFYLPIIGVKKNPNSPMYTNLGVITKGTVIEVNISELGLVTQAG 564
            LR   A+V+ PE      + I GV +NP +  Y    VITKG ++E +I          
Sbjct: 56  LLRDETANVSDPETGKVKKVEIAGVVENPANNHYVRRNVITKGAIVETDIG--------- 106

Query: 565 KVVWGKYAQVTNNPENDGCINAVLL 589
                 YA+VT+ P  DG +N VL+
Sbjct: 107 ------YARVTSRPGQDGVVNGVLI 125


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 43.9 bits (104), Expect = 3e-05
 Identities = 23/72 (31%), Positives = 35/72 (48%), Gaps = 4/72 (5%)

Query: 348 RLDYEERK-RKREAREP--HKRAEKARSVRKIKAKLYNK-QRRNEKIQMKKKIKAHEEKL 403
           RL+  + K +K    E    KR EK R   +  AK   K Q++ +K + KKK K   +K 
Sbjct: 51  RLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKE 110

Query: 404 TKQKHEKKEEGA 415
            K+  +  EE +
Sbjct: 111 EKEGSKSSEESS 122


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 43.9 bits (103), Expect = 5e-05
 Identities = 32/126 (25%), Positives = 59/126 (46%), Gaps = 24/126 (19%)

Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKK---K 395
           E  R+    RL  EE  R++E     ++ EKA+   + + K   +Q   E+IQ +K   +
Sbjct: 59  ELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEK--QEQEEQERIQKQKEEAE 116

Query: 396 IKAHEE----KLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPI 451
            +A EE    +L ++KH ++ E         ++   R K L  ++K+ RK +        
Sbjct: 117 ARAREEAERMRLEREKHFQQIE---------QERLERKKRLEEIMKRTRKSEVS------ 161

Query: 452 PKVRTQ 457
           P+V+ +
Sbjct: 162 PQVKKE 167


>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947).  Family of
           eukaryotic proteins with unknown function.
          Length = 168

 Score = 42.3 bits (100), Expect = 2e-04
 Identities = 30/124 (24%), Positives = 55/124 (44%), Gaps = 22/124 (17%)

Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKA 398
           ++ RK Y    DY    R++E  E  K  +K +     K +L    +  +  ++K     
Sbjct: 48  DKFRKNYKFLDDY----REKEIEELEKALKKTKDSE-EKEELKRTLQSMKS-RLKTLKNK 101

Query: 399 HEEKLTKQKHEKKE-----EGALPVYLLDRDVQ-----------SRAKVLSNMIKQKRKE 442
             E+   ++H+K+E     EG  P YL   +++            ++K L   +++KRK+
Sbjct: 102 DREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQLDKALEKKRKK 161

Query: 443 KAGK 446
            AGK
Sbjct: 162 NAGK 165


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 43.4 bits (103), Expect = 3e-04
 Identities = 23/118 (19%), Positives = 46/118 (38%), Gaps = 28/118 (23%)

Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYN----------------------KQRRNEK 389
            ER+ +REA    ++ EK R V +++A+                         K+R+ EK
Sbjct: 163 AEREEEREAERRERKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKEK 222

Query: 390 IQMKKKIKAHEE---KLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKA 444
            + +K+ +  +E      +Q  EK+E           +   R ++L    + +  E+ 
Sbjct: 223 EEAEKRRRQKQELQRAREEQIEEKEERLQE---ERAEEEAERERMLEKQAEDEELEQE 277



 Score = 38.0 bits (89), Expect = 0.013
 Identities = 22/102 (21%), Positives = 45/102 (44%), Gaps = 14/102 (13%)

Query: 347 RRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKK---------IK 397
           R    EE++ + +     + AE+ R + K       +Q   EK +MK+          I+
Sbjct: 239 REEQIEEKEERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQIE 298

Query: 398 AHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQK 439
             EE+   ++ E+ EEG     L + + + +A++     +Q+
Sbjct: 299 EKEERRAAEREEELEEGER---LREEEAERQARI--EEERQR 335



 Score = 37.6 bits (88), Expect = 0.015
 Identities = 25/121 (20%), Positives = 44/121 (36%), Gaps = 24/121 (19%)

Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARS---VRKIKAKLYNKQRRNEKIQM 392
           E  E+  K   ++   EE     E R   K  EK R      KI      K  R E+ + 
Sbjct: 115 EAQEKREK---QKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEEREA 171

Query: 393 KKKIKAHE---------EKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEK 443
           +++ +  E          +  + + E++E   L     D   +   +      K+++KEK
Sbjct: 172 ERRERKEEKEREVARLRAQQEEAEDEREELDEL---RADLYQEEYER------KERQKEK 222

Query: 444 A 444
            
Sbjct: 223 E 223



 Score = 36.4 bits (85), Expect = 0.041
 Identities = 24/129 (18%), Positives = 51/129 (39%), Gaps = 28/129 (21%)

Query: 334 HNEYIERHRKLYGRRLD-YEERKRKREAREPHKRAEKARSVRKIKAKLY-------NKQR 385
             + I+   K   RR+D   E +R +   E  +  E+ R   + + +          ++R
Sbjct: 28  EKKRIKAEEKEEERRIDEMMEEERLKALAE-EEERERKRKEERREGRAVLQEQIEEREKR 86

Query: 386 R--------NEKIQMK---KKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSN 434
           R         E+ QM    ++I+  +E   ++K EK+++       L  ++         
Sbjct: 87  RQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKK-------LREEID-EFNEERI 138

Query: 435 MIKQKRKEK 443
             K++ KE+
Sbjct: 139 ERKEEEKER 147



 Score = 35.7 bits (83), Expect = 0.070
 Identities = 17/129 (13%), Positives = 49/129 (37%), Gaps = 14/129 (10%)

Query: 352 EERKRKREAREPHKR-AEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEK 410
           E+++ K E +E  +R  E     R        ++   E+ + +++ +     L +Q  E+
Sbjct: 28  EKKRIKAEEKEEERRIDEMMEEERLKALAEEEER---ERKRKEERREGRAV-LQEQIEER 83

Query: 411 KEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGK 470
           ++          ++ +   +++  + ++   E   K E    + + + + + F       
Sbjct: 84  EKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREK---QKKLREEIDEF------N 134

Query: 471 TKRKAWKRM 479
            +R   K  
Sbjct: 135 EERIERKEE 143



 Score = 35.3 bits (82), Expect = 0.082
 Identities = 22/112 (19%), Positives = 45/112 (40%), Gaps = 26/112 (23%)

Query: 348 RLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEK----- 402
           R D  + + +R+ R+     EK  + ++ + K   ++ R E+I+ K++    E       
Sbjct: 205 RADLYQEEYERKERQ----KEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAE 260

Query: 403 ----LTKQKHEKKEEGALPVYLLDRDVQSRAK------VLSNMIKQKRKEKA 444
               L KQ  +++ E            + R K       L   I++K + +A
Sbjct: 261 RERMLEKQAEDEELEQE-------NAEKRRMKRLEHRRELEQQIEEKEERRA 305



 Score = 35.3 bits (82), Expect = 0.093
 Identities = 26/129 (20%), Positives = 51/129 (39%), Gaps = 31/129 (24%)

Query: 338 IERHRKLYGRRLDYEERKRKREAR------------EPHKRAEKARSVRKIKA--KLYNK 383
           IE   K   R+ +YEER   +E              +  +  EK    +K++     +N+
Sbjct: 80  IEEREKR--RQEEYEERL--QEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNE 135

Query: 384 QRRNEKIQMKKKIKAHEEKL---TKQKHEKKEEGALPVYLLDRDVQSRAK-----VLSNM 435
           +R   K + K++ +  E K+    ++K E++EE        +R  +   K      L   
Sbjct: 136 ERIERKEEEKEREREEELKILEYQREKAEREEE-----REAERRERKEEKEREVARLRAQ 190

Query: 436 IKQKRKEKA 444
            ++   E+ 
Sbjct: 191 QEEAEDERE 199



 Score = 30.3 bits (69), Expect = 3.7
 Identities = 13/52 (25%), Positives = 27/52 (51%), Gaps = 5/52 (9%)

Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKL 403
           EE++ +R A    +R E+     +++ +   +Q R E+ +  + +K H E L
Sbjct: 298 EEKEERRAA----EREEELEEGERLREEEAERQARIEEERQ-RLLKEHAEAL 344


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 41.0 bits (97), Expect = 0.002
 Identities = 22/87 (25%), Positives = 44/87 (50%), Gaps = 6/87 (6%)

Query: 332 SLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKA-----RSVRKIKAKLYNKQRR 386
            L  E  E+  KL     D    + ++EA++  K A+K      + +R+++   Y   + 
Sbjct: 548 KLKEELEEKKEKLQEEE-DKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKA 606

Query: 387 NEKIQMKKKIKAHEEKLTKQKHEKKEE 413
           +E I+ +K++    EK  K+K ++KE+
Sbjct: 607 HELIEARKRLNKANEKKEKKKKKQKEK 633


>gnl|CDD|225058 COG2147, RPL19A, Ribosomal protein L19E [Translation, ribosomal
           structure and biogenesis].
          Length = 150

 Score = 38.4 bits (90), Expect = 0.003
 Identities = 21/71 (29%), Positives = 32/71 (45%), Gaps = 9/71 (12%)

Query: 339 ERH-RKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKI------Q 391
           +RH +K  GRR     RK  + AR P K     R +R ++ +L  K R + KI      +
Sbjct: 65  KRHAQKKKGRRRGPGSRKGTKGARMPSKERWIKR-IRALRREL-RKLRDDGKIDKHTYRK 122

Query: 392 MKKKIKAHEEK 402
           + +  K    K
Sbjct: 123 LYRMAKGGAFK 133


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 40.3 bits (95), Expect = 0.003
 Identities = 27/101 (26%), Positives = 51/101 (50%), Gaps = 20/101 (19%)

Query: 357 KREAREPHKRA--EKARSVRKIKAKLYN--KQRRNEKIQMKKKIKAHEEKLTKQ------ 406
           K+EA    K A  E    V K++A+L    K+RRNE  ++++++   EE L ++      
Sbjct: 42  KKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDK 101

Query: 407 ---KHEKKEEGALPVYLLDR--DVQSRAKVLSNMIKQKRKE 442
                EKKE+      L ++  ++  + + L  +I ++R+E
Sbjct: 102 KEENLEKKEKE-----LSNKEKNLDEKEEELEELIAEQREE 137


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 40.0 bits (94), Expect = 0.004
 Identities = 22/60 (36%), Positives = 37/60 (61%), Gaps = 7/60 (11%)

Query: 347 RRLDYEE-RKRKRE---AREPHKRAE-KARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEE 401
           ++L  EE + R+ E    RE   R E KA+ ++KIK+K Y++  + EK  +K+++K  EE
Sbjct: 192 KKLSPEEAKARRAELRKMRELLFREEAKAKRIKKIKSKKYHRVHKKEK--LKEELKEFEE 249


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 39.6 bits (92), Expect = 0.005
 Identities = 23/113 (20%), Positives = 44/113 (38%), Gaps = 12/113 (10%)

Query: 355 KRKREAREPHKRAEKAR-----SVRKIKAKL-YNKQRRNEKIQMKKKIKAHEEKLTKQKH 408
           K++ E R+  + A+        S  K   ++  N++R  EK Q++ K    E    K K 
Sbjct: 245 KQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEAL--KAKD 302

Query: 409 EKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAE 461
            K  +         +  +  A+      ++KR+  A   +   P+V  Q  + 
Sbjct: 303 HKAFDLKQES----KASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPTSL 351



 Score = 30.0 bits (67), Expect = 4.9
 Identities = 17/91 (18%), Positives = 34/91 (37%), Gaps = 2/91 (2%)

Query: 356 RKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK--EE 413
           ++RE++E  KRA++ +     K    +K ++          K  +E   KQ+  K   + 
Sbjct: 204 KERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKP 263

Query: 414 GALPVYLLDRDVQSRAKVLSNMIKQKRKEKA 444
                   D+ V    K      + + K+  
Sbjct: 264 ADTSSPKEDKQVAENQKREIEKAQIEIKKND 294


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 39.0 bits (91), Expect = 0.007
 Identities = 35/149 (23%), Positives = 59/149 (39%), Gaps = 25/149 (16%)

Query: 339 ERHRKLYGRRLDYEERKRK-----REAREPHKRAEKARSVRKIKAKLY--NKQRRNE--- 388
           ER ++L   RL  +E+K++     ++A    K+AE+A +     AK     + +R     
Sbjct: 101 ERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAA 160

Query: 389 -KIQMKKKIKAHEEKLTKQKHE-KKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGK 446
            K   + K KA  E   K   E KK+  A        + + +A+  +   K+K   +A K
Sbjct: 161 KKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEA---KKKAAAEAKK 217

Query: 447 WEVPIPKVRTQADAEVFKIVRSGKTKRKA 475
                     +A AE          + KA
Sbjct: 218 ----------KAAAEAKAAAAKAAAEAKA 236



 Score = 34.0 bits (78), Expect = 0.24
 Identities = 24/110 (21%), Positives = 42/110 (38%), Gaps = 12/110 (10%)

Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK 411
           E+RK+K + +    + ++A    ++K     K+R     Q +KK      K    K ++ 
Sbjct: 79  EQRKKKEQQQAEELQQKQAAEQERLKQL--EKER--LAAQEQKKQAEEAAKQAALKQKQA 134

Query: 412 EEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAE 461
           EE A           + A       + KR   A K      K + +A+A 
Sbjct: 135 EEAA--------AKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAA 176


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 39.2 bits (91), Expect = 0.007
 Identities = 25/134 (18%), Positives = 51/134 (38%), Gaps = 16/134 (11%)

Query: 347 RRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQ---RRNEKIQMKKKIKAHEEKL 403
           + L+Y + K K E  E +        + + +  L  +     + E    K++++  EE L
Sbjct: 209 KALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEIL 268

Query: 404 TKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVF 463
            +   E KEE         +  +   K+L+   ++ + E          K+  +   +  
Sbjct: 269 AQVLKENKEEEK-----EKKLQEEELKLLAKEEEELKSELL--------KLERRKVDDEE 315

Query: 464 KIVRSGKTKRKAWK 477
           K+  S K  +K  K
Sbjct: 316 KLKESEKELKKLEK 329



 Score = 34.9 bits (80), Expect = 0.16
 Identities = 17/78 (21%), Positives = 35/78 (44%), Gaps = 1/78 (1%)

Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKA 398
           +   +   + L+ EE    +  +E +K  EK + +++ + KL  K+    K ++ K  + 
Sbjct: 251 QEEIESSKQELEKEEEILAQVLKE-NKEEEKEKKLQEEELKLLAKEEEELKSELLKLERR 309

Query: 399 HEEKLTKQKHEKKEEGAL 416
             +   K K  +KE   L
Sbjct: 310 KVDDEEKLKESEKELKKL 327



 Score = 33.4 bits (76), Expect = 0.47
 Identities = 16/94 (17%), Positives = 36/94 (38%), Gaps = 4/94 (4%)

Query: 351 YEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEK 410
             E+++K+E  +      +  +   I  +    Q    K Q KK      E    ++  +
Sbjct: 166 SREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKA----LEYYQLKEKLE 221

Query: 411 KEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKA 444
            EE  L      +  + R  +L  +++ +++E  
Sbjct: 222 LEEENLLYLDYLKLNEERIDLLQELLRDEQEEIE 255


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
           unknown].
          Length = 869

 Score = 39.3 bits (91), Expect = 0.007
 Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 6/79 (7%)

Query: 336 EYIERHRKLYGRRLDYEE-RKRKRE---AREPHKRAE-KARSVRKIKAKLYNKQRRNEKI 390
             +++  +L   +L  EE  +R R+    RE   R E KA+ V KIK+K Y K R+N K 
Sbjct: 353 SALKKQEELALNKLSVEEVAERTRQLRFMRELMFREERKAKRVAKIKSKTYRKIRKNRKE 412

Query: 391 QMKKKIKAHEEKLTKQKHE 409
           + +  +    E L  +K E
Sbjct: 413 K-EMALIPKSEDLENEKSE 430


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 37.8 bits (88), Expect = 0.008
 Identities = 21/82 (25%), Positives = 33/82 (40%), Gaps = 14/82 (17%)

Query: 334 HNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMK 393
             E  E  R         EERK+  E RE  ++ E+     + K K   +++  E+    
Sbjct: 20  QREAEEEER---------EERKKLEEKREGERKEEEELEEEREKKKEEEERKEREE---- 66

Query: 394 KKIKAHEE-KLTKQKHEKKEEG 414
           +  K  EE +  K     +EEG
Sbjct: 67  QARKEQEEYEKLKSSFVVEEEG 88



 Score = 31.6 bits (72), Expect = 0.79
 Identities = 13/67 (19%), Positives = 31/67 (46%), Gaps = 5/67 (7%)

Query: 352 EERKRKREAREPHKRAEKA-----RSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQ 406
             +KR +   +  +R ++      R  RK   +    +R+ E+   +++ K  EE+  K+
Sbjct: 4   GAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKE 63

Query: 407 KHEKKEE 413
           + E+  +
Sbjct: 64  REEQARK 70



 Score = 31.6 bits (72), Expect = 0.79
 Identities = 15/62 (24%), Positives = 33/62 (53%), Gaps = 1/62 (1%)

Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK 411
           EE++ +R+ RE  +   + R   + K +   K+   E  + ++K K  EE+  +++  +K
Sbjct: 12  EEKQARRQQREAEEEEREERKKLEEKREGERKEEE-ELEEEREKKKEEEERKEREEQARK 70

Query: 412 EE 413
           E+
Sbjct: 71  EQ 72


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 38.3 bits (89), Expect = 0.010
 Identities = 25/119 (21%), Positives = 51/119 (42%), Gaps = 19/119 (15%)

Query: 347 RRLDYEERKRKREAR----EPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEK 402
           +R   + R+++ E R    +  K+AE+A    + K K   + +  +  + K K +A  EK
Sbjct: 83  QRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEK 142

Query: 403 LTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAE 461
             K++ +K+ E            +      +   K+K  E   K E    + + +A+A+
Sbjct: 143 KAKEEAKKQAE------------EEAKAKAAAEAKKKAAEAKKKAE---AEAKAKAEAK 186



 Score = 37.5 bits (87), Expect = 0.019
 Identities = 27/127 (21%), Positives = 49/127 (38%), Gaps = 9/127 (7%)

Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKI-- 396
            R ++L  R    +  K+  +A +  +  +K     K K     K +   + + K K   
Sbjct: 89  ARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEA 148

Query: 397 --KAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKV 454
             +A EE   K   E K++ A      + +  ++AK      K K K +  K +    K 
Sbjct: 149 KKQAEEEAKAKAAAEAKKKAAEAKKKAEAE--AKAKAE---AKAKAKAEEAKAKAEAAKA 203

Query: 455 RTQADAE 461
           +  A+A 
Sbjct: 204 KAAAEAA 210



 Score = 31.3 bits (71), Expect = 1.7
 Identities = 20/112 (17%), Positives = 46/112 (41%), Gaps = 14/112 (12%)

Query: 352 EERKRKREAREPHKRAEKARS--VRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHE 409
           ++  R ++ ++P  + E+ R   + +   +   KQR  E+ + K+     E++   +K  
Sbjct: 50  QQANRIQQQKKPAAKKEQERQKKLEQQAEEA-EKQRAAEQARQKEL----EQRAAAEKAA 104

Query: 410 KKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAE 461
           K+ E A       +     AK       ++  E   K E    + + + +A+
Sbjct: 105 KQAEQAAKQAEEKQKQAEEAK------AKQAAEAKAKAE-AEAEKKAKEEAK 149



 Score = 30.6 bits (69), Expect = 2.9
 Identities = 22/109 (20%), Positives = 46/109 (42%), Gaps = 9/109 (8%)

Query: 353 ERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKE 412
           E++R+++  +  + AEK R      A+   ++   ++   +K  K   E+  KQ  EK++
Sbjct: 66  EQERQKKLEQQAEEAEKQR-----AAEQARQKELEQRAAAEKAAKQ-AEQAAKQAEEKQK 119

Query: 413 EGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAE 461
           +         +             ++K KE+A K      K +  A+A+
Sbjct: 120 QAEEAKA---KQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAK 165


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 38.5 bits (90), Expect = 0.014
 Identities = 27/113 (23%), Positives = 49/113 (43%), Gaps = 20/113 (17%)

Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKK 395
           E  E + KL     +Y +  R+ E R   +  E+   + +   +L  K+ R E++  KKK
Sbjct: 290 EKAEEYIKLSEFYEEYLDELREIEKRL-SRLEEEINGIEERIKELEEKEERLEEL--KKK 346

Query: 396 IKAHEEKLT--KQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGK 446
           +K  E++L   +++HE  EE               AK     +++ +K   G 
Sbjct: 347 LKELEKRLEELEERHELYEE---------------AKAKKEELERLKKRLTGL 384



 Score = 37.7 bits (88), Expect = 0.019
 Identities = 23/78 (29%), Positives = 39/78 (50%), Gaps = 1/78 (1%)

Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMK-K 394
           E  +R  +L  R   YEE K K+E  E  K+     +  K++ +L   ++  E+I+ +  
Sbjct: 349 ELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEIS 408

Query: 395 KIKAHEEKLTKQKHEKKE 412
           KI A   +L K+  E K+
Sbjct: 409 KITARIGELKKEIKELKK 426



 Score = 35.4 bits (82), Expect = 0.099
 Identities = 15/83 (18%), Positives = 36/83 (43%), Gaps = 7/83 (8%)

Query: 339 ERHRKLYGRRLDYE---ERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQ---- 391
           E + +L    L+        R        +R E  +++ K+K +L  +++  ++++    
Sbjct: 659 EEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEK 718

Query: 392 MKKKIKAHEEKLTKQKHEKKEEG 414
             ++++   EK+ K K   KE  
Sbjct: 719 ALERVEELREKVKKYKALLKERA 741



 Score = 34.7 bits (80), Expect = 0.16
 Identities = 19/82 (23%), Positives = 43/82 (52%), Gaps = 14/82 (17%)

Query: 332 SLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQ 391
             +NEY+E   K   + L+ EE++ K+   E          + K   +L   ++R E++ 
Sbjct: 599 PFYNEYLEL--KDAEKELEREEKELKKLEEE----------LDKAFEELAETEKRLEEL- 645

Query: 392 MKKKIKAHEEKLTKQKHEKKEE 413
            +K+++  E+K +++++E+  E
Sbjct: 646 -RKELEELEKKYSEEEYEELRE 666



 Score = 32.7 bits (75), Expect = 0.73
 Identities = 24/83 (28%), Positives = 39/83 (46%), Gaps = 13/83 (15%)

Query: 346 GRRLDYEERKR---------KREAREPHKRAEKARSVRKIKAKLYNKQRRNEKI----QM 392
           GR L  E RK          KR  +E  +  EK R +RK   +L    ++  ++    ++
Sbjct: 442 GRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKEL 501

Query: 393 KKKIKAHEEKLTKQKHEKKEEGA 415
            +++K  EEKL K   E+ E+ A
Sbjct: 502 AEQLKELEEKLKKYNLEELEKKA 524



 Score = 31.6 bits (72), Expect = 1.5
 Identities = 33/125 (26%), Positives = 55/125 (44%), Gaps = 15/125 (12%)

Query: 331 MSLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKAR-SVRKIKAKLYNKQRRNEK 389
            S   E  E   KL     + EE K + E  E  K  E    S RK++ K+   + R E+
Sbjct: 213 SSELPELREELEKLEKEVKELEELKEEIEELE--KELESLEGSKRKLEEKIRELEERIEE 270

Query: 390 IQMKKKIKAHEEK---LTKQKHEKKEEGALPVYLLD-----RDVQSRAKVLSNMIK--QK 439
             +KK+I+  EEK   L + K + +E   L  +  +     R+++ R   L   I   ++
Sbjct: 271 --LKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEE 328

Query: 440 RKEKA 444
           R ++ 
Sbjct: 329 RIKEL 333



 Score = 31.2 bits (71), Expect = 2.3
 Identities = 17/82 (20%), Positives = 41/82 (50%), Gaps = 3/82 (3%)

Query: 349 LDYEERKRK-REAREPHKRAEKARS-VRKIKAKLYNKQR-RNEKIQMKKKIKAHEEKLTK 405
            + EE ++K  E  +  ++  K +  ++ +K +L   +  + +  +++KK+   EE+L +
Sbjct: 515 YNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAE 574

Query: 406 QKHEKKEEGALPVYLLDRDVQS 427
              E +E G   V  L+  ++ 
Sbjct: 575 LLKELEELGFESVEELEERLKE 596



 Score = 30.4 bits (69), Expect = 3.6
 Identities = 22/103 (21%), Positives = 40/103 (38%), Gaps = 25/103 (24%)

Query: 335 NEYIERHRKLYGRRLDYEERKRKREAR--------------EPHKRAE--KARS----VR 374
              IER  K   R  + EE  +++E                 P  R E  K       + 
Sbjct: 175 KRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELE 234

Query: 375 KIKAKLYNKQRRNEKIQMKK-----KIKAHEEKLTKQKHEKKE 412
           ++K ++   ++  E ++  K     KI+  EE++ + K E +E
Sbjct: 235 ELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEE 277



 Score = 29.3 bits (66), Expect = 7.6
 Identities = 26/128 (20%), Positives = 51/128 (39%), Gaps = 21/128 (16%)

Query: 333 LHNEYIERHRKLYGRRLDYEER---------------KRKREAREPHKRAEKARSVRKIK 377
            + EY++  R++  R    EE                +R  E ++  K  EK     + +
Sbjct: 301 FYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEER 360

Query: 378 AKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK----EEGALPVYLLDRDVQSRAKVLS 433
            +LY + +   K +  +++K     LT +K EK+    E+    +      + +R   L 
Sbjct: 361 HELYEEAKA--KKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELK 418

Query: 434 NMIKQKRK 441
             IK+ +K
Sbjct: 419 KEIKELKK 426


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 37.5 bits (87), Expect = 0.020
 Identities = 19/73 (26%), Positives = 33/73 (45%), Gaps = 4/73 (5%)

Query: 346 GRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKA---HEEK 402
           GR+ ++E R+ KRE  +  K    A        +  ++ +R EK + KK       H   
Sbjct: 343 GRQSEWEARQAKREGGD-AKAGRAAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKPLHPSW 401

Query: 403 LTKQKHEKKEEGA 415
             K+K ++K+  A
Sbjct: 402 EAKKKAKEKKANA 414


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 36.3 bits (84), Expect = 0.025
 Identities = 19/67 (28%), Positives = 35/67 (52%), Gaps = 3/67 (4%)

Query: 350 DYEERKRKREAREPHKRAEKARSVRKIKAKL-YNKQRRNEKIQ--MKKKIKAHEEKLTKQ 406
           D+  +KRKRE +    +  K R V + +    Y    R+ + Q  +++K K  ++K+ + 
Sbjct: 139 DWRAQKRKREQKARAAKERKERLVAEAREHFGYWVDPRDPRFQEMLQQKEKEEKKKVKEA 198

Query: 407 KHEKKEE 413
           K  +KEE
Sbjct: 199 KRREKEE 205



 Score = 30.5 bits (69), Expect = 2.0
 Identities = 28/125 (22%), Positives = 55/125 (44%), Gaps = 21/125 (16%)

Query: 333 LHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKL--YNKQRRNEKI 390
           L  E  E +  L     +  E+++++EAR   + AE A+++ K+   +  +  Q+R    
Sbjct: 90  LEEEEREWYPSLNQMLEENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKR---- 145

Query: 391 QMKKKIKAHEEKLTKQK--HEKKEEGALPV---------YLLDRDVQSRAKVLSNMIKQK 439
             K++ KA   K  K++   E +E     V          L  ++ + + KV     K++
Sbjct: 146 --KREQKARAAKERKERLVAEAREHFGYWVDPRDPRFQEMLQQKEKEEKKKV--KEAKRR 201

Query: 440 RKEKA 444
            KE+ 
Sbjct: 202 EKEEK 206


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 37.4 bits (86), Expect = 0.030
 Identities = 24/117 (20%), Positives = 48/117 (41%), Gaps = 9/117 (7%)

Query: 339  ERHRKLYGRRLDYEERK--RKREAREPHKRAEKARSVRKIKAKLYNKQRRNE----KIQM 392
            +  +K   ++   E +K   K++A E  K+AE+A+   + K K    +++ +    K + 
Sbjct: 1281 DELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEE 1340

Query: 393  KKK---IKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGK 446
             KK       E +    + E  EE A        + + +A       ++K+K    K
Sbjct: 1341 AKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAK 1397



 Score = 37.0 bits (85), Expect = 0.042
 Identities = 24/97 (24%), Positives = 47/97 (48%)

Query: 347  RRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQ 406
            ++ + +E+K     ++  + A+KA  ++K +A+   K    +K + + KIKA E K   +
Sbjct: 1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAE 1740

Query: 407  KHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEK 443
            + +KK E A       + +    K      ++ RKEK
Sbjct: 1741 EDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEK 1777



 Score = 36.7 bits (84), Expect = 0.052
 Identities = 31/121 (25%), Positives = 54/121 (44%), Gaps = 15/121 (12%)

Query: 339  ERHRKLYGRRLDYEERK----RKREAREPHKRAE--KARSVRKIKAKLYNKQRRNEKIQM 392
            E+      ++   E +K    +K+EA E  K  E  KA    KIKA+   K+   +K + 
Sbjct: 1687 EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDK-KK 1745

Query: 393  KKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIP 452
             ++ K  EE+  K  H KKEE        ++  +   K    +I+++  E+  K  + + 
Sbjct: 1746 AEEAKKDEEEKKKIAHLKKEE--------EKKAEEIRKEKEAVIEEELDEEDEKRRMEVD 1797

Query: 453  K 453
            K
Sbjct: 1798 K 1798



 Score = 36.3 bits (83), Expect = 0.070
 Identities = 39/149 (26%), Positives = 67/149 (44%), Gaps = 14/149 (9%)

Query: 338  IERHRKLYG--RRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKI--QMK 393
            IE   KLY   +++  EE K+  EA+   +  +KA   +K   +L  K+   +K   ++K
Sbjct: 1594 IEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELK 1653

Query: 394  K-----KIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRK--EKAGK 446
            K     KIKA EE    ++ +KK E A      + D +  A+ L    ++ +K  E   K
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKA---EEDEKKAAEALKKEAEEAKKAEELKKK 1710

Query: 447  WEVPIPKVRTQADAEVFKIVRSGKTKRKA 475
                  K      AE    +++ + K++A
Sbjct: 1711 EAEEKKKAEELKKAEEENKIKAEEAKKEA 1739



 Score = 36.3 bits (83), Expect = 0.073
 Identities = 24/128 (18%), Positives = 56/128 (43%), Gaps = 12/128 (9%)

Query: 352  EERKRK----REAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQK 407
            +E K+K    ++A E  K+AE+A+   + K K    ++ +E  +  ++ K   ++  K  
Sbjct: 1447 DEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAA 1506

Query: 408  HEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVR 467
              KK+           +   +A       + K+ ++A K E    + +   + +  + ++
Sbjct: 1507 EAKKKADEAK----KAEEAKKADEAKKAEEAKKADEAKKAE----EKKKADELKKAEELK 1558

Query: 468  SGKTKRKA 475
              + K+KA
Sbjct: 1559 KAEEKKKA 1566



 Score = 35.9 bits (82), Expect = 0.086
 Identities = 27/128 (21%), Positives = 50/128 (39%), Gaps = 12/128 (9%)

Query: 348  RLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQK 407
                EE K+K  A    K+AE+ +   + K K    +++ ++++     K   ++  K+ 
Sbjct: 1370 EKKKEEAKKK--ADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKA 1427

Query: 408  HEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVR 467
             EKK+         +      AK        K+ E+A K E    K      A+  K  +
Sbjct: 1428 EEKKKADEAKKKAEEAKKADEAK--------KKAEEAKKAEEAKKKAEEAKKADEAK--K 1477

Query: 468  SGKTKRKA 475
              +  +KA
Sbjct: 1478 KAEEAKKA 1485



 Score = 34.7 bits (79), Expect = 0.18
 Identities = 28/125 (22%), Positives = 53/125 (42%), Gaps = 22/125 (17%)

Query: 352  EERKRKREARE-PHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEK 410
            EE K+  E +    ++AE+A+   + + +   K    EK    ++ K  EE   K +  K
Sbjct: 1567 EEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELK 1626

Query: 411  KEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGK 470
            K E            + + KV    +K+K  E+  K E    +++    AE    +++ +
Sbjct: 1627 KAE------------EEKKKV--EQLKKKEAEEKKKAE----ELKK---AEEENKIKAAE 1665

Query: 471  TKRKA 475
              +KA
Sbjct: 1666 EAKKA 1670



 Score = 34.7 bits (79), Expect = 0.18
 Identities = 25/128 (19%), Positives = 52/128 (40%), Gaps = 16/128 (12%)

Query: 352  EERKRK----REAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQK 407
            +E K+K    ++A E  K+AE+A+   + K K     ++ E+ + K +     ++  K+ 
Sbjct: 1421 DEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEA-KKAEEAKKKAEEAKKADEAKKKA 1479

Query: 408  HEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVR 467
             E K+          +     AK  ++  K+  + K    E    +   +AD       +
Sbjct: 1480 EEAKKADEA------KKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE-----AK 1528

Query: 468  SGKTKRKA 475
              +  +KA
Sbjct: 1529 KAEEAKKA 1536



 Score = 34.7 bits (79), Expect = 0.18
 Identities = 25/118 (21%), Positives = 48/118 (40%), Gaps = 8/118 (6%)

Query: 339  ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKA 398
            E  +K    +   E++       E  K+AE+AR    +K     K+ + E+ +  ++ K 
Sbjct: 1561 EEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKI 1620

Query: 399  HEEKLTKQKHEKKEEGALPVYLLD--------RDVQSRAKVLSNMIKQKRKEKAGKWE 448
              E+L K + EKK+   L     +        +  +   K+ +    +K +E   K E
Sbjct: 1621 KAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAE 1678



 Score = 34.3 bits (78), Expect = 0.26
 Identities = 31/144 (21%), Positives = 56/144 (38%), Gaps = 19/144 (13%)

Query: 339  ERHRKLYGRRLDYEERKRKREAREPHKRAE---KARSVRKIKAKLYNKQRRNEKIQMKKK 395
                K+    L   E ++K+  +   K AE   KA  ++K + +   K     K   + K
Sbjct: 1615 AEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDK 1674

Query: 396  IKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVR 455
             KA E K  ++  +K  E       L ++ +   K     +K+K  E+  K E    +++
Sbjct: 1675 KKAEEAKKAEEDEKKAAEA------LKKEAEEAKKA--EELKKKEAEEKKKAE----ELK 1722

Query: 456  TQADAEVFKIVRSGKT----KRKA 475
               +    K   + K     K+KA
Sbjct: 1723 KAEEENKIKAEEAKKEAEEDKKKA 1746



 Score = 34.0 bits (77), Expect = 0.35
 Identities = 30/147 (20%), Positives = 55/147 (37%), Gaps = 14/147 (9%)

Query: 339  ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIK----AKLYNKQRRNEKIQMKK 394
            E  +K    +   E +K   EA+    +AE+ R+  +I+    A++ +  RR   I+ ++
Sbjct: 1221 EDAKKAEAVKKAEEAKKDAEEAK----KAEEERNNEEIRKFEEARMAHFARRQAAIKAEE 1276

Query: 395  KIKAHE----EKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVP 450
              KA E    E+  K    KK E         +  +   K  ++  K+K +E   K +  
Sbjct: 1277 ARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKK--ADEAKKKAEEAKKKADAA 1334

Query: 451  IPKVRTQADAEVFKIVRSGKTKRKAWK 477
              K      A       +     +A  
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEA 1361



 Score = 33.6 bits (76), Expect = 0.46
 Identities = 22/81 (27%), Positives = 42/81 (51%), Gaps = 4/81 (4%)

Query: 339  ERHRKLYGRRLDYEERKRKREAR---EPHKRAEKARSVRKIKAKLYNKQRRNE-KIQMKK 394
            E  +K   ++   E +K+  EA+   E  K+AE+A+   + K K    ++ +E K + ++
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481

Query: 395  KIKAHEEKLTKQKHEKKEEGA 415
              KA E K   ++ +KK + A
Sbjct: 1482 AKKADEAKKKAEEAKKKADEA 1502



 Score = 33.2 bits (75), Expect = 0.58
 Identities = 24/119 (20%), Positives = 56/119 (47%), Gaps = 21/119 (17%)

Query: 352  EERKRKREAR---EPHKRAEKARSVRKIK----------AKLYNKQRRNEKIQMKKKIKA 398
            +E K+  EA+   +  K+AE+A+   + K          AK   ++++ ++++  +++K 
Sbjct: 1500 DEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKK 1559

Query: 399  HEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQ 457
             EEK   ++ +K EE        D+++  R    +   ++ R E+  K      K++ +
Sbjct: 1560 AEEKKKAEEAKKAEE--------DKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAE 1610



 Score = 32.8 bits (74), Expect = 0.74
 Identities = 40/173 (23%), Positives = 68/173 (39%), Gaps = 37/173 (21%)

Query: 339  ERHRKLYGRRLDYEERKRKREAR--EPHKRAEKARSVRKIK-----AKLYNKQRRNEKI- 390
            E  RK    R   EE ++  EAR  E  K+AE  +   + K     AK   ++R NE+I 
Sbjct: 1197 EDARKAEAAR-KAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIR 1255

Query: 391  -----------QMKKKIKAHE----EKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNM 435
                       + +  IKA E    ++L K + +KK + A       +  + + K     
Sbjct: 1256 KFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEA-------KKAEEKKKADEAK 1308

Query: 436  IKQKRKEKAGKWEVPIPKVRTQADA------EVFKIVRSGKTKRKAWKRMITK 482
             K +  +KA + +    + + +ADA      E  K   + K + +A       
Sbjct: 1309 KKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEA 1361



 Score = 32.0 bits (72), Expect = 1.3
 Identities = 21/98 (21%), Positives = 41/98 (41%), Gaps = 2/98 (2%)

Query: 352  EERKRKREAREPHKRAEKARSVRKIK-AKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEK 410
            E  ++  E R+  +   KA   RK + A+   ++R+ E+ +  +  K  E     ++ +K
Sbjct: 1179 EAARKAEEVRKAEE-LRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKK 1237

Query: 411  KEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWE 448
              E A        + + R    + M    R++ A K E
Sbjct: 1238 DAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAE 1275



 Score = 31.6 bits (71), Expect = 1.6
 Identities = 32/127 (25%), Positives = 56/127 (44%), Gaps = 9/127 (7%)

Query: 352  EERKRKREA--REPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKK-KIKAHEEKLTKQKH 408
            EE K+  EA   E    A++A +  + KA+   K++   K +    K KA E+K   +  
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEE-KAEAAEKKKEEAKKKADAAKKKAEEKKKADEAK 1397

Query: 409  EKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRS 468
            +K EE       L +   ++ K  ++  K+K +EK  K +    K      A+  K  + 
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKK--ADEAKKKAEEKK-KADEAKKKAEEAKKADEAK--KK 1452

Query: 469  GKTKRKA 475
             +  +KA
Sbjct: 1453 AEEAKKA 1459



 Score = 31.6 bits (71), Expect = 1.9
 Identities = 28/127 (22%), Positives = 47/127 (37%), Gaps = 8/127 (6%)

Query: 352  EERKRKREAREPHKRAE---KARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKH 408
            E      EA    ++AE   K +   K KA    K+   +K   + K KA E+K    + 
Sbjct: 1351 EAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADEL 1410

Query: 409  EKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRS 468
            +K            +  + + K       +K+ E+A K +    K      AE  K  + 
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADE---AKKKAEEAKKADEAKKKAEEAKKAEEAK--KK 1465

Query: 469  GKTKRKA 475
             +  +KA
Sbjct: 1466 AEEAKKA 1472



 Score = 31.3 bits (70), Expect = 2.2
 Identities = 32/143 (22%), Positives = 53/143 (37%), Gaps = 26/143 (18%)

Query: 339  ERHRKLYGRRLDYEERKRKREAREPHKRAE---KARSVRKIKAKLYNKQRRNEKIQMKKK 395
            E  RK    R+ +  R++     E  ++A+   KA   +  KA    K    +K    KK
Sbjct: 1252 EEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKK--KADEAKKAEEKKKADEAKK 1309

Query: 396  IKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVR 455
             KA E K   +  +K EE               AK  ++  K+K +E     E    +  
Sbjct: 1310 -KAEEAKKADEAKKKAEE---------------AKKKADAAKKKAEEAKKAAEAAKAEAE 1353

Query: 456  TQADAEVFKIVRSGKTKRKAWKR 478
              AD        + + K +A ++
Sbjct: 1354 AAADE-----AEAAEEKAEAAEK 1371



 Score = 30.5 bits (68), Expect = 3.9
 Identities = 27/129 (20%), Positives = 57/129 (44%), Gaps = 9/129 (6%)

Query: 352  EERKR----KREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAH-EEKLTKQ 406
            EE+K+    K++A E  K+A++ +     K K    +++ E+ +   + K   EE     
Sbjct: 1388 EEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKAD 1447

Query: 407  KHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIV 466
            + +KK E A       +  +   K      K +  +KA + +    + + +AD    +  
Sbjct: 1448 EAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKAD----EAK 1503

Query: 467  RSGKTKRKA 475
            ++ + K+KA
Sbjct: 1504 KAAEAKKKA 1512



 Score = 29.7 bits (66), Expect = 6.2
 Identities = 31/140 (22%), Positives = 59/140 (42%), Gaps = 7/140 (5%)

Query: 339  ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIK-AKLYNKQRRNEKIQMKKKIK 397
            E  +K    +   EE ++  EA++  + A KA   RK + A+   + R+ E  +  +  +
Sbjct: 1101 EEAKKTETGKA--EEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIAR 1158

Query: 398  AHEEKLTKQKHEKKEEGALPVYLLDRDVQSR-AKVLSNMIKQKRKEKAGKWE--VPIPKV 454
              E+   K +  +K E A       +  + R A+ L      ++ E A K E      + 
Sbjct: 1159 KAEDA-RKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEA 1217

Query: 455  RTQADAEVFKIVRSGKTKRK 474
            R   DA+  + V+  +  +K
Sbjct: 1218 RKAEDAKKAEAVKKAEEAKK 1237



 Score = 29.7 bits (66), Expect = 6.4
 Identities = 20/79 (25%), Positives = 38/79 (48%), Gaps = 5/79 (6%)

Query: 339  ERHRKLYGRRLDYEERKRKREAR----EPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKK 394
            E  +K    +   EE K+  EA+    E  K+A++A+   + K K  ++ ++ E+ +   
Sbjct: 1467 EEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKA-DEAKKAEEAKKAD 1525

Query: 395  KIKAHEEKLTKQKHEKKEE 413
            + K  EE     + +K EE
Sbjct: 1526 EAKKAEEAKKADEAKKAEE 1544


>gnl|CDD|236295 PRK08570, rpl19e, 50S ribosomal protein L19e; Reviewed.
          Length = 150

 Score = 35.2 bits (82), Expect = 0.032
 Identities = 20/53 (37%), Positives = 25/53 (47%), Gaps = 3/53 (5%)

Query: 339 ERHRKL-YGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKI 390
           ERH K   GRR     RK K+ AR P K     R +R ++  L  + R   KI
Sbjct: 65  ERHEKRKKGRRRGPGSRKGKKGARTPKKERWINR-IRALRRYL-RELRDEGKI 115


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 36.3 bits (84), Expect = 0.051
 Identities = 22/89 (24%), Positives = 37/89 (41%), Gaps = 14/89 (15%)

Query: 343 KLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEK-----------IQ 391
           + + +R +  E KRKRE     ++  KAR  R+  +KL   +   E             +
Sbjct: 40  ENWKKRQEEAEAKRKREELR--EKIAKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKK 97

Query: 392 MKKKIKAHE-EKLTKQKHEKKEEGALPVY 419
            KK+ K  E E+      ++KE+     Y
Sbjct: 98  SKKRQKKKEAERKKALLLDEKEKERAAEY 126


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This
           family includes several uncharacterized proteins from
           Caenorhabditis elegans.
          Length = 517

 Score = 36.2 bits (82), Expect = 0.056
 Identities = 32/147 (21%), Positives = 70/147 (47%), Gaps = 7/147 (4%)

Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKA 398
           ++ +K   +  + E++   ++A E   R  +A + ++   +  +KQ + E+ +++K+ + 
Sbjct: 76  KKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEK 135

Query: 399 HEEKLTKQKHEKKEEGALPVYLLDRDVQSR-----AKVLSNMIKQKRKEKAGKWEVPIPK 453
            E  L K + EKK+E A+     ++  +++     A V   ++ +K          P PK
Sbjct: 136 KEADLKKLQAEKKKEKAVKAEKAEKAEKTKKASTPAPVEEEIVVKKVANDRSAAPAPEPK 195

Query: 454 VRTQADAEVFKIVR--SGKTKRKAWKR 478
             T   AE  + V+  +GK  +K  K+
Sbjct: 196 TPTNTPAEPAEQVQEITGKKNKKNKKK 222


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 35.9 bits (84), Expect = 0.072
 Identities = 23/101 (22%), Positives = 51/101 (50%), Gaps = 20/101 (19%)

Query: 357 KREAREPHKRAE-KARS-VRKIKAKLYN--KQRRNEKIQMKKKIKAHEEKLTKQKH--EK 410
           K+EA    K A  +A+  + K++ +     ++RRNE  +++K++   EE L ++    EK
Sbjct: 48  KKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEK 107

Query: 411 KEEGALPVYLLDR---------DVQSRAKVLSNMIKQKRKE 442
           +EE      L  +         +++ + + L  +I+++ +E
Sbjct: 108 REE-----ELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143



 Score = 32.4 bits (75), Expect = 0.91
 Identities = 32/150 (21%), Positives = 70/150 (46%), Gaps = 18/150 (12%)

Query: 341 HRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRN----EKIQMKKKI 396
            +K+   ++   E + KR   E  K AE  +    ++AK    + RN    E  + + ++
Sbjct: 25  RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNEL 84

Query: 397 KAHEEKLTKQKHE---KKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPK 453
           +  E++L  QK E   +K E    +   + +++ + K L      +++++  K E  + +
Sbjct: 85  QKLEKRL-LQKEENLDRKLEL---LEKREEELEKKEKELE-----QKQQELEKKEEELEE 135

Query: 454 VRTQADAEVFKIVRSGKTKRKAWKRMITKV 483
           +  +   E+ +I  SG T  +A + ++ KV
Sbjct: 136 LIEEQLQELERI--SGLTAEEAKEILLEKV 163


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 35.4 bits (82), Expect = 0.11
 Identities = 28/124 (22%), Positives = 49/124 (39%), Gaps = 12/124 (9%)

Query: 332 SLHNEYIERHRKLYGRRLDYEER-KRKREAREPHKRAEKARS---------VRKIKAKLY 381
            L  EY E   +L   R + EE  K   E R+  K   +             R++     
Sbjct: 354 KLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQE 413

Query: 382 NKQRRNEKI-QMKKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKR 440
             QR +E++  +   I   E K+ + + E+KE+ AL +   +  ++  A  LS   ++  
Sbjct: 414 ELQRLSEELADLNAAIAGIEAKINELE-EEKEDKALEIKKQEWKLEQLAADLSKYEQELY 472

Query: 441 KEKA 444
             K 
Sbjct: 473 DLKE 476


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 35.1 bits (81), Expect = 0.12
 Identities = 31/124 (25%), Positives = 58/124 (46%), Gaps = 20/124 (16%)

Query: 337 YIERHRKLYGR--RLDYEERKRKREAREPHKRAEKARS-VRKIKAKLYNKQRRNEKIQMK 393
           Y +R +KL     RL+ E  + KRE  E  +  EK  S + + + ++ +K R++ +I+ +
Sbjct: 420 YEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRAR 479

Query: 394 -KKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIP 452
            ++I    E+L K+  EKK+     V  L+R            +++ RK +      P+ 
Sbjct: 480 DRRI----ERLEKELEEKKKR----VEELERK--------LAELRKMRKLELSGKGTPVK 523

Query: 453 KVRT 456
            V  
Sbjct: 524 VVEK 527


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 34.5 bits (79), Expect = 0.18
 Identities = 17/63 (26%), Positives = 25/63 (39%), Gaps = 1/63 (1%)

Query: 353 ERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKE 412
             K+K EA E    AEKA++  + KAK   K     + +   + K    K    K     
Sbjct: 191 AAKKKAEA-EAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAA 249

Query: 413 EGA 415
           + A
Sbjct: 250 KAA 252



 Score = 33.4 bits (76), Expect = 0.36
 Identities = 24/124 (19%), Positives = 56/124 (45%), Gaps = 12/124 (9%)

Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK 411
           ++  R +  +   K+ E+ R  ++ +     K ++  + +  K+++  +E+L  Q+ +K+
Sbjct: 62  QQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLE--KERLKAQEQQKQ 119

Query: 412 EEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKT 471
            E A     L++  Q      +   ++K+ E A        K +  A+A   K   + + 
Sbjct: 120 AEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAA--------KAKAAAEAAKLKA--AAEA 169

Query: 472 KRKA 475
           K+KA
Sbjct: 170 KKKA 173



 Score = 33.0 bits (75), Expect = 0.45
 Identities = 21/67 (31%), Positives = 29/67 (43%), Gaps = 3/67 (4%)

Query: 352 EERKRKREAREPHKRAEKARSVR---KIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKH 408
            E  + + A E  K+AE+A       K KA+    +++ E        KA  E   K K 
Sbjct: 158 AEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKA 217

Query: 409 EKKEEGA 415
           EKK E A
Sbjct: 218 EKKAEAA 224



 Score = 31.8 bits (72), Expect = 1.2
 Identities = 30/132 (22%), Positives = 52/132 (39%), Gaps = 22/132 (16%)

Query: 353 ERKRKRE-------AREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTK 405
           E++R ++       A+E  K+AE+A    +++ K   +Q R    + KKK    E    K
Sbjct: 99  EQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKK---AEAAKAK 155

Query: 406 QKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAE--VF 463
              E  +  A           + AK  +    +  +E   K E    K + +A+A+    
Sbjct: 156 AAAEAAKLKA----------AAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAE 205

Query: 464 KIVRSGKTKRKA 475
           K     + K KA
Sbjct: 206 KAKAEAEAKAKA 217


>gnl|CDD|238268 cd00481, Ribosomal_L19e, Ribosomal protein L19e.  L19e is found in
           the large ribosomal subunit of eukaryotes and archaea.
           L19e is distinct from the ribosomal subunit L19, which
           is found in prokaryotes. It consists of two small
           globular domains connected by an extended segment. It is
           located toward the surface of the large subunit, with
           one exposed end involved in forming the intersubunit
           bridge with the small subunit.  The other exposed end is
           involved in forming the translocon binding site, along
           with L22, L23, L24, L29, and L31e subunits.
          Length = 145

 Score = 33.0 bits (76), Expect = 0.18
 Identities = 20/71 (28%), Positives = 29/71 (40%), Gaps = 9/71 (12%)

Query: 339 ERHRKL-YGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKI------Q 391
           +RH     GR      RK  + AR P K     R +R ++  L  K R + KI      +
Sbjct: 62  KRHEARRKGRHRGPGSRKGTKGARMPSKELWIRR-IRALRRLL-KKLRDSGKIDKHTYRE 119

Query: 392 MKKKIKAHEEK 402
           +  K K +  K
Sbjct: 120 LYLKAKGNVFK 130


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 34.2 bits (79), Expect = 0.18
 Identities = 14/87 (16%), Positives = 32/87 (36%), Gaps = 12/87 (13%)

Query: 334 HNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKA----RSVRKIKAKLYN-KQRRNE 388
            N   E +R     +L     K   E         +     R ++K   +    K++R E
Sbjct: 280 ENLLYENYRT---EKL--SGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLE 334

Query: 389 KI--QMKKKIKAHEEKLTKQKHEKKEE 413
           ++   + ++ K    KL + + + ++ 
Sbjct: 335 ELEQNLIEERKELNSKLEEIQKKLEDL 361


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 32.7 bits (75), Expect = 0.18
 Identities = 18/70 (25%), Positives = 35/70 (50%), Gaps = 13/70 (18%)

Query: 354 RKRKREARE----PHKRAE------KARSVRKIKAKLYNKQR-RNEKIQMKKKIKAHEEK 402
           + +K+   E       R E      K +  R+ KA+   K++ R E+I+ +K+I+  EE+
Sbjct: 4   KAKKKGVEEVVFDKEARKEYLTGFHKRKQQRRKKAQEEAKEKEREERIEERKRIR--EER 61

Query: 403 LTKQKHEKKE 412
             + + + KE
Sbjct: 62  KQELEKQLKE 71



 Score = 30.0 bits (68), Expect = 1.9
 Identities = 20/73 (27%), Positives = 39/73 (53%), Gaps = 13/73 (17%)

Query: 348 RLDY----EERK--RKREAREPHKRAEKARSVRKIKAKLYNKQRRNE-KIQMKKKIKAHE 400
           R +Y     +RK  R+++A+E  K  E+     +I+ +   K+ R E K +++K++K  +
Sbjct: 20  RKEYLTGFHKRKQQRRKKAQEEAKEKEREE---RIEER---KRIREERKQELEKQLKERK 73

Query: 401 EKLTKQKHEKKEE 413
           E L   + E  +E
Sbjct: 74  EALKLLEEENDDE 86


>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family.  Members of this family are
           coiled-coil proteins that are involved in pre-rRNA
           processing.
          Length = 105

 Score = 32.0 bits (73), Expect = 0.23
 Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 1/63 (1%)

Query: 351 YEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEK 410
           +E+R  KR  ++  K  EK     K   +    Q   E+   K++ K   EK+  + H K
Sbjct: 30  WEKRMEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAKEE-KERYEKMAAKMHAK 88

Query: 411 KEE 413
           K E
Sbjct: 89  KVE 91


>gnl|CDD|238706 cd01418, Ribosomal_L19e_A, Ribosomal protein L19e, archaeal.  L19e
           is found in the large ribosomal subunit of eukaryotes
           and archaea. L19e is distinct from the ribosomal subunit
           L19, which is found in prokaryotes. It consists of two
           small globular domains connected by an extended segment.
           It is located toward the surface of the large subunit,
           with one exposed end involved in forming the
           intersubunit bridge with the small subunit.  The other
           exposed end is involved in forming the translocon
           binding site, along with L22, L23, L24, L29, and L31e
           subunits.
          Length = 145

 Score = 32.7 bits (75), Expect = 0.24
 Identities = 19/53 (35%), Positives = 24/53 (45%), Gaps = 3/53 (5%)

Query: 339 ERHRKL-YGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKI 390
           ERH K   GRR     RK K+ AR P K       +R ++  L  + R   KI
Sbjct: 62  ERHEKRKKGRRRGPGSRKGKKGARTPKKERWIKT-IRALRRYL-KELRDKGKI 112


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 32.3 bits (74), Expect = 0.29
 Identities = 11/62 (17%), Positives = 26/62 (41%), Gaps = 8/62 (12%)

Query: 346 GRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTK 405
           GR +D EE++ ++E  +  K  ++ +         + K     + + ++K     EK   
Sbjct: 9   GRIIDIEEKREEKEREKEEKERKEEKEKE------WGKG--LVQKEEREKRLEELEKAKN 60

Query: 406 QK 407
           + 
Sbjct: 61  KP 62


>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2.  This
           family consists of several Eukaryotic rRNA processing
           protein EBP2 sequences. Ebp2p is required for the
           maturation of 25S rRNA and 60S subunit assembly. Ebp2p
           may be one of the target proteins of Rrs1p for executing
           the signal to regulate ribosome biogenesis. This family
           also plays a role in chromosome segregation.
          Length = 270

 Score = 33.5 bits (77), Expect = 0.32
 Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 16/77 (20%)

Query: 352 EERKRKREAREPHKR-----AEKARS---VRKIKAKLYNKQRRNEKIQMKKKIKA----- 398
           E RK  ++   P KR     AE  +S   ++K++ KL  +    +  +  +K +      
Sbjct: 88  EARKLLKKLGVPFKRPDDYFAEMVKSDEHMQKVRKKLLAEAAAKKASEEARKQRELKKFG 147

Query: 399 ---HEEKLTKQKHEKKE 412
                EKL ++  EKK+
Sbjct: 148 KQVQVEKLQERAKEKKD 164


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
           region.  This highly conserved region is found towards
           the C-terminus of the transmembrane domain. The function
           is unclear.
          Length = 151

 Score = 32.3 bits (73), Expect = 0.35
 Identities = 16/61 (26%), Positives = 31/61 (50%), Gaps = 5/61 (8%)

Query: 374 RKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLS 433
           RK   K   + ++++K   KKK K  E+K   +K E+K  G +P +  +++V     ++ 
Sbjct: 14  RKESGK--TQSQKSDK---KKKEKVSEKKGKSKKKEEKPNGKIPEHEPNQEVTEVEVIIE 68

Query: 434 N 434
            
Sbjct: 69  K 69


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 33.1 bits (76), Expect = 0.36
 Identities = 17/73 (23%), Positives = 33/73 (45%), Gaps = 16/73 (21%)

Query: 333 LHNEYIERHRKLYGRRLDYEERK-------RKREAREPHKRAEKARSVRKIKAKLYNKQR 385
           L  E +   R+   ++  +E+R+       ++ E  E  +R EK R       K  +K+R
Sbjct: 149 LEEEELAELRQ---QQRQFEQRRNAELAETQRLEEAERRRREEKERR------KKQDKER 199

Query: 386 RNEKIQMKKKIKA 398
           +  + +  +KI A
Sbjct: 200 KQREKETAEKIAA 212


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 33.4 bits (76), Expect = 0.41
 Identities = 37/161 (22%), Positives = 53/161 (32%), Gaps = 25/161 (15%)

Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEK------------ARSVRKIKAKLYNKQRR 386
           E  RK    +   E  K + E RE  K   K            A +  K KA    KQ+R
Sbjct: 20  EEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAALAKQKR 79

Query: 387 --------NEKIQMKKK----IKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSN 434
                    EK + K K     KA    L KQK  +  E            ++ A   + 
Sbjct: 80  EGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQK-REGTEEVTEEEKAAAKAKAAAAAKAK 138

Query: 435 MIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKTKRKA 475
                ++++ G  EV   +  T  +    K   + K K  A
Sbjct: 139 AAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAA 179



 Score = 31.5 bits (71), Expect = 1.6
 Identities = 31/124 (25%), Positives = 49/124 (39%), Gaps = 17/124 (13%)

Query: 354 RKRKREAREPHKRAEKARSVRKIK--AKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK 411
            K+KRE  E     EKA +  K    AK        +K +  +++   EE+  K+K + K
Sbjct: 109 AKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAK 168

Query: 412 EEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKT 471
              A          +++A  L+   KQK  E     E    + + +A A   K   + K 
Sbjct: 169 AAAA---------AKAKAAALA---KQKAAEAGEGTEEVTEEEKAKAKA---KAAAAAKA 213

Query: 472 KRKA 475
           K  A
Sbjct: 214 KAAA 217


>gnl|CDD|233750 TIGR02151, IPP_isom_2, isopentenyl-diphosphate delta-isomerase,
           type 2.  Isopentenyl-diphosphate delta-isomerase (IPP
           isomerase) interconverts isopentenyl diphosphate and
           dimethylallyl diphosphate. This model represents the
           type 2 enzyme. FMN, NADPH, and Mg2+ are required by this
           form, which lacks homology to the type 1 enzyme
           (TIGR02150). IPP is precursor to many compounds,
           including enzyme cofactors, sterols, and isoprenoids
           [Biosynthesis of cofactors, prosthetic groups, and
           carriers, Other].
          Length = 333

 Score = 33.0 bits (76), Expect = 0.43
 Identities = 13/41 (31%), Positives = 21/41 (51%), Gaps = 5/41 (12%)

Query: 503 PMALRFNKAHVTHPELKATFYLPIIGVKK-NPNSPMYTNLG 542
           PM +   +A +  PE   TF +    V++  PN P+  N+G
Sbjct: 85  PMGVGSQRAALKDPETADTFEV----VREEAPNGPLIANIG 121


>gnl|CDD|223596 COG0522, RpsD, Ribosomal protein S4 and related proteins
           [Translation, ribosomal structure and biogenesis].
          Length = 205

 Score = 32.7 bits (75), Expect = 0.43
 Identities = 12/68 (17%), Positives = 25/68 (36%), Gaps = 7/68 (10%)

Query: 346 GRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQM-------KKKIKA 398
            R    +++  +R    P  +    +  +       + Q R +K+         K+K++A
Sbjct: 2   ARYPGPKKKLSRRLGFNPWLKERLCKERKLPYKPGQHGQERWKKLSDYGLQLREKQKLRA 61

Query: 399 HEEKLTKQ 406
               L KQ
Sbjct: 62  FYGVLEKQ 69


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 33.1 bits (76), Expect = 0.49
 Identities = 26/111 (23%), Positives = 39/111 (35%), Gaps = 14/111 (12%)

Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHE-- 409
            + +   E  E            K K K    QR  EK + + + +A EEK  K+K    
Sbjct: 253 SDDESAWEGFESEYEPINKPVRPKRKTK---AQRNKEKRRKELEREAKEEKQLKKKLAQL 309

Query: 410 ------KKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKV 454
                  KE          +  Q + +      K KR+ K GK + P P +
Sbjct: 310 ARLKEIAKEVAQKEKARARKKEQRKERGEK--KKLKRR-KLGKHKYPEPPL 357



 Score = 32.0 bits (73), Expect = 0.90
 Identities = 12/75 (16%), Positives = 34/75 (45%)

Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK 411
           E+R+++ E     ++  K +  +  + K   K+   ++    +K +  +E+  K+K +++
Sbjct: 286 EKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKLKRR 345

Query: 412 EEGALPVYLLDRDVQ 426
           + G         +V 
Sbjct: 346 KLGKHKYPEPPLEVV 360


>gnl|CDD|216581 pfam01577, Peptidase_S30, Potyvirus P1 protease.  The potyviridae
           family positive stand RNA viruses with genome encoding a
           polyprotein. members include zucchini yellow mosaic
           virus, and turnip mosaic viruses which cause
           considerable losses of crops worldwide. This family
           consists of a C terminus region from various plant
           potyvirus P1 proteins (found at the N terminus of the
           polyprotein). The C terminus of P1 is a serine-type
           protease responsible for autocatalytic cleavage between
           P1 and the helper component protease pfam00851. The
           entire P1 protein may be involved in virus-host
           interactions.
          Length = 245

 Score = 32.7 bits (75), Expect = 0.49
 Identities = 22/114 (19%), Positives = 37/114 (32%), Gaps = 13/114 (11%)

Query: 374 RKIKAKLYNKQRRNEKIQMKKKIKAH-EEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVL 432
              K     KQ +  +I ++K   A   +K  K K E++EE                   
Sbjct: 11  LLRKEMSKIKQEKKGRIILRKLSPAQVAKKREKLKREEREERQFLQGAYA---------- 60

Query: 433 SNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKTKRKAWKRMITKVCFV 486
           S + K          +      RT       K ++  K K+K    M  K+ ++
Sbjct: 61  SIVSKITPIGTDKVSKTESVSFRTPYYKRTTKKMKKKKKKKKV--VMSDKINYL 112


>gnl|CDD|217843 pfam04012, PspA_IM30, PspA/IM30 family.  This family includes PspA
           a protein that suppresses sigma54-dependent
           transcription. The PspA protein, a negative regulator of
           the Escherichia coli phage shock psp operon, is produced
           when virulence factors are exported through secretins in
           many Gram-negative pathogenic bacteria and its homologue
           in plants, VIPP1, plays a critical role in thylakoid
           biogenesis, essential for photosynthesis. Activation of
           transcription by the enhancer-dependent bacterial
           sigma(54) containing RNA polymerase occurs through ATP
           hydrolysis-driven protein conformational changes enabled
           by activator proteins that belong to the large AAA(+)
           mechanochemical protein family. It has been shown that
           PspA directly and specifically acts upon and binds to
           the AAA(+) domain of the PspF transcription activator.
          Length = 220

 Score = 32.4 bits (74), Expect = 0.55
 Identities = 24/120 (20%), Positives = 44/120 (36%), Gaps = 14/120 (11%)

Query: 358 REAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKEEGALP 417
           R+ +    +A +A +    + K   ++   +K Q KK        LTK   E        
Sbjct: 33  RDMQSELGKARQALAQVIARQKQLERKLEEQKEQAKKLENKARAALTKGNEE-------- 84

Query: 418 VYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKI--VRSGKTKRKA 475
              L R+  +    L    +    +   +    + ++R Q  A   KI  +++ KT  KA
Sbjct: 85  ---LAREALAEIATLEKQAEALETQ-LTQQRSAVEQLRKQLAALETKIQQLKAKKTALKA 140


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 33.2 bits (75), Expect = 0.59
 Identities = 24/102 (23%), Positives = 54/102 (52%), Gaps = 6/102 (5%)

Query: 345 YGRRLDYEERKRKR-EAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKL 403
           YG R+D +  +R R E    H++A + + + K + +   ++ R E+++ ++  +   E+L
Sbjct: 426 YGGRVDKDHAERARIEKENAHRKALEMKILEKKRIERLEREER-ERLERERMERIERERL 484

Query: 404 TKQKHEKKEEGALPVYLLDRDVQSR-AKVLSNMIKQKRKEKA 444
            +++ E++    L    L+RD   R  +   + +++ R EKA
Sbjct: 485 ERERLERER---LERDRLERDRLDRLERERVDRLERDRLEKA 523


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
           family of proteins has no known function. However, one
           of the members is annotated as an EF-hand family
           protein.
          Length = 349

 Score = 32.4 bits (74), Expect = 0.63
 Identities = 12/78 (15%), Positives = 38/78 (48%), Gaps = 2/78 (2%)

Query: 335 NEYIERHRK-LYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMK 393
              ++  R     +    ++ K++ E R+  K+    +  ++ +A    ++ + E+ +  
Sbjct: 272 ESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPAQKLQEERKEQL 331

Query: 394 KKIKAHEEKLTKQKHEKK 411
            K++  EEK  ++++E++
Sbjct: 332 AKLRK-EEKEREKEYEQE 348



 Score = 32.4 bits (74), Expect = 0.74
 Identities = 26/117 (22%), Positives = 52/117 (44%), Gaps = 22/117 (18%)

Query: 359 EAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHE----------EKLTKQKH 408
             RE  KR +K+    K++ +L   +++ E+ + KKK KA+           E++ +Q  
Sbjct: 12  TKREKKKREKKSIRQSKLEEELNKLEKKEEEAECKKKFKANPVPASVKLPLYEEIMEQNE 71

Query: 409 EKKEEGAL-----------PVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEV-PIPK 453
           E++EE              P    +R+ Q +A +   +     + +  K++  P+PK
Sbjct: 72  ERREEVREKSKAILLSSQKPFSFYEREEQKKAILPRKLRSSTSEREPKKFKAKPVPK 128



 Score = 30.5 bits (69), Expect = 2.8
 Identities = 17/63 (26%), Positives = 28/63 (44%), Gaps = 3/63 (4%)

Query: 354 RKRKREAREPHKRAEKARSVRKIKAKLY---NKQRRNEKIQMKKKIKAHEEKLTKQKHEK 410
           R  ++E     ++ +K    RK K K      KQR       +K  +  +E+L K + E+
Sbjct: 279 RTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPAQKLQEERKEQLAKLRKEE 338

Query: 411 KEE 413
           KE 
Sbjct: 339 KER 341


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 32.8 bits (75), Expect = 0.63
 Identities = 15/82 (18%), Positives = 36/82 (43%), Gaps = 7/82 (8%)

Query: 332 SLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQ 391
            L  E IE   +      + +ER  + +AR     + +  +++  + +L   +R  E+++
Sbjct: 229 ELEQE-IEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELE 287

Query: 392 MKKKIKAHEEKLTKQKHEKKEE 413
            K +      +L + + E +E 
Sbjct: 288 EKIE------RLEELEREIEEL 303



 Score = 30.9 bits (70), Expect = 2.9
 Identities = 20/128 (15%), Positives = 49/128 (38%), Gaps = 16/128 (12%)

Query: 329 TAMSLHNEYIERHRKLYGRRLDYEERKRKREAREPH-KRAEKARSVRKIKAKLYNKQRRN 387
              +L  E  E+  KL     + EE K K + ++   +  +    ++++K  L   +   
Sbjct: 523 LEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELL---EELR 579

Query: 388 EKIQMKKKIKAHEEKLTK--------QKHEKKEEGALPVYLLD---RDVQSRAKVLSNMI 436
                K++++   E+L +        ++   + E  L    L     +++   + L +  
Sbjct: 580 LLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELES-E 638

Query: 437 KQKRKEKA 444
            +K   +A
Sbjct: 639 LEKLNLQA 646



 Score = 29.3 bits (66), Expect = 7.5
 Identities = 19/113 (16%), Positives = 44/113 (38%), Gaps = 18/113 (15%)

Query: 333 LHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQM 392
            H + +    +L    L+ E  + K EA    +  E  + +R+++ +L       E  + 
Sbjct: 470 EHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELI---ELLELEEA 526

Query: 393 -KKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKA 444
            K++++   EKL     E +E               + K+    +K++ ++  
Sbjct: 527 LKEELEEKLEKLENLLEELEEL--------------KEKLQLQQLKEELRQLE 565


>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777).  This is
           a family of eukaryotic proteins of unknown function.
           Some of the proteins in this family are putative nucleic
           acid binding proteins.
          Length = 158

 Score = 31.8 bits (72), Expect = 0.64
 Identities = 20/90 (22%), Positives = 43/90 (47%), Gaps = 13/90 (14%)

Query: 336 EYIERHRKLYGRRLDYEERKRKREAR-EPHKRAEKARSVRKIKA------KLYNKQRRNE 388
           +  ER R+    R    ER R+R +R     R+ ++RS R+ ++      +  +++R  +
Sbjct: 22  DRRERRRERSRSR----ERDRRRRSRSRSPHRSRRSRSPRRHRSRSRSPSRRRDRKRERD 77

Query: 389 KIQMKKKIKAHEEKLTKQKHEKK--EEGAL 416
           K   + K +  ++ + ++  E K  EE  +
Sbjct: 78  KDAREPKKRERQKLIKEEDLEGKSDEEVEM 107


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 32.8 bits (75), Expect = 0.66
 Identities = 29/148 (19%), Positives = 55/148 (37%), Gaps = 14/148 (9%)

Query: 342 RKLYGRRLDYEERKRKREAREPHKRA--EKARSVRKIKAKLYNKQRRNEKIQMKKKIKAH 399
             L  R+   +E +R  +  +       E A    K        Q+R E  + +++++  
Sbjct: 23  VLLKKRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQRE 82

Query: 400 EEKLTKQKHEKKEEGA----LPVYLLDRD--VQSRAKVLSNMIKQKR---KEKAGKWEVP 450
           EE+L +++ +          L   L +R+  + +R   L  + KQ        AG     
Sbjct: 83  EERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQ 142

Query: 451 IPKV---RTQADAEVFKIVRSGKTKRKA 475
             K+      A+ E  K  R  K + +A
Sbjct: 143 ARKLLLKLLDAELEEEKAQRVKKIEEEA 170



 Score = 32.0 bits (73), Expect = 1.3
 Identities = 17/75 (22%), Positives = 32/75 (42%), Gaps = 4/75 (5%)

Query: 333 LHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQ- 391
           L N+  ER + L  R L+ EE +++ +              RK+  KL + +   EK Q 
Sbjct: 103 LENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLKLLDAELEEEKAQR 162

Query: 392 ---MKKKIKAHEEKL 403
              ++++     E+ 
Sbjct: 163 VKKIEEEADLEAERK 177



 Score = 30.8 bits (70), Expect = 2.9
 Identities = 21/130 (16%), Positives = 44/130 (33%), Gaps = 21/130 (16%)

Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNK--QRRNEKIQMK 393
           E + R R    +    E  + +RE     ++ E              K     N+  + +
Sbjct: 59  ELLLRERNQQRQEARREREELQREEERLVQKEE-------QLDARAEKLDNLENQLEERE 111

Query: 394 KKIKAHEEKLTKQKHEKKEE------------GALPVYLLDRDVQSRAKVLSNMIKQKRK 441
           K + A E +L + + +   E              L + LLD +++         I+++  
Sbjct: 112 KALSARELELEELEKQLDNELYRVAGLTPEQARKLLLKLLDAELEEEKAQRVKKIEEEAD 171

Query: 442 EKAGKWEVPI 451
            +A +    I
Sbjct: 172 LEAERKAQNI 181


>gnl|CDD|201707 pfam01280, Ribosomal_L19e, Ribosomal protein L19e. 
          Length = 148

 Score = 31.4 bits (72), Expect = 0.66
 Identities = 16/53 (30%), Positives = 22/53 (41%), Gaps = 3/53 (5%)

Query: 339 ERHRKL-YGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKI 390
           +R  K   GR      RK  + AR P K     R +R ++  L  + R   KI
Sbjct: 64  KRKEKRRKGRHRGPGSRKGTKGARMPKKELWIRR-IRALRRLL-RELRDAGKI 114


>gnl|CDD|153276 cd07592, BAR_Endophilin_A, The Bin/Amphiphysin/Rvs (BAR) domain of
           Endophilin-A.  BAR domains are dimerization, lipid
           binding and curvature sensing modules found in many
           different proteins with diverse functions. Endophilins
           are accessory proteins, localized at synapses, which
           interact with the endocytic proteins, dynamin and
           synaptojanin. They are essential for synaptic vesicle
           formation from the plasma membrane. They interact with
           voltage-gated calcium channels, thus linking vesicle
           endocytosis to calcium regulation. They also play roles
           in virus budding, mitochondrial morphology maintenance,
           receptor-mediated endocytosis inhibition, and endosomal
           sorting. Endophilins contain an N-terminal N-BAR domain
           (BAR domain with an additional N-terminal amphipathic
           helix), followed by a variable region containing proline
           clusters, and a C-terminal SH3 domain. They are
           classified into two types, A and B. Vertebrates contain
           three endophilin-A isoforms. Endophilin-A proteins are
           enriched in the brain and play multiple roles in
           receptor-mediated endocytosis. They tubulate membranes
           and regulate calcium influx into neurons to trigger the
           activation of the endocytic machinery. They are also
           involved in the sorting of plasma membrane proteins,
           actin filament assembly, and the uncoating of
           clathrin-coated vesicles for fusion with endosomes. The
           BAR domains of endophilin-A1 and A3 form crescent-shaped
           dimers that can detect membrane curvature and drive
           membrane bending.
          Length = 223

 Score = 31.9 bits (73), Expect = 0.69
 Identities = 14/23 (60%), Positives = 17/23 (73%), Gaps = 1/23 (4%)

Query: 338 IERHR-KLYGRRLDYEERKRKRE 359
           I  HR KL GRRLDY+ +KRK+ 
Sbjct: 130 INHHRKKLEGRRLDYDYKKRKQG 152


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 31.3 bits (72), Expect = 0.79
 Identities = 17/74 (22%), Positives = 27/74 (36%), Gaps = 12/74 (16%)

Query: 332 SLHNEYIERHRKLYGRRLDYEERKR-----KREAREPHKRAEKARSVRKIKAKLYNKQRR 386
            L+ EY +    L    LD +E K      K+  +   K  +K     + K +L+     
Sbjct: 46  DLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPRLF----- 100

Query: 387 NEKIQMKKKIKAHE 400
              +  K  I A E
Sbjct: 101 --VLDFKGDIDASE 112


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 32.0 bits (73), Expect = 0.82
 Identities = 21/85 (24%), Positives = 32/85 (37%), Gaps = 7/85 (8%)

Query: 330 AMSLHNEYIERHRKLYGRRLDYEERKRKREA-REPHKRAEKARSVRKIKAKLYNKQRRNE 388
           A   H E I+          D E RK   E   E  + ++K   + +    L N+ R  E
Sbjct: 188 AQEYHEEMIK-----LFEEAD-ELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELE 241

Query: 389 KIQMKKKIKAHEEKLTKQKHEKKEE 413
           K     + K    K  +++ E KE 
Sbjct: 242 KKIKALRAKEKAAKRREKREELKER 266



 Score = 30.4 bits (69), Expect = 3.2
 Identities = 21/81 (25%), Positives = 41/81 (50%), Gaps = 8/81 (9%)

Query: 333 LHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQM 392
           LH E++E  +K+    L  E R  + E RE  K+      ++ ++AK    +RR ++ ++
Sbjct: 212 LHEEFVELSKKI--DELHEEFRNLQNELRELEKK------IKALRAKEKAAKRREKREEL 263

Query: 393 KKKIKAHEEKLTKQKHEKKEE 413
           K++ +   EK  + +    EE
Sbjct: 264 KERAEEIYEKFKRGEKLTTEE 284



 Score = 30.0 bits (68), Expect = 4.1
 Identities = 14/103 (13%), Positives = 43/103 (41%), Gaps = 19/103 (18%)

Query: 374 RKIKAKLYN--KQRRNEKIQMKKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKV 431
           RK   +     K++R+E  +   ++    ++L  +  E +E+         ++++     
Sbjct: 15  RKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKA--------QELREERDE 66

Query: 432 LSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKTKRK 474
           ++  +++ ++++         ++  +   E+ K  R  K KR 
Sbjct: 67  INEEVQELKEKRD--------EINAKLQ-ELRKEYRELKEKRN 100



 Score = 28.5 bits (64), Expect = 9.8
 Identities = 22/115 (19%), Positives = 45/115 (39%), Gaps = 13/115 (11%)

Query: 335 NEYIERHRKLYGRRLDYEERKRK--REAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQM 392
           +E  E  ++L  +R +   + ++  +E RE  ++  +     +    L  +  R EK Q 
Sbjct: 65  DEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQQ 124

Query: 393 KKKI-KAHEEKLTKQKHEKKEEGALPVYLLDRDVQSR----AKVLSNMIKQKRKE 442
              +    E +L ++  E ++E      L D           K L   I + +K+
Sbjct: 125 TSVLTPEEERELVQKIKELRKE------LEDAKKALEENEKLKELKAEIDELKKK 173


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 31.6 bits (72), Expect = 0.89
 Identities = 14/60 (23%), Positives = 25/60 (41%)

Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK 411
              K      E     E+     K K K   K+++ +K + +K ++    K  K+K +KK
Sbjct: 134 TSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 31.6 bits (72), Expect = 0.91
 Identities = 24/126 (19%), Positives = 42/126 (33%), Gaps = 27/126 (21%)

Query: 347 RRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKL--- 403
           + L  EER +K++  +     E  +  +K         +       KK  +         
Sbjct: 76  KELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLD 135

Query: 404 ----------TKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPK 453
                     T Q  E   E      L +R+++ R K+ +   K+K K+K         K
Sbjct: 136 SPRRKSSRSSTVQNKEATHE-----RLKEREIR-RKKIQAKARKRKEKKK--------EK 181

Query: 454 VRTQAD 459
             TQ +
Sbjct: 182 ELTQEE 187


>gnl|CDD|153336 cd07652, F-BAR_Rgd1, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Saccharomyces cerevisiae 
           Rho GTPase activating protein Rgd1 and similar proteins.
            F-BAR domains are dimerization modules that bind and
           bend membranes and are found in proteins involved in
           membrane dynamics and actin reorganization.
           Saccharomyces cerevisiae Rgd1 is a GTPase activating
           protein (GAP) with activity towards Rho3p and Rho4p,
           which are involved in bud growth and cytokinesis,
           respectively. At low pH, S. cerevisiae Rgd1 is required
           for cell survival and the activation of the protein
           kinase C pathway, which is important in cell integrity
           and the maintenance of cell shape. It contains an
           N-terminal F-BAR domain and a C-terminal Rho GAP domain.
           The F-BAR domain of S. cerevisiae Rgd1 binds to
           phosphoinositides and plays an important role in the
           localization of the protein to the bud tip/neck during
           the cell cycle. F-BAR domains form banana-shaped dimers
           with a positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 234

 Score = 31.5 bits (72), Expect = 0.96
 Identities = 25/108 (23%), Positives = 41/108 (37%), Gaps = 18/108 (16%)

Query: 328 LTAMSLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKA-----RSVRKIKAKLYN 382
           L  M   +E     + +        E+ RK   +E  KRAEK       +  K KA+  +
Sbjct: 88  LNEM--SDELSSLAKTV--------EKSRK-SIKETGKRAEKKVQDAEAAAEKAKARYDS 136

Query: 383 KQRRNEKIQMKKKIKAHEEKLTKQK-HEKKEEGAL-PVYLLDRDVQSR 428
                E+++     K  +  L   K   + E+  L  V   D+D  S+
Sbjct: 137 LADDLERVKTGDPGKKLKFGLKGNKSAAQHEDELLRKVQAADQDYASK 184


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 30.7 bits (70), Expect = 0.97
 Identities = 17/62 (27%), Positives = 30/62 (48%), Gaps = 7/62 (11%)

Query: 357 KREAREPHKRAEKARSVRK----IKAKL-YNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK 411
           KR  R+  K  +K     K    +K +   NKQ +  K + K+K +  +E+  + K +KK
Sbjct: 68  KRLQRQAAKEVKKPGISTKAQQALKLEHERNKQEK--KKRSKEKKEEEKERKRQLKQQKK 125

Query: 412 EE 413
           + 
Sbjct: 126 KA 127


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 31.8 bits (71), Expect = 1.00
 Identities = 39/153 (25%), Positives = 62/153 (40%), Gaps = 17/153 (11%)

Query: 340 RHRKLYGRRLDYEERKRKREAREPHKR--AEKARSVRKIKAKLYNKQRRNEKIQMKKKIK 397
           +  K  GR      R+  REAR P K     + R +R++  K   +++ +  I  +  +K
Sbjct: 67  KEAKSMGRHEGAGRREGTREARMPSKELWMRRLRILRRLLRKYREEKKIDRHIYRELYVK 126

Query: 398 AH----------EEKLTKQKHEKKEEGALPVYLLD---RDVQSRAKVLSNMIKQKRK--E 442
           A            E + K K+EKK+E  L   L     +D Q R K     ++++ K  E
Sbjct: 127 AKGNVFRNKRNLMEHIHKVKNEKKKERQLAEQLAAKRLKDEQHRHKARKQELRKREKDRE 186

Query: 443 KAGKWEVPIPKVRTQADAEVFKIVRSGKTKRKA 475
           +A + +        Q  A       SGK   KA
Sbjct: 187 RARREDAAAAAAAKQKAAAKKAAAPSGKKSAKA 219


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 31.1 bits (71), Expect = 1.1
 Identities = 40/167 (23%), Positives = 68/167 (40%), Gaps = 41/167 (24%)

Query: 352 EERKRKREAREPHKRAE-----KARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLT-- 404
           E+R+RKRE R+  K+ +     K    +K +A+   K   N+  +    I+  E  +   
Sbjct: 9   EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEE-VKNEENKSKKKAAPIENAEGNIVFS 67

Query: 405 ------------KQKHEKKEEGALPVY--LLDRDVQSRAKVLSNMIKQKRKE--KAGKWE 448
                         K +KK++     Y  LL + +++R K L  + + K  E  +  KW 
Sbjct: 68  KVEFADGEQAKKDLKLKKKKKKKKTDYKQLLKK-LEARKKKLEELDEDKAAEIEEKEKWT 126

Query: 449 VPIPKVRTQA-------DAEVFK--IVRSGKTKRKA---WKRMITKV 483
               K   +A       D ++ K  + R  K K+K+   WK    KV
Sbjct: 127 ----KALAKAEGVKVKDDEKLLKKALKRKEKQKKKSKKEWKERKEKV 169


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 32.1 bits (73), Expect = 1.1
 Identities = 25/136 (18%), Positives = 58/136 (42%), Gaps = 4/136 (2%)

Query: 341 HRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHE 400
            ++L     + +ER+R ++     +  E  ++++K    +  ++ + +KI   K+IK+ E
Sbjct: 545 KKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESII-RELKEKKIHKAKEIKSIE 603

Query: 401 EKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAG--KWEVPIPKVRTQA 458
           + L K K  K++    P       +  + ++     K K  +  G  KW V +  +R + 
Sbjct: 604 D-LVKLKETKQKIPQKPTNFQADKIGDKVRIRYFGQKGKIVQILGGNKWNVTVGGMRMKV 662

Query: 459 DAEVFKIVRSGKTKRK 474
                + +      +K
Sbjct: 663 HGSELEKINKAPPPKK 678


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 32.0 bits (73), Expect = 1.1
 Identities = 17/111 (15%), Positives = 42/111 (37%), Gaps = 7/111 (6%)

Query: 338 IERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIK 397
           IE  +       +  E   + +  E  +  E+   +    + L  ++R  E     ++++
Sbjct: 269 IEELKS---ELEELREELEELQE-ELLELKEEIEELEGEISLL--RERLEELENELEELE 322

Query: 398 AHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWE 448
              E+L K+K E  +E       L  +++     L    ++  ++ +   E
Sbjct: 323 ERLEEL-KEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLE 372


>gnl|CDD|220103 pfam09074, Mer2, Mer2.  Mer2 (Rec107) forms part of a complex that
           is required for meiotic double strand DNA break
           formation. Mer2 increases in abundance and is
           phosphorylated during the prophase phase of cell
           division. Blocking double strand break formation results
           in delayed dephosphorylation and dissociation of Mer2
           from the chromosome.
          Length = 190

 Score = 31.0 bits (70), Expect = 1.1
 Identities = 16/97 (16%), Positives = 38/97 (39%), Gaps = 7/97 (7%)

Query: 391 QMKKKIKAHEEKLTK----QKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGK 446
              K++     +L +     +    EE    +Y L   + ++ + +   + +K K   G 
Sbjct: 55  NNSKELNKTNYQLNELLNSCEASVNEEIKTLIYQLGAQLNNQLEEVVIALSEKVKTINGN 114

Query: 447 ---WEVPIPKVRTQADAEVFKIVRSGKTKRKAWKRMI 480
               +VP  +V      E F I++S + ++    + +
Sbjct: 115 SMMGQVPPSEVERINSDEPFDIIKSFEARQLENNKSV 151


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 32.2 bits (73), Expect = 1.2
 Identities = 20/111 (18%), Positives = 44/111 (39%), Gaps = 5/111 (4%)

Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQR--RNEKIQMKKKI 396
           +R +    R+        +RE     +   + +++ +++ K  N+      E   +K K+
Sbjct: 85  KRAQIELERKASTLAENYERELDRNLELEVRLKALEELEKKAENEAAEAEEEAKLLKDKL 144

Query: 397 KAHEEKLTKQKHEKKEEGALPVYLLDRDV---QSRAKVLSNMIKQKRKEKA 444
            A   KL  +K ++ +E    +  +  D+   Q RA+     +K    E  
Sbjct: 145 DAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNADTELKLLESELE 195


>gnl|CDD|225407 COG2851, CitM, H+/citrate symporter [Energy production and
           conversion].
          Length = 433

 Score = 31.9 bits (73), Expect = 1.2
 Identities = 22/81 (27%), Positives = 29/81 (35%), Gaps = 24/81 (29%)

Query: 226 FLPIYLETGDNRLLL-LTPMVS----ELNPWSIHEPSYRVGPMPQTVSNL---------P 271
            LP+Y   G N  LL L   +S     + PW         GP  +  S L         P
Sbjct: 126 LLPLYKRLGMNPYLLALLATLSAGVMNMLPWG--------GPTARAASVLGVDPAELFVP 177

Query: 272 LQPSQSSGTLWLKAMCDALWL 292
           L P Q  G + +  +  A  L
Sbjct: 178 LIPIQIIGLVLV--LALAWLL 196


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 31.5 bits (72), Expect = 1.2
 Identities = 23/88 (26%), Positives = 30/88 (34%), Gaps = 32/88 (36%)

Query: 338 IERHRKLYGRRLDYEER----KRKREAREPHK--------------RAEKAR-------- 371
           IE   K   ++   EER    K  RE RE                   EKAR        
Sbjct: 229 IEGPAKK--KKQTKEERIATAKEGREDREKFGSRKGKKDKEGKSTTNKEKARKKNFMMTL 286

Query: 372 ---SVR-KIKAKLYNKQRRNEKIQMKKK 395
               VR K K  L +KQ+      +++K
Sbjct: 287 HKKRVRGKQKRSLRDKQKVLRAHILRQK 314


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 30.6 bits (69), Expect = 1.2
 Identities = 16/58 (27%), Positives = 32/58 (55%)

Query: 338 IERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKK 395
            E+ +KL  ++   ++RKR++  +E  KR ++   +   K K   ++RR +K+  K K
Sbjct: 68  FEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRKKLTKKTK 125



 Score = 29.9 bits (67), Expect = 2.6
 Identities = 17/85 (20%), Positives = 38/85 (44%), Gaps = 12/85 (14%)

Query: 334 HNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMK 393
             +  +  RK   ++   E+++  ++ +   +  E A+           +Q+  EKI++ 
Sbjct: 58  QVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAK-----------RQKELEKIELS 106

Query: 394 KKIKAHEEKLTKQKHEKKEEGALPV 418
           KK K  E +  ++K  KK +   P+
Sbjct: 107 KK-KQKERERRRKKLTKKTKSGQPL 130



 Score = 28.7 bits (64), Expect = 5.3
 Identities = 17/80 (21%), Positives = 38/80 (47%), Gaps = 7/80 (8%)

Query: 340 RHRKLYGRRLDYE-----ERKRKREAREPHKRAEKARSVRKI--KAKLYNKQRRNEKIQM 392
           R +K Y + L+ E     E++   +  +  K   K    +K+  K ++  +++R ++ + 
Sbjct: 33  RLKKEYLKLLEKEGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKE 92

Query: 393 KKKIKAHEEKLTKQKHEKKE 412
             K +   EK+   K ++KE
Sbjct: 93  LAKRQKELEKIELSKKKQKE 112


>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase.  This domain family is found in
           eukaryotes, and is approximately 140 amino acids in
           length. The family is found in association with
           pfam00069. Polo-like kinase 1 (Plx1) is essential during
           mitosis for the activation of Cdc25C, for spindle
           assembly, and for cyclin B degradation. This family is
           Polo kinase kinase (PKK) which phosphorylates Polo
           kinase and Polo-like kinase to activate them. PKK is a
           serine/threonine kinase.
          Length = 142

 Score = 30.4 bits (69), Expect = 1.4
 Identities = 24/105 (22%), Positives = 48/105 (45%), Gaps = 13/105 (12%)

Query: 342 RKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEE 401
            +   +R+  E++ R +  +E  K   + + +++   KL   Q + EK +MK + +  E 
Sbjct: 50  LRRLPKRIRAEQKTRLKMFKESLK--IEKKELKQEVEKLPRFQEQ-EKKRMKAEKEEQE- 105

Query: 402 KLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGK 446
               QKH+K+E        L +  ++  + L  +  +KR E A  
Sbjct: 106 ----QKHQKQERE-----FLAKQEENLEEALQQLQNEKRHELAET 141


>gnl|CDD|219523 pfam07697, 7TMR-HDED, 7TM-HD extracellular.  This entry represents
           the extracellular domain of the 7TM-HD (7TM Receptors
           with HD hydrolase).
          Length = 218

 Score = 31.1 bits (71), Expect = 1.4
 Identities = 19/107 (17%), Positives = 39/107 (36%), Gaps = 17/107 (15%)

Query: 393 KKKIKAHE------EKLTKQKHEKKEEGALPVYLLDRDVQSRAK-----VLSNMIKQKR- 440
              I+A        E+ T++K ++  E   PVY +D +V    +     +   + + K  
Sbjct: 8   PVDIRAPRDIEVVDEEATEEKRKEAAESVPPVYDIDPEVTQNLEEEINSLFDEIREVKAS 67

Query: 441 ---KEKAGKWEVPIPKVRTQADAEVFK-IVRSGKTKRKAWKRMITKV 483
               EKA + +  +     Q   E +  ++ +     K  +  I   
Sbjct: 68  AEKAEKADRLK-SLNLSTFQLSDEQWSTLLNADDEDLKLLEDAIRTA 113


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 31.3 bits (71), Expect = 1.6
 Identities = 20/114 (17%), Positives = 48/114 (42%), Gaps = 15/114 (13%)

Query: 347 RRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQ 406
            R++ EER RK    +  +  ++A+   ++  K Y K+   ++ Q ++ +K  EE + +Q
Sbjct: 87  ARVEAEER-RKTLQEQTQQEQQRAQYQDELARKRYQKELEQQRRQNEELLKMQEESVLRQ 145

Query: 407 KHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADA 460
           +  ++               +  ++L    +   +E   + E    K+  +A  
Sbjct: 146 EAMRR--------------ATEEEILEMRRETIEEEAELERENIRAKIEAEARG 185


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 31.3 bits (71), Expect = 1.6
 Identities = 20/112 (17%), Positives = 44/112 (39%), Gaps = 5/112 (4%)

Query: 335 NEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKK 394
           N   E  ++   +R  Y   K+K  ++   +   +    +K +  L+N+Q +  ++ ++ 
Sbjct: 42  NTLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQ----KKEQKNLFNEQIKQFELALQD 97

Query: 395 KIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGK 446
           +I   E        + KE   L   L +   +   K L N  +   K++   
Sbjct: 98  EIAKLEALELLNLEKDKELELLEKELDELSKE-LQKQLQNTAEIIEKKRENN 148


>gnl|CDD|227392 COG5059, KIP1, Kinesin-like protein [Cytoskeleton].
          Length = 568

 Score = 31.6 bits (72), Expect = 1.6
 Identities = 19/120 (15%), Positives = 36/120 (30%), Gaps = 7/120 (5%)

Query: 324 DALWLTAMSLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNK 383
           +  W    +L    IE  R L    L  EE   K++ +       +      + +     
Sbjct: 423 EEGWKYKSTLQFLRIEIDRLL----LLREEELSKKKTKIHKLNKLRHDLSSLLSSIPEET 478

Query: 384 QRRNEKIQMKKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEK 443
             R E  +  K   +   KL  +      +       L+    S  ++  N +     E+
Sbjct: 479 SDRVESEKASKLRSSASTKLNLRSSRSHSKFRDH---LNGSNSSTKELSLNQVDLAGSER 535


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 31.8 bits (72), Expect = 1.6
 Identities = 19/59 (32%), Positives = 33/59 (55%), Gaps = 4/59 (6%)

Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEK--ARSVRKIKAKLYNKQRRNEKIQMKKK 395
           E+ R+L   +L  EE +R+R+A E  +R E+  A    + +AK   ++RR +   + KK
Sbjct: 253 EKRREL--EKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQNLLKK 309


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 31.4 bits (71), Expect = 1.7
 Identities = 19/101 (18%), Positives = 39/101 (38%), Gaps = 4/101 (3%)

Query: 347 RRLDYEERKRKREAREPHKRAEKARS-VRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTK 405
           +R  Y E++++++A     R  + R  +   +A   + +    K     +    E K  K
Sbjct: 290 QRAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKN--EAKARK 347

Query: 406 QKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGK 446
            K   ++  A    +     Q RA  ++    ++   KA K
Sbjct: 348 -KEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKK 387



 Score = 29.0 bits (65), Expect = 8.0
 Identities = 22/115 (19%), Positives = 39/115 (33%), Gaps = 13/115 (11%)

Query: 364 HKRAEKARSVRKIKAKLYNKQR-RNEKIQMKKKIKAHEEKLTKQKHEKKEEGALPVYLLD 422
            +RA+     ++ KA L+  +R R   I    +      +  + K  +  E         
Sbjct: 289 EQRAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAE--------- 339

Query: 423 RDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKTKRKAWK 477
              ++ AK     I QKR+    +      + R  A A       + K K+K   
Sbjct: 340 ---KNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLI 391


>gnl|CDD|233421 TIGR01453, grpIintron_endo, group I intron endonuclease.  This
           model represents one subfamily of endonucleases
           containing the endo/excinuclease amino terminal domain,
           pfam01541 at its amino end. A distinct subfamily
           includes excinuclease abc subunit c (uvrC). Members of
           pfam01541 are often termed GIY-YIG endonucleases after
           conserved motifs near the amino end. This subfamily in
           This model is found in open reading frames of group I
           introns in both phage and mitochondria. The closely
           related endonucleases of phage T4: segA, segB, segC,
           segD and segE, score below the trusted cutoff for the
           family.
          Length = 214

 Score = 30.8 bits (70), Expect = 1.7
 Identities = 25/85 (29%), Positives = 37/85 (43%), Gaps = 10/85 (11%)

Query: 346 GRRLDY---EERKRKREAREPHKRAEKARSVR---KIKAKLYNKQRRNEKIQMKKKIKAH 399
           G  L Y   EE K K       K+           + KAK+ +K +  E      K   H
Sbjct: 92  GSSLGYKHSEETKAKMSKLFSGKKNNPWYGKTHSEETKAKI-SKNKLGENNPFFGK--KH 148

Query: 400 EEKLTKQKHEKKEEGALPVYLLDRD 424
            E+ TK+K  +KE+GA  V + D++
Sbjct: 149 SEE-TKKKISEKEKGAKKVNVYDKN 172


>gnl|CDD|237791 PRK14701, PRK14701, reverse gyrase; Provisional.
          Length = 1638

 Score = 31.8 bits (72), Expect = 1.7
 Identities = 31/138 (22%), Positives = 47/138 (34%), Gaps = 29/138 (21%)

Query: 389 KIQMKKKIKAHEEKLTKQKHEKKEEGALPVYL--------LDRDVQSRAKVLSNMIKQK- 439
           + ++  +       L       K E  L   +           DV+    +L +    K 
Sbjct: 414 RFRVDLEDPTIYRILGLLSEILKIEEELKEGIPIEGVLDVFPEDVEFLRSILKDEEVIKK 473

Query: 440 --------RKEKAGKWEVPIPKVRTQADAEVFKIVRSGKTKRKAWKRMITK-VCFVGENF 490
                    K++ GK+ + IP VRT   A       SG+T R  +   ITK    +  + 
Sbjct: 474 VAERPFVSLKKEEGKYYIEIPDVRTYIQA-------SGRTSR-LFAGGITKGASVLIVD- 524

Query: 491 TRKPPKFERFIRPMALRF 508
              P  F   IR M  RF
Sbjct: 525 --DPEIFNALIRQMRFRF 540


>gnl|CDD|224691 COG1777, COG1777, Predicted transcriptional regulators
           [Transcription].
          Length = 217

 Score = 30.9 bits (70), Expect = 1.8
 Identities = 15/69 (21%), Positives = 28/69 (40%), Gaps = 7/69 (10%)

Query: 342 RKLYG-RRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHE 400
              +G  R D EE   + E  E  K  +    + ++ ++L    R  E++   +      
Sbjct: 88  PNFFGAERFDLEEDDLESERSEVSKLFKSPEGISELISRLLEINREIEELSRAQT----- 142

Query: 401 EKLTKQKHE 409
            +L KQ +E
Sbjct: 143 -ELQKQLNE 150


>gnl|CDD|238111 cd00187, TOP4c, DNA Topoisomerase, subtype IIA; domain A';
           bacterial DNA topoisomerase IV (C subunit, ParC),
           bacterial DNA gyrases (A subunit, GyrA),mammalian DNA
           toposiomerases II. DNA topoisomerases are essential
           enzymes that regulate the conformational changes in DNA
           topology by catalysing the concerted breakage and
           rejoining of DNA strands during normal cellular growth.
          Length = 445

 Score = 31.0 bits (71), Expect = 1.9
 Identities = 19/101 (18%), Positives = 39/101 (38%), Gaps = 26/101 (25%)

Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSV-------RKIKAKLYNKQRRNE 388
           E+++   ++Y RR +YE  K +      H      +++         I++    K+   E
Sbjct: 323 EFLDHRLEVYTRRKEYELGKAEARL---HILEGLLKAILNIDEVINLIRSSDEAKKALIE 379

Query: 389 KIQMKK--KIKAH--------------EEKLTKQKHEKKEE 413
           +++     +I+A                EKL K+  E + E
Sbjct: 380 ELEKLGFSEIQADAILDMRLRRLTKLEREKLLKELKELEAE 420


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 31.0 bits (70), Expect = 2.0
 Identities = 20/77 (25%), Positives = 37/77 (48%), Gaps = 13/77 (16%)

Query: 340 RHRKLYGRRLDYEERKRKREAREPHKRA---------EKARSVRKIKAKLYNKQRRNEKI 390
             R    ++L  EE+KRK+E  E   RA         E A S+++ K  L  +    +K+
Sbjct: 490 DSRVNAEKQLA-EEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEI---KKL 545

Query: 391 QMKKKIKAHEEKLTKQK 407
           +   K+K  E ++ +++
Sbjct: 546 EHDLKLKEEECRMLEKE 562


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 31.4 bits (71), Expect = 2.1
 Identities = 22/90 (24%), Positives = 31/90 (34%), Gaps = 10/90 (11%)

Query: 333 LHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQM 392
           L  E        Y  +    E K +    E     EK   ++K K  L   +  N+ I  
Sbjct: 462 LGEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYL-ESEEINKSINE 520

Query: 393 KKKIKAHEEKLT---------KQKHEKKEE 413
             KI++    L          K KH+K EE
Sbjct: 521 YNKIESARADLEDIKIKINELKDKHDKYEE 550


>gnl|CDD|150786 pfam10153, DUF2361, Uncharacterized conserved protein (DUF2361).
           This is a region of 120 amino acids that is conserved in
           a family of proteins found from plants to fungi. The
           function is not known.
          Length = 114

 Score = 29.5 bits (67), Expect = 2.2
 Identities = 18/75 (24%), Positives = 37/75 (49%), Gaps = 9/75 (12%)

Query: 338 IERHRKL--YGRRLDYEERKRKRE--AREPHK-----RAEKARSVRKIKAKLYNKQRRNE 388
           +E+ R L    + L+  ER++KR+  A++ H      R +  R +++++ KL       E
Sbjct: 24  VEKERALEALKKELEDAERRKKRKKMAKKYHMVRFFERKKATRRLKQLEKKLEETSDEKE 83

Query: 389 KIQMKKKIKAHEEKL 403
              +K+++   E  L
Sbjct: 84  IKDLKEQLSKAEVDL 98



 Score = 28.3 bits (64), Expect = 5.3
 Identities = 18/72 (25%), Positives = 34/72 (47%), Gaps = 11/72 (15%)

Query: 353 ERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQM-----------KKKIKAHEE 401
           ER  K++      R EK R++  +K +L + +RR ++ +M           +KK     +
Sbjct: 10  ERLLKKKDLPADVRVEKERALEALKKELEDAERRKKRKKMAKKYHMVRFFERKKATRRLK 69

Query: 402 KLTKQKHEKKEE 413
           +L K+  E  +E
Sbjct: 70  QLEKKLEETSDE 81


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 30.8 bits (70), Expect = 2.2
 Identities = 23/95 (24%), Positives = 43/95 (45%), Gaps = 12/95 (12%)

Query: 332 SLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRA-EKARSVRKIK---AKLYNKQRRN 387
            L  +  ER R+    +++    +   EA+E  + A E A  + +     AK    ++ +
Sbjct: 29  VLSIKEKERLREEEEEQVEQLREEANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTD 88

Query: 388 E------KIQMK--KKIKAHEEKLTKQKHEKKEEG 414
           E      K+QM+  ++ +  EE+L +   E K EG
Sbjct: 89  EASSIIEKLQMQIEREQEEWEEELERLIEEAKAEG 123


>gnl|CDD|235353 PRK05157, PRK05157, pyrroloquinoline quinone biosynthesis protein
           PqqC; Provisional.
          Length = 246

 Score = 30.6 bits (70), Expect = 2.3
 Identities = 12/40 (30%), Positives = 17/40 (42%), Gaps = 11/40 (27%)

Query: 303 AMCDALWLKAMCDTLWLTTMCDALWLTAMSLHNEYIERHR 342
              +AL  K   D LW  +M DAL++        Y+  H 
Sbjct: 208 RALEALQFK--LDVLW--SMLDALYM-------AYVLGHI 236


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 29.5 bits (67), Expect = 2.4
 Identities = 15/75 (20%), Positives = 38/75 (50%), Gaps = 3/75 (4%)

Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKA 398
           +R  +L  +  + +E   K +       A++ R+ +K + +   ++ + E+I   K++KA
Sbjct: 32  QREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEEKKLRKEKEEEI---KELKA 88

Query: 399 HEEKLTKQKHEKKEE 413
             E+L  +  + +E+
Sbjct: 89  ELEELKAEIEKLEEK 103



 Score = 28.3 bits (64), Expect = 5.4
 Identities = 15/80 (18%), Positives = 38/80 (47%), Gaps = 10/80 (12%)

Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEK--IQMK 393
             +E+ R++   +L  + ++ +       +R E+    R  + +L  K+   ++  I+  
Sbjct: 1   RLLEKRREMEEVQLALDAKREE------FERREELLKQR--EEELEKKEEELQESLIKFD 52

Query: 394 KKIKAHEEKLTKQKHEKKEE 413
           K +K +E K  + + + +EE
Sbjct: 53  KFLKENEAKRRRAEKKAEEE 72


>gnl|CDD|214704 smart00512, Skp1, Found in Skp1 protein family.  Family of Skp1
           (kinetochore protein required for cell cycle
           progression) and elongin C (subunit of RNA polymerase II
           transcription factor SIII) homologues.
          Length = 104

 Score = 29.2 bits (66), Expect = 2.4
 Identities = 10/49 (20%), Positives = 24/49 (48%), Gaps = 4/49 (8%)

Query: 418 VYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIV 466
           V+ ++R+V  ++K +  MI+    +      +P+P V     +++   V
Sbjct: 13  VFEVEREVARQSKTIKAMIEDLGVDDENNNPIPLPNVT----SKILSKV 57


>gnl|CDD|172658 PRK14170, PRK14170, bifunctional 5,10-methylene-tetrahydrofolate
           dehydrogenase/ 5,10-methylene-tetrahydrofolate
           cyclohydrolase; Provisional.
          Length = 284

 Score = 30.4 bits (68), Expect = 2.6
 Identities = 27/82 (32%), Positives = 40/82 (48%), Gaps = 6/82 (7%)

Query: 388 EKIQMKKKIKAHEEKLTKQKHEKKEEGALP---VYLLDRDVQSRAKVLSNMIKQKRKEKA 444
           E I  KK  K  +EK+T++  E  +EG  P   V L+  +  SR  V +   KQKR E+A
Sbjct: 3   EIIDGKKLAKEIQEKVTREVAELVKEGKKPGLAVVLVGDNQASRTYVRN---KQKRTEEA 59

Query: 445 GKWEVPIPKVRTQADAEVFKIV 466
           G   V I       + ++  +V
Sbjct: 60  GMKSVLIELPENVTEEKLLSVV 81


>gnl|CDD|185443 PTZ00097, PTZ00097, 60S ribosomal protein L19; Provisional.
          Length = 175

 Score = 30.0 bits (68), Expect = 2.7
 Identities = 35/122 (28%), Positives = 51/122 (41%), Gaps = 20/122 (16%)

Query: 335 NEYIERHRKLYGRRLDYEERKRKREAREPHK-------RA-----EKARSVRKIKAKLYN 382
             + E  RK  GR     +R+  REAR P K       R       K R+ +KI   +Y+
Sbjct: 62  RRFHEAKRK--GRHTGIGKRRGTREARMPTKVLWMRRQRVLRRLLRKYRAAKKIDRHMYH 119

Query: 383 KQRRNEKIQMKKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKE 442
           +     K    K  +   E + K K+EK +E  +   L  R    RAK  +  ++ KRK 
Sbjct: 120 EFYLKSKGNQFKNKRVLIEAIHKTKNEKVKEKKIQDQLEAR----RAK--AKALRNKRKA 173

Query: 443 KA 444
           K 
Sbjct: 174 KK 175


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
           is an essential protein in yeast that is involved in
           pre-rRNA processing and ribosome assembly. It is
           speculated to be required for correct assembly of rpS27
           into the pre-ribosomal particle.
          Length = 131

 Score = 29.5 bits (67), Expect = 2.7
 Identities = 17/68 (25%), Positives = 31/68 (45%), Gaps = 10/68 (14%)

Query: 354 RKRK----REAREPHKRAEKARSVRKIKAKL--YNKQRRNEKI----QMKKKIKAHEEKL 403
           RKRK    R      +R  K +  +K K     Y  Q R +K     +++KK +  ++++
Sbjct: 60  RKRKAGASRNKAAEERRKLKEKKKKKKKELENFYRFQIREKKKEELAELRKKFEEDKKRI 119

Query: 404 TKQKHEKK 411
            + K  +K
Sbjct: 120 EQLKAARK 127


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 30.3 bits (68), Expect = 3.0
 Identities = 13/73 (17%), Positives = 29/73 (39%), Gaps = 4/73 (5%)

Query: 347 RRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKK----IKAHEEK 402
           R+   E  K   E  + +   E A    K   K    ++ NE  +   +     KA  + 
Sbjct: 146 RQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDD 205

Query: 403 LTKQKHEKKEEGA 415
           +T++ +++++   
Sbjct: 206 VTEEDYDEEDNPV 218


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 30.5 bits (69), Expect = 3.0
 Identities = 19/79 (24%), Positives = 35/79 (44%), Gaps = 7/79 (8%)

Query: 333 LHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQM 392
           L  +  +   K Y ++L    +K      E     E        +AK   ++ + E+ + 
Sbjct: 362 LSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEA-------EAKAKEEKLKQEENEK 414

Query: 393 KKKIKAHEEKLTKQKHEKK 411
           K+K +A E+K  +QK E+K
Sbjct: 415 KQKEQADEDKEKRQKDERK 433


>gnl|CDD|190741 pfam03763, Remorin_C, Remorin, C-terminal region.  Remorins are
           plant-specific plasma membrane-associated proteins. In
           tobacco remorin co-purifies with lipid rafts. Most
           remorins have a variable, proline-rich C-half and a more
           conserved N-half that is predicted to form coiled coils.
           Consistent with this, circular dichroism studies have
           demonstrated that much of the protein is alpha-helical.
           Remorins exist in plasma membrane preparations as
           oligomeric structures and form filaments in vitro. The
           proteins can bind polyanions including the extracellular
           matrix component oligogalacturonic acid (OGA). In vitro,
           remorin in plasma membrane preparations is
           phosphorylated (principally on threonine residues) in
           the presence of OGA and thus co-purifies with a protein
           kinases(s). The biological functions of remorins are
           unknown but roles as components of the
           membrane/cytoskeleton are possible.
          Length = 112

 Score = 28.8 bits (65), Expect = 3.0
 Identities = 28/112 (25%), Positives = 49/112 (43%), Gaps = 19/112 (16%)

Query: 353 ERKRKREAREPH-KRAEKARSVRKIKAKL------YNKQRRNEKIQMKKKIKAHEEKLTK 405
            +K + E+R    + AEKA+   K + +        NK++   + ++KK     E KL K
Sbjct: 1   SKKSEAESRADAWEEAEKAKINNKYQREEAKIQAWENKKKAKAEAELKKI----ERKLEK 56

Query: 406 QKHEKKEEGALPVYLLDR--DVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVR 455
           +K E  E+      L ++      +A+      + KR E+  K E    K+R
Sbjct: 57  KKAEAAEK------LKNKLAAAHKKAEEKRATAEAKRGEEEAKAEEKAAKIR 102


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 29.7 bits (67), Expect = 3.1
 Identities = 16/65 (24%), Positives = 31/65 (47%), Gaps = 10/65 (15%)

Query: 351 YEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKK--IKAHEEKLTKQKH 408
           Y E K+K++        E A  + K+K +   KQ+   K +  KK   K  ++K  K+  
Sbjct: 57  YTEAKKKKK--------ELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDD 108

Query: 409 EKKEE 413
           + +++
Sbjct: 109 KSEKK 113


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 30.1 bits (68), Expect = 3.1
 Identities = 12/70 (17%), Positives = 31/70 (44%), Gaps = 2/70 (2%)

Query: 345 YGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRN--EKIQMKKKIKAHEEK 402
           Y       E K K+E  E     +KA  + +   +L  +   +  E+ +++ ++     +
Sbjct: 31  YEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEEAAASEEERERLEAEVDEATAE 90

Query: 403 LTKQKHEKKE 412
           + K + E+++
Sbjct: 91  VAKLEEEREK 100


>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit.  This
           family includes the vacuolar ATP synthase E subunit, as
           well as the archaebacterial ATP synthase E subunit.
          Length = 195

 Score = 29.6 bits (67), Expect = 3.3
 Identities = 28/118 (23%), Positives = 42/118 (35%), Gaps = 27/118 (22%)

Query: 352 EERKRKREAREPHKRAEKARSVRK--IKAKLYNKQR------RNEKIQM-----KKKIKA 398
           E  K+  E  E   + EK   + K  I +   N+ R      R E +       K+++  
Sbjct: 31  EAEKKIEEIYE---KKEKQAEMEKQIIISNAKNEARLKVLNAREELLDSVFEEAKERLAN 87

Query: 399 HEE----------KLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGK 446
             E           L KQ   K  E  + V   + D +     L    K++ KEK GK
Sbjct: 88  LSEDKDEYKDLLKDLIKQALVKLGEPKVIVRSREEDEELVKSALDE-AKEEYKEKTGK 144


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 30.6 bits (69), Expect = 3.3
 Identities = 24/131 (18%), Positives = 52/131 (39%), Gaps = 6/131 (4%)

Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKH--- 408
           E+ K + +  +  K+ +     +  K K   K++R  K + K+K K  EE   +++    
Sbjct: 107 EKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKR 166

Query: 409 EKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRS 468
           E+    + P     +   ++ K      ++K+++ A +     P+     + E  K    
Sbjct: 167 ERVRAKSRPKKPPKKKPPNKKKEPPE--EEKQRQAAREAVKGKPE-EPDVNEEREKEEDD 223

Query: 469 GKTKRKAWKRM 479
           GK +      M
Sbjct: 224 GKDRETTTSPM 234



 Score = 29.9 bits (67), Expect = 4.8
 Identities = 22/129 (17%), Positives = 51/129 (39%), Gaps = 10/129 (7%)

Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK 411
            +  +    +E  K  E+ +  +K K +   ++ ++ K + + K K   ++  K+K +K 
Sbjct: 95  AKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKV 154

Query: 412 EEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKT 471
           EE        DR+ + +       ++ K + K    + P  K +   + E  +       
Sbjct: 155 EEPR------DREEEKK----RERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAV 204

Query: 472 KRKAWKRMI 480
           K K  +  +
Sbjct: 205 KGKPEEPDV 213


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.  This
           presumed domain is found at the N-terminus of Ribosomal
           L30 proteins and has been termed RL30NT or NUC018.
          Length = 71

 Score = 28.0 bits (63), Expect = 3.3
 Identities = 15/60 (25%), Positives = 25/60 (41%), Gaps = 17/60 (28%)

Query: 354 RKRKREAREPHKRAEKARSVRKIKAKL--------------YNKQRRNEKIQMKKKIKAH 399
           RKR  + R   KRA+K  + +  + K               Y    R+E I++K++ K  
Sbjct: 6   RKRNEKLRA--KRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERDE-IRLKREAKKK 62


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 28.8 bits (65), Expect = 3.8
 Identities = 15/62 (24%), Positives = 31/62 (50%), Gaps = 6/62 (9%)

Query: 353 ERKRKREAREPHKRAE--KARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEK 410
           E++  RE  +  KR    K R +     +    +R  E+ + +K+ + + EK  K++ ++
Sbjct: 72  EKELLREKEKKKKRKRPGKKRRIALRLRR----ERTKERAEKEKRTRKNREKKFKRRQKE 127

Query: 411 KE 412
           KE
Sbjct: 128 KE 129


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 29.6 bits (67), Expect = 3.9
 Identities = 14/44 (31%), Positives = 23/44 (52%)

Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKK 395
           EE KR  E RE   RA K+RS +    +   + ++ E  +M+ +
Sbjct: 126 EEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHR 169


>gnl|CDD|214660 smart00434, TOP4c, DNA Topoisomerase IV.  Bacterial DNA
           topoisomerase IV, GyrA, ParC.
          Length = 444

 Score = 30.2 bits (69), Expect = 4.1
 Identities = 16/108 (14%), Positives = 43/108 (39%), Gaps = 11/108 (10%)

Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKK 395
           E+++   ++Y RR +Y   K + E    H       ++  I   +   +   +  +  K+
Sbjct: 330 EFLDHRLEVYTRRKEYLLGKLEAERL--HILEGLFIALSIIDEIIVLIRSSKDLAKEAKE 387

Query: 396 IKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEK 443
                 +L++ + +          +LD  ++   K+    ++++ KE 
Sbjct: 388 KLMERFELSEIQAD---------AILDMRLRRLTKLEVEKLEKELKEL 426


>gnl|CDD|214429 MTH00139, COX2, cytochrome c oxidase subunit II; Provisional.
          Length = 226

 Score = 29.7 bits (67), Expect = 4.2
 Identities = 16/61 (26%), Positives = 26/61 (42%), Gaps = 4/61 (6%)

Query: 141 MCDALWLKAMCDTLWLTTMCDALW--LTAMCLISLHISSLATNMLSKFKEILDTYLTDKV 198
               +  K    +L  +   + +W  L A  L+ L + SL   +L    E+ D YLT K 
Sbjct: 42  SLSLMSNKFTSRSLLESQEVETIWTVLPAFILLFLALPSL--RLLYLMDEVSDPYLTFKA 99

Query: 199 I 199
           +
Sbjct: 100 V 100


>gnl|CDD|206243 pfam14073, Cep57_CLD, Centrosome localisation domain of Cep57.  The
           CLD or centrosome localisation domain of Cep57 is found
           at the N-terminus, and lies approximately between
           residues 58 and 239. This region lies within the first
           alpha-helical coiled-coil segment of Cep57, and
           localises to the centrosome internally to gamma-tubulin,
           suggesting that it is either on both centrioles or on a
           centromatrix component. This N-terminal region can also
           multimerise with the N-terminus of other Cep57
           molecules. The C-terminal part, Family Cep57_MT_bd,
           pfam06657, is the microtubule-binding region of Cep57.
          Length = 178

 Score = 29.4 bits (66), Expect = 4.2
 Identities = 18/76 (23%), Positives = 35/76 (46%), Gaps = 5/76 (6%)

Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKK 395
           E  E+ R     +    +R+++ +  +   + EK   + K   +L   Q   E      K
Sbjct: 88  ENAEKERTAVLEQQVSLQREKEHDQMDVQAQLEKLEVLEKEYLRLTRTQSLAET-----K 142

Query: 396 IKAHEEKLTKQKHEKK 411
           I+  EEKL +++H++K
Sbjct: 143 IQQLEEKLQEEEHQRK 158


>gnl|CDD|130605 TIGR01542, A118_put_portal, phage portal protein, putative, A118
           family.  This model represents a family of phage minor
           structural proteins. The protein is suggested to be the
           head-tail connector, or portal protein, on the basis of
           its position in the phage gene order, its presence in
           mature phage, its size, and its conservation across a
           number of complete genomes of tailed phage that lack
           other candidate portal proteins. Several other known
           portal protein families lack clear homology to this
           family and to each other [Mobile and extrachromosomal
           element functions, Prophage functions].
          Length = 476

 Score = 30.3 bits (68), Expect = 4.3
 Identities = 19/91 (20%), Positives = 36/91 (39%), Gaps = 2/91 (2%)

Query: 311 KAMCDTLWLTTMCDALWLTAMSLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKA 370
           + +  T  L  + D   + A     + I+  ++ Y  + DY E           KR   +
Sbjct: 1   RRVGLTKSLKDIKDHPKINANDEEYKRIDMWKRYY--QGDYAEVHNLNHNGNTVKRRLLS 58

Query: 371 RSVRKIKAKLYNKQRRNEKIQMKKKIKAHEE 401
            ++ K+ AK       NE++++    KA  E
Sbjct: 59  MNLPKVAAKKMASLLFNEQVKINVDDKAANE 89


>gnl|CDD|226108 COG3580, COG3580, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 351

 Score = 29.7 bits (67), Expect = 4.5
 Identities = 16/63 (25%), Positives = 29/63 (46%), Gaps = 14/63 (22%)

Query: 437 KQKRKEKAGKWEVPIPKV----------RTQADAEVFKIVRSGKTKR----KAWKRMITK 482
              +KE+AG+  + IP+V           T      F++V S K+ +    K  + + ++
Sbjct: 12  HSLKKEEAGRGTIGIPRVLNMYEYYPFFHTFFTELGFRVVLSPKSSKELYEKGIETIPSE 71

Query: 483 VCF 485
           VCF
Sbjct: 72  VCF 74


>gnl|CDD|162036 TIGR00784, citMHS, citrate transporter, CitMHS family.  This family
           includes two characterized citrate/proton symporters
           from Bacillus subtilis. CitM transports citrate
           complexed to Mg2+, while the CitH apparently transports
           citrate without Mg2+. The family also includes
           uncharacterized transporters, including a third paralog
           in Bacillus subtilis [Transport and binding proteins,
           Carbohydrates, organic alcohols, and acids].
          Length = 431

 Score = 29.8 bits (67), Expect = 4.5
 Identities = 19/74 (25%), Positives = 30/74 (40%), Gaps = 13/74 (17%)

Query: 227 LPIYLETGDNRLLLLT-PMVS----ELNPWSIHEPSYRVGPMPQTVSNLPLQPSQSSGTL 281
           LP+Y   G N ++L    M+      + PW         GP  + +S L L PS+    L
Sbjct: 126 LPLYKRLGMNPMVLAGIAMLGLGIMNMIPWG--------GPTTRAISVLGLDPSEFFVPL 177

Query: 282 WLKAMCDALWLTAM 295
               +   L + A+
Sbjct: 178 IPTMLGGILCVIAV 191


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 29.1 bits (66), Expect = 4.5
 Identities = 18/71 (25%), Positives = 40/71 (56%), Gaps = 10/71 (14%)

Query: 352 EERKRKREAR--------EPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKL 403
           E+RK++++AR        E  + AE+A++ +  + +  N+Q R  + + +K I A  ++L
Sbjct: 23  EKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQ-RQAEAE-QKAIVAQIKQL 80

Query: 404 TKQKHEKKEEG 414
            +Q    +++G
Sbjct: 81  IEQNRIDRKDG 91


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 30.0 bits (68), Expect = 4.9
 Identities = 19/103 (18%), Positives = 43/103 (41%), Gaps = 18/103 (17%)

Query: 351 YEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEK----IQMKKKIKAHEEKLTKQ 406
            EE+KR   A    + A K   +   + +    ++++ K    + + ++ +  E +L + 
Sbjct: 637 LEEQKR---AEAEARTALKQARLDLQRLQ---NEQQSLKDKLELAIAERKQQAETQLRQL 690

Query: 407 KHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEV 449
             + K         L    Q+  + L +  ++ R E+  KW+V
Sbjct: 691 DAQLK--------QLLEQQQAFLEALKDDFRELRTERLAKWQV 725


>gnl|CDD|201249 pfam00472, RF-1, RF-1 domain.  This domain is found in peptide
           chain release factors such as RF-1 and RF-2, and a
           number of smaller proteins of unknown function. This
           domain contains the peptidyl-tRNA hydrolase activity.
           The domain contains a highly conserved motif GGQ, where
           the glutamine is thought to coordinate the water that
           mediates the hydrolysis.
          Length = 114

 Score = 28.3 bits (64), Expect = 5.0
 Identities = 12/38 (31%), Positives = 21/38 (55%)

Query: 348 RLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQR 385
           +L   E ++KRE  +P + ++  R  RK K + YN  +
Sbjct: 70  KLYEAELQKKREKTKPTRASQVRRGDRKEKIRTYNFPQ 107


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation of
           60S ribosomal subunits.
          Length = 132

 Score = 28.5 bits (64), Expect = 5.2
 Identities = 30/138 (21%), Positives = 56/138 (40%), Gaps = 21/138 (15%)

Query: 309 WLKAMCDTLWLTTMCDALWLTAMSLHNEYIERHRKLYGRRLDYEERKRKREAR-EPHKRA 367
           +  A    L             +S   + ++  +KL   +L   E+K KR+ R E  +  
Sbjct: 2   FADAFAKILASKLPASKRKDPILSRSKKLLKAKKKLKSEKL---EKKAKRQLRAEKRQAL 58

Query: 368 EKARSVRKIKAKLYNKQRRNEKI------QMKKKIKAHEEKLTKQKHEKKEEGALPVYLL 421
           EK R V+ +  +   K+RR  K+      ++   ++A +    K +   KE G       
Sbjct: 59  EKGR-VKPVLPEDLEKERRLRKVAQKGVVKLFNAVRAAQ---KKTEKAVKEAGK------ 108

Query: 422 DRDVQSRAKVLSNMIKQK 439
              V+ R ++L  + K+K
Sbjct: 109 KARVK-REELLKEVSKEK 125


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 30.1 bits (68), Expect = 5.2
 Identities = 16/66 (24%), Positives = 27/66 (40%), Gaps = 5/66 (7%)

Query: 351  YEERKRKREAREPHKRAEKARSVRKI---KAKLYNKQRRNEKIQMKKKIKAHEEKLTKQK 407
             E   R +E +EP  + E      K    ++ L N+++  E+   +  IK  + K  KQ 
Sbjct: 1518 IEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNK--KQY 1575

Query: 408  HEKKEE 413
                E 
Sbjct: 1576 KSNTEA 1581


>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A. 
          Length = 427

 Score = 29.8 bits (68), Expect = 5.3
 Identities = 21/96 (21%), Positives = 41/96 (42%), Gaps = 18/96 (18%)

Query: 336 EYIERHRKLYGRRLDYEERKRKREAR--EPHKRA-EKARSVRK-IKAKLYNKQRRNE--- 388
           E++E   ++Y RR +Y   K +      E   +A  K   V + I+  +  K+ + E   
Sbjct: 312 EFLEHRLEVYKRRKEYLLEKLEERLHILEGLLKALNKIDFVIEVIRGSIDLKKAKKELIE 371

Query: 389 -----------KIQMKKKIKAHEEKLTKQKHEKKEE 413
                       +++++  K   EKL K+  E ++E
Sbjct: 372 ELSEIQADYLLDMRLRRLTKEEIEKLEKEIEELEKE 407


>gnl|CDD|107304 cd06309, PBP1_YtfQ_like, Periplasmic binding domain of ABC-type
           YtfQ-like transport systems.  Periplasmic binding domain
           of ABC-type YtfQ-like transport systems. The YtfQ
           protein from Escherichia coli is up-regulated under
           glucose-limited conditions and shares homology with a
           family of pentose/hexose sugar-binding proteins of the
           type I periplasmic binding protein superfamily. Members
           of this group are predicted to be involved in the
           transport of sugar-containing molecules across cellular
           and organellar membranes; however their ligand
           specificity is not determined experimentally.
          Length = 273

 Score = 29.5 bits (67), Expect = 5.4
 Identities = 14/45 (31%), Positives = 22/45 (48%), Gaps = 9/45 (20%)

Query: 409 EKKEEGALPVYLLDRDVQSRAKVL------SNMIKQKRKEKAGKW 447
           E K  G +PV L+DR V  +   L      S+ +++ R  +A  W
Sbjct: 75  EAKAAG-IPVILVDRGVDVKDDSLYVTFIGSDFVEEGR--RAADW 116


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 29.7 bits (67), Expect = 5.5
 Identities = 25/119 (21%), Positives = 35/119 (29%), Gaps = 22/119 (18%)

Query: 356 RKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKEEGA 415
            K       +   K  +    K K  +K   ++K   KKK K  E+K  K + E K    
Sbjct: 35  SKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLG-- 92

Query: 416 LPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKTKRK 474
                                K  +K K  K + P PK     D    KI    +    
Sbjct: 93  --------------------FKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNV 131


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 29.3 bits (66), Expect = 5.5
 Identities = 17/59 (28%), Positives = 30/59 (50%), Gaps = 3/59 (5%)

Query: 354 RKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKE 412
           R  + E        +K +S +K K KLY +    E+ + +KK+K  E++L  Q+   K+
Sbjct: 158 RISQLEKTSLVDEKQKKKSAKK-KRKLYKEL--KERKEREKKLKKVEQRLELQRELMKK 213


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 29.0 bits (66), Expect = 6.1
 Identities = 25/97 (25%), Positives = 50/97 (51%), Gaps = 12/97 (12%)

Query: 357 KREAREPHKRA---EKARSVRKIKAKLYN--KQRRNEKIQMKKKIKAHEEKLTKQ--KHE 409
           K+EA E  K+    E    + K++A+     K+RRNE  + +K++   EE L ++    E
Sbjct: 44  KKEA-EALKKEALLEAKEEIHKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESLE 102

Query: 410 KKEEGAL----PVYLLDRDVQSRAKVLSNMIKQKRKE 442
           KKEE        +    + ++ + + L  +I+++++E
Sbjct: 103 KKEESLEEKEKELAARQQQLEEKEEELEELIEEQQQE 139


>gnl|CDD|223368 COG0291, RpmI, Ribosomal protein L35 [Translation, ribosomal
           structure and biogenesis].
          Length = 65

 Score = 26.8 bits (60), Expect = 6.2
 Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 11/57 (19%)

Query: 452 PKVRT-QADAEVFKIVRSGKTKRK-AWKRMITKVCFVGENFTRKPPKFERFIRPMAL 506
           PK++T +  A+ FKI  +GK KRK A KR I          T+K  K +R +R  A+
Sbjct: 2   PKMKTKKGAAKRFKITGTGKIKRKHAGKRHI---------LTKKSTKRKRHLRKTAV 49


>gnl|CDD|153298 cd07614, BAR_Endophilin_A2, The Bin/Amphiphysin/Rvs (BAR) domain of
           Endophilin-A2.  BAR domains are dimerization, lipid
           binding and curvature sensing modules found in many
           different proteins with diverse functions. Endophilins
           are accessory proteins, localized at synapses, which
           interact with the endocytic proteins, dynamin and
           synaptojanin. They are essential for synaptic vesicle
           formation from the plasma membrane. They interact with
           voltage-gated calcium channels, thus linking vesicle
           endocytosis to calcium regulation. They also play roles
           in virus budding, mitochondrial morphology maintenance,
           receptor-mediated endocytosis inhibition, and endosomal
           sorting. Endophilins contain an N-terminal N-BAR domain
           (BAR domain with an additional N-terminal amphipathic
           helix), followed by a variable region containing proline
           clusters, and a C-terminal SH3 domain. They are
           classified into two types, A and B. Endophilin-A
           proteins are enriched in the brain and play multiple
           roles in receptor-mediated endocytosis. Endophilin-A2
           (or endophilin-2) is also referred to as SH3P8 (SH3
           domain containing protein 8) or SH3GL1 (SH3 domain
           containing Grb2-like protein 1). It localizes to
           presynaptic nerve terminals and forms heterodimers with
           endophilin-A1 through their BAR domains. Endophilin-A2
           binds dynamin 1, synaptojanin 1, and the
           beta1-adrenergic receptor cytoplasmic tail through its
           SH3 domain.
          Length = 223

 Score = 29.3 bits (65), Expect = 6.2
 Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 2/34 (5%)

Query: 338 IERH-RKLYGRRLDYEERKRKREAREPHKRAEKA 370
           I+ H +KL GRRLD+ + K+KR+ + P +   +A
Sbjct: 130 IQHHLKKLEGRRLDF-DYKKKRQGKIPDEELRQA 162


>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
           structure and biogenesis].
          Length = 107

 Score = 28.0 bits (62), Expect = 6.3
 Identities = 15/55 (27%), Positives = 23/55 (41%), Gaps = 6/55 (10%)

Query: 389 KIQMKKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQK-RKE 442
           K Q+ K+ KA + K    K +KK           ++   RA  +   +  K RKE
Sbjct: 4   KSQLSKEKKAEKAKAGTAKDKKKWSKK-----KKKEEARRAVTVDEELLDKIRKE 53


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 28.3 bits (64), Expect = 6.7
 Identities = 14/79 (17%), Positives = 41/79 (51%), Gaps = 14/79 (17%)

Query: 374 RKIKAKL--YNKQRRNEKIQMKKKIKAHEEKLTKQ--------KHEKKEEGALPVYLLDR 423
           +  + +L    K+R+ E  +++K+++  +EKL K         + +K++E    +    +
Sbjct: 17  KAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKE----LQKKVQ 72

Query: 424 DVQSRAKVLSNMIKQKRKE 442
           + Q + + L   ++++++E
Sbjct: 73  EFQRKQQKLQQDLQKRQQE 91


>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2.  A Saccharomyces
           cerevisiae member of this family (PGA2) is an ER protein
           which has been implicated in protein trafficking.
          Length = 139

 Score = 28.2 bits (63), Expect = 7.2
 Identities = 7/31 (22%), Positives = 18/31 (58%)

Query: 383 KQRRNEKIQMKKKIKAHEEKLTKQKHEKKEE 413
           K R+ ++  ++K ++A E+    Q  ++ E+
Sbjct: 102 KARKRQRKVIRKLLEAEEQLREDQYDDEDED 132


>gnl|CDD|129363 TIGR00261, traB, pheromone shutdown-related protein TraB.  traB is
           a plasmid encoded gene that functions in the shutdown of
           the peptide sex pheromone cPD1 which is produced by the
           plasmid free recipient cell prior to conjugative
           transfer in Enterococcus faecalis. Once the recipient
           acquires the plasmid, production of cPD1 is shut down.
           The gene product may play another role in the other
           species in the family [Unknown function, General].
          Length = 380

 Score = 29.0 bits (65), Expect = 7.5
 Identities = 20/65 (30%), Positives = 32/65 (49%), Gaps = 7/65 (10%)

Query: 380 LYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQK 439
           L N Q+   K+  ++ IK   E  T    EK ++  +P+ L+DRD+++  K     I   
Sbjct: 74  LANFQK---KLGEEQGIKPGSEMKTAI--EKAKKHGIPLILIDRDIETTLKRA--WISIT 126

Query: 440 RKEKA 444
             EKA
Sbjct: 127 FFEKA 131


>gnl|CDD|153299 cd07615, BAR_Endophilin_A3, The Bin/Amphiphysin/Rvs (BAR) domain of
           Endophilin-A3.  BAR domains are dimerization, lipid
           binding and curvature sensing modules found in many
           different proteins with diverse functions. Endophilins
           are accessory proteins localized at synapses that
           interacts with the endocytic proteins, dynamin and
           synaptojanin. They are essential for synaptic vesicle
           formation from the plasma membrane. They interact with
           voltage-gated calcium channels, thus linking vesicle
           endocytosis to calcium regulation. They also play roles
           in virus budding, mitochondrial morphology maintenance,
           receptor-mediated endocytosis inhibition, and endosomal
           sorting. Endophilins contain an N-terminal N-BAR domain
           (BAR domain with an additional N-terminal amphipathic
           helix), followed by a variable region containing proline
           clusters, and a C-terminal SH3 domain. They are
           classified into two types, A and B. Endophilin-A
           proteins are enriched in the brain and play multiple
           roles in receptor-mediated endocytosis. Endophilin-A3
           (or endophilin-3) is also referred to as SH3P13 (SH3
           domain containing protein 13) or SH3GL3 (SH3 domain
           containing Grb2-like protein 3). It regulates
           Arp2/3-dependent actin filament assembly during
           endocytosis. It binds N-WASP through its SH3 domain and
           enhances the ability of N-WASP to activate the Arp2/3
           complex. Endophilin-A3 co-localizes with the vesicular
           glutamate transporter 1 (VGLUT1), and may play an
           important role in the synaptic release of glutamate.
          Length = 223

 Score = 28.8 bits (64), Expect = 7.6
 Identities = 12/29 (41%), Positives = 20/29 (68%), Gaps = 1/29 (3%)

Query: 342 RKLYGRRLDYEERKRKREAREPHKRAEKA 370
           +KL GRRLD+ + K+KR+ + P +   +A
Sbjct: 135 KKLEGRRLDF-DYKKKRQGKIPDEEIRQA 162


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 28.8 bits (65), Expect = 8.4
 Identities = 12/46 (26%), Positives = 26/46 (56%), Gaps = 6/46 (13%)

Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIK 397
           EE + K+E +   K+ E+   + K+  +   +QR+ E+ + KK+ +
Sbjct: 282 EEAQEKKEEK---KKEEREAKLAKLSPE---EQRKLEEKERKKQAR 321


>gnl|CDD|221313 pfam11917, DUF3435, Protein of unknown function (DUF3435).  This
           family of proteins are functionally uncharacterized.
           This protein is found in eukaryotes. Proteins in this
           family are typically between 435 to 791 amino acids in
           length. This family is related to pfam00589 suggesting
           it may be an integrase enzyme.
          Length = 418

 Score = 29.3 bits (66), Expect = 8.4
 Identities = 11/66 (16%), Positives = 27/66 (40%), Gaps = 1/66 (1%)

Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKA-RSVRKIKAKLYNKQRRNEKIQMKK 394
           E I +   L    +    +  K +    ++R EK  R VR  + +L  + ++  + +  +
Sbjct: 292 ELIRKRDHLKKEIIALYGQVAKAKGTPLYERLEKRRREVRNERQRLRRELKKKIREEFDE 351

Query: 395 KIKAHE 400
           +    +
Sbjct: 352 EQPVID 357


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 28.1 bits (63), Expect = 8.7
 Identities = 14/54 (25%), Positives = 25/54 (46%), Gaps = 5/54 (9%)

Query: 359 EAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKE 412
           E  +    AE  R   + KAK   ++ R +K  +K K +  E++  K++   K 
Sbjct: 49  EDEDDESTAESKRPEGRKKAK---EKLRRDK--LKAKKEEAEKEKEKEERFMKA 97


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 29.0 bits (66), Expect = 8.7
 Identities = 17/69 (24%), Positives = 31/69 (44%), Gaps = 7/69 (10%)

Query: 332 SLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQ 391
           +L+ +Y E   +L    LD +E K   +A++  ++ E   +  K K +L+        + 
Sbjct: 44  NLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPRLF-------VLD 96

Query: 392 MKKKIKAHE 400
            K  I A E
Sbjct: 97  FKGDIDASE 105


>gnl|CDD|227780 COG5493, COG5493, Uncharacterized conserved protein containing a
           coiled-coil domain [Function unknown].
          Length = 231

 Score = 28.7 bits (64), Expect = 9.6
 Identities = 16/62 (25%), Positives = 25/62 (40%), Gaps = 4/62 (6%)

Query: 347 RRLDYEERKRKREAREPHKRAEKARS----VRKIKAKLYNKQRRNEKIQMKKKIKAHEEK 402
            + D EE +++ E R+     EK         K   KL  + +  E    K+ IK  E  
Sbjct: 44  TKQDVEELRKETEQRQKELADEKLEVRKQKATKEDLKLLQRFQEEEFRATKEDIKRLETI 103

Query: 403 LT 404
           +T
Sbjct: 104 IT 105


>gnl|CDD|237292 PRK13182, racA, polar chromosome segregation protein; Reviewed.
          Length = 175

 Score = 28.1 bits (63), Expect = 9.6
 Identities = 20/85 (23%), Positives = 37/85 (43%), Gaps = 7/85 (8%)

Query: 332 SLHNEYIERHRKLYGRRLDYEERKRKREAREP------HKRAEKARSVRKIKAKLYNKQR 385
           S+  E +E       RRLD  ER+ +++A +         R E    + +++ KL  + +
Sbjct: 84  SVDFEQLEAQLNTITRRLDELERQLQQKADDVVSYQLLQHRREMEEMLERLQ-KLEARLK 142

Query: 386 RNEKIQMKKKIKAHEEKLTKQKHEK 410
           + E I +    +   E+  K K  K
Sbjct: 143 KLEPIYITPDTEPTYEREKKPKRRK 167


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 28.8 bits (65), Expect = 9.8
 Identities = 14/53 (26%), Positives = 24/53 (45%), Gaps = 3/53 (5%)

Query: 364 HKRAEKARSVRKIKAK---LYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKEE 413
            K  E+    R +K K   L +K       +++ K  A E++L  +   +KEE
Sbjct: 182 LKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEE 234


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.133    0.435 

Gapped
Lambda     K      H
   0.267   0.0788    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 33,212,973
Number of extensions: 3196491
Number of successful extensions: 3795
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3497
Number of HSP's successfully gapped: 237
Length of query: 657
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 554
Effective length of database: 6,369,140
Effective search space: 3528503560
Effective search space used: 3528503560
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 62 (27.5 bits)