RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1625
         (130 letters)



>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 74.3 bits (183), Expect = 5e-18
 Identities = 65/116 (56%), Positives = 82/116 (70%)

Query: 1   MGSSAGAGSGEFHVYRHLRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKK 60
            GSSAGAGSGEFHVYR LRR+EY R + + EK +KE  DEEFQQK +EK++  E+KTAKK
Sbjct: 27  QGSSAGAGSGEFHVYRALRRREYERLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKK 86

Query: 61  RAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSER 116
           RAKR K+KQK K K K  K   KEE+E S+   +S DE  + +ED +E   E+ E+
Sbjct: 87  RAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEIMEK 142


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 35.3 bits (82), Expect = 0.005
 Identities = 14/71 (19%), Positives = 38/71 (53%)

Query: 56  KTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSE 115
           K  KK  K +++ +K + + KK K +     ++ E++ + E E  +++++ +E   E  +
Sbjct: 407 KATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466

Query: 116 RKTEIEQDENS 126
            + E ++ + +
Sbjct: 467 EEEEEKKKKQA 477



 Score = 33.0 bits (76), Expect = 0.034
 Identities = 23/86 (26%), Positives = 44/86 (51%), Gaps = 3/86 (3%)

Query: 33  SEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDD 92
           +E+E+   EF    ++  +  +    K   KR + K++ K K    K + +EEEEE E  
Sbjct: 395 TEEEI---EFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451

Query: 93  SKSEDENSDKDEDNKEGNEEVSERKT 118
            + ++E  ++ E+ KE  EE  +++ 
Sbjct: 452 EEEKEEEEEEAEEEKEEEEEKKKKQA 477



 Score = 31.8 bits (73), Expect = 0.074
 Identities = 18/72 (25%), Positives = 41/72 (56%), Gaps = 3/72 (4%)

Query: 59  KKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKT 118
           KK  K++   +K+  K +K + E K+E+++     K ++E  +++++ KE  +E  E + 
Sbjct: 406 KKATKKI---KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462

Query: 119 EIEQDENSVQED 130
           E E++E   ++ 
Sbjct: 463 EEEKEEEEEKKK 474



 Score = 26.4 bits (59), Expect = 5.4
 Identities = 16/63 (25%), Positives = 31/63 (49%), Gaps = 1/63 (1%)

Query: 63  KRLKRKQKLKNKPKKVKTEGKEEEEES-EDDSKSEDENSDKDEDNKEGNEEVSERKTEIE 121
           + L   +K   K KK+  + +++ EE  ++  K       K+E+ +E  E+  E K E E
Sbjct: 400 EFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459

Query: 122 QDE 124
           ++ 
Sbjct: 460 EEA 462


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 34.6 bits (80), Expect = 0.009
 Identities = 17/78 (21%), Positives = 35/78 (44%), Gaps = 2/78 (2%)

Query: 31   EKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESE 90
             K   E      ++K + +++T   K +K R K+    Q  +   +  K   K +    +
Sbjct: 1310 VKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKK--KSDSSSED 1367

Query: 91   DDSKSEDENSDKDEDNKE 108
            DD    D++ D+D+++ E
Sbjct: 1368 DDDSEVDDSEDEDDEDDE 1385



 Score = 27.7 bits (62), Expect = 2.0
 Identities = 18/92 (19%), Positives = 42/92 (45%), Gaps = 1/92 (1%)

Query: 40   EEFQQKIQEKRQTFEDKTAKKRAK-RLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDE 98
            EE ++  +++    +   +K + K    RK KLK K KK K    ++ +++     S+  
Sbjct: 1142 EEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRV 1201

Query: 99   NSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
            +SD+     +  +      +  +Q+++  Q+ 
Sbjct: 1202 DSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKT 1233



 Score = 26.2 bits (58), Expect = 5.9
 Identities = 15/86 (17%), Positives = 37/86 (43%), Gaps = 6/86 (6%)

Query: 32   KSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESED 91
              + +  +E     ++E+ +  E + AK++  RLK K K K    +     K+E+++ + 
Sbjct: 1131 LEDLDKFEE----ALEEQEEVEEKEIAKEQ--RLKSKTKGKASKLRKPKLKKKEKKKKKS 1184

Query: 92   DSKSEDENSDKDEDNKEGNEEVSERK 117
             +    + S      +  ++E  +  
Sbjct: 1185 SADKSKKASVVGNSKRVDSDEKRKLD 1210


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 34.1 bits (78), Expect = 0.011
 Identities = 25/101 (24%), Positives = 41/101 (40%), Gaps = 3/101 (2%)

Query: 11  EFHVYRHLRRKEYARQKHIQEKSEKELLDE--EFQQKI-QEKRQTFEDKTAKKRAKRLKR 67
           E H      +K    +K+  +  +KE+  +    +++I +E RQ      A+ RA+R   
Sbjct: 324 ELHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAV 383

Query: 68  KQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKE 108
           K K K        E    E E    S  + E +   E N+E
Sbjct: 384 KAKKKGLIDASPNEDTPSENEESKGSPPQVEATTTAEPNRE 424



 Score = 27.9 bits (62), Expect = 1.6
 Identities = 17/93 (18%), Positives = 39/93 (41%), Gaps = 15/93 (16%)

Query: 38  LDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQK-------------LKNKPKKVKTEGKE 84
           L +EF++   E+R  + +K  +K+A     ++              L  +  ++K     
Sbjct: 279 LTDEFKEHHAEQRAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTA 338

Query: 85  EEEESEDDSKSEDEN--SDKDEDNKEGNEEVSE 115
           E+ E++   K   +   + + E N+E  +E + 
Sbjct: 339 EKNEAKARKKEIAQKRRAAEREINREARQERAA 371


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 33.0 bits (75), Expect = 0.031
 Identities = 18/77 (23%), Positives = 33/77 (42%)

Query: 34   EKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDS 93
             ++L DE  ++ + E  Q   +++A      L  K+      +    + KE+EEE  DD 
Sbjct: 3904 NEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDV 3963

Query: 94   KSEDENSDKDEDNKEGN 110
              +DE     ++N    
Sbjct: 3964 GIDDEIQPDIQENNSQP 3980



 Score = 30.0 bits (67), Expect = 0.39
 Identities = 19/92 (20%), Positives = 36/92 (39%), Gaps = 6/92 (6%)

Query: 33   SEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDD 92
             E + LDEE     ++   +  +K   +  +    + + K+       E      ES+  
Sbjct: 3883 EENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSN------EQSAANNESDLV 3936

Query: 93   SKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
            SK +D  + +D+D +E  +E          DE
Sbjct: 3937 SKEDDNKALEDKDRQEKEDEEEMSDDVGIDDE 3968



 Score = 28.4 bits (63), Expect = 1.2
 Identities = 16/42 (38%), Positives = 27/42 (64%), Gaps = 3/42 (7%)

Query: 88   ESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
             ++D   SE+ENSD +E+N++ +EEV++     E   NS+ E
Sbjct: 3867 VTKDSVVSENENSDSEEENQDLDEEVNDIP---EDLSNSLNE 3905



 Score = 27.3 bits (60), Expect = 3.0
 Identities = 19/102 (18%), Positives = 37/102 (36%), Gaps = 6/102 (5%)

Query: 34   EKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEE-----EEE 88
             KE  D E  + +Q++    E+        +      L    +K+  +G EE     EE 
Sbjct: 4019 NKEEADAEKDEPMQDE-DPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEES 4077

Query: 89   SEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
            +ED  KS++E    +    +  +   +   +         E+
Sbjct: 4078 TEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEADEE 4119


>gnl|CDD|237115 PRK12489, PRK12489, anaerobic C4-dicarboxylate transporter;
           Reviewed.
          Length = 443

 Score = 32.5 bits (75), Expect = 0.036
 Identities = 16/48 (33%), Positives = 22/48 (45%), Gaps = 5/48 (10%)

Query: 18  LRR-KEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKR 64
           LRR K+  +    QE+    L D EF++ +     T  DK   K AK 
Sbjct: 193 LRRGKDLDKDPEFQER----LKDPEFREYVYGTTATLLDKKLPKSAKL 236


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 32.2 bits (74), Expect = 0.051
 Identities = 19/117 (16%), Positives = 44/117 (37%), Gaps = 13/117 (11%)

Query: 22  EYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTE 81
           +Y RQ     K+E   +++E ++K +E +  FE + A+   ++  R+ + K   +    +
Sbjct: 429 QYYRQ----AKAEIRAIEQE-KKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAK 483

Query: 82  GKEE--------EEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
            K+         + +    ++     +    DN         RK +    +   Q  
Sbjct: 484 DKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAEKQAA 540


>gnl|CDD|222127 pfam13436, Gly-zipper_OmpA, Glycine-zipper containing OmpA-like
           membrane domain. 
          Length = 116

 Score = 31.1 bits (71), Expect = 0.055
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 2   GSSAGAGSGEFHVYRHLRRKEYARQKHIQEK 32
           G++AGAGSG++  Y   RR + A ++ +  K
Sbjct: 86  GAAAGAGSGQYSSYGAQRRYDNAYRQCMYAK 116


>gnl|CDD|237549 PRK13897, PRK13897, type IV secretion system component VirD4;
           Provisional.
          Length = 606

 Score = 32.1 bits (73), Expect = 0.059
 Identities = 19/77 (24%), Positives = 30/77 (38%), Gaps = 1/77 (1%)

Query: 32  KSEKELLDEEFQQKIQEKRQTFEDKTAKKRA-KRLKRKQKLKNKPKKVKTEGKEEEEESE 90
           + E+ LL E       +K + +EDK    R         +    P+    E  EE E   
Sbjct: 530 RDEQILLIESTYPIKSKKIKYYEDKFFTSRLLPPTFVPTQEPYDPRANNNEASEETETIT 589

Query: 91  DDSKSEDENSDKDEDNK 107
               +E++  D DED+ 
Sbjct: 590 VPENNEEDEEDDDEDDD 606


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 31.7 bits (72), Expect = 0.061
 Identities = 19/73 (26%), Positives = 35/73 (47%), Gaps = 1/73 (1%)

Query: 56  KTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSE-DENSDKDEDNKEGNEEVS 114
           KT+KK+    ++K+  K   KK K + +  E  +E+ S SE +E +DK  D +    + +
Sbjct: 49  KTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNT 108

Query: 115 ERKTEIEQDENSV 127
             +   E   +  
Sbjct: 109 AAEVNHEDVIDLS 121


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 31.9 bits (72), Expect = 0.068
 Identities = 21/94 (22%), Positives = 43/94 (45%), Gaps = 1/94 (1%)

Query: 22  EYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTE 81
              + +H  E    E  + E Q + Q      +D+T ++     + + + K   K V   
Sbjct: 797 AEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNA-ENQGEAKQDEKGVDGG 855

Query: 82  GKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSE 115
           G  +  +SE++ + E+E  +++E+ +E  EE  E
Sbjct: 856 GGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889



 Score = 30.0 bits (67), Expect = 0.32
 Identities = 14/49 (28%), Positives = 29/49 (59%)

Query: 81  EGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
           E K++E+  +    S+  +S+++E+ +E  EE  E + E E++E   +E
Sbjct: 844 EAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 28.0 bits (62), Expect = 1.4
 Identities = 20/91 (21%), Positives = 37/91 (40%), Gaps = 2/91 (2%)

Query: 40  EEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDEN 99
           E+  ++   K     D +    A+     ++   + ++      E  EES  +++ E E 
Sbjct: 614 EQLSRRPVAKVMALGDLSKGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGET 673

Query: 100 SDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
             K E+  EG    +ERK E E  E  ++  
Sbjct: 674 ETKGENESEGEIP-AERKGEQE-GEGEIEAK 702



 Score = 26.5 bits (58), Expect = 5.8
 Identities = 13/50 (26%), Positives = 21/50 (42%)

Query: 79  KTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQ 128
             +  E E + E++S+ E     K E   EG  E  E   + E +   V+
Sbjct: 667 AEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVE 716



 Score = 26.5 bits (58), Expect = 5.8
 Identities = 31/132 (23%), Positives = 50/132 (37%), Gaps = 5/132 (3%)

Query: 2   GSSAGAGSGEFHVYRHLRRKEYARQKHIQEKSEKELLDE---EFQQKIQEKRQTFEDKTA 58
           G   G G  E     H    E    +H  E   +   DE   E  ++ +E     E +  
Sbjct: 691 GEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAE 750

Query: 59  KKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKT 118
            K     +  +K      + + EGKE+E+E E  +  + E   K ++  EG  E      
Sbjct: 751 GKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEM--KGDEGAEGKVEHEGETE 808

Query: 119 EIEQDENSVQED 130
             E+DE+  Q +
Sbjct: 809 AGEKDEHEGQSE 820



 Score = 25.7 bits (56), Expect = 8.9
 Identities = 21/99 (21%), Positives = 44/99 (44%), Gaps = 13/99 (13%)

Query: 39  DEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTE-GKEEEEESEDDSK--- 94
           DE  + K++ + +T   +  +   +   +    + K +  + E   E + E++ D K   
Sbjct: 794 DEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVD 853

Query: 95  ---------SEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
                    SE+E  +++E+ +E  EE  E + E E +E
Sbjct: 854 GGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 31.9 bits (73), Expect = 0.072
 Identities = 15/69 (21%), Positives = 29/69 (42%)

Query: 20  RKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVK 79
                     +   E+  L ++F +KI+EK +  E+       +R +   KL+   KK++
Sbjct: 300 EPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLE 359

Query: 80  TEGKEEEEE 88
              K  E+ 
Sbjct: 360 DLEKRLEKL 368


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 31.6 bits (71), Expect = 0.094
 Identities = 23/91 (25%), Positives = 49/91 (53%), Gaps = 5/91 (5%)

Query: 44  QKIQEKRQTFEDKTAKKRAKRLKRKQ-KLKNKPKKVKTEGKEEEEESEDDSKSEDENSDK 102
           ++ +EK  +  ++  +  AK+ K ++    N P + + EG+ E E S+  S +E+ +SD 
Sbjct: 81  KRQREKPASDTEEPERVTAKKSKTQELSRPNSPSEGEGEGEGEGESSDSRSVNEEGSSDP 140

Query: 103 ---DEDNKEGNEEV-SERKTEIEQDENSVQE 129
              D+DN+  +  + S +  E + D ++ Q+
Sbjct: 141 KDIDQDNRSSSPSIPSPQDNESDSDSSAQQQ 171


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
          Provisional.
          Length = 330

 Score = 31.3 bits (72), Expect = 0.10
 Identities = 17/63 (26%), Positives = 33/63 (52%), Gaps = 3/63 (4%)

Query: 24 ARQKHIQEKSEKEL--LDEEFQQKIQEKRQTFEDKTA-KKRAKRLKRKQKLKNKPKKVKT 80
            Q+   +K E E+  L+E++++  +E +    DK   K   K  K+K+K + K  K K+
Sbjct: 29 LAQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKS 88

Query: 81 EGK 83
          + +
Sbjct: 89 KPR 91



 Score = 28.3 bits (64), Expect = 1.3
 Identities = 14/78 (17%), Positives = 32/78 (41%), Gaps = 14/78 (17%)

Query: 1  MGSSAGAGSGEFHVYRHLRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKK 60
          +     +  GE  V  +L      + K ++E+ +  LLD++ + K   K        A+K
Sbjct: 29 LAQRKKSQKGELEV-TNLNE----QYKEMKEELKAALLDKK-ELKAWHK--------AQK 74

Query: 61 RAKRLKRKQKLKNKPKKV 78
          + ++ + K        ++
Sbjct: 75 KKEKQEAKAAKAKSKPRL 92


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 31.6 bits (72), Expect = 0.10
 Identities = 19/101 (18%), Positives = 48/101 (47%)

Query: 30  QEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEES 89
           QE  ++ +  EE + K    ++       ++     + ++ LK + K   +E +E EEE 
Sbjct: 805 QELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEI 864

Query: 90  EDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
           ++ +++++E  +   D K+ N+ +  +   I+   +S +  
Sbjct: 865 QNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIK 905


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 31.1 bits (71), Expect = 0.13
 Identities = 20/84 (23%), Positives = 42/84 (50%), Gaps = 4/84 (4%)

Query: 5   AGAGSGEFHVYRHLRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKR 64
           A +G  +  V +   +     ++ ++E+ E+EL + E +++I E  +    K  +K+AK+
Sbjct: 46  AESGERKSAVDKLAMKPLEEIEEELREEYEEELKEYEAEKEIWEAEK----KGLEKKAKK 101

Query: 65  LKRKQKLKNKPKKVKTEGKEEEEE 88
             +K K +    +   E + EE E
Sbjct: 102 AIKKGKDEEALAEELLELEAEEPE 125


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 31.0 bits (70), Expect = 0.15
 Identities = 19/76 (25%), Positives = 37/76 (48%)

Query: 30  QEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEES 89
            E  +K L +EE ++K +++ +  E +  K +A + + K KL+ +     T   ++ E+ 
Sbjct: 6   SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKK 65

Query: 90  EDDSKSEDENSDKDED 105
                 EDEN +   D
Sbjct: 66  SRKRDVEDENPEDFID 81


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 31.0 bits (70), Expect = 0.15
 Identities = 16/62 (25%), Positives = 25/62 (40%)

Query: 43   QQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDK 102
             ++ +EK Q    +     AK++ R+   K  PKK   +  E E   E    S  E  + 
Sbjct: 1175 AEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENV 1234

Query: 103  DE 104
             E
Sbjct: 1235 AE 1236


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
          This domain is found to the N-terminus of bacterial
          signal peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 30.2 bits (69), Expect = 0.16
 Identities = 16/54 (29%), Positives = 27/54 (50%), Gaps = 4/54 (7%)

Query: 25 RQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKV 78
            K ++E  E  LLD++ + K  EK +    K  K +AK  K+K K +    ++
Sbjct: 50 EYKDLKESLEAALLDKK-ELKAWEKAEK---KAEKAKAKAEKKKAKKEEPKPRL 99



 Score = 29.0 bits (66), Expect = 0.41
 Identities = 21/69 (30%), Positives = 32/69 (46%), Gaps = 5/69 (7%)

Query: 24 ARQKHIQEKSEKEL--LDEEFQQKIQEKRQTFEDK-TAKKRAKRLKRKQKLKNKPKKVKT 80
            QK   +K E E+  L+EE++   +       DK   K   K  K+ +K K K +K K 
Sbjct: 31 LAQKKKGKKGELEITDLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKK- 89

Query: 81 EGKEEEEES 89
            K+EE + 
Sbjct: 90 -AKKEEPKP 97


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 30.7 bits (69), Expect = 0.17
 Identities = 11/53 (20%), Positives = 26/53 (49%)

Query: 73  NKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDEN 125
           ++P + + E + E E    +  +  E+   D +++ G EE+ +   +   DE+
Sbjct: 218 DQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDES 270



 Score = 28.0 bits (62), Expect = 1.7
 Identities = 18/47 (38%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 84  EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
           E  EE  DD++SEDE  D D+D    NE+  + + E E  E S  ++
Sbjct: 198 ELAEEMGDDTESEDE-EDGDDDQPTENEQEEQGEGEGEGQEGSAPQE 243



 Score = 27.6 bits (61), Expect = 2.0
 Identities = 13/41 (31%), Positives = 24/41 (58%)

Query: 84  EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
           E+EE+ +DD  +E+E  ++ E   EG E  + +++E    E
Sbjct: 210 EDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRE 250


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 30.6 bits (69), Expect = 0.20
 Identities = 17/49 (34%), Positives = 26/49 (53%), Gaps = 1/49 (2%)

Query: 82  GKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
           G +  EE  D+   +D+  D +ED++ G EE SE   E E+DE    + 
Sbjct: 223 GDDGIEEDADEEDGDDDQPDNNEDSEAGREE-SEGSDESEEDEAEATDG 270



 Score = 29.0 bits (65), Expect = 0.63
 Identities = 14/52 (26%), Positives = 22/52 (42%)

Query: 73  NKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
           ++P   +      EE    D   EDE    D + +EG  + +E   + E DE
Sbjct: 239 DQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDE 290



 Score = 28.3 bits (63), Expect = 1.1
 Identities = 12/41 (29%), Positives = 22/41 (53%)

Query: 84  EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
            +EE+ +DD    +E+S+   +  EG++E  E + E    E
Sbjct: 231 ADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGE 271


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 30.5 bits (70), Expect = 0.23
 Identities = 17/103 (16%), Positives = 52/103 (50%), Gaps = 1/103 (0%)

Query: 27  KHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEE 86
           K I E+++KE  +   ++ + E ++       +   +  +R+ +L+   K++  + +  +
Sbjct: 41  KRILEEAKKEA-EAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLD 99

Query: 87  EESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
            + E   K E+E   K+++ ++  +E+ +++ E+E+      +
Sbjct: 100 RKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQ 142


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 30.1 bits (68), Expect = 0.24
 Identities = 13/71 (18%), Positives = 27/71 (38%)

Query: 49  KRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKE 108
           KRQ F       RA   ++ +  K+K           ++++ +  K   +    +   K 
Sbjct: 2   KRQDFFRAKRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKL 61

Query: 109 GNEEVSERKTE 119
            ++E  E K +
Sbjct: 62  HSQEKKEEKKK 72



 Score = 26.3 bits (58), Expect = 5.0
 Identities = 18/86 (20%), Positives = 35/86 (40%), Gaps = 5/86 (5%)

Query: 12 FHVYRHLRRKEYARQKHIQ-----EKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLK 66
          F   R +R  E  + K  +     E   K    ++  +K +   +    +  KK   + K
Sbjct: 7  FRAKRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEK 66

Query: 67 RKQKLKNKPKKVKTEGKEEEEESEDD 92
          +++K K K KKV  +    +   +D+
Sbjct: 67 KEEKKKPKKKKVPLQVNPAQLFVDDE 92


>gnl|CDD|217952 pfam04189, Gcd10p, Gcd10p family.  eIF-3 is a multi-subunit complex
           that stimulates translation initiation in vitro at
           several different steps. This family corresponds to the
           gamma subunit if eIF3. The Yeast protein Gcd10p has also
           been shown to be part of a complex with the
           methyltransferase Gcd14p that is involved in modifying
           tRNA.
          Length = 299

 Score = 30.0 bits (68), Expect = 0.29
 Identities = 15/31 (48%), Positives = 20/31 (64%)

Query: 40  EEFQQKIQEKRQTFEDKTAKKRAKRLKRKQK 70
           EE  +K+ E  +TF  KTA  + K LKRK+K
Sbjct: 124 EEIIEKLVENSETFHTKTAFSQEKYLKRKKK 154


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 30.1 bits (68), Expect = 0.33
 Identities = 15/63 (23%), Positives = 31/63 (49%), Gaps = 4/63 (6%)

Query: 63  KRLKRKQKLKNKPKKVKTEG----KEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKT 118
           K +   ++ KN    V   G    KEE++ ++  ++ ED +    E +KE  E++ + K 
Sbjct: 214 KEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKE 273

Query: 119 EIE 121
           + +
Sbjct: 274 KAK 276


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 29.7 bits (67), Expect = 0.41
 Identities = 12/109 (11%), Positives = 49/109 (44%), Gaps = 3/109 (2%)

Query: 14  VYRHLRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKN 73
           + +  R  + A +   + + E E   E    + +E      ++  +++  R   +++L+ 
Sbjct: 25  LKKRQRLAKEAERILQEAQKEAEEKLEAALLEAKE--LLLRERNQQRQEARR-EREELQR 81

Query: 74  KPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQ 122
           + +++  + ++ +  +E     E++  ++++       E+ E + +++ 
Sbjct: 82  EEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDN 130



 Score = 25.8 bits (57), Expect = 8.3
 Identities = 18/117 (15%), Positives = 43/117 (36%), Gaps = 3/117 (2%)

Query: 10  GEFHVYRHLRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRK- 68
               + +   R     QK  +EK E  LL+ +     +  +Q  E +  ++  +R + + 
Sbjct: 27  KRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERL 86

Query: 69  -QKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
            QK +    + +     E +  E +        + +E  K+  +    R   +  ++
Sbjct: 87  VQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQ-LDNELYRVAGLTPEQ 142


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 29.8 bits (67), Expect = 0.42
 Identities = 22/81 (27%), Positives = 46/81 (56%), Gaps = 1/81 (1%)

Query: 45  KIQEKRQTFEDKTAKKRAKRLKR-KQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKD 103
           K+ E+ ++  D +  KR + LK   +KL++  KK+     E +   E ++K+++E   ++
Sbjct: 351 KLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQE 410

Query: 104 EDNKEGNEEVSERKTEIEQDE 124
           E+ K+  E+  E K + ++DE
Sbjct: 411 ENEKKQKEQADEDKEKRQKDE 431



 Score = 26.7 bits (59), Expect = 4.1
 Identities = 21/91 (23%), Positives = 49/91 (53%), Gaps = 6/91 (6%)

Query: 23  YARQKHIQE-KSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTE 81
            A  K  +E KS  +L  ++ Q+ ++E  +  +D T     K+L   +   +  ++ + +
Sbjct: 347 LALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYT-----KKLGEVKDETDASEEAEAK 401

Query: 82  GKEEEEESEDDSKSEDENSDKDEDNKEGNEE 112
            KEE+ + E++ K + E +D+D++ ++ +E 
Sbjct: 402 AKEEKLKQEENEKKQKEQADEDKEKRQKDER 432


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 29.6 bits (67), Expect = 0.42
 Identities = 9/46 (19%), Positives = 24/46 (52%)

Query: 84  EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
           +EE + ED+  S+ E+ + D+D+ + ++  +  ++   +      E
Sbjct: 130 DEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWE 175



 Score = 29.6 bits (67), Expect = 0.45
 Identities = 14/40 (35%), Positives = 20/40 (50%)

Query: 80  TEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTE 119
            E  EE+EES      ED++ D D+D+    E   ER+  
Sbjct: 131 EESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRR 170



 Score = 28.4 bits (64), Expect = 1.2
 Identities = 10/39 (25%), Positives = 22/39 (56%)

Query: 81  EGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTE 119
           +  EE +E +++S   +++ D D+D+ + +    ER  E
Sbjct: 128 DDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLE 166



 Score = 28.0 bits (63), Expect = 1.7
 Identities = 10/42 (23%), Positives = 27/42 (64%)

Query: 81  EGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQ 122
           +  ++EE  E+D +S     D+D+D+ + +++++ R+  +E+
Sbjct: 126 DDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLER 167


>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
          Length = 1622

 Score = 29.7 bits (67), Expect = 0.48
 Identities = 15/53 (28%), Positives = 23/53 (43%), Gaps = 10/53 (18%)

Query: 82  GKEEEEESEDDSKSEDENSDKDEDNKEGNE---EVSERKTE-------IEQDE 124
           GK EE   E+  + +D+ S K   N   N    + S +K         I+Q+E
Sbjct: 841 GKMEEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEE 893



 Score = 27.4 bits (61), Expect = 2.9
 Identities = 20/65 (30%), Positives = 29/65 (44%), Gaps = 3/65 (4%)

Query: 26  QKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEG-KE 84
           QK ++   + E   EE     +E  QT     A   A  LK+    K K K+ K+   K+
Sbjct: 834 QKLMENAGKMEEYVEE--NGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQ 891

Query: 85  EEEES 89
           EE E+
Sbjct: 892 EERET 896


>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
          Length = 815

 Score = 29.4 bits (66), Expect = 0.48
 Identities = 14/68 (20%), Positives = 28/68 (41%)

Query: 57  TAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSER 116
            A +  K   RKQK +NK  + +   K+ ++E + D          ++  KE  E     
Sbjct: 12  PAAEEGKGGTRKQKAENKEHEGEQSPKKAKKEKKQDDSGNGNGKSAEDAVKEFEEFCKAI 71

Query: 117 KTEIEQDE 124
           +  +  ++
Sbjct: 72  EEHLSIEQ 79


>gnl|CDD|217059 pfam02480, Herpes_gE, Alphaherpesvirus glycoprotein E.
           Glycoprotein E (gE) of Alphaherpesvirus forms a complex
           with glycoprotein I (gI) (pfam01688), functioning as an
           immunoglobulin G (IgG) Fc binding protein. gE is
           involved in virus spread but is not essential for
           propagation.
          Length = 437

 Score = 29.3 bits (66), Expect = 0.60
 Identities = 12/56 (21%), Positives = 20/56 (35%)

Query: 57  TAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEE 112
             ++R ++          P        E  E+   DS+S+DE  D D D +     
Sbjct: 375 VTRRRRRKPYAVILNPFSPVYTSLPTNELFEDVSSDSESDDEEFDSDSDREPSGPS 430


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 29.2 bits (65), Expect = 0.60
 Identities = 12/52 (23%), Positives = 30/52 (57%)

Query: 64  RLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSE 115
            +K++  ++   K+ + E  EEE++ E+D   +D+  + D+D+ +  ++  E
Sbjct: 31  TMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDE 82



 Score = 27.3 bits (60), Expect = 2.4
 Identities = 14/61 (22%), Positives = 34/61 (55%)

Query: 70  KLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
           K +N  +K+  E +EE  E EDD + +D++ D ++++ + +++  + + E + D     +
Sbjct: 33  KKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDD 92

Query: 130 D 130
            
Sbjct: 93  S 93



 Score = 26.5 bits (58), Expect = 5.7
 Identities = 17/70 (24%), Positives = 35/70 (50%)

Query: 57  TAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSER 116
           T     K+    +KL  + ++   E ++++EE +DD   EDE+ D D+D+++  +E  + 
Sbjct: 27  TLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDD 86

Query: 117 KTEIEQDENS 126
            T  +     
Sbjct: 87  STLHDDSSAD 96


>gnl|CDD|222025 pfam13286, HD_assoc, Phosphohydrolase-associated domain.  This
          domain is found on bacterial and archaeal
          metal-dependent phosphohydrolases.
          Length = 91

 Score = 27.8 bits (63), Expect = 0.62
 Identities = 14/57 (24%), Positives = 25/57 (43%), Gaps = 12/57 (21%)

Query: 13 HVYRHLRRK-EYARQKHIQEK------SEKELLDEEFQQKIQEKRQTFEDKTAKKRA 62
          +VYRH   + E  + + I  +      ++ ELL  EF+ + +       D  A+ R 
Sbjct: 21 YVYRHPEVQREEYKARRIVRELFEALMADPELLPPEFRARWEA-----ADDDARARV 72


>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
           insulinase-like [General function prediction only].
          Length = 978

 Score = 28.8 bits (65), Expect = 0.73
 Identities = 11/73 (15%), Positives = 33/73 (45%), Gaps = 14/73 (19%)

Query: 38  LDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKP--------------KKVKTEGK 83
           L++E ++ +Q++     D+  +K  K  K+ ++ +++P                V    +
Sbjct: 465 LEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEEDLATLPTLKLGDVPDPIE 524

Query: 84  EEEEESEDDSKSE 96
           +   E+E  ++++
Sbjct: 525 KTSLETEVSNEAK 537


>gnl|CDD|240235 PTZ00032, PTZ00032, 60S ribosomal protein L18; Provisional.
          Length = 211

 Score = 28.7 bits (64), Expect = 0.79
 Identities = 18/60 (30%), Positives = 24/60 (40%), Gaps = 2/60 (3%)

Query: 49  KRQTFEDKTAKKRAKRLKRKQKLKN--KPKKVKTEGKEEEEESEDDSKSEDENSDKDEDN 106
           K+   E     K    L ++ KLK   KP+K+   G   E   E   + E  N  KD D 
Sbjct: 38  KKSAPEQVPTGKNKLLLTKRSKLKGIPKPRKLHKHGFWAEIFEEKVEREELGNPCKDLDK 97


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 28.5 bits (64), Expect = 0.87
 Identities = 18/86 (20%), Positives = 39/86 (45%), Gaps = 8/86 (9%)

Query: 31  EKSEKELLDEEFQ------QKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKE 84
           +KS+ + +   +       ++I       E+K A    K+ K++ K     KK  +E K+
Sbjct: 17  QKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKK--SEKKK 74

Query: 85  EEEESEDDSKSEDENSDKDEDNKEGN 110
           ++++ + + KSE E     +  K+  
Sbjct: 75  KKKKEKKEPKSEGETKLGFKTPKKSK 100



 Score = 28.1 bits (63), Expect = 1.5
 Identities = 15/79 (18%), Positives = 37/79 (46%), Gaps = 2/79 (2%)

Query: 53  FEDKTAKKRAKRL--KRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGN 110
           F   T +K   +        +    K++ +   EEE +    S  +D+  DK+ ++K+ +
Sbjct: 11  FFSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKS 70

Query: 111 EEVSERKTEIEQDENSVQE 129
           E+  ++K E ++ ++  + 
Sbjct: 71  EKKKKKKKEKKEPKSEGET 89


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 28.2 bits (63), Expect = 0.92
 Identities = 19/75 (25%), Positives = 31/75 (41%)

Query: 56  KTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSE 115
           K  KK+     RK     K KK   E   EEE+  +    + E ++ D DN E + +  E
Sbjct: 49  KQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQE 108

Query: 116 RKTEIEQDENSVQED 130
             +     +   ++D
Sbjct: 109 SASANSLSDIDNEDD 123


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 28.7 bits (64), Expect = 0.93
 Identities = 15/50 (30%), Positives = 21/50 (42%), Gaps = 2/50 (4%)

Query: 81  EGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
             ++ E ESE D   ED   D+ ED+ E NE V      + +    V   
Sbjct: 667 AIEDSESESESDG--EDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNL 714



 Score = 27.9 bits (62), Expect = 1.8
 Identities = 13/59 (22%), Positives = 31/59 (52%)

Query: 68  KQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENS 126
           K        +   EG+EE+E+  +++  +++  +  ED++  +E   E   E EQ++++
Sbjct: 633 KADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDA 691



 Score = 27.9 bits (62), Expect = 1.8
 Identities = 12/53 (22%), Positives = 27/53 (50%)

Query: 77  KVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
             +T+  E+E E+ +DS+SE E+  +D +  E  ++    +  +  D+   + 
Sbjct: 655 LEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRA 707


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 28.5 bits (63), Expect = 0.97
 Identities = 29/115 (25%), Positives = 54/115 (46%), Gaps = 13/115 (11%)

Query: 19  RRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRK--------QK 70
           R  + A +K + E SE     EE  + +  +    + ++ KK  +R +RK        ++
Sbjct: 75  RSVKRATKKTVVEISEPL---EEGSELVVNEDAALDKES-KKTPRRTRRKAAAASSDVEE 130

Query: 71  LKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDE-DNKEGNEEVSERKTEIEQDE 124
            K + K  K    ++ +E  +D  SE E SD +E +     E  SE + ++E+D+
Sbjct: 131 EKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDD 185


>gnl|CDD|222988 PHA03105, PHA03105, EEV glycoprotein; Provisional.
          Length = 188

 Score = 28.1 bits (62), Expect = 1.0
 Identities = 15/49 (30%), Positives = 32/49 (65%)

Query: 58  AKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDN 106
           A  R+ + ++  K+KN+ + ++ E K+ + E E+DS + +EN + D++N
Sbjct: 82  ANFRSSKKEKSFKIKNEKENIEVEIKDCDNEFEEDSNNNEENLNWDDNN 130


>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter.  The Monovalent
           Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
           CPA1 family is a large family of proteins derived from
           Gram-positive and Gram-negative bacteria, blue green
           bacteria, yeast, plants and animals. Transporters from
           eukaryotes have been functionally characterized, and all
           of these catalyze Na+:H+ exchange. Their primary
           physiological functions may be in (1) cytoplasmic pH
           regulation, extruding the H+ generated during
           metabolism, and (2) salt tolerance (in plants), due to
           Na+ uptake into vacuoles. This model is specific for the
           fungal members of this family [Transport and binding
           proteins, Cations and iron carrying compounds].
          Length = 810

 Score = 28.4 bits (63), Expect = 1.1
 Identities = 10/49 (20%), Positives = 22/49 (44%)

Query: 73  NKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIE 121
             P +V   G  +   SED+   ++  S+  +D ++ ++   E K   +
Sbjct: 762 EGPSRVPERGNHDLLHSEDEMADDEAESENMDDYEDSDDNAYESKDHAD 810


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 28.5 bits (63), Expect = 1.4
 Identities = 11/47 (23%), Positives = 24/47 (51%)

Query: 75  PKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIE 121
            ++      EE+    D+   +D++ D+DED+ +  ++  E + E E
Sbjct: 133 KRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEE 179


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 28.3 bits (63), Expect = 1.4
 Identities = 15/54 (27%), Positives = 34/54 (62%), Gaps = 1/54 (1%)

Query: 76  KKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
           K++K + +E++EE  ++ K+E+ + + +E+ K    E+ E K  IE ++  ++E
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYE-KVNIEANKKFIKE 329


>gnl|CDD|178493 PLN02905, PLN02905, beta-amylase.
          Length = 702

 Score = 28.0 bits (62), Expect = 1.4
 Identities = 9/30 (30%), Positives = 18/30 (60%)

Query: 76  KKVKTEGKEEEEESEDDSKSEDENSDKDED 105
           K + T  +++EEE + D K ED+   ++ +
Sbjct: 7   KLIGTSEEDDEEEMDMDVKEEDDGDRRNRE 36


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 28.2 bits (63), Expect = 1.4
 Identities = 19/87 (21%), Positives = 37/87 (42%), Gaps = 4/87 (4%)

Query: 30  QEKSEKELLDEEFQQKIQEKRQTFED---KTAKKRAKRLKRKQKLKNKPKKVKTEGKEEE 86
           +        D   +Q+I+           K  KK+AK  KR+   +        EG   +
Sbjct: 198 EALVRLREADGTLEQRIKGAEGGGAMKVLKQPKKQAKSSKRRTIAQIDGIDSDDEGDGSD 257

Query: 87  EE-SEDDSKSEDENSDKDEDNKEGNEE 112
           ++  ED  +S+ ++SD D  +++G + 
Sbjct: 258 DDDDEDAIESDLDDSDDDVSDEDGEDL 284


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 28.0 bits (63), Expect = 1.5
 Identities = 20/108 (18%), Positives = 49/108 (45%), Gaps = 9/108 (8%)

Query: 26  QKHIQEKSEKELLDEEFQQKIQEKRQTFE--DKTAKKRAKRLKRKQKLKNKPKKVKTEGK 83
           Q  +   ++++LL+ E +   Q+  QT    DK  +++ +  + KQ+L   P K++    
Sbjct: 42  QAQLDALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQAPAKLR---- 97

Query: 84  EEEEESEDDSKSEDENSDKDED-NKEGNEEVSERKTEIEQDENSVQED 130
             + ++E ++  +D + +  E  +     ++  R  +      + Q D
Sbjct: 98  --QAQAELEALKDDNDEETRETLSTLSLRQLESRLAQTLDQLQNAQND 143


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 28.3 bits (63), Expect = 1.5
 Identities = 20/60 (33%), Positives = 36/60 (60%), Gaps = 1/60 (1%)

Query: 18  LRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKK 77
           L  K    +K  +E++E+E   EE Q++ +E++   E   A+ +A+  KR++KL+N  KK
Sbjct: 251 LEEKRRELEKLAKEEAERERQAEE-QRRREEEKAAMEADRAQAKAEVEKRREKLQNLLKK 309


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 28.1 bits (62), Expect = 1.6
 Identities = 14/50 (28%), Positives = 29/50 (58%)

Query: 81  EGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
           E  E EEE  +   S D+ SD+ ++++E +E   +   +  ++++S +ED
Sbjct: 930 ESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEED 979


>gnl|CDD|162284 TIGR01282, nifD, nitrogenase molybdenum-iron protein alpha chain.
           Nitrogenase consists of alpha (NifD) and beta (NifK)
          subunits of the molybdenum-iron protein and an
          ATP-binding iron-sulfur protein (NifH). This model
          describes a large clade of NifD proteins, but excludes
          a lineage that contains putative NifD and NifD homologs
          from species with vanadium-dependent nitrogenases
          [Central intermediary metabolism, Nitrogen fixation].
          Length = 466

 Score = 28.1 bits (63), Expect = 1.6
 Identities = 12/38 (31%), Positives = 18/38 (47%)

Query: 46 IQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGK 83
          I E  + + +K AKKR+K L   +        VK+  K
Sbjct: 2  IDEVLEVYPEKAAKKRSKHLVVNEPEGKSDCGVKSNIK 39


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 28.0 bits (62), Expect = 1.6
 Identities = 16/50 (32%), Positives = 26/50 (52%)

Query: 80  TEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
            E  EEEEE E++ + E++ S+++E   E  EE  E     E++     E
Sbjct: 442 EESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSE 491



 Score = 26.8 bits (59), Expect = 4.6
 Identities = 19/93 (20%), Positives = 43/93 (46%)

Query: 38  LDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSED 97
           L+E   +   ++  T E++  K++ +  +      + P K  +   E    +  +S+ E+
Sbjct: 384 LEEVISKYAMKQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEE 443

Query: 98  ENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
              +++E+ +E  EE  E + E  +DE   +E 
Sbjct: 444 SVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEV 476


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 27.5 bits (62), Expect = 1.7
 Identities = 16/62 (25%), Positives = 31/62 (50%)

Query: 30  QEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEES 89
           Q KS K+ L E     ++ +      +   K A+  K K + +++ K+++ + KE E ES
Sbjct: 45  QAKSIKQHLAELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAES 104

Query: 90  ED 91
           + 
Sbjct: 105 DH 106


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 27.6 bits (62), Expect = 1.8
 Identities = 12/56 (21%), Positives = 24/56 (42%)

Query: 55  DKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGN 110
               KK  K+ ++ +K   K +  K   K++ E +   +K  D  + K + +  G 
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDPLGE 463


>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
          Length = 534

 Score = 27.7 bits (61), Expect = 1.9
 Identities = 13/51 (25%), Positives = 23/51 (45%), Gaps = 3/51 (5%)

Query: 80  TEGKEEEEESEDDSKSEDENS--DKDEDNKEGNEEV-SERKTEIEQDENSV 127
            E  +E  + EDDS S    S    ++++   ++    E  + IE D+  V
Sbjct: 460 QESGDERADGEDDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDGGV 510


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 27.6 bits (61), Expect = 2.0
 Identities = 12/58 (20%), Positives = 25/58 (43%), Gaps = 4/58 (6%)

Query: 55  DKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEES----EDDSKSEDENSDKDEDNKE 108
              A  RAK+  +++   +K  K+++   E E E     + + K  D  +  +   K+
Sbjct: 790 TIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKK 847


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 27.5 bits (62), Expect = 2.0
 Identities = 21/93 (22%), Positives = 43/93 (46%), Gaps = 14/93 (15%)

Query: 20  RKEYARQK---HIQEKSEKELLDEEFQQKIQEKRQTFED-----------KTAKKRAKRL 65
           ++E   +K     +E    E  ++E QQ I+E ++  ++             A  +A  L
Sbjct: 550 KEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHEL 609

Query: 66  KRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDE 98
              +K  NK  + K + K++++E +++ K  DE
Sbjct: 610 IEARKRLNKANEKKEKKKKKQKEKQEELKVGDE 642


>gnl|CDD|189012 cd09605, M3A, Peptidase M3A family includes Thimet oligopeptidase,
           dipeptidyl carboxypeptidase and mitochondrial
           intermediate peptidase.  The peptidase M3-like family,
           also called neurolysin-like family, is part of the
           "zincins" metallopeptidases, and includes M3, M2 and M32
           families of metallopeptidases.  The M3 family is
           subdivided into two subfamilies: the widespread M3A,
           which comprises a number of high-molecular mass endo-
           and exopeptidases from bacteria, archaea, protozoa,
           fungi, plants and animals, and the small M3B, whose
           members are enzymes primarily from bacteria. Well-known
           mammalian/eukaryotic M3A endopeptidases are the thimet
           oligopeptidase (TOP; endopeptidase 3.4.24.15),
           neurolysin (alias endopeptidase 3.4.24.16), and the
           mitochondrial intermediate peptidase. The first two are
           intracellular oligopeptidases, which act only on
           relatively short substrates of less than 20 amino acid
           residues, while the latter cleaves N-terminal
           octapeptides from proteins during their import into the
           mitochondria. The M3A subfamily also contains several
           bacterial endopeptidases, collectively called
           oligopeptidases A, as well as a large number of
           bacterial carboxypeptidases, called dipeptidyl
           peptidases (Dcp; Dcp II; peptidyl dipeptidase; EC
           3.4.15.5). The peptidases in the M3 family contain the
           HEXXH motif that forms the active site in conjunction
           with a C-terminally-located Glutamic acid (Glu) residue.
           A single zinc ion is ligated by the side-chains of the
           two Histidine (His) residues, and the more C-terminal
           Glu. Most of the peptidases are synthesized without
           signal peptides or propeptides, and function
           intracellularly. The structure of neurolysin shows
           similarities to those of angiotensin-converting enzyme
           (ACE; peptidyl-dipeptidase A) peptidase unit 2 belonging
           to peptidase family M2. ACE is an enzyme responsible for
           cleavage of dipeptides from the C-termini of proteins,
           notably converting angiotensin I to angiotensin II in
           mammals. There are similarities to the thermostable
           carboxypeptidases  from Pyrococcus furiosus
           carboxypeptidase (PfuCP), and Thermus aquaticus (TaqCP),
           belonging to peptidase family M32. Little is known about
           function of this family, including carboxypeptidases Taq
           and Pfu.
          Length = 590

 Score = 27.5 bits (62), Expect = 2.0
 Identities = 16/64 (25%), Positives = 28/64 (43%), Gaps = 6/64 (9%)

Query: 11  EFHVYRHLRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFE------DKTAKKRAKR 64
           EF    +L    YAR K + E  E   LD E ++ ++   + FE       +  ++R + 
Sbjct: 73  EFSTELNLNEDLYARLKAVYESRESLSLDPEQRRLLELLLRDFERSGAHLPEEKRERLRE 132

Query: 65  LKRK 68
           L  +
Sbjct: 133 LNEE 136


>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2.  This model
           describes a tightly conserved subfamily of the larger
           family of sugar (and other) transporters described by
           PFAM model pfam00083. Members of this subfamily include
           closely related forms SV2A and SV2B of synaptic vesicle
           protein from vertebrates and a more distantly related
           homolog (below trusted cutoff) from Drosophila
           melanogaster. Members are predicted to have two sets of
           six transmembrane helices.
          Length = 742

 Score = 27.6 bits (61), Expect = 2.1
 Identities = 19/89 (21%), Positives = 35/89 (39%), Gaps = 9/89 (10%)

Query: 51  QTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDED----- 105
             F+D+ A  +  +   K+  K+  KKV       ++E    S S  E+ D D+D     
Sbjct: 3   DGFKDRAAFIKGAKDIAKEVKKHAAKKVNKALDRAQDEYSQRSYSRFEDEDDDDDFPAPA 62

Query: 106 ----NKEGNEEVSERKTEIEQDENSVQED 130
                 E  ++  E +   +  E   ++D
Sbjct: 63  DGYSRGEAADDEEEGEASSDATEGHDEDD 91


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 27.5 bits (61), Expect = 2.1
 Identities = 17/83 (20%), Positives = 24/83 (28%), Gaps = 7/83 (8%)

Query: 28  HIQEKSEKELLDEEFQQKIQEKRQTF-------EDKTAKKRAKRLKRKQKLKNKPKKVKT 80
           H+Q     +L D  +  +     +         E K  KK  K    K     KPKK K 
Sbjct: 110 HLQMLHHPQLGDNPYGTREMHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKK 169

Query: 81  EGKEEEEESEDDSKSEDENSDKD 103
           +G       +          D  
Sbjct: 170 KGSVSNRSVKMPGIDPRSKPDWK 192


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 27.4 bits (60), Expect = 2.4
 Identities = 30/101 (29%), Positives = 63/101 (62%), Gaps = 2/101 (1%)

Query: 24   ARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGK 83
            A +++  + +E+    EE ++K +E ++  ED+  KK A+ LK++ +   K +++K +  
Sbjct: 1655 AEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDE--KKAAEALKKEAEEAKKAEELKKKEA 1712

Query: 84   EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
            EE++++E+  K+E+EN  K E+ K+  EE  ++  E ++DE
Sbjct: 1713 EEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDE 1753



 Score = 26.3 bits (57), Expect = 5.9
 Identities = 27/107 (25%), Positives = 62/107 (57%), Gaps = 2/107 (1%)

Query: 18   LRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKK 77
            LR+ E A++   +E   +E++    ++K  +  +  + + AK +A+ LK+ ++ K K ++
Sbjct: 1580 LRKAEEAKK--AEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQ 1637

Query: 78   VKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
            +K +  EE++++E+  K+E+EN  K  +  +  EE  ++  E ++ E
Sbjct: 1638 LKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAE 1684


>gnl|CDD|218055 pfam04386, SspB, Stringent starvation protein B.  Escherichia coli
           stringent starvation protein B (SspB), is thought to
           enhance the specificity of degradation of tmRNA-tagged
           proteins by the ClpXP protease. The tmRNA tag, also
           known as ssrA, is an 11-aa peptide added to the C
           terminus of proteins stalled during translation, targets
           proteins for degradation by ClpXP and ClpAP. SspB a
           cytoplasmic protein that specifically binds to residues
           1-4 and 7 of the tag. Binding of SspB enhances
           degradation of tagged proteins by ClpX, and masks
           sequence elements important for ClpA interactions,
           inhibiting degradation by ClpA. However, more recent
           work has cast doubt on the importance of SspB in
           wild-type cells. SspB is encoded in an operon whose
           synthesis is stimulated by carbon, amino acid, and
           phosphate starvation. SspB may play a special role
           during nutrient stress, for example by ensuring rapid
           degradation of the products of stalled translation,
           without causing a global increase in degradation of all
           ClpXP substrates.
          Length = 153

 Score = 26.8 bits (60), Expect = 2.6
 Identities = 8/34 (23%), Positives = 20/34 (58%)

Query: 84  EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERK 117
           EE +E E++   + ++S +DED +  +   ++ +
Sbjct: 113 EEADEEEEEEDDDADDSPEDEDPEPKDPPKTKGR 146


>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
          Length = 852

 Score = 27.4 bits (61), Expect = 2.7
 Identities = 10/27 (37%), Positives = 13/27 (48%)

Query: 89  SEDDSKSEDENSDKDEDNKEGNEEVSE 115
           SEDDS+      D D+D   G   + E
Sbjct: 33  SEDDSEDPASLDDDDDDRDSGGGMLEE 59


>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
           motility and secretion / Intracellular trafficking and
           secretion].
          Length = 696

 Score = 27.2 bits (61), Expect = 2.7
 Identities = 16/43 (37%), Positives = 23/43 (53%), Gaps = 1/43 (2%)

Query: 57  TAKKRAKRLKRKQKLKNKPKKVKT-EGKEEEEESEDDSKSEDE 98
            A   +KR +++ + K KP +    E +EEEEES DD    D 
Sbjct: 319 LAYLLSKRKEQQAEEKEKPAEEAKKEEEEEEEESVDDVLLIDP 361


>gnl|CDD|184220 PRK13663, PRK13663, hypothetical protein; Provisional.
          Length = 493

 Score = 27.2 bits (61), Expect = 2.7
 Identities = 14/62 (22%), Positives = 31/62 (50%), Gaps = 1/62 (1%)

Query: 29  IQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEE 88
            +E S++E++   +Q  +  K+    D+ A K+ + L     +  + +KV    +E+ EE
Sbjct: 295 CREASKQEIIRRYYQTLVDFKK-GGVDEEAVKKIELLMNDAGITPEDRKVVVAAREKAEE 353

Query: 89  SE 90
           + 
Sbjct: 354 TG 355


>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal.  The family represents
           the N-terminal region of RXT2-like proteins. In S.
           cerevisiae, RXT2 has been demonstrated to be involved in
           conjugation with cellular fusion (mating) and invasive
           growth. A high throughput localisation study has
           localised RXT2 to the nucleus.
          Length = 141

 Score = 26.6 bits (59), Expect = 2.7
 Identities = 12/42 (28%), Positives = 26/42 (61%)

Query: 65  LKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDN 106
           L RK+   N P ++  +G + +++ EDD   E+ +++ D++N
Sbjct: 34  LSRKRIKFNNPPRIDEDGGDIDDDDEDDEDDEEADAEDDDEN 75


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 27.2 bits (61), Expect = 2.8
 Identities = 15/53 (28%), Positives = 30/53 (56%)

Query: 36  ELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEE 88
           + L EE  ++I+E ++ +     KKR ++  +K+K K K KK   + K++  +
Sbjct: 362 DELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414


>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
           pre-ribosomes [Translation, ribosomal structure and
           biogenesis / Intracellular trafficking and secretion].
          Length = 657

 Score = 27.3 bits (60), Expect = 2.8
 Identities = 15/59 (25%), Positives = 24/59 (40%)

Query: 71  LKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
           +K K   V      EE  SE +S++  E     +D K+ + E  +   EI   +    E
Sbjct: 78  IKTKDNVVADVNNGEEFLSESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSE 136


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 27.3 bits (61), Expect = 2.9
 Identities = 14/44 (31%), Positives = 27/44 (61%)

Query: 81  EGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
           +G ++E+E +DD   E+E  D D  ++E +EE  +   E +++E
Sbjct: 328 DGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEE 371



 Score = 25.7 bits (57), Expect = 8.6
 Identities = 11/52 (21%), Positives = 33/52 (63%)

Query: 79  KTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
           + E +EE+   ++D + +D++ +++E++ + ++E  + + E   DE+  +E+
Sbjct: 321 EDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEE 372



 Score = 25.7 bits (57), Expect = 9.7
 Identities = 12/41 (29%), Positives = 28/41 (68%)

Query: 84  EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
           E+EE+ +DD + E+E+ D  ++ ++  +E S+ + + E++E
Sbjct: 333 EDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEE 373


>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein.  This
           family consists of REP proteins from Dictyostelium
           (Slime molds). REP protein is likely involved in
           transcription regulation and control of DNA replication,
           specifically amplification of plasmid at low copy
           numbers. The formation of homomultimers may be required
           for their regulatory activity.
          Length = 910

 Score = 27.1 bits (60), Expect = 3.8
 Identities = 18/53 (33%), Positives = 26/53 (49%)

Query: 60  KRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEE 112
           K  K    K K  N    ++     + +E+EDD   EDE+ D+DED  E  +E
Sbjct: 858 KLVKNFISKFKELNDLTDLEEYLISQFQENEDDDADEDEDQDEDEDEDEDEDE 910


>gnl|CDD|234271 TIGR03591, polynuc_phos, polyribonucleotide nucleotidyltransferase.
            Members of this protein family are polyribonucleotide
           nucleotidyltransferase, also called polynucleotide
           phosphorylase. Some members have been shown also to have
           additional functions as guanosine pentaphosphate
           synthetase and as poly(A) polymerase (see model
           TIGR02696 for an exception clade, within this family)
           [Transcription, Degradation of RNA].
          Length = 684

 Score = 26.7 bits (60), Expect = 3.9
 Identities = 20/78 (25%), Positives = 35/78 (44%), Gaps = 9/78 (11%)

Query: 26  QKHIQEKSEKE-------LLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKV 78
            + + E++ KE        +DEE + K++E       K A +  ++ +R   L    ++V
Sbjct: 210 IEELAEEAGKEKWEFVAPEVDEELKAKVKELAYEAGLKEAYQITEKQERYAALDAIKEEV 269

Query: 79  KTE--GKEEEEESEDDSK 94
                 +EEEEE  D  K
Sbjct: 270 LEALAEEEEEEEEADREK 287


>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
          Length = 336

 Score = 26.5 bits (59), Expect = 4.1
 Identities = 17/57 (29%), Positives = 34/57 (59%)

Query: 68  KQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDE 124
           K+KL    +KV+ + +EEEEE   +++ E+E  D++E  ++       + TEI++ +
Sbjct: 6   KEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKD 62


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 26.6 bits (59), Expect = 4.1
 Identities = 18/81 (22%), Positives = 39/81 (48%), Gaps = 4/81 (4%)

Query: 39  DEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDE 98
               Q    E+     D       + +K ++K          + +++ +  ++D +SE+E
Sbjct: 39  SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANS----EDKEDKGDAEKEDEESEEE 94

Query: 99  NSDKDEDNKEGNEEVSERKTE 119
           N ++DE++ + NE+ +E KTE
Sbjct: 95  NEEEDEESSDENEKETEEKTE 115



 Score = 26.2 bits (58), Expect = 4.8
 Identities = 14/77 (18%), Positives = 38/77 (49%)

Query: 54  EDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEV 113
           +    ++ AK+   ++  + +  K + +     E+ ED   +E E+ + +E+N+E +EE 
Sbjct: 43  QAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEES 102

Query: 114 SERKTEIEQDENSVQED 130
           S+   +  +++     +
Sbjct: 103 SDENEKETEEKTESNVE 119



 Score = 25.5 bits (56), Expect = 9.5
 Identities = 17/77 (22%), Positives = 34/77 (44%)

Query: 53  FEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEE 112
           F    + + A   +  +K  ++      E KEEE+E+ +    ED+   + ED +   E 
Sbjct: 36  FPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEEN 95

Query: 113 VSERKTEIEQDENSVQE 129
             E +   +++E   +E
Sbjct: 96  EEEDEESSDENEKETEE 112


>gnl|CDD|222968 PHA03065, PHA03065, Hypothetical protein; Provisional.
          Length = 438

 Score = 26.5 bits (59), Expect = 4.2
 Identities = 18/52 (34%), Positives = 29/52 (55%), Gaps = 4/52 (7%)

Query: 60  KRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNE 111
           K++ R KR++  KN  K+     +EE E+ EDD K+ D + +  E+ K   E
Sbjct: 81  KQSLREKRRKASKNTIKR----KREEIEKLEDDIKNLDVDDEMYEEIKTDLE 128


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
           Superfamily; Provisional.
          Length = 844

 Score = 26.8 bits (59), Expect = 4.4
 Identities = 11/43 (25%), Positives = 15/43 (34%)

Query: 88  ESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
           ES  D  S  E  D+DED             + +   NS  + 
Sbjct: 112 ESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQTDQ 154


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 26.5 bits (59), Expect = 4.5
 Identities = 10/43 (23%), Positives = 24/43 (55%)

Query: 66  KRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKE 108
           K+KQ+ K + K+ K   ++++++ ++D   E      +E  +E
Sbjct: 183 KKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEGGEERQRE 225



 Score = 26.1 bits (58), Expect = 5.4
 Identities = 7/43 (16%), Positives = 21/43 (48%)

Query: 55  DKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSED 97
             ++    K+ ++KQ+ K   +  + +  +E+E+ + +   E 
Sbjct: 176 GHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEG 218


>gnl|CDD|223052 PHA03346, PHA03346, US22 family homolog; Provisional.
          Length = 520

 Score = 26.7 bits (59), Expect = 4.6
 Identities = 19/101 (18%), Positives = 31/101 (30%), Gaps = 25/101 (24%)

Query: 54  EDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEE-------------------------E 88
            D  A   A+ L+ +   + +P+       E +                          +
Sbjct: 374 WDAFALWHARVLRGEFAERRRPRGPPENEDEADGGGGLAPPPEPPRPPPPPFPDDDDEGD 433

Query: 89  SEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
            EDD   ED   D DED+    ++V E + E         E
Sbjct: 434 DEDDDDWEDLGFDLDEDDVYDLKDVDEWRLERRIATAERWE 474


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 26.4 bits (59), Expect = 4.7
 Identities = 12/42 (28%), Positives = 20/42 (47%)

Query: 45  KIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEE 86
           K    ++  + +  K++ K  KR +  KN  K+ K  G  EE
Sbjct: 393 KKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434


>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
           Cobaltochelatase is responsible for the insertion of
           cobalt into the corrin ring of coenzyme B12 during its
           biosynthesis. Two versions have been well described.
           CbiK/CbiX is a monomeric, anaerobic version which acts
           early in the biosynthesis (pfam06180). CobNST is a
           trimeric, ATP-dependent, aerobic version which acts late
           in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
           number of genomes (actinobacteria, cyanobacteria,
           betaproteobacteria and pseudomonads) which apparently
           biosynthesize B12, encode a cobN gene but are
           demonstrably lacking cobS and cobT. These genomes do,
           however contain a homolog (modelled here) of the
           magnesium chelatase subunits BchI/BchD family. Aside
           from the cyanobacteria (which have a separate magnesium
           chelatase trimer), these species do not make chlorins,
           so do not have any use for a magnesium chelatase.
           Furthermore, in nearly all cases the members of this
           family are proximal to either CobN itself or other genes
           involved in cobalt transport or B12 biosynthesis.
          Length = 633

 Score = 26.6 bits (59), Expect = 5.0
 Identities = 16/72 (22%), Positives = 28/72 (38%)

Query: 59  KKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKT 118
           ++R K  ++ Q    K  + K E    + E  D+ + + E S        GN++  E   
Sbjct: 308 RRRRKPFEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPDP 367

Query: 119 EIEQDENSVQED 130
           + E D      D
Sbjct: 368 QEEADGQGSSTD 379


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 26.6 bits (59), Expect = 5.0
 Identities = 14/121 (11%), Positives = 42/121 (34%), Gaps = 18/121 (14%)

Query: 1   MGSSAGAGSGEFHVYRHLRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKK 60
           M S    G  +      +       Q  I+++++   +  + + ++      +E      
Sbjct: 823 MESVPSGGGSQ-----DIIADHQGHQAIIEQRTQDSGIRNDVKHQVDNMVTEYEGNIGDT 877

Query: 61  RAKRLKRKQKLKNKPKKVKTEGKEE---------EEESED----DSKSEDENSDKDEDNK 107
           +      +  +K +  +++   K E         EE+S       + S +E   + ++ +
Sbjct: 878 QNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQ 937

Query: 108 E 108
           +
Sbjct: 938 D 938


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 26.5 bits (58), Expect = 5.1
 Identities = 11/42 (26%), Positives = 21/42 (50%)

Query: 82  GKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQD 123
           G E +++    S   DE  D D+D+ E N++ +    + + D
Sbjct: 270 GYESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCD 311


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 26.5 bits (59), Expect = 5.1
 Identities = 7/41 (17%), Positives = 16/41 (39%)

Query: 58 AKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDE 98
          AK+  +    K++ K        +G+  +  + DD   +  
Sbjct: 19 AKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPT 59


>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
          [Posttranslational modification, protein turnover,
          chaperones].
          Length = 881

 Score = 26.4 bits (58), Expect = 5.1
 Identities = 13/46 (28%), Positives = 25/46 (54%), Gaps = 2/46 (4%)

Query: 55 DKTAKKRAKRLKRKQKL--KNKPKKVKTEGKEEEEESEDDSKSEDE 98
          D++ KK+    ++ Q    K  P K   + +EEE+ SE+D+  + +
Sbjct: 3  DESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEEDAMLKGD 48


>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
          Length = 390

 Score = 26.4 bits (59), Expect = 5.3
 Identities = 15/62 (24%), Positives = 30/62 (48%)

Query: 44  QKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKD 103
           +++     TF ++  ++ A+  +  +K++        E  EE EE +DD   +D + D D
Sbjct: 322 KELCWDLMTFIEENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDD 381

Query: 104 ED 105
           E 
Sbjct: 382 EG 383


>gnl|CDD|177775 PLN00180, PLN00180, NDF6 (NDH-dependent flow 6); Provisional.
          Length = 180

 Score = 25.9 bits (57), Expect = 5.4
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 23  YARQKHIQEKSEKELLDEEFQQKIQEK 49
           Y RQ+ I+E +  EL  EE  ++I++K
Sbjct: 141 YERQEDIEESARAELWREELIEEIEQK 167


>gnl|CDD|233048 TIGR00605, rad4, DNA repair protein rad4.  All proteins in this
           family for which functions are known are involved in
           targeting nucleotide excision repair to specific regions
           of the genome.This family is based on the phylogenomic
           analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
           University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 713

 Score = 26.4 bits (58), Expect = 5.5
 Identities = 18/91 (19%), Positives = 37/91 (40%), Gaps = 1/91 (1%)

Query: 39  DEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDE 98
           + E Q K + ++   E++ + +R K+  +    +  P +V      +   S+D   S  +
Sbjct: 21  EAEKQPKSRRRKVRRENEPSLRRRKKRFKTGLNEL-PHEVVLMCNLDSTHSDDRVVSVPD 79

Query: 99  NSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
           +    E+     E+   R+ E     N V E
Sbjct: 80  SLSVSEEIPSREEDYDSREFEDVYLSNLVAE 110


>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
           kDa subunit.  This family contains several DNA-directed
           RNA polymerase 19 kDa polypeptides. The Poxvirus
           DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
           DNA-template-directed extension of the 3'-end of an RNA
           strand by one nucleotide at a time.
          Length = 167

 Score = 25.8 bits (57), Expect = 5.8
 Identities = 12/48 (25%), Positives = 22/48 (45%)

Query: 81  EGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQ 128
           + +E EEE ED+  +E   S      K+ N ++      IE   ++ +
Sbjct: 16  DSEEYEEEEEDEEDAESLESSDVSSLKQSNYKIESASENIEDATSNPK 63


>gnl|CDD|215456 PLN02850, PLN02850, aspartate-tRNA ligase.
          Length = 530

 Score = 26.2 bits (58), Expect = 6.1
 Identities = 11/44 (25%), Positives = 19/44 (43%)

Query: 47 QEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESE 90
          QE  +   +K +KK AK+   K +   +    K      E+E +
Sbjct: 3  QEAVEESGEKISKKAAKKAAAKAEKLRREATAKAAAASLEDEDD 46


>gnl|CDD|238566 cd01146, FhuD, Fe3+-siderophore binding domain FhuD.  These
           proteins have been shown to function as initial
           receptors in ABC transport of Fe3+-siderophores in many
           eubacterial species. They belong to the TroA-like
           superfamily of helical backbone metal receptor proteins
           that share a distinct fold and ligand binding mechanism.
            A typical TroA-like protein is comprised of two
           globular subdomains connected by a long alpha helix and
           binds its specific ligands in the cleft between these
           domains.
          Length = 256

 Score = 26.1 bits (58), Expect = 6.3
 Identities = 12/31 (38%), Positives = 18/31 (58%), Gaps = 2/31 (6%)

Query: 30  QEKSEKELLDEEFQQKIQEKRQTFEDKTAKK 60
           +E++EK L   E+ Q++ E RQ   DK  K 
Sbjct: 115 EEEAEKLL--AEYDQRLAELRQKLPDKGPKP 143


>gnl|CDD|240272 PTZ00109, PTZ00109, DNA gyrase subunit b; Provisional.
          Length = 903

 Score = 26.4 bits (58), Expect = 6.4
 Identities = 13/59 (22%), Positives = 25/59 (42%), Gaps = 4/59 (6%)

Query: 72  KNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTE----IEQDENS 126
           K+   K  T   EE+  + D     ++    +++ +E N +  E KTE    +   E +
Sbjct: 763 KDYSSKETTRSVEEKGNAPDLDNEYEDEKLDNKNMRENNVDEVELKTELGTNVADTEQT 821


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 26.0 bits (58), Expect = 6.5
 Identities = 21/89 (23%), Positives = 43/89 (48%), Gaps = 9/89 (10%)

Query: 25  RQKHIQEKSEKELLDEEFQQKIQEKRQ-TFEDKTAKKRAKRLKR--------KQKLKNKP 75
           +++ I   +   L D+E ++ +++  +   EDK  K+R +            ++ LK + 
Sbjct: 489 KEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEG 548

Query: 76  KKVKTEGKEEEEESEDDSKSEDENSDKDE 104
            K+    K++ EE+ +  K E E  DK+E
Sbjct: 549 DKLPEADKKKVEEAIEWLKEELEGEDKEE 577


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 26.3 bits (58), Expect = 6.8
 Identities = 13/47 (27%), Positives = 23/47 (48%)

Query: 84  EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
           E+     DD + EDE  +++ED  EG  +      E E+D+   + +
Sbjct: 382 EDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYE 428


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 25.9 bits (57), Expect = 7.3
 Identities = 13/47 (27%), Positives = 26/47 (55%)

Query: 84  EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
           +  EE  D+ +S D   ++DED+ + +E+  + + E     +S+ ED
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSED 253



 Score = 25.5 bits (56), Expect = 9.6
 Identities = 13/48 (27%), Positives = 23/48 (47%)

Query: 83  KEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQED 130
           + E  +SED+   +D   D+D+D  E  E  S      + D +S + +
Sbjct: 215 EPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEME 262


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 25.8 bits (57), Expect = 7.4
 Identities = 26/101 (25%), Positives = 54/101 (53%), Gaps = 2/101 (1%)

Query: 17  HLRRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPK 76
           HL+ + +A   +  E +E +   +E  ++I++ ++ +E+K   K  K+  +K+K K+K K
Sbjct: 42  HLQDRHFATPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDK 101

Query: 77  KVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERK 117
           K   +  + E++ E   ++ED+  D  +   E    +SE K
Sbjct: 102 KDDKKDDKSEKKDEK--EAEDKLEDLTKSYSETLSTLSELK 140


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 25.8 bits (57), Expect = 7.9
 Identities = 18/59 (30%), Positives = 30/59 (50%), Gaps = 3/59 (5%)

Query: 70  KLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQ 128
           K+K +P+ V   G +EE+  E   K+E+E  + D    E  +E  E + E E D+   +
Sbjct: 264 KVKGEPEVV---GGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 26.1 bits (58), Expect = 7.9
 Identities = 14/67 (20%), Positives = 27/67 (40%)

Query: 54  EDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEV 113
           E    ++R +    + +  ++ K  K E +   EE       E E+ DK ED  +     
Sbjct: 228 ETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILK 287

Query: 114 SERKTEI 120
            ++  E+
Sbjct: 288 EKKDEEL 294


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 25.9 bits (57), Expect = 8.2
 Identities = 9/57 (15%), Positives = 24/57 (42%)

Query: 73  NKPKKVKTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
           +  +  + E +EEE+ S++  + E E+S+++      +             E+   +
Sbjct: 366 DFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADK 422


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 25.7 bits (57), Expect = 8.3
 Identities = 21/90 (23%), Positives = 44/90 (48%), Gaps = 1/90 (1%)

Query: 40  EEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKEEEEESEDDSKSEDEN 99
            E QQ++ E ++    +  ++ A+  +  Q+LK +   ++ E +  ++E     +     
Sbjct: 69  PELQQELAELQEEL-AELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANA 127

Query: 100 SDKDEDNKEGNEEVSERKTEIEQDENSVQE 129
            + DE+N+E  EE++E K E E  E   + 
Sbjct: 128 IELDEENRELREELAELKQENEALEAENER 157


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 25.7 bits (57), Expect = 8.5
 Identities = 25/99 (25%), Positives = 42/99 (42%), Gaps = 9/99 (9%)

Query: 28  HIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKR--AKRLKRKQKLKNKPKKVKTE--GK 83
           HI++  E+     E  +K++       +   K R    +L   ++ K K      E   K
Sbjct: 32  HIEDLKEEL--SNERLRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIK 89

Query: 84  EEEEESEDDSKSEDENSDKDEDNKEGNEEVSERKTEIEQ 122
           + EEE E     E E  + +E+  E   E+ E + EIE+
Sbjct: 90  DVEEELEKI---EKEIKELEEEISELENEIKELEQEIER 125


>gnl|CDD|223522 COG0445, GidA, Flavin-dependent tRNA uridine
           5-carboxymethylaminomethyl modification enzyme GidA  
           [Cell cycle control, cell division, chromosome
           partitioning].
          Length = 621

 Score = 26.0 bits (58), Expect = 8.5
 Identities = 9/48 (18%), Positives = 21/48 (43%), Gaps = 8/48 (16%)

Query: 37  LLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTEGKE 84
           L+D+E  ++  +K++  E        + ++R +     P +V  E   
Sbjct: 454 LVDDERYERFLKKKENIE--------EEIERLKSTWVTPSEVAKELLA 493


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 25.6 bits (56), Expect = 8.8
 Identities = 20/94 (21%), Positives = 35/94 (37%), Gaps = 7/94 (7%)

Query: 19  RRKEYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKV 78
              +Y   K     + +ELL E F   +        D  ++       RK+K +  P   
Sbjct: 309 YSSDYQAPKPEPIYTWEELLRERFPSDLFAISSL-PDSDSEASDSGPTRKRKRRRVPPL- 366

Query: 79  KTEGKEEEEESEDDSKSEDENSDKDEDNKEGNEE 112
                 E    ED+  S+++  D +++ K   EE
Sbjct: 367 -----PEYSSDEDEDDSDEDEVDYEKERKRRREE 395


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 25.4 bits (56), Expect = 9.3
 Identities = 11/74 (14%), Positives = 27/74 (36%), Gaps = 1/74 (1%)

Query: 22  EYARQKHIQEKSEKELLDEEFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVKTE 81
           E  +      K E     E   Q I+ +    +++  +KR +  + K+ L  +   + + 
Sbjct: 53  EDGKLAADLLKLEVARKKERLNQ-IRARISQLKEEIEQKRERIEELKRALAQRRSDLSSA 111

Query: 82  GKEEEEESEDDSKS 95
             + E+      + 
Sbjct: 112 SYQLEKRRASQLEK 125


>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A. 
          Length = 427

 Score = 25.6 bits (57), Expect = 9.5
 Identities = 12/47 (25%), Positives = 24/47 (51%), Gaps = 2/47 (4%)

Query: 35  KELLDE--EFQQKIQEKRQTFEDKTAKKRAKRLKRKQKLKNKPKKVK 79
           KE+L E  E + ++ ++R+ +  +  ++R   L+   K  NK   V 
Sbjct: 307 KEILKEFLEHRLEVYKRRKEYLLEKLEERLHILEGLLKALNKIDFVI 353


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.296    0.119    0.298 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,290,625
Number of extensions: 573388
Number of successful extensions: 4898
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3119
Number of HSP's successfully gapped: 1294
Length of query: 130
Length of database: 10,937,602
Length adjustment: 86
Effective length of query: 44
Effective length of database: 7,123,158
Effective search space: 313418952
Effective search space used: 313418952
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 44 (21.9 bits)
S2: 53 (24.0 bits)