RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14604
         (209 letters)



>gnl|CDD|215770 pfam00176, SNF2_N, SNF2 family N-terminal domain.  This domain is
           found in proteins involved in a variety of processes
           including transcription regulation (e.g., SNF2, STH1,
           brahma, MOT1), DNA repair (e.g. ERCC6, RAD16, RAD5), DNA
           recombination (e.g. RAD54), and chromatin unwinding
           (e.g. ISWI) as well as a variety of other proteins with
           little functional information (e.g. lodestar, ETL1).
          Length = 301

 Score =  115 bits (290), Expect = 2e-31
 Identities = 52/101 (51%), Positives = 59/101 (58%), Gaps = 1/101 (0%)

Query: 108 YQLEGINWLRYSWGQNIDTILADEMGLGKTIQTITFLYSLFKEGHC-KGPFLVSAPLSTI 166
           YQLEG+NWL       +  ILADEMGLGKT+QTI  L +  KEG   +GP LV  PLST+
Sbjct: 1   YQLEGVNWLISLESNGLGGILADEMGLGKTLQTIALLATYLKEGKDRRGPTLVVCPLSTL 60

Query: 167 INWEREFETWAPDFYVVTYVGDKDCRIVLRDHDISWEDTAN 207
            NW  EFE WAP   VV Y GD   R  LR       DT +
Sbjct: 61  HNWLNEFEKWAPALRVVVYHGDGRERSKLRQSMAKRLDTYD 101


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 96.0 bits (239), Expect = 5e-23
 Identities = 56/159 (35%), Positives = 74/159 (46%), Gaps = 25/159 (15%)

Query: 46  KKKKSKGRGKTKTKE-------LQEDEDGASGSKPRRYTPPPEKPTTDLRKKYEKQPPFL 98
           KK K +GR  +K  E       L+E+EDG  GS   R    P      +R          
Sbjct: 122 KKAKGRGRHASKLTEEEEDEEYLKEEEDGLGGSGGTRLLVQPSCIKGKMRD--------- 172

Query: 99  DDTGMQLHAYQLEGINWLRYSWGQNIDTILADEMGLGKTIQTITFLYSLFKEGHCKGPFL 158
                    YQL G+NWL   +   I+ ILADEMGLGKT+QTI+ L  L +     GP +
Sbjct: 173 ---------YQLAGLNWLIRLYENGINGILADEMGLGKTLQTISLLGYLHEYRGITGPHM 223

Query: 159 VSAPLSTIINWEREFETWAPDFYVVTYVGDKDCRIVLRD 197
           V AP ST+ NW  E   + P    V + G+ + R   R+
Sbjct: 224 VVAPKSTLGNWMNEIRRFCPVLRAVKFHGNPEERAHQRE 262


>gnl|CDD|223627 COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family
           [Transcription / DNA replication, recombination, and
           repair].
          Length = 866

 Score = 90.9 bits (225), Expect = 3e-21
 Identities = 54/162 (33%), Positives = 74/162 (45%), Gaps = 8/162 (4%)

Query: 34  DEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASGSKPRRYTPPPEKPTTDLR-KKYE 92
            E LK        +     R    ++ L++          R      ++   +LR  +  
Sbjct: 267 PEQLKEEDLFARLRLLDPLRLADLSQILEKFVRETLKLSARDLKDELKELLAELRLSEDL 326

Query: 93  KQPPFLDDTGMQLHAYQLEGINWLR---YSWGQNIDTILADEMGLGKTIQTITFLYSLFK 149
              P   D   +L  YQLEG+NWL     S       ILAD+MGLGKT+QTI  L SL +
Sbjct: 327 LNAPEPVDLSAELRPYQLEGVNWLSELLRSNLLGG--ILADDMGLGKTVQTIALLLSLLE 384

Query: 150 EGHC-KGPFLVSAPLSTIINWEREFETWAPDF-YVVTYVGDK 189
                 GP L+  P S + NW+REFE +APD   V+ Y G+K
Sbjct: 385 SIKVYLGPALIVVPASLLSNWKREFEKFAPDLRLVLVYHGEK 426


>gnl|CDD|214692 smart00487, DEXDc, DEAD-like helicases superfamily. 
          Length = 201

 Score = 67.5 bits (165), Expect = 5e-14
 Identities = 29/97 (29%), Positives = 32/97 (32%), Gaps = 7/97 (7%)

Query: 104 QLHAYQLEGINWLRYSWGQNIDTILADEMGLGKTIQTITFLYSLFKEGHCKGPFLVSAPL 163
            L  YQ E I  L        D ILA   G GKT+  +       K G   G  LV  P 
Sbjct: 8   PLRPYQKEAIEALLSGLR---DVILAAPTGSGKTLAALLPALEALKRGK-GGRVLVLVPT 63

Query: 164 -STIINWEREFETWAPDFY--VVTYVGDKDCRIVLRD 197
                 W  E +   P     VV   G    R  LR 
Sbjct: 64  RELAEQWAEELKKLGPSLGLKVVGLYGGDSKREQLRK 100


>gnl|CDD|238005 cd00046, DEXDc, DEAD-like helicases superfamily. A diverse family
           of proteins involved in ATP-dependent RNA or DNA
           unwinding. This domain contains the ATP-binding region.
          Length = 144

 Score = 55.0 bits (133), Expect = 6e-10
 Identities = 17/70 (24%), Positives = 24/70 (34%), Gaps = 3/70 (4%)

Query: 125 DTILADEMGLGKTIQTITFLYSLFKEGHCKGPFLVSAPLSTIINWEREFETWAPDFY--V 182
           D +LA   G GKT+  +  +  L       G  LV AP   + N   E           V
Sbjct: 2   DVLLAAPTGSGKTLAALLPILELLDSLK-GGQVLVLAPTRELANQVAERLKELFGEGIKV 60

Query: 183 VTYVGDKDCR 192
              +G    +
Sbjct: 61  GYLIGGTSIK 70


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 34.0 bits (78), Expect = 0.051
 Identities = 23/93 (24%), Positives = 36/93 (38%), Gaps = 6/93 (6%)

Query: 14  RGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASGSKP 73
           +G KKKK    ++K    +   D +      GKK + KGR K+        +  A G K 
Sbjct: 418 KGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKS---SKVPSDSKAGGKKE 474

Query: 74  RRYTPPPEK---PTTDLRKKYEKQPPFLDDTGM 103
              +        P   + KK  +  P L++ G 
Sbjct: 475 SVKSQEDNNNIPPEEWVMKKILEWVPDLEEDGT 507


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 32.1 bits (73), Expect = 0.15
 Identities = 14/74 (18%), Positives = 24/74 (32%)

Query: 8   VRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDG 67
              A +R K   +   R+ K  +    ED +    +  KK++ K R     +        
Sbjct: 146 ECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSS 205

Query: 68  ASGSKPRRYTPPPE 81
              S       PP+
Sbjct: 206 GGQSGLSTKDEPPK 219


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 31.6 bits (72), Expect = 0.19
 Identities = 17/67 (25%), Positives = 27/67 (40%)

Query: 11  AAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASG 70
              +  K KK   +  K  D D  +D K        +K+++ + +  TK   E     S 
Sbjct: 79  EEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSE 138

Query: 71  SKPRRYT 77
            KPR+Y 
Sbjct: 139 LKPRKYA 145



 Score = 30.8 bits (70), Expect = 0.34
 Identities = 14/77 (18%), Positives = 26/77 (33%), Gaps = 12/77 (15%)

Query: 16  KKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASGSKPRR 75
           K+K K   + SK   D D            K KK   +     K+ +++ +       + 
Sbjct: 81  KQKWKWKKKKSKKKKDKD------------KDKKDDKKDDKSEKKDEKEAEDKLEDLTKS 128

Query: 76  YTPPPEKPTTDLRKKYE 92
           Y+      +    +KY 
Sbjct: 129 YSETLSTLSELKPRKYA 145



 Score = 26.6 bits (59), Expect = 8.4
 Identities = 17/79 (21%), Positives = 28/79 (35%), Gaps = 1/79 (1%)

Query: 24  RASKAADDDDDEDLKAAYFN-DGKKKKSKGRGKTKTKELQEDEDGASGSKPRRYTPPPEK 82
           +  K    ++ E +K  Y      K K K   K K K+  + +D       ++     E 
Sbjct: 61  KKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAED 120

Query: 83  PTTDLRKKYEKQPPFLDDT 101
              DL K Y +    L + 
Sbjct: 121 KLEDLTKSYSETLSTLSEL 139


>gnl|CDD|235319 PRK04914, PRK04914, ATP-dependent helicase HepA; Validated.
          Length = 956

 Score = 31.7 bits (73), Expect = 0.26
 Identities = 10/11 (90%), Positives = 11/11 (100%)

Query: 128 LADEMGLGKTI 138
           LADE+GLGKTI
Sbjct: 174 LADEVGLGKTI 184


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 31.0 bits (70), Expect = 0.42
 Identities = 17/94 (18%), Positives = 36/94 (38%)

Query: 2   KSKRRPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKEL 61
           K++      A AR K+  +    A +  + +  ++  AA      ++ +    K    + 
Sbjct: 334 KTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDA 393

Query: 62  QEDEDGASGSKPRRYTPPPEKPTTDLRKKYEKQP 95
             +ED  S ++  + +PP  + TT      E   
Sbjct: 394 SPNEDTPSENEESKGSPPQVEATTTAEPNREPSQ 427


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 30.5 bits (69), Expect = 0.59
 Identities = 17/78 (21%), Positives = 28/78 (35%), Gaps = 4/78 (5%)

Query: 16  KKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASGSKPRR 75
           K    ST +  K   +++ +          KKKK + +      E +        SK  +
Sbjct: 48  KVATTSTKKDKKEDKNNESKKKSE----KKKKKKKEKKEPKSEGETKLGFKTPKKSKKTK 103

Query: 76  YTPPPEKPTTDLRKKYEK 93
             PP  KP  D+   + K
Sbjct: 104 KKPPKPKPNEDVDNAFNK 121


>gnl|CDD|222918 PHA02687, PHA02687, ORF061 late transcription factor VLTF-4;
           Provisional.
          Length = 231

 Score = 30.0 bits (67), Expect = 0.66
 Identities = 19/70 (27%), Positives = 28/70 (40%), Gaps = 10/70 (14%)

Query: 4   KRRPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQE 63
           KRR  +   A+  K +KS     K    DD +        D  ++K       + +E  +
Sbjct: 87  KRRTKK--KAKADKPEKSPKAVEKLCPPDDRD--------DKNEEKEPTEEAQRNEESGD 136

Query: 64  DEDGASGSKP 73
            E GASG  P
Sbjct: 137 AEGGASGRSP 146


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 30.2 bits (68), Expect = 0.86
 Identities = 13/74 (17%), Positives = 22/74 (29%)

Query: 13   ARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASGSK 72
             R   KKK+   A +  ++D+  DLK                K +          A+  K
Sbjct: 1242 GRAGAKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETVKAVPARRAAARK 1301

Query: 73   PRRYTPPPEKPTTD 86
                +      + D
Sbjct: 1302 KPLASVSVISDSDD 1315



 Score = 28.3 bits (63), Expect = 3.9
 Identities = 22/85 (25%), Positives = 31/85 (36%), Gaps = 7/85 (8%)

Query: 3    SKRRPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQ 62
             K  P R AAAR KK   S    S + DDDDD  ++ +     KKK  +       K   
Sbjct: 1289 VKAVPARRAAAR-KKPLASVSVISDSDDDDDDFAVEVSLAERLKKKGGRKPAAANKKA-- 1345

Query: 63   EDEDGASGSKPRRYTPPPEKPTTDL 87
                    +  ++  P   +    L
Sbjct: 1346 ----AKPPAAAKKRGPATVQSGQKL 1366



 Score = 27.1 bits (60), Expect = 9.3
 Identities = 15/69 (21%), Positives = 22/69 (31%), Gaps = 11/69 (15%)

Query: 2    KSKRRPVRGAAARGKKKKKSTGRASKAADDD----DDEDLKAAY-------FNDGKKKKS 50
            K +R   RG +   KK  +   +            + E  +  Y        N  +  K 
Sbjct: 1181 KLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKP 1240

Query: 51   KGRGKTKTK 59
            KGR   K K
Sbjct: 1241 KGRAGAKKK 1249


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 29.6 bits (67), Expect = 1.1
 Identities = 10/40 (25%), Positives = 17/40 (42%)

Query: 2   KSKRRPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAY 41
             K +  + A    KKK K+  + +    +DDD+D     
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDDYDGYD 231


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 29.8 bits (67), Expect = 1.2
 Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 10/66 (15%)

Query: 46  KKKKSKGRGKTKTKE-------LQEDEDGASGSKPRRYTPPPEKPTTDLRKKYEKQPPFL 98
           KKKKSK + K K +E       L+  E+G  GS   +     E P   L KK+ + P F+
Sbjct: 688 KKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEK---SSETPQLKLEKKFGQSPVFV 744

Query: 99  DDTGMQ 104
             T ++
Sbjct: 745 ASTLLE 750


>gnl|CDD|178493 PLN02905, PLN02905, beta-amylase.
          Length = 702

 Score = 29.2 bits (65), Expect = 1.9
 Identities = 19/74 (25%), Positives = 28/74 (37%), Gaps = 7/74 (9%)

Query: 27 KAADDDDDED------LKAAYFNDGKKKKSKGRGK-TKTKELQEDEDGASGSKPRRYTPP 79
          K  DD D  +      +      DG    S G  +    + +QE      G   RR  P 
Sbjct: 25 KEEDDGDRRNREKHIPVTGMVSVDGAAVSSSGNDEFMHQQSIQEQVGTPGGGGSRRSRPL 84

Query: 80 PEKPTTDLRKKYEK 93
           EK  T LR+++ +
Sbjct: 85 EEKERTKLRERHRR 98


>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator.  This
           family of trans-activating transcriptional regulator
           (TATR), also known as intermediate early protein 1, are
           common to the Nucleopolyhedroviruses.
          Length = 571

 Score = 28.7 bits (64), Expect = 2.3
 Identities = 20/90 (22%), Positives = 28/90 (31%), Gaps = 4/90 (4%)

Query: 6   RPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDE 65
            P         + K+ T    K ++ D D D   +        K K R K K   +Q   
Sbjct: 107 SPGESLPESLSETKQVTVSKRKGSELDSDSDSSDSSKGKKLVNKPKKRQKYKKATIQ--- 163

Query: 66  DGASGSKPRRY-TPPPEKPTTDLRKKYEKQ 94
              S ++   Y T       TD   KY   
Sbjct: 164 SPTSLTEEENYNTEICTVAPTDQIAKYFLM 193


>gnl|CDD|224107 COG1186, PrfB, Protein chain release factor B [Translation,
           ribosomal structure and biogenesis].
          Length = 239

 Score = 28.4 bits (64), Expect = 2.7
 Identities = 11/43 (25%), Positives = 20/43 (46%), Gaps = 4/43 (9%)

Query: 55  KTKTKELQEDEDGAS----GSKPRRYTPPPEKPTTDLRKKYEK 93
           + +++E   +         G++ R Y   P +PT DLR   E+
Sbjct: 174 EKRSQEKNRERALKKLIGWGNQIRSYVLDPYQPTKDLRTGVER 216


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 28.5 bits (64), Expect = 2.9
 Identities = 17/89 (19%), Positives = 26/89 (29%), Gaps = 8/89 (8%)

Query: 16  KKKKKSTGRASKAADDDDDEDLKAAYFND--------GKKKKSKGRGKTKTKELQEDEDG 67
           +K  K   + SK   DDDD  L      +         KKKK K + K K    +  +  
Sbjct: 19  QKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLA 78

Query: 68  ASGSKPRRYTPPPEKPTTDLRKKYEKQPP 96
                        +         + +Q  
Sbjct: 79  YDLPVVWSSAAFQDNSHIRKLGNWPEQEW 107


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 28.6 bits (63), Expect = 2.9
 Identities = 8/59 (13%), Positives = 19/59 (32%)

Query: 48  KKSKGRGKTKTKELQEDEDGASGSKPRRYTPPPEKPTTDLRKKYEKQPPFLDDTGMQLH 106
           K++ G     +     +E     +     T  P   ++ LR ++    P +      + 
Sbjct: 712 KETSGSTLMASTSPHTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTTKQATVP 770


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 28.4 bits (63), Expect = 3.2
 Identities = 23/74 (31%), Positives = 35/74 (47%), Gaps = 7/74 (9%)

Query: 11  AAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGK-KKKSKGRGKTKTKELQEDEDGAS 69
           AAA+ K    +  +AS+   D  DED KA      K K  +  R KTK  E +++E+   
Sbjct: 209 AAAKAKAAALAKQKASQGNGDSGDEDAKAKAIAAAKAKAAAAARAKTKGAEGKKEEE--- 265

Query: 70  GSKPRRYTPPPEKP 83
              P++  P   +P
Sbjct: 266 ---PKQEEPSVNQP 276


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 28.6 bits (64), Expect = 3.3
 Identities = 18/77 (23%), Positives = 32/77 (41%), Gaps = 8/77 (10%)

Query: 1   GKSKR-RPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRG-KTKT 58
           GK+ R +  +G A +   K++  G+      D      K A     KK   K +    KT
Sbjct: 735 GKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKT 794

Query: 59  KELQEDEDGASGSKPRR 75
           +++      A+ +K +R
Sbjct: 795 QKI------AAATKAKR 805


>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
          Length = 935

 Score = 28.5 bits (63), Expect = 3.5
 Identities = 22/100 (22%), Positives = 38/100 (38%), Gaps = 22/100 (22%)

Query: 15  GKKKKKSTGRASKAADD----DDDEDLKAAYFNDGKKKKS---------------KGRGK 55
           G+  + +TG   + ++D     DDE+L      +G K+K                 G+  
Sbjct: 339 GEHDEGATGETREESEDTESDGDDEELPRIVSREGTKRKRPPIFLRRLHRLLLMRAGKLT 398

Query: 56  TKTKELQEDEDGASGSKPRRYTPPPEKPTTDLRKKYEKQP 95
            + +E  E     +   PR   PP EKP  ++ +  E   
Sbjct: 399 ERAREALEKASEPTYGTPR---PPVEKPRPEVPQSLETAT 435


>gnl|CDD|215553 PLN03052, PLN03052, acetate--CoA ligase; Provisional.
          Length = 728

 Score = 28.1 bits (63), Expect = 3.6
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query: 182 VVTYVGDKDCRIVLRDHDISWED 204
           +V     K  R+ LR+ D+SW+D
Sbjct: 312 IVLPADGKSVRVKLREGDMSWDD 334


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 27.8 bits (62), Expect = 3.8
 Identities = 16/83 (19%), Positives = 35/83 (42%), Gaps = 1/83 (1%)

Query: 11  AAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASG 70
            AA  +++ K +     A  ++  E+ K A  ++ K+ K     + +  E +E+E+    
Sbjct: 43  QAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESE-EENEEEDEE 101

Query: 71  SKPRRYTPPPEKPTTDLRKKYEK 93
           S         EK  +++ K+   
Sbjct: 102 SSDENEKETEEKTESNVEKEITN 124


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 28.2 bits (63), Expect = 3.8
 Identities = 19/67 (28%), Positives = 29/67 (43%)

Query: 4   KRRPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQE 63
            R  +  A  + ++  K  G  +   DDDDD+D KA      K++K K   + K   L E
Sbjct: 58  LREKIAKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKRQKKKEAERKKALLLDE 117

Query: 64  DEDGASG 70
            E   + 
Sbjct: 118 KEKERAA 124


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This
          is a family of proteins conserved in fungi. The
          function is not known.
          Length = 436

 Score = 27.8 bits (62), Expect = 4.6
 Identities = 16/78 (20%), Positives = 23/78 (29%), Gaps = 4/78 (5%)

Query: 10 GAAARGKKKKKSTGRASKAADDDDD--EDLK--AAYFNDGKKKKSKGRGKTKTKELQEDE 65
                 KK K    +  + +DD+D  E L            KK K   +  T    +  
Sbjct: 9  PDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKS 68

Query: 66 DGASGSKPRRYTPPPEKP 83
             + S        P KP
Sbjct: 69 SKPTESSAASSEEKPAKP 86


>gnl|CDD|238071 cd00124, MYSc, Myosin motor domain. This catalytic (head) domain
           has ATPase activity and belongs to the larger group of
           P-loop NTPases. Myosins are actin-dependent molecular
           motors that play important roles in muscle contraction,
           cell motility, and organelle transport. The head domain
           is a molecular motor, which utilizes ATP hydrolysis to
           generate directed movement toward the plus end along
           actin filaments. A cyclical interaction between myosin
           and actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 679

 Score = 28.0 bits (63), Expect = 4.7
 Identities = 20/75 (26%), Positives = 27/75 (36%), Gaps = 15/75 (20%)

Query: 108 YQLEGINWLRYSWGQN---IDTILADEMGL-----------GKTIQT-ITFLYSLFKEGH 152
           YQ EGI+W    +  N   ID I     GL             T +T +  L +  K  +
Sbjct: 408 YQEEGIDWESIDFTDNQEVIDLIEKKPGGLLSLLDEECLFPKGTDETFLEKLNNKLKSNN 467

Query: 153 CKGPFLVSAPLSTII 167
              P   +AP    I
Sbjct: 468 AFYPAKKNAPTEFTI 482


>gnl|CDD|165245 PHA02934, PHA02934, Hypothetical protein; Provisional.
          Length = 253

 Score = 27.3 bits (60), Expect = 5.0
 Identities = 14/39 (35%), Positives = 20/39 (51%), Gaps = 2/39 (5%)

Query: 118 YSWGQNIDTILADEMGLGKTIQTITFLYSLFKEGHCKGP 156
           Y     ++TIL D  GLG  + TI+F+  L +   C  P
Sbjct: 155 YDINNEVNTILMDNKGLGVRLATISFITELGRR--CMNP 191


>gnl|CDD|181766 PRK09295, PRK09295, bifunctional cysteine
           desulfurase/selenocysteine lyase; Validated.
          Length = 406

 Score = 27.8 bits (62), Expect = 5.1
 Identities = 11/26 (42%), Positives = 14/26 (53%), Gaps = 4/26 (15%)

Query: 111 EGINWLRYSWG----QNIDTILADEM 132
           EGIN +  SWG    +  D I+  EM
Sbjct: 96  EGINLVANSWGNSNVRAGDNIIISEM 121


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 27.7 bits (61), Expect = 5.3
 Identities = 15/57 (26%), Positives = 24/57 (42%), Gaps = 10/57 (17%)

Query: 16  KKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASGSK 72
           KK  + T R + AA  D +E          +K + K R + K K++ ED +      
Sbjct: 111 KKTPRRTRRKAAAASSDVEE----------EKTEKKVRKRRKVKKMDEDVEDQGSES 157


>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
          Length = 1052

 Score = 27.6 bits (61), Expect = 5.8
 Identities = 19/57 (33%), Positives = 23/57 (40%), Gaps = 3/57 (5%)

Query: 2    KSKRRPVRGAAA---RGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGK 55
            K+ RR V+ AAA     KKK      A KAA         AA     KK  +K   +
Sbjct: 976  KTVRRSVKKAAATRAAMKKKVAKKAPAKKAAAKKAAAKKAAAKKKVAKKAPAKKVAR 1032


>gnl|CDD|218777 pfam05847, Baculo_LEF-3, Nucleopolyhedrovirus late expression
           factor 3 (LEF-3).  This family consists of LEF-3
           Nucleopolyhedrovirus late expression factor 3 (LEF-3)
           sequences which are known to be ssDNA-binding proteins.
           Alkaline nuclease (AN) and LEF-3 may participate in
           homologous recombination of the baculovirus genome in a
           manner similar to that of exonuclease (Redalpha) and
           DNA-binding protein (Redbeta) of the Red-mediated
           homologous recombination system of bacteriophage lambda.
           LEF-3 is essential for transporting the putative
           baculovirus helicase protein P143 into the nucleus where
           they function together during viral DNA replication.
           LEF-3 and other proteins have been shown to bind to
           closely linked sites on viral chromatin in vivo,
           suggesting that they may form part of the baculovirus
           replisome.
          Length = 348

 Score = 27.2 bits (61), Expect = 6.3
 Identities = 10/34 (29%), Positives = 18/34 (52%)

Query: 31  DDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQED 64
           +D +E +KA ++ D K    K     K ++L+ D
Sbjct: 269 EDSNEWIKAVFYVDNKNNDKKNNDDDKLQKLEMD 302


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
          (DUF1764).  This is a family of eukaryotic proteins of
          unknown function. This family contains many
          hypothetical proteins.
          Length = 98

 Score = 25.9 bits (57), Expect = 6.5
 Identities = 15/67 (22%), Positives = 23/67 (34%), Gaps = 3/67 (4%)

Query: 26 SKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASGSKPRRYTPPPEKPTT 85
           K  +  D  D+   + N  K+KK K R     +     +      K       PE P  
Sbjct: 3  EKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDE---FPEFPEE 59

Query: 86 DLRKKYE 92
            R++ E
Sbjct: 60 SKRRRTE 66


>gnl|CDD|240380 PTZ00362, PTZ00362, hypothetical protein; Provisional.
          Length = 479

 Score = 27.0 bits (60), Expect = 7.3
 Identities = 12/58 (20%), Positives = 25/58 (43%)

Query: 16 KKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGASGSKP 73
          KKKKK   +ASK + +      K  +  +  ++K +   +        +E  +S ++ 
Sbjct: 9  KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRHNEEGTFNNYGTNESQSSSTQN 66


>gnl|CDD|153108 cd01049, RNRR2, Ribonucleotide Reductase, R2/beta subunit,
           ferritin-like diiron-binding domain.  Ribonucleotide
           Reductase, R2/beta subunit (RNRR2) is a member of a
           broad superfamily of ferritin-like diiron-carboxylate
           proteins. The RNR protein catalyzes the conversion of
           ribonucleotides to deoxyribonucleotides and is found in
           all eukaryotes, many prokaryotes, several viruses, and
           few archaea. The catalytically active form of RNR is a
           proposed alpha2-beta2 tetramer. The homodimeric alpha
           subunit (R1) contains the active site and redox active
           cysteines as well as the allosteric binding sites. The
           beta subunit (R2) contains a diiron cluster that, in its
           reduced state, reacts with dioxygen to form a stable
           tyrosyl radical and a diiron(III) cluster. This
           essential tyrosyl radical is proposed to generate a
           thiyl radical, located on a cysteine residue in the R1
           active site that initiates ribonucleotide reduction. The
           beta subunit is composed of 10-13 helices, the 8 longest
           helices form an alpha-helical bundle; some have 2
           addition beta strands. Yeast is unique in that it
           assembles both homodimers and heterodimers of RNRR2. The
           yeast heterodimer, Y2Y4, contains R2 (Y2) and a R2
           homolog (Y4) that lacks the diiron center and is
           proposed to only assist in cofactor assembly, and
           perhaps stabilize R1 (Y1) in its active conformation.
          Length = 288

 Score = 26.8 bits (60), Expect = 7.9
 Identities = 6/30 (20%), Positives = 12/30 (40%), Gaps = 1/30 (3%)

Query: 103 MQLHAYQLEGINWLRYSWGQNIDTILADEM 132
           +   A +LE   + R      I  +  ++M
Sbjct: 223 LIKEAVELE-KEFARDLLPDGILGLNKEDM 251


>gnl|CDD|130307 TIGR01240, mevDPdecarb, diphosphomevalonate decarboxylase.  This
           enzyme catalyzes the last step in the synthesis of
           isopentenyl diphosphate (IPP) in the mevalonate pathway.
           Alternate names: mevalonate diphosphate decarboxylase;
           pyrophosphomevalonate decarboxylase [Central
           intermediary metabolism, Other].
          Length = 305

 Score = 26.8 bits (59), Expect = 8.4
 Identities = 19/69 (27%), Positives = 23/69 (33%), Gaps = 2/69 (2%)

Query: 1   GKSKRRPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYF--NDGKKKKSKGRGKTKT 58
           G + R    G  A  K K   +  A + ADD D           ND KK  S  +G   T
Sbjct: 136 GSACRSLFGGYVAWEKGKDDHSSAAVQVADDSDWPQXAMCVLVVNDIKKDVSSRQGMQLT 195

Query: 59  KELQEDEDG 67
               E    
Sbjct: 196 VATSELFKE 204


>gnl|CDD|221144 pfam11595, DUF3245, Protein of unknown function (DUF3245).  This is
           a family of proteins conserved in fungi. The function is
           not known, and there is no S. cerevisiae member.
          Length = 145

 Score = 26.1 bits (57), Expect = 9.1
 Identities = 17/52 (32%), Positives = 24/52 (46%), Gaps = 4/52 (7%)

Query: 2   KSKRRPVRGAAARGKKKKKSTGRASKAADDDDDED----LKAAYFNDGKKKK 49
           K++  PV+  AA   K+ K + +     DDDDDED             KK+K
Sbjct: 94  KAEGGPVKRKAAVVAKEAKQSSKGVGDDDDDDDEDESRSAAFGKKGSNKKRK 145


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
          spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
          U5) and proteins, catalyzes the excision of introns
          from pre-mRNAs in two successive trans-esterification
          reactions. Step 2 depends upon integral spliceosome
          constituents such as U5 snRNA and Prp8 and
          non-spliceosomal proteins Prp16, Slu7, Prp18, and
          Prp22. ATP hydrolysis by the DEAH-box enzyme Prp16
          promotes a conformational change in the spliceosome
          that leads to protection of the 3'ss from targeted
          RNase H cleavage. This change, which probably reflects
          binding of the 3'ss PyAG in the catalytic centre of the
          spliceosome, requires the ordered recruitment of Slu7,
          Prp18, and Prp22 to the spliceosome. There is a close
          functional relationship between Prp8, Prp18, and Slu7,
          and Prp18 interacts with Slu7, so that together they
          recruit Prp22 to the spliceosome. Most members of the
          family carry a zinc-finger of the CCHC-type upstream of
          this domain.
          Length = 236

 Score = 26.6 bits (59), Expect = 9.2
 Identities = 11/53 (20%), Positives = 24/53 (45%)

Query: 16 KKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKELQEDEDGA 68
           +KK    +   + D D +++ +  Y ++     S  + +T  + L+  ED A
Sbjct: 32 LRKKLKAKKEDDSDDTDYEDEDEELYLDESDMGDSDSKTRTTVRNLRIREDTA 84


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 26.8 bits (59), Expect = 10.0
 Identities = 19/94 (20%), Positives = 30/94 (31%), Gaps = 2/94 (2%)

Query: 2   KSKRRPVRGAAARGKKKKKSTGRASKAADDDDDEDLKAAYFNDGKKKKSKGRGKTKTKEL 61
                    A A+   KK S  +A+K A         AA      KK +K   K   K +
Sbjct: 208 AKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAV 267

Query: 62  QEDEDGASGSKPRRYTPPPEKPTTDLRKKYEKQP 95
           +  +     +K            T  + K +K+ 
Sbjct: 268 K--KAAKKAAKAAAKAAKGAAKATKGKAKAKKKA 299


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.134    0.408 

Gapped
Lambda     K      H
   0.267   0.0643    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,994,145
Number of extensions: 1029466
Number of successful extensions: 1133
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1062
Number of HSP's successfully gapped: 93
Length of query: 209
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 116
Effective length of database: 6,812,680
Effective search space: 790270880
Effective search space used: 790270880
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 57 (25.9 bits)