RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy18120
         (392 letters)



>gnl|CDD|128456 smart00151, SWIB, SWI complex, BAF60b domains. 
          Length = 77

 Score =  110 bits (277), Expect = 4e-30
 Identities = 32/77 (41%), Positives = 39/77 (50%)

Query: 300 QPLQFKLDPRLARLLGVHTQTRPVIISALWQYIKTHKLQDAHEREFINCDKFFEQIFSCP 359
              +  L P LA++LG    TR  II  LW+YIK H LQD   +  I CD   EQIF   
Sbjct: 1   ITKKVTLSPELAKVLGAPEMTRTEIIKRLWEYIKEHNLQDPQNKREILCDSKLEQIFGKD 60

Query: 360 RMKFAEIPQRLNPLLHP 376
           RM   E+ + L P L  
Sbjct: 61  RMDMFEMNKLLTPHLIK 77


>gnl|CDD|145386 pfam02201, SWIB, SWIB/MDM2 domain.  This family includes the SWIB
           domain and the MDM2 domain. The p53-associated protein
           (MDM2) is an inhibitor of the p53 tumour suppressor gene
           binding the transactivation domain and down regulating
           the ability of p53 to activate transcription. This
           family contains the p53 binding domain of MDM2.
          Length = 76

 Score = 97.7 bits (244), Expect = 2e-25
 Identities = 27/75 (36%), Positives = 38/75 (50%)

Query: 300 QPLQFKLDPRLARLLGVHTQTRPVIISALWQYIKTHKLQDAHEREFINCDKFFEQIFSCP 359
               F L P LA+ LG    +R  ++  LWQYIK H LQD   +  I CD+  + IF   
Sbjct: 1   LTKPFPLSPDLAKFLGAGELSRTEVVKKLWQYIKEHNLQDPKNKRIILCDEKLKSIFGGD 60

Query: 360 RMKFAEIPQRLNPLL 374
           R+ F E+ + L+   
Sbjct: 61  RVGFFEMSKLLSSHF 75


>gnl|CDD|227818 COG5531, COG5531, SWIB-domain-containing proteins implicated in
           chromatin remodeling [Chromatin structure and dynamics].
          Length = 237

 Score = 70.2 bits (172), Expect = 8e-14
 Identities = 48/186 (25%), Positives = 82/186 (44%), Gaps = 12/186 (6%)

Query: 197 LLEDSKNDPNKSGEGEEGSVASWELRVEGRLLEDSKNDPNKVKRKFSSFFKSLVIE---- 252
            L+  + D N   +  +   + W +R+EGR      ND   ++ KF S  +   +     
Sbjct: 14  WLQLDERDTNNEKDVGKLLFSEWTVRLEGRF--KDNNDL--IRDKFDSLAEEPRVLRKEK 69

Query: 253 ---LDKDLYGPDNHLVEWHRTPTTQET-DGFQVKRPGDKNVRCTILLLLDYQPLQFKLDP 308
                K     D    E    P+++ET +G    +  DK  + + +        + KL P
Sbjct: 70  YNITRKTTGKNDLPKEEDSSLPSSKETENGDTEGKETDKKKKSSTISKNSPSGEKVKLSP 129

Query: 309 RLARLLGVHTQTRPVIISALWQYIKTHKLQDAHEREFINCDKFFEQIFSCPRMKFAEIPQ 368
           +LA +LG+   TRP  +  LW+YIK H LQD + +  I CD   +++     +   E+ +
Sbjct: 130 KLAAILGLEPGTRPEAVKKLWKYIKKHNLQDPNNKRLILCDSKLKKVLGSDPIDMFELTK 189

Query: 369 RLNPLL 374
            L+P L
Sbjct: 190 PLSPHL 195


>gnl|CDD|237803 PRK14724, PRK14724, DNA topoisomerase III; Provisional.
          Length = 987

 Score = 44.9 bits (106), Expect = 6e-05
 Identities = 19/57 (33%), Positives = 29/57 (50%)

Query: 310 LARLLGVHTQTRPVIISALWQYIKTHKLQDAHEREFINCDKFFEQIFSCPRMKFAEI 366
           LA ++G     RP +I  LW YIK + LQD  ++  IN D     +F   ++   E+
Sbjct: 922 LAAVIGAEPVARPEVIKKLWDYIKANNLQDPADKRAINADAKLRPVFGKDQVTMFEL 978


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 44.4 bits (105), Expect = 9e-05
 Identities = 15/68 (22%), Positives = 29/68 (42%)

Query: 304 FKLDPRLARLLGVHTQTRPVIISALWQYIKTHKLQDAHEREFINCDKFFEQIFSCPRMKF 363
           +   P LA ++G     R      +W YIK H LQ    ++ I  D   + +     +  
Sbjct: 788 YTPSPALAAMIGAEPVGRGEATKKVWDYIKEHGLQSPENKKLIIPDSKLQGVIGPDPIDM 847

Query: 364 AEIPQRLN 371
            ++ ++L+
Sbjct: 848 FQLSKKLS 855


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 38.4 bits (89), Expect = 0.008
 Identities = 29/103 (28%), Positives = 37/103 (35%), Gaps = 16/103 (15%)

Query: 7   GSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPG--PRPGGPGVPPNQQPPYT 64
           G    +Q     P P+  P        A  P P  TP P        GP  PP +QPP  
Sbjct: 398 GPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAP 457

Query: 65  GMRPSGPVNPNIANKRPSDA----RPPNNLKNDYQHGPPGPGP 103
              P+   +P+ A ++  DA    RPP          PPG   
Sbjct: 458 ATEPAPD-DPDDATRKALDALRERRPPE---------PPGADL 490



 Score = 37.6 bits (87), Expect = 0.012
 Identities = 25/90 (27%), Positives = 28/90 (31%), Gaps = 14/90 (15%)

Query: 19   PPPSGPPMRYGNQNMAMQPRPGFTPTPPGP----RPGGPGVPPNQQPPYTGMRPSG-PVN 73
            PPP  PP           P P   P P  P    R   P  PP    P   +   G P  
Sbjct: 2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRG 2610

Query: 74   PNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
            P   +  P D   P+         PP P P
Sbjct: 2611 PAPPSPLPPDTHAPD---------PPPPSP 2631



 Score = 36.5 bits (84), Expect = 0.030
 Identities = 20/91 (21%), Positives = 23/91 (25%), Gaps = 14/91 (15%)

Query: 17   RYPPPSGPPMRYGNQNMAMQPRP----GFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPV 72
              PPP   P          +P P      TP PPGP       P     P     P+GP 
Sbjct: 2700 DPPPPPPTP----------EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPA 2749

Query: 73   NPNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
             P    +                  P    P
Sbjct: 2750 TPGGPARPARPPTTAGPPAPAPPAAPAAGPP 2780



 Score = 32.2 bits (73), Expect = 0.52
 Identities = 28/95 (29%), Positives = 35/95 (36%), Gaps = 16/95 (16%)

Query: 8    SQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMR 67
            S++T    L    P  PP          QP+    P PP P+P  P  P  Q PP    R
Sbjct: 2892 SRSTESFALPPDQPERPP----------QPQA---PPPPQPQPQPPPPPQPQPPPPPPPR 2938

Query: 68   PSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPG 102
            P  P+ P      P+ A  P+        G   PG
Sbjct: 2939 PQPPLAP---TTDPAGAGEPSGAVPQPWLGALVPG 2970



 Score = 32.2 bits (73), Expect = 0.54
 Identities = 23/93 (24%), Positives = 28/93 (30%), Gaps = 9/93 (9%)

Query: 14   APLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVN 73
                 PPPS  P    N+             PP  RP     P     P    R      
Sbjct: 2622 HAPDPPPPSPSP--AANE----PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQ 2675

Query: 74   PNIANKRPSD--ARPP-NNLKNDYQHGPPGPGP 103
             +   +RP    ARP   +L +     PP P P
Sbjct: 2676 ASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708



 Score = 31.8 bits (72), Expect = 0.75
 Identities = 18/88 (20%), Positives = 24/88 (27%), Gaps = 1/88 (1%)

Query: 15   PLRYPPPSGPPMRYGNQNMAMQPRPGFTPTP-PGPRPGGPGVPPNQQPPYTGMRPSGPVN 73
                PP               QP+P   P P P P P  P  P     P T    +G  +
Sbjct: 2897 SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956

Query: 74   PNIANKRPSDARPPNNLKNDYQHGPPGP 101
              +         P       ++   P P
Sbjct: 2957 GAVPQPWLGALVPGRVAVPRFRVPQPAP 2984



 Score = 31.8 bits (72), Expect = 0.76
 Identities = 19/91 (20%), Positives = 19/91 (20%), Gaps = 6/91 (6%)

Query: 13   QAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPV 72
                  P    PP          QP P   P PP P P  P  P        G       
Sbjct: 2905 PERPPQPQAPPPPQP------QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGA 2958

Query: 73   NPNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
             P          R         Q  P    P
Sbjct: 2959 VPQPWLGALVPGRVAVPRFRVPQPAPSREAP 2989



 Score = 30.7 bits (69), Expect = 1.6
 Identities = 26/117 (22%), Positives = 34/117 (29%), Gaps = 3/117 (2%)

Query: 11   TNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGP-RPGGPGVPPNQQPPYTGMRPS 69
            +     +   P+ PP+R   +    +    F   P  P RP  P  PP  QP      P 
Sbjct: 2868 SRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPP 2927

Query: 70   GPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKKKLADKILPQKVRDLVPES 126
             P  P     RP                P G  P      L    +    R  VP+ 
Sbjct: 2928 QPQPPPPPPPRP-QPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAV-PRFRVPQP 2982



 Score = 29.9 bits (67), Expect = 3.0
 Identities = 25/95 (26%), Positives = 31/95 (32%), Gaps = 15/95 (15%)

Query: 11   TNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGP--GVPPNQQPPYTGMRP 68
            T+  P   PPP GPP             P   P      PGG     PP++ P      P
Sbjct: 2832 TSAQPTAPPPPPGPP-------------PPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP 2878

Query: 69   SGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
            + P    +A    S +     L  D    PP P  
Sbjct: 2879 ARPPVRRLARPAVSRSTESFALPPDQPERPPQPQA 2913



 Score = 29.5 bits (66), Expect = 3.9
 Identities = 18/69 (26%), Positives = 23/69 (33%), Gaps = 3/69 (4%)

Query: 19   PPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIAN 78
             PP+           A  P     P P   +P  P  PP   PP   +   G V P    
Sbjct: 2806 DPPAAVLAPAAALPPAASPAGP-LPPPTSAQPTAPPPPPGPPPPSLPL--GGSVAPGGDV 2862

Query: 79   KRPSDARPP 87
            +R   +R P
Sbjct: 2863 RRRPPSRSP 2871


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 37.4 bits (86), Expect = 0.009
 Identities = 29/91 (31%), Positives = 34/91 (37%), Gaps = 11/91 (12%)

Query: 19  PPPSGPPMRYG-NQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQ----PPYTGM--RPSGP 71
           P     P R+G      M  RPGF   PP      PG+PPNQ      P  GM  RP   
Sbjct: 165 PQQRINPQRFGFPMQPNMGMRPGFNQMPPH----MPGMPPNQMRPGFNPMPGMPPRPGFN 220

Query: 72  VNPNIANKRPSDARPPNNLKNDYQHGPPGPG 102
            NPN+          P     ++   P GP 
Sbjct: 221 QNPNMMPNMNRPGFRPQPGGFNHPGTPMGPN 251



 Score = 33.9 bits (77), Expect = 0.11
 Identities = 27/85 (31%), Positives = 28/85 (32%), Gaps = 9/85 (10%)

Query: 12  NQAPLRYPPPSGPPMRYG-NQNMAMQPRPGFTPTP--------PGPRPGGPGVPPNQQPP 62
           NQ P   P      MR G N    M PRPGF   P        PG RP   G      P 
Sbjct: 189 NQMPPHMPGMPPNQMRPGFNPMPGMPPRPGFNQNPNMMPNMNRPGFRPQPGGFNHPGTPM 248

Query: 63  YTGMRPSGPVNPNIANKRPSDARPP 87
              M+     NPN     P     P
Sbjct: 249 GPNMQQRPGFNPNQGMNPPPHMAGP 273



 Score = 31.2 bits (70), Expect = 0.91
 Identities = 23/73 (31%), Positives = 25/73 (34%), Gaps = 11/73 (15%)

Query: 1   MSQRFQGSQNTNQAPLRY-----PPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGV 55
           M  R   +QN N  P        P P G           MQ RPGF P   G  P     
Sbjct: 213 MPPRPGFNQNPNMMPNMNRPGFRPQPGGFNHPGTPMGPNMQQRPGFNPNQ-GMNP----- 266

Query: 56  PPNQQPPYTGMRP 68
           PP+   P  G  P
Sbjct: 267 PPHMAGPRAGFPP 279



 Score = 29.3 bits (65), Expect = 3.8
 Identities = 16/65 (24%), Positives = 22/65 (33%), Gaps = 1/65 (1%)

Query: 25  PMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIANKRPSDA 84
           P    N     QP        P P+   P   PN Q      R   P+ PN+  +   + 
Sbjct: 132 PTEEVNTQEPTQPAGVNVANNPQPQVQ-PQFGPNPQQRINPQRFGFPMQPNMGMRPGFNQ 190

Query: 85  RPPNN 89
            PP+ 
Sbjct: 191 MPPHM 195


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 36.7 bits (85), Expect = 0.018
 Identities = 29/140 (20%), Positives = 38/140 (27%), Gaps = 12/140 (8%)

Query: 3   QRFQGSQNTNQAPLRYPPP--------SGPPMRYGNQNMAMQPRP-GFTPTPPGPRPGGP 53
              Q      Q  +R  P           PP            +P G+      P P GP
Sbjct: 369 AHLQDQFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGP 428

Query: 54  GVP--PNQQPPYTGMRPSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGP-IKKKKKL 110
           G P  PN   P   +R       N A K P                   P P     +  
Sbjct: 429 GGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGG 488

Query: 111 ADKILPQKVRDLVPESQAYM 130
            +K L Q +    P+ Q  +
Sbjct: 489 QNKKLAQVLASATPQMQKQV 508


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This family
           contains bacterial fibronectin-attachment proteins
           (FAP). Family members are rich in alanine and proline,
           are approximately 300 long, and seem to be restricted to
           mycobacteria. These proteins contain a
           fibronectin-binding motif that allows mycobacteria to
           bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 35.6 bits (82), Expect = 0.032
 Identities = 20/78 (25%), Positives = 24/78 (30%), Gaps = 8/78 (10%)

Query: 10  NTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPS 69
           N + AP   PP +           A  P     P PP   P      PN  PP     P+
Sbjct: 38  NADPAPPPPPPSTAAA--------APAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPN 89

Query: 70  GPVNPNIANKRPSDARPP 87
            P  P +    P    P 
Sbjct: 90  APPPPPVDPNAPPPPAPE 107


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 35.9 bits (83), Expect = 0.043
 Identities = 34/160 (21%), Positives = 49/160 (30%), Gaps = 12/160 (7%)

Query: 13  QAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPY----TGMRP 68
           + P  YP P             +         PP P       PP QQP +      M P
Sbjct: 198 EQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPP 257

Query: 69  SGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKKKLADKILPQKVRDLVPESQA 128
             P  P    + P     P           P PG  + +          ++  LV + Q 
Sbjct: 258 PPPQPPQQQQQPPQPQAQPPPQ----NQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQG 313

Query: 129 YMDLLAFERKLDSTIMRKRLDI-QEALKRPMKQKRKLRIF 167
                 F  +L     ++R  + QE  KR    KR+ +I 
Sbjct: 314 QQRGPQFREQLVQLSQQQREALSQEEAKR---AKRRHKIV 350



 Score = 31.3 bits (71), Expect = 0.92
 Identities = 15/69 (21%), Positives = 20/69 (28%), Gaps = 2/69 (2%)

Query: 6   QGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTG 65
           Q +    Q P +  P   PP +          +P   P PP   P           P   
Sbjct: 167 QQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPS-- 224

Query: 66  MRPSGPVNP 74
             P+ P  P
Sbjct: 225 QAPAQPPLP 233


>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
          (LMP2) protein.  This family consists of several
          Gammaherpesvirus latent membrane protein (LMP2)
          proteins. Epstein-Barr virus is a human
          Gammaherpesvirus that infects and establishes latency
          in B lymphocytes in vivo. The latent membrane protein 2
          (LMP2) gene is expressed in latently infected B cells
          and encodes two protein isoforms, LMP2A and LMP2B, that
          are identical except for an additional N-terminal 119
          aa cytoplasmic domain which is present in the LMP2A
          isoform. LMP2A is thought to play a key role in either
          the establishment or the maintenance of latency and/or
          the reactivation of productive infection from the
          latent state. The significance of LMP2B and its role in
          pathogenesis remain unclear.
          Length = 489

 Score = 35.6 bits (82), Expect = 0.044
 Identities = 15/56 (26%), Positives = 17/56 (30%), Gaps = 2/56 (3%)

Query: 33 MAMQP-RPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIANKRPSDARPP 87
          + MQP   G  P   G   G  G      P   G     P  P   +      RPP
Sbjct: 4  LEMQPLGAG-GPRSHGGPDGDEGDSNPYYPSSFGSSWDRPGPPVPEDYDAPSHRPP 58


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
           family of proteins are found in large numbers in the
           Trichomonas vaginalis proteome. The function of this
           protein is unknown.
          Length = 422

 Score = 35.0 bits (80), Expect = 0.060
 Identities = 16/50 (32%), Positives = 19/50 (38%)

Query: 15  PLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYT 64
           P + P    P  +   QN A QP+      P  P        P QQPP T
Sbjct: 203 PTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQT 252



 Score = 28.1 bits (62), Expect = 8.2
 Identities = 15/88 (17%), Positives = 22/88 (25%), Gaps = 5/88 (5%)

Query: 15  PLRYPPPSGPPMRYGNQN-----MAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPS 69
           P R  P  G P  + + +      A +P    T   P  +P         Q         
Sbjct: 176 PPREAPAPGLPKTFTSSHGHRHRHAPKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQ 235

Query: 70  GPVNPNIANKRPSDARPPNNLKNDYQHG 97
               P   N      +     K   + G
Sbjct: 236 PAQQPTPQNPAQQPPQTEQGHKRSREQG 263


>gnl|CDD|216868 pfam02084, Bindin, Bindin. 
          Length = 239

 Score = 34.1 bits (78), Expect = 0.083
 Identities = 21/70 (30%), Positives = 25/70 (35%), Gaps = 4/70 (5%)

Query: 3  QRFQGSQNTNQAPLRYPPPSGPPMR-YGNQNMAMQPRPGFTPTPPGPRPGGP---GVPPN 58
          Q +  + N       YP P  P  + Y NQ M      G  P   G  PGGP   G   +
Sbjct: 5  QNYPQAMNPQMGGGNYPAPGQPAQQGYANQGMGGPVGGGGGPGAGGGAPGGPVGGGGGGS 64

Query: 59 QQPPYTGMRP 68
            PP  G   
Sbjct: 65 GGPPGGGEVA 74


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 34.7 bits (79), Expect = 0.093
 Identities = 20/82 (24%), Positives = 29/82 (35%), Gaps = 3/82 (3%)

Query: 6   QGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTG 65
           Q +Q    AP    P +G   +   Q     P+  +       +P     P  QQP    
Sbjct: 427 QPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQE-PLYQQPQPVE 485

Query: 66  MRPSGPVNPNIANKRPSDARPP 87
            +P     P +   +P  ARPP
Sbjct: 486 QQPVVEPEPVVEETKP--ARPP 505



 Score = 29.7 bits (66), Expect = 3.9
 Identities = 21/113 (18%), Positives = 30/113 (26%), Gaps = 1/113 (0%)

Query: 15  PLRYPPPSG-PPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVN 73
           P+  P P G P      Q       P   P  P      P      Q PY    P  P  
Sbjct: 371 PVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQ 430

Query: 74  PNIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKKKLADKILPQKVRDLVPES 126
                  P      N  + + Q     P    + ++   +   Q+     P+ 
Sbjct: 431 QPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQP 483



 Score = 28.5 bits (63), Expect = 7.2
 Identities = 25/98 (25%), Positives = 36/98 (36%), Gaps = 16/98 (16%)

Query: 3   QRFQGSQNTNQAPLRYPPPS---GPPMRYGNQNMAMQPRPGFT----PTPPGPRPGGPGV 55
           Q++Q  Q       +Y  P     P  +Y      + P+P +     P  P P+   P  
Sbjct: 763 QQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQ 822

Query: 56  PPNQQPPYTGMRPSGPVNPNIANK-------RPSDARP 86
           P   QP Y   +P  PV P   +        R  D+RP
Sbjct: 823 PVAPQPQY--QQPQQPVAPQPQDTLLHPLLMRNGDSRP 858


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 34.8 bits (80), Expect = 0.11
 Identities = 18/85 (21%), Positives = 23/85 (27%), Gaps = 4/85 (4%)

Query: 19  PPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIAN 78
           PPP        N++ +       T  P  P   G   PP    P     P  P   +   
Sbjct: 71  PPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDP--PPPTPPPASPPP 128

Query: 79  KRPSDARPPNNLKNDYQHGPPGPGP 103
               D      L+     GPP    
Sbjct: 129 SPAPDLSEM--LRPVGSPGPPPAAS 151



 Score = 30.9 bits (70), Expect = 1.5
 Identities = 16/86 (18%), Positives = 20/86 (23%), Gaps = 5/86 (5%)

Query: 2   SQRFQGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQP 61
                   + +  P R P  +  P                 P   G R  G       +P
Sbjct: 822 RSHTPDGGSESSGPARPPGAAARPPP-----ARSSESSKSKPAAAGGRARGKNGRRRPRP 876

Query: 62  PYTGMRPSGPVNPNIANKRPSDARPP 87
           P    RP     P  A   P    P 
Sbjct: 877 PEPRARPGAAAPPKAAAAAPPAGAPA 902



 Score = 29.8 bits (67), Expect = 3.5
 Identities = 19/93 (20%), Positives = 21/93 (22%)

Query: 11  TNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSG 70
           +  A  R      P     +   A  P     P P          P        G     
Sbjct: 813 SRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSESSKSKPAAAGGRARGKNGRR 872

Query: 71  PVNPNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
              P     RP  A PP         G P P P
Sbjct: 873 RPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRP 905


>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
          Length = 935

 Score = 34.3 bits (78), Expect = 0.12
 Identities = 31/121 (25%), Positives = 51/121 (42%), Gaps = 7/121 (5%)

Query: 9   QNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRP 68
           Q+   A    P P+GP +R    +++  P   F P  P P P  P VP    P     RP
Sbjct: 489 QDGRPACAPVPAPAGPIVRPWEASLSQVPGVAFAPVMPQPMPVEP-VP---VPTVALERP 544

Query: 69  SGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKKKLADKILPQKVRDLVPESQA 128
             P  P IA + P +      ++  ++  P  P P +   +++ +    ++R    E+Q 
Sbjct: 545 VCPAPPLIAMQGPGETSGIVRVRERWRPAPWTPNPPRSPSQMSVRDRLARLR---AEAQP 601

Query: 129 Y 129
           Y
Sbjct: 602 Y 602


>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
           (DUF1750).  This is a fungal domain of unknown function.
          Length = 669

 Score = 34.2 bits (78), Expect = 0.14
 Identities = 12/53 (22%), Positives = 14/53 (26%), Gaps = 1/53 (1%)

Query: 14  APLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGM 66
           A      P GP            P+PG     P  R GG   P   +      
Sbjct: 209 AGQSMYQPPGPYPNAMVGRQPFYPQPG-AVAGPPKRRGGHKAPRGHRASQAAG 260


>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein.  The WWbp domain is
           characterized by several short PY and PT-like motifs of
           the PPPPY form. These appear to bind directly to the WW
           domains of WWP1 and WWP2 and other such diverse proteins
           as dystrophin and YAP (Yes-associated protein). This is
           the WW-domain binding protein WWbp via PY and PY_like
           motifs. The presence of a phosphotyrosine residue in the
           pWBP-1 peptide abolishes WW domain binding which
           suggests a potential regulatory role for tyrosine
           phosphorylation in modulating WW domain-ligand
           interactions. Given the likelihood that WWP1 and WWP2
           function as E3 ubiquitin-protein ligases, it is possible
           that initial substrate-specific recognition occurs via
           WW domain-substrate protein interaction followed by
           ubiquitin transfer and subsequent proteolysis. This
           domain lies just downstream of the GRAM (pfam02893) in
           many members.
          Length = 111

 Score = 32.0 bits (73), Expect = 0.17
 Identities = 14/72 (19%), Positives = 19/72 (26%)

Query: 16  LRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPN 75
            +   P      Y     A               P   G PP   P   G+ P+ P   +
Sbjct: 34  AQRAQPVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNS 93

Query: 76  IANKRPSDARPP 87
                P +  PP
Sbjct: 94  GYMADPQEPPPP 105


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 33.9 bits (77), Expect = 0.17
 Identities = 23/100 (23%), Positives = 28/100 (28%), Gaps = 11/100 (11%)

Query: 13  QAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTP---------PGPRPGGPGVPPNQQPPY 63
           +AP    PP+ PP R      A          P         PG        P   +PP 
Sbjct: 699 RAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPA 758

Query: 64  TGMRPSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
                + P  P  A   P+   PP       Q     P P
Sbjct: 759 AAPGRARP--PAAAPGAPTPQPPPQAPPAPQQRPRGAPTP 796



 Score = 30.0 bits (67), Expect = 2.8
 Identities = 21/104 (20%), Positives = 28/104 (26%), Gaps = 9/104 (8%)

Query: 6   QGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTG 65
             +    + P   P  + PP          +P P   P    P    PG P  Q PP   
Sbjct: 728 AAAPGRARPPAAAPGRARPP---AAAPGRARP-PAAAPGRARPPAAAPGAPTPQPPP--- 780

Query: 66  MRPSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKKK 109
              + P         P+   PP       Q  P      +   K
Sbjct: 781 --QAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTK 822


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 32.0 bits (73), Expect = 0.42
 Identities = 27/135 (20%), Positives = 39/135 (28%), Gaps = 14/135 (10%)

Query: 4   RFQGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPG--PRPGGPGVPPNQQP 61
                                 +  G+Q     P       PP     P GP  P  Q  
Sbjct: 57  PPVAQLPQPLPQPPPTQAL-QALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPTIQTE 115

Query: 62  P---YTGMRPSG-PVNP-NIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKKKLADKILP 116
           P   Y    P     NP N    +P+  R    L+  Y  G P  G +  +++ A K   
Sbjct: 116 PGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRY--GAPASGQLPSQQQSAQKNDE 173

Query: 117 QKVRDL----VPESQ 127
            +++       P  Q
Sbjct: 174 SQLQQQPNGETPPQQ 188


>gnl|CDD|218108 pfam04487, CITED, CITED.  CITED, CBP/p300-interacting
           transactivator with ED-rich tail, are characterized by a
           conserved 32-amino acid sequence at the C-terminus.
           CITED proteins do not bind DNA directly and are thought
           to function as transcriptional co-activators.
          Length = 206

 Score = 31.8 bits (72), Expect = 0.43
 Identities = 13/47 (27%), Positives = 15/47 (31%), Gaps = 4/47 (8%)

Query: 12  NQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPN 58
            Q       P+G P          Q RPG    P       P +PPN
Sbjct: 110 TQYQGHAGAPAGHPG----GGGPQQFRPGAGQPPGMQHMPAPALPPN 152


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 32.5 bits (74), Expect = 0.48
 Identities = 21/85 (24%), Positives = 22/85 (25%), Gaps = 19/85 (22%)

Query: 19  PPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIAN 78
            P   PP            R   T  P  PRPG         P      P  P    +  
Sbjct: 94  EPAPPPPH---------ARR---TSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPT 141

Query: 79  KRPSDARPPNNLKNDYQHGPPGPGP 103
            RP  A P        Q   PG  P
Sbjct: 142 ARP--AYP-----AYQQRPEPGAWP 159



 Score = 29.0 bits (65), Expect = 5.2
 Identities = 18/77 (23%), Positives = 23/77 (29%), Gaps = 3/77 (3%)

Query: 13  QAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPV 72
           +   R      P   +         RP   P       GGPG P     P   +RPS P 
Sbjct: 203 EYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRGGPGPPERDDAPVVPIRPSAPG 262

Query: 73  NPNIANKRPSDARPPNN 89
                  +P+ A  P  
Sbjct: 263 PL---AAQPAPAPGPGE 276


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 32.0 bits (73), Expect = 0.49
 Identities = 20/108 (18%), Positives = 31/108 (28%), Gaps = 6/108 (5%)

Query: 7   GSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPP---Y 63
            S   + +P     PS P       + ++ P P  +       P  P  P N  PP    
Sbjct: 184 PSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAP--SSFQSDTPPPSPESPTNPSPPPGPA 241

Query: 64  TGMRPSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKKKLA 111
               P     P ++  +P+                     I K +K A
Sbjct: 242 APPPPPVQQVPPLSTAKPTPPSASATPAPIGGIT-LDDDAIAKAQKHA 288



 Score = 31.2 bits (71), Expect = 0.86
 Identities = 20/100 (20%), Positives = 26/100 (26%), Gaps = 17/100 (17%)

Query: 19  PPPSGPPMRYGNQNMA---MQPRPGFTPTPPGPRPGGPGVPPNQQP-------------- 61
            P  GPP+   +++           F      P    P  PP+  P              
Sbjct: 147 DPNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSS 206

Query: 62  PYTGMRPSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGP 101
           P     P  P +       PS   P N         PP P
Sbjct: 207 PSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPP 246



 Score = 30.0 bits (68), Expect = 2.0
 Identities = 17/83 (20%), Positives = 23/83 (27%), Gaps = 1/83 (1%)

Query: 20  PPSGPPMRYGNQNMAMQPRPGFTPTPPGPR-PGGPGVPPNQQPPYTGMRPSGPVNPNIAN 78
           P S  P    + +  +   P     P  P     P  P + Q       P  P NP+   
Sbjct: 179 PASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPP 238

Query: 79  KRPSDARPPNNLKNDYQHGPPGP 101
              +   PP           P P
Sbjct: 239 GPAAPPPPPVQQVPPLSTAKPTP 261


>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
          Length = 1560

 Score = 32.4 bits (73), Expect = 0.52
 Identities = 26/96 (27%), Positives = 36/96 (37%), Gaps = 14/96 (14%)

Query: 6   QGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQP--RPGFTPTPPGPRPGGPGVPPNQQPPY 63
           Q +   + A    P  S P   Y N   +  P   P  + TP          PPN  PPY
Sbjct: 404 QSNAGFSNAGYSNPGNSNPG--YNNAPNSNTPYNNPPNSNTPYSN-------PPNSNPPY 454

Query: 64  TGMRPSGP--VNPNIANKRPSDARP-PNNLKNDYQH 96
           + +  S     N  ++N  PS A+   +     YQH
Sbjct: 455 SNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQH 490


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 32.1 bits (73), Expect = 0.54
 Identities = 21/89 (23%), Positives = 30/89 (33%), Gaps = 17/89 (19%)

Query: 18  YPPPS---GPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNP 74
             P     G P+R    + A++        P  PRPGGP       PP   +  +    P
Sbjct: 672 ARPSRVARGDPVRPTAHHAALRA-------PQAPRPGGPPGGGGGLPPPPDLPAAAGPAP 724

Query: 75  NIANKRPSDARPPNNLKNDYQHGPPGPGP 103
             ++   S   PP       +  PPG   
Sbjct: 725 CGSSLIASPTAPP-------EPEPPGAEQ 746



 Score = 31.7 bits (72), Expect = 0.76
 Identities = 20/88 (22%), Positives = 23/88 (26%), Gaps = 10/88 (11%)

Query: 19  PPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIAN 78
           P       R          R G    P  P P   G+P +   P    R   PV P   +
Sbjct: 637 PVFKTALPRPDYN------RGGEAGGPGVPGPVPVGMPAHTARPSRVAR-GDPVRPTAHH 689

Query: 79  KR---PSDARPPNNLKNDYQHGPPGPGP 103
                P   RP           PP   P
Sbjct: 690 AALRAPQAPRPGGPPGGGGGLPPPPDLP 717



 Score = 31.3 bits (71), Expect = 1.0
 Identities = 15/79 (18%), Positives = 18/79 (22%), Gaps = 16/79 (20%)

Query: 10  NTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPS 69
             + A LR P    P              PG     P P        P    P      +
Sbjct: 686 TAHHAALRAPQAPRPG-----------GPPGGGGGLPPPPDLPAAAGP---APCGSSLIA 731

Query: 70  GPVNPNIANKRPSDARPPN 88
            P  P      P  A   +
Sbjct: 732 SPTAP--PEPEPPGAEQAD 748


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 31.1 bits (70), Expect = 1.2
 Identities = 24/102 (23%), Positives = 29/102 (28%), Gaps = 7/102 (6%)

Query: 2   SQRFQGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQP 61
            Q  QG    N           P M      +         P PP P    PG P +Q P
Sbjct: 391 IQGNQGGLGANPMQ-----QGQPGMMSSPSPVPQVQTNQSMPQPPQPSVPSPGGPGSQPP 445

Query: 62  PY--TGMRPSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGP 101
                GM PS P      + + S +             P GP
Sbjct: 446 QSVSGGMIPSPPALMPSPSPQMSQSPASQRTIQQDMVSPGGP 487


>gnl|CDD|219914 pfam08577, PI31_Prot_C, PI31 proteasome regulator.  PI31 is a
          cellular regulator of proteasome formation and of
          proteasome-mediated antigen processing.
          Length = 68

 Score = 28.1 bits (63), Expect = 1.6
 Identities = 20/67 (29%), Positives = 22/67 (32%), Gaps = 3/67 (4%)

Query: 7  GSQNTNQAPLRYPPPSGPPMRYGNQNMA--MQPRPGFTPTPPGPRPGGPGVPPNQQPPYT 64
          G ++     L  P P  P       N    M   P   P    PR GG   PP   PP  
Sbjct: 3  GDRDLYPPGLGPPDPFDPLPAPLGGNGQGGMIFDP-NRPGFGPPRGGGGDGPPPGVPPGA 61

Query: 65 GMRPSGP 71
             P GP
Sbjct: 62 RYDPFGP 68


>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
          Length = 1000

 Score = 30.8 bits (69), Expect = 1.7
 Identities = 25/89 (28%), Positives = 33/89 (37%), Gaps = 6/89 (6%)

Query: 44  TPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIANKRPSDARPPNNLKNDYQH---GPPG 100
           TPP   P   G PP   PP      +GP      +  P D  PP+           GPP 
Sbjct: 551 TPPKVSPSDRG-PPKASPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPA 609

Query: 101 PGPIKKKKKLA--DKILPQKVRDLVPESQ 127
            GP +K+   +    + P  VR  + E  
Sbjct: 610 SGPHEKQPPSSAPRDMAPSVVRMFLRERL 638


>gnl|CDD|233508 TIGR01649, hnRNP-L_PTB, hnRNP-L/PTB/hephaestus splicing factor
           family.  Included in this family of heterogeneous
           ribonucleoproteins are PTB (polypyrimidine tract binding
           protein ) and hnRNP-L. These proteins contain four RNA
           recognition motifs (rrm: pfam00067).
          Length = 481

 Score = 30.2 bits (68), Expect = 1.9
 Identities = 17/87 (19%), Positives = 25/87 (28%), Gaps = 1/87 (1%)

Query: 19  PPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIAN 78
           P   G      +Q    +        P      G         P  G    GP +   + 
Sbjct: 191 PDLPGRRDPGLDQTHRQRQPALLGQHPSSYGHDGYSSHGGPLAPLAGGDRMGPPHGPPSR 250

Query: 79  KRPS-DARPPNNLKNDYQHGPPGPGPI 104
            RP+ +A P     + Y     GPG +
Sbjct: 251 YRPAYEAAPLAPAISSYGPAGGGPGSV 277


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 30.3 bits (69), Expect = 1.9
 Identities = 26/110 (23%), Positives = 32/110 (29%), Gaps = 19/110 (17%)

Query: 6   QGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTG 65
           Q    +N A    PP   PP           P P  T T P P P  P  PP      + 
Sbjct: 513 QSGSASNTAKTPPPPQKSPP-----PPAPTPPLPQPTATAPPPTP--PPPPPTATQASSN 565

Query: 66  MRPSGPVNPNIANKRPS--DARPPNNLKNDYQHGPPGPGPIKKK-KKLAD 112
                P + +     P      P  +   +          I K  K LAD
Sbjct: 566 APAQIPADSSPPPPIPEEPTPSPTKDSSPE---------EIDKAAKNLAD 606


>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
           (DUF2076).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function. The domain,
           however, is found in various periplasmic ligand-binding
           sensor proteins.
          Length = 234

 Score = 29.6 bits (67), Expect = 2.0
 Identities = 12/56 (21%), Positives = 17/56 (30%), Gaps = 5/56 (8%)

Query: 1   MSQRFQGSQNTNQAPLRYPPPSGPPMR--YGNQNMAMQPRPGFTPTPPGPRPGGPG 54
           +S  F G             P  PP R  +G+       + G    P   +PG   
Sbjct: 83  LSGMFGGGAPRPPPAAPAVQPPAPPARPGWGSGG---PSQQGAGQQPGYAQPGPGS 135


>gnl|CDD|218658 pfam05616, Neisseria_TspB, Neisseria meningitidis TspB protein.
           This family consists of several Neisseria meningitidis
           TspB virulence factor proteins.
          Length = 502

 Score = 30.3 bits (68), Expect = 2.0
 Identities = 21/79 (26%), Positives = 23/79 (29%), Gaps = 11/79 (13%)

Query: 12  NQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTP---PGPRPGGPGVPPNQQPPYTGMRP 68
              PL    P+  P    N       RP   P P   P   P   G P        G RP
Sbjct: 328 EAQPLPEVSPAENPANNPNPRENPGTRPNPEPDPDLNPDANPDTDGQP--------GTRP 379

Query: 69  SGPVNPNIANKRPSDARPP 87
             P  P+  N R    R  
Sbjct: 380 DSPAVPDRPNGRHRKERKE 398


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 30.2 bits (68), Expect = 2.2
 Identities = 20/71 (28%), Positives = 25/71 (35%), Gaps = 6/71 (8%)

Query: 19  PPPSGPPMRYGNQNMAMQP--RPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNI 76
           P PS  P      N+  +   R   TP P  PRP  P VP   +      R + PV+   
Sbjct: 383 PAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVD--- 439

Query: 77  ANKRPSDARPP 87
             K       P
Sbjct: 440 -EKPKYTPPAP 449


>gnl|CDD|223631 COG0557, VacB, Exoribonuclease R [Transcription].
          Length = 706

 Score = 30.1 bits (68), Expect = 2.3
 Identities = 20/98 (20%), Positives = 31/98 (31%), Gaps = 8/98 (8%)

Query: 113 KILPQKVRDLVPESQAYMDLLAFERKLDSTIMRKRLDIQEALKRPMKQKRKLRIFISNTF 172
           ++LP   R    E  A + +L  ER   + +   R         P  ++    I I    
Sbjct: 114 ELLPSDKRGRFKE-AAVVRIL--ERANSALVGELRPSQGIGRVLPDDKRLPFLIAIPPEQ 170

Query: 173 YPAKESGEGEEGSVASWELR---VEGRLLED--SKNDP 205
            P  E G      +  W        G ++E      DP
Sbjct: 171 APGAEEGHLVVVEITRWPDTSRPPFGEVVEVLGDLGDP 208


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 30.0 bits (67), Expect = 2.4
 Identities = 27/133 (20%), Positives = 41/133 (30%), Gaps = 11/133 (8%)

Query: 8   SQNTNQAPLRYPPPSGPPMRYGN-QNMAMQPRPGFTPTPPGPR-PGGPGVPPNQQPPYTG 65
           SQ       ++PP +    +           R    P     R P     PP        
Sbjct: 239 SQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQET 298

Query: 66  MRPS-GPVNPNIANKRPSDARPP--------NNLKNDYQHGPPGPGPIKKKKKLADKILP 116
            RP+  P     +   P  + PP         NL +  +  PP P  I     + ++ LP
Sbjct: 299 GRPTPRPTATTQSGSSPPHSSPPGVQANPTTQNLVDCKELDPPKPNSICYGVGIYNEALP 358

Query: 117 QKVRDLVPESQAY 129
           +    +VP    Y
Sbjct: 359 RGCDIVVPLCSTY 371


>gnl|CDD|216412 pfam01285, TEA, TEA/ATTS domain family. 
          Length = 424

 Score = 29.4 bits (66), Expect = 3.4
 Identities = 15/79 (18%), Positives = 24/79 (30%), Gaps = 9/79 (11%)

Query: 4   RFQGSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPY 63
           R   +Q ++   L Y  P GP  R    ++       +                    PY
Sbjct: 134 RESSAQRSSAQHLFYIIPGGPSWR---TSIKPFSSSHYGSHN----SSAYSDHLRPLQPY 186

Query: 64  TGM--RPSGPVNPNIANKR 80
           +G    P GP      +K+
Sbjct: 187 SGELPPPLGPNWQASNSKK 205


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 29.6 bits (66), Expect = 3.5
 Identities = 30/134 (22%), Positives = 43/134 (32%), Gaps = 16/134 (11%)

Query: 37  PRPGFTPTPPGP----RPGGPGVPPNQQPPYTGMRPSGPVNPNIANKRPSDARPPNNLKN 92
                  T  GP    RP  P   P   P  + M    PV     +        P +   
Sbjct: 367 AVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGT 426

Query: 93  DYQHGPPGPGPIKKKKKLADKILPQKVRDLVPESQAYMDLLAFERKLDSTIMRKR-LDIQ 151
            Y   P GP P +          P     + P S A M      ++      RKR  +++
Sbjct: 427 SYGPEPVGPVPPQ----------PTNPYVM-PISMANMVYPGHPQEHGHERKRKRGGELK 475

Query: 152 EALKRPMKQKRKLR 165
           E L   +K  +KL+
Sbjct: 476 EELIETLKLVKKLK 489


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 29.4 bits (66), Expect = 3.6
 Identities = 10/33 (30%), Positives = 11/33 (33%)

Query: 39  PGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGP 71
           PG       P   GP  PP   PP    +P   
Sbjct: 906 PGGDAASAPPPGAGPPAPPQAVPPPRTTQPPAA 938



 Score = 29.1 bits (65), Expect = 4.9
 Identities = 11/35 (31%), Positives = 13/35 (37%)

Query: 37  PRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGP 71
              G   + P P  G P  P    PP T   P+ P
Sbjct: 905 APGGDAASAPPPGAGPPAPPQAVPPPRTTQPPAAP 939


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 29.3 bits (66), Expect = 3.8
 Identities = 17/63 (26%), Positives = 18/63 (28%), Gaps = 11/63 (17%)

Query: 42  TPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPN-IANKRPSDARPPNNLKNDYQHGPPG 100
            P PP    G P       PP     P  P  P   A  RP+ A  P          P  
Sbjct: 381 APAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAP----------PAA 430

Query: 101 PGP 103
             P
Sbjct: 431 AAP 433


>gnl|CDD|205967 pfam13794, MiaE_2, tRNA-(MS[2]IO[6]A)-hydroxylase (MiaE)-like. 
          Length = 185

 Score = 28.7 bits (64), Expect = 4.1
 Identities = 16/67 (23%), Positives = 23/67 (34%), Gaps = 5/67 (7%)

Query: 184 GSVASWELRVEGRLLEDSKNDPNKSGEGEEGSVASWELR----VEGRLLEDSKNDPNKVK 239
           G++A  EL    RL ED++  P          +A  E R    +  R L     D     
Sbjct: 12  GALAYGELSAFERLAEDARYAPTLHDRAVLARMAVAEFRHYELLSDR-LAARGIDAEDAM 70

Query: 240 RKFSSFF 246
             F +  
Sbjct: 71  LPFVAAV 77


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 29.2 bits (66), Expect = 4.8
 Identities = 20/92 (21%), Positives = 22/92 (23%), Gaps = 1/92 (1%)

Query: 13  QAPLRYPPPSGPPMRYGNQNMAM-QPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGP 71
            AP   PP   P         A  QP P    TPP  +   P   P Q            
Sbjct: 676 AAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAA 735

Query: 72  VNPNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
            +P      P D   P          P     
Sbjct: 736 DDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPA 767



 Score = 28.8 bits (65), Expect = 6.2
 Identities = 14/90 (15%), Positives = 15/90 (16%), Gaps = 8/90 (8%)

Query: 14  APLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVN 73
                P  + P         A        P             P   P      P  P  
Sbjct: 389 GGAGAPAAAAPSAAAAAPA-AAPAPAAAAPAAAAAPAPAAAPQPAPAPA-PAPAPPSPAG 446

Query: 74  PNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
              A   PS                P P P
Sbjct: 447 NAPAGGAPSPPPAAA------PSAQPAPAP 470


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 28.9 bits (64), Expect = 4.9
 Identities = 19/77 (24%), Positives = 26/77 (33%), Gaps = 7/77 (9%)

Query: 11  TNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSG 70
            +QA  R P P+  P    ++    +P     PTP       P   P+Q        P  
Sbjct: 46  PHQAASRAPDPAVAPTSAASR----KPDLAQAPTPAASEKFDPAPAPHQA---ASRAPDP 98

Query: 71  PVNPNIANKRPSDARPP 87
            V P +A     DA   
Sbjct: 99  AVAPQLAAAPKPDAAEA 115


>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR.  This region is found
           to the N-terminus of the pfam00170 transcription factor
           domain. It is between 150 and 200 amino acids in length.
           The N-terminal half is rather rich in proline residues
           and has been termed the PRD (proline rich domain),
           whereas the C-terminal half is more polar and has been
           called the MFMR (multifunctional mosaic region). It has
           been suggested that this family is composed of three
           sub-families called A, B and C, classified according to
           motif composition. It has been suggested that some of
           these motifs may be involved in mediating
           protein-protein interactions. The MFMR region contains a
           nuclear localisation signal in bZIP opaque and GBF-2.
           The MFMR also contains a transregulatory activity in
           TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention
           signals.
          Length = 189

 Score = 28.3 bits (63), Expect = 5.0
 Identities = 25/94 (26%), Positives = 31/94 (32%), Gaps = 13/94 (13%)

Query: 19  PPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGG----PGVPPNQQPPYTGMRPSGPVNP 74
             P   P  +G Q   M P     P      PGG    P +PP   P      PS  V  
Sbjct: 53  SSPQPHPYMWGPQQPMMPPYGTPPPYAAMYPPGGVYAHPSMPPGSHPFSPYAMPSAEVPG 112

Query: 75  NIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKK 108
           +      +DA+  +N            G IKK K
Sbjct: 113 STPLSMETDAKSSDNKDK---------GSIKKSK 137


>gnl|CDD|185575 PTZ00361, PTZ00361, 26 proteosome regulatory subunit 4-like
           protein; Provisional.
          Length = 438

 Score = 29.0 bits (65), Expect = 5.2
 Identities = 17/64 (26%), Positives = 31/64 (48%)

Query: 98  PPGPGPIKKKKKLADKILPQKVRDLVPESQAYMDLLAFERKLDSTIMRKRLDIQEALKRP 157
           PP P  IK+KKK        K+  + P ++  + LL  ER  D  ++ +     +  ++P
Sbjct: 26  PPPPHEIKRKKKRKGPDAASKLPKVTPNTKCRLRLLKLERIKDYLLLEEEFITNQEAQKP 85

Query: 158 MKQK 161
            ++K
Sbjct: 86  AQEK 89


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 28.8 bits (65), Expect = 5.4
 Identities = 21/83 (25%), Positives = 25/83 (30%), Gaps = 7/83 (8%)

Query: 17  RYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNI 76
           +Y PP         +    QP     P    PRP  P   P QQP Y       P  P  
Sbjct: 91  QYQPPYASAQ---PRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQ----PQPEQPLQ 143

Query: 77  ANKRPSDARPPNNLKNDYQHGPP 99
               P  A  P  + +  Q    
Sbjct: 144 QPVSPQVAPAPQPVHSAPQPAQQ 166


>gnl|CDD|215915 pfam00429, TLV_coat, ENV polyprotein (coat polyprotein). 
          Length = 560

 Score = 29.0 bits (65), Expect = 5.8
 Identities = 12/69 (17%), Positives = 17/69 (24%), Gaps = 4/69 (5%)

Query: 7   GSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPG---PRPGGPGVPPNQ-QPP 62
           G Q +   P   P          +  +  Q       TPP       G   V P + +  
Sbjct: 46  GGQVSWSPPCTPPQSVCSGCPDLSAYLTDQSLWPPCVTPPKRTPHANGSFYVCPGECRTR 105

Query: 63  YTGMRPSGP 71
                  G 
Sbjct: 106 ANRRHCGGY 114


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 28.7 bits (64), Expect = 6.2
 Identities = 21/106 (19%), Positives = 28/106 (26%), Gaps = 7/106 (6%)

Query: 4   RFQGSQNTNQAPLRYPPPSG-----PPMRYGNQNMAMQPRPGFTPT-PPGPRPGGPGVPP 57
           R Q    +     R     G     PP +  N++     R    P   P  + G    P 
Sbjct: 453 RGQRGPRSQPQNQRRQQNRGRQSSQPPRQQQNRSNQNNQRQSQGPNQGPRGQGGYNLRPR 512

Query: 58  NQQPPYTGMRPSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
             QP   G       NPN   ++    R           G     P
Sbjct: 513 TYQPQRYGGGQGRRWNPNPY-RQSGQGRSQQQQPQPEARGNQSRTP 557


>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
          modification].
          Length = 2365

 Score = 28.8 bits (64), Expect = 6.3
 Identities = 15/30 (50%), Positives = 15/30 (50%), Gaps = 5/30 (16%)

Query: 33 MAMQPRPGFTPTPPGPRPGGPGVPPNQQPP 62
          MA  P PG  P PP P    PG  P  QPP
Sbjct: 1  MASLP-PGNPPPPPPP----PGFEPPSQPP 25


>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
           terminal.  This domain family is found in eukaryotes,
           and is typically between 177 and 207 amino acids in
           length. The family is found in association with
           pfam00778, pfam02377, pfam00610, pfam00595. The segment
           polarity gene dishevelled (dsh) is required for pattern
           formation of the embryonic segments. It is involved in
           the determination of body organisation through the
           Wingless pathway (analogous to the Wnt-1 pathway).
          Length = 202

 Score = 28.0 bits (62), Expect = 6.5
 Identities = 13/37 (35%), Positives = 13/37 (35%), Gaps = 8/37 (21%)

Query: 21  PSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPP 57
             G P  Y          P     PP P PG PG PP
Sbjct: 151 APGVPPPYNP--------PMLMMKPPPPSPGPPGAPP 179


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score = 28.7 bits (64), Expect = 6.7
 Identities = 11/54 (20%), Positives = 13/54 (24%)

Query: 34 AMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIANKRPSDARPP 87
          A QP P        P P  P               + P        +P   R P
Sbjct: 14 AEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKP 67


>gnl|CDD|218350 pfam04959, ARS2, Arsenite-resistance protein 2.  Arsenite is a
           carcinogenic compound which can act as a co-mutagen by
           inhibiting DNA repair. Arsenite-resistance protein 2 is
           thought to play a role in arsenite resistance.
          Length = 211

 Score = 28.2 bits (63), Expect = 6.9
 Identities = 16/49 (32%), Positives = 17/49 (34%), Gaps = 2/49 (4%)

Query: 13  QAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPP--GPRPGGPGVPPNQ 59
           Q P   P    P +            P   P PP  G   GGP  PPNQ
Sbjct: 134 QPPKPDPGGLAPGLPGYPPQTPQALMPYGQPRPPMMGYGRGGPPFPPNQ 182


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 28.6 bits (64), Expect = 7.1
 Identities = 24/133 (18%), Positives = 34/133 (25%), Gaps = 17/133 (12%)

Query: 15  PLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNP 74
           P+   P + P             +      P  P+       P   PP   + P   V  
Sbjct: 381 PVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQ---PAGTPPTVSVDPPAAVPV 437

Query: 75  NIANKRPSDARPPNNLKNDYQHGPPGPGPIKKKKKLADKIL-PQK-----VRDLVPESQA 128
           N  +  P   RP        Q       P+ K   L    L P +         + E+  
Sbjct: 438 NPPSTAPQAVRPA-------QFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATGNIKEAPT 490

Query: 129 YMDLLAF-ERKLD 140
                 F E  L 
Sbjct: 491 GTQKEIFTEEDLQ 503


>gnl|CDD|227889 COG5602, SIN3, Histone deacetylase complex, SIN3 component
           [Chromatin structure and dynamics].
          Length = 1163

 Score = 28.7 bits (64), Expect = 7.4
 Identities = 19/95 (20%), Positives = 27/95 (28%), Gaps = 12/95 (12%)

Query: 8   SQNTNQAPLRYPPPSGPPMRYGNQN---MAMQPRPGFTPTPPGPRPGGPGV-PPNQQPPY 63
               N + L    P GP      Q+   +A       T  P  P    P    P+  P  
Sbjct: 201 LPQPNGSRLHVTTPQGPLSSPPVQSSYYVAPCNHDQRTSHPTLPSDSQPEPSAPSHMPSD 260

Query: 64  TGMRPSGPVNPNIANKRPSDARPPNNLKNDYQHGP 98
              +     N  I           N +K  +Q+ P
Sbjct: 261 ARGKHQVDFNQAIIFV--------NKVKVRFQNNP 287


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 28.5 bits (63), Expect = 7.7
 Identities = 26/97 (26%), Positives = 33/97 (34%), Gaps = 13/97 (13%)

Query: 13  QAPLRYPPPSGPPMRYGNQNMAMQPRP------GFTPTPPGPRPGGPGVPPNQQPPYTGM 66
                  P S PP  +G     + P P        + TPP      P  PP +QP     
Sbjct: 282 GPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQP----- 336

Query: 67  RPSGPVNPNIANKRPSDARPPNNLKNDYQHGPPGPGP 103
            P  P  P+I  K P     P      ++H P   GP
Sbjct: 337 LPPAPSMPHI--KPPPTTPIPQLPNQSHKHPPHLQGP 371


>gnl|CDD|173839 cd01150, AXO, Peroxisomal acyl-CoA oxidase.  Peroxisomal acyl-CoA
           oxidases (AXO) catalyze the first set in the peroxisomal
           fatty acid beta-oxidation, the alpha,beta
           dehydrogenation of the corresponding trans-enoyl-CoA by
           FAD, which becomes reduced. In a second oxidative
           half-reaction, the reduced FAD is reoxidized by
           molecular oxygen. AXO is generally a homodimer, but it
           has been reported to form a different type of oligomer
           in yeast. There are several subtypes of AXO's, based on
           substrate specificity. Palmitoyl-CoA oxidase acts on
           straight-chain fatty acids and prostanoids; whereas, the
           closely related Trihydroxycoprostanoly-CoA oxidase has
           the greatest activity for  2-methyl branched side chains
           of bile precursors. Pristanoyl-CoA oxidase, acts on
           2-methyl branched fatty acids.  AXO has an additional
           domain, C-terminal to the region with similarity to
           acyl-CoA dehydrogenases, which is included in this
           alignment.
          Length = 610

 Score = 28.4 bits (64), Expect = 8.0
 Identities = 10/26 (38%), Positives = 15/26 (57%)

Query: 296 LLDYQPLQFKLDPRLARLLGVHTQTR 321
           +LDYQ  Q++L P+LA     H   +
Sbjct: 328 ILDYQLQQYRLFPQLAAAYAFHFAAK 353


>gnl|CDD|215026 smart01104, CTD, Spt5 C-terminal nonapeptide repeat binding Spt4.
           The C-terminal domain of the transcription elongation
           factor protein Spt5 is necessary for binding to Spt4 to
           form the functional complex that regulates early
           transcription elongation by RNA polymerase II. The
           complex may be involved in pre-mRNA processing through
           its association with mRNA capping enzymes. This CTD
           domain carries a regular nonapeptide repeat that can be
           present in up to 18 copies, as in S. pombe. The repeat
           has a characteristic TPA motif.
          Length = 121

 Score = 27.1 bits (60), Expect = 8.3
 Identities = 12/48 (25%), Positives = 14/48 (29%), Gaps = 1/48 (2%)

Query: 7   GSQNTNQAPLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPG-PRPGGP 53
           G  +           S      G    A  P  G TP+  G   PGG 
Sbjct: 67  GGASAWGNKSSEGSASSWAAGPGGAYGAPTPGYGGTPSAYGPATPGGG 114


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 28.0 bits (62), Expect = 8.6
 Identities = 13/70 (18%), Positives = 15/70 (21%), Gaps = 2/70 (2%)

Query: 19  PPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRP--SGPVNPNI 76
           P    P  R              T  P    P         Q   T        P  P  
Sbjct: 205 PATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGG 264

Query: 77  ANKRPSDARP 86
               P++A P
Sbjct: 265 GEAPPANATP 274


>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 504

 Score = 28.3 bits (63), Expect = 8.7
 Identities = 12/48 (25%), Positives = 16/48 (33%), Gaps = 4/48 (8%)

Query: 14  APLRYPPPSGPPMRYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQP 61
           AP      +G P+     +   +PR    P P       P V P   P
Sbjct: 380 APTAAATAAGAPL----PDFDPRPRGPPAPEPARSAEAPPLVAPAAAP 423


>gnl|CDD|226683 COG4230, COG4230, Delta 1-pyrroline-5-carboxylate dehydrogenase
           [Energy production and conversion].
          Length = 769

 Score = 28.1 bits (63), Expect = 9.3
 Identities = 14/56 (25%), Positives = 28/56 (50%), Gaps = 9/56 (16%)

Query: 221 LRVEGRLLEDSKNDPNKVKRKFSSFFKSLVIELD------KDLYGPDNHLVEWHRT 270
           +R +GRL+  +    +  K    +F    +IEL+      ++++GP  H+V + R 
Sbjct: 440 MRSKGRLVHQAAAPNSLQK---GTFVAPTLIELENLDELQREVFGPVLHVVRYKRD 492


>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
           dihydrolipoamide succinyltransferase.  This model
           represents an Actinobacterial clade of E2 enzyme, a
           component of the 2-oxoglutarate dehydrogenase complex
           involved in the TCA cycle. These proteins have multiple
           domains including the catalytic domain (pfam00198), one
           or two biotin domains (pfam00364) and an E3-component
           binding domain (pfam02817).
          Length = 579

 Score = 28.1 bits (62), Expect = 9.6
 Identities = 12/63 (19%), Positives = 18/63 (28%)

Query: 27  RYGNQNMAMQPRPGFTPTPPGPRPGGPGVPPNQQPPYTGMRPSGPVNPNIANKRPSDARP 86
             G+ N A           P      P   P  + P+    P  P         P+ A P
Sbjct: 199 IIGDANAAPAEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAPAAAAP 258

Query: 87  PNN 89
            ++
Sbjct: 259 VSS 261


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.138    0.421 

Gapped
Lambda     K      H
   0.267   0.0630    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,943,795
Number of extensions: 2075022
Number of successful extensions: 2892
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2454
Number of HSP's successfully gapped: 229
Length of query: 392
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 293
Effective length of database: 6,546,556
Effective search space: 1918140908
Effective search space used: 1918140908
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (27.1 bits)