RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4747
         (414 letters)



>gnl|CDD|227640 COG5333, CCL1, Cdk activating kinase (CAK)/RNA polymerase II
           transcription initiation/nucleotide excision repair
           factor TFIIH/TFIIK, cyclin H subunit [Cell division and
           chromosome partitioning / Transcription / DNA
           replication, recombination, and repair].
          Length = 297

 Score =  102 bits (255), Expect = 1e-24
 Identities = 59/185 (31%), Positives = 96/185 (51%), Gaps = 13/185 (7%)

Query: 55  SFREGIDYETEQRYRREGARFIIDCGTKMDLGYNTMATGVVFFHRFYMYHSFKTFPRYVT 114
                +  E E        + I+D  T+++L    +AT ++FF RFY+ +S +    Y  
Sbjct: 31  VLEPELTLEKELNLVIYYLKLIMDLCTRLNLPQTVLATAILFFSRFYLKNSVEEISLYSV 90

Query: 115 ASCCLFLAGKVEETPKKCKDVIKTARMLVDD---KKFKTFGDDPREEVMTLERILLQTIK 171
            + C++LA KVE+TP+     I        D   ++ K+     RE ++  E  LL+ + 
Sbjct: 91  VTTCVYLACKVEDTPRD----ISIESFEARDLWSEEPKSS----RERILEYEFELLEALD 142

Query: 172 FDLQVDHPYSFLLKYAKSLKGDKTKLEKMLQMAWTFVNDSLCTTLCLQWEPEIIAVALMY 231
           FDL V HPY +L  + K L+       K+LQ+AW  +ND+L T LCL + P IIA+A + 
Sbjct: 143 FDLHVHHPYKYLEGFLKDLQEKDKY--KLLQIAWKIINDALRTDLCLLYPPHIIALAALL 200

Query: 232 LAGKL 236
           +A ++
Sbjct: 201 IACEV 205


>gnl|CDD|238003 cd00043, CYCLIN, Cyclin box fold. Protein binding domain
           functioning in cell-cycle and transcription control.
           Present in cyclins, TFIIB and Retinoblastoma (RB).The
           cyclins consist of 8 classes of cell cycle regulators
           that regulate cyclin dependent kinases (CDKs). TFIIB is
           a transcription factor that binds the TATA box. Cyclins,
           TFIIB and RB contain 2 copies of the domain.
          Length = 88

 Score = 63.8 bits (156), Expect = 6e-13
 Identities = 25/99 (25%), Positives = 40/99 (40%), Gaps = 12/99 (12%)

Query: 69  RREGARFIIDCGTKMDLGYNTMATGVVFFHRFYMYHSFKTFPRYVTASCCLFLAGKVEET 128
           R     F+      + L   T+   V    RF + +S       + A+  L+LA KVEE 
Sbjct: 2   RPTPLDFLRRVAKALGLSPETLTLAVNLLDRFLLDYSVLGRSPSLVAAAALYLAAKVEEI 61

Query: 129 PKKCKDVIKTARMLVDDKKFKTFGDDPREEVMTLERILL 167
           P   KD++               G    EE++ +E++LL
Sbjct: 62  PPWLKDLVHVT------------GYATEEEILRMEKLLL 88



 Score = 47.2 bits (113), Expect = 4e-07
 Identities = 17/59 (28%), Positives = 27/59 (45%), Gaps = 4/59 (6%)

Query: 178 HPYSFLLKYAKSLKGDKTKLEKMLQMAWTFVNDSLCTTLCLQWEPEIIAVALMYLAGKL 236
            P  FL + AK+L        + L +A   ++  L     L   P ++A A +YLA K+
Sbjct: 4   TPLDFLRRVAKALGLSP----ETLTLAVNLLDRFLLDYSVLGRSPSLVAAAALYLAAKV 58


>gnl|CDD|214641 smart00385, CYCLIN, domain present in cyclins, TFIIB and
           Retinoblastoma.  A helical domain present in cyclins and
           TFIIB (twice) and Retinoblastoma (once). A protein
           recognition domain functioning in cell-cycle and
           transcription control.
          Length = 83

 Score = 57.6 bits (140), Expect = 8e-11
 Identities = 24/95 (25%), Positives = 40/95 (42%), Gaps = 12/95 (12%)

Query: 74  RFIIDCGTKMDLGYNTMATGVVFFHRFYMYHSFKTFPRYVTASCCLFLAGKVEETPKKCK 133
            F+      ++L   T+   V    RF   + F  +   + A+  L+LA K EETP   K
Sbjct: 1   DFLRRVCKALNLDPETLNLAVNLLDRFLSDYKFLKYSPSLIAAAALYLASKTEETPPWTK 60

Query: 134 DVIKTARMLVDDKKFKTFGDDPREEVMTLERILLQ 168
           ++                G    EE++ +ER+LL+
Sbjct: 61  EL------------VHYTGYFTEEEILRMERLLLE 83



 Score = 41.8 bits (99), Expect = 3e-05
 Identities = 17/55 (30%), Positives = 27/55 (49%), Gaps = 4/55 (7%)

Query: 182 FLLKYAKSLKGDKTKLEKMLQMAWTFVNDSLCTTLCLQWEPEIIAVALMYLAGKL 236
           FL +  K+L  D     + L +A   ++  L     L++ P +IA A +YLA K 
Sbjct: 2   FLRRVCKALNLDP----ETLNLAVNLLDRFLSDYKFLKYSPSLIAAAALYLASKT 52


>gnl|CDD|215740 pfam00134, Cyclin_N, Cyclin, N-terminal domain.  Cyclins regulate
           cyclin dependent kinases (CDKs). Human cyclin-O is a
           Uracil-DNA glycosylase that is related to other cyclins.
           Cyclins contain two domains of similar all-alpha fold,
           of which this family corresponds with the N-terminal
           domain.
          Length = 127

 Score = 56.4 bits (137), Expect = 5e-10
 Identities = 25/135 (18%), Positives = 51/135 (37%), Gaps = 13/135 (9%)

Query: 42  YWYFDKKDIRNSP-SFREGIDYETEQRYRREGARFIIDCGTKMDLGYNTMATGVVFFHRF 100
           Y Y  + +  + P         +   + R     ++++   +  L   T+   V +  RF
Sbjct: 3   YAYLRELEEEDRPPPDYLDQQPDINPKMRAILIDWLVEVHEEFKLLPETLYLAVNYLDRF 62

Query: 101 YMYHSFKTFPRYVTASCCLFLAGKVEET-PKKCKDVIKTARMLVDDKKFKTFGDDPREEV 159
                       +    CL +A K EE  P   +D +      + D  +       +EE+
Sbjct: 63  LSKQPVPRTKLQLVGVTCLLIAAKYEEIYPPSVEDFV-----YITDNAYT------KEEI 111

Query: 160 MTLERILLQTIKFDL 174
           + +E ++L T+ +DL
Sbjct: 112 LRMELLILSTLNWDL 126


>gnl|CDD|129660 TIGR00569, ccl1, cyclin ccl1.  All proteins in this family for
           which functions are known are cyclins that are
           components of TFIIH, a complex that is involved in
           nucleotide excision repair and transcription initiation.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, StanfordUniversity) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 305

 Score = 55.6 bits (134), Expect = 1e-08
 Identities = 40/150 (26%), Positives = 70/150 (46%), Gaps = 24/150 (16%)

Query: 91  ATGVVFFHRFYMYHSFKTF-PRYVTASCCLFLAGKVEE----TPKKCKDVIKTARMLVDD 145
            T +++F RFY+ +S   + P+ +  +C +FLA KVEE      +   ++ +T    ++ 
Sbjct: 80  GTAIMYFKRFYLNNSVMEYHPKIIMLTC-VFLACKVEEFNVSIDQFVGNLKETPLKALE- 137

Query: 146 KKFKTFGDDPREEVMTLERILLQTIKFDLQVDHPY----SFLLKYAKSLKGDKTKLEKML 201
                       +V+  E +L+Q + F L V +PY     FL+     L G     E + 
Sbjct: 138 ------------QVLEYELLLIQQLNFHLIVHNPYRPLEGFLIDIKTRLPG-LENPEYLR 184

Query: 202 QMAWTFVNDSLCTTLCLQWEPEIIAVALMY 231
           + A  F+N +L T   L + P  IA+A + 
Sbjct: 185 KHADKFLNRTLLTDAYLLYTPSQIALAAIL 214


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 48.0 bits (114), Expect = 9e-06
 Identities = 35/113 (30%), Positives = 38/113 (33%), Gaps = 5/113 (4%)

Query: 280  QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAP-PHSAIPTLGAQYPQRVAP 338
               Q   Q   PPQ QP P   P PQ  L P+        P      P LGA  P RVA 
Sbjct: 2915 PPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAV 2974

Query: 339  P----PSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQT 387
            P    P        PA   P    H +  V    S    H    P   S +QT
Sbjct: 2975 PRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQT 3027



 Score = 44.2 bits (104), Expect = 1e-04
 Identities = 29/131 (22%), Positives = 43/131 (32%), Gaps = 9/131 (6%)

Query: 283  QQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQ-YPQRVAPPPS 341
            + A  + +     P    + PPQ   PP         PP    P       PQ    P +
Sbjct: 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTT 2947

Query: 342  GHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLCHSY 401
                   P+  VP P +      + PG       R P  A S +       S+TP    +
Sbjct: 2948 DPAGAGEPSGAVPQPWL----GALVPGRVAVPRFRVPQPAPSREA----PASSTPPLTGH 2999

Query: 402  PGSPISTTCST 412
              S +S+  S+
Sbjct: 3000 SLSRVSSWASS 3010



 Score = 40.7 bits (95), Expect = 0.002
 Identities = 28/118 (23%), Positives = 37/118 (31%), Gaps = 5/118 (4%)

Query: 279  SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
            ++   +A+Q  SPPQ +P      P    L      P     P  A   L +  P    P
Sbjct: 2667 ARRLGRAAQASSPPQ-RPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGP 2725

Query: 339  PPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTP 396
              +       PA P P     P     P G       RPP  AG        + +  P
Sbjct: 2726 AAARQASPALPAAPAPPAV--PAGPATPGGPARPA--RPPTTAGPPAPAPPAAPAAGP 2779



 Score = 37.2 bits (86), Expect = 0.018
 Identities = 30/120 (25%), Positives = 38/120 (31%), Gaps = 16/120 (13%)

Query: 290  SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAP-----------PHSAIPTLGAQYP-QRVA 337
             PP T  +P+  PPP    PPS      VAP             +A P   A+ P +R+A
Sbjct: 2828 LPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLA 2887

Query: 338  PPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPL 397
             P        +   P          A  PP  Q     +PPP                PL
Sbjct: 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQ----PQPPPPPQPQPPPPPPPRPQPPL 2943



 Score = 36.8 bits (85), Expect = 0.022
 Identities = 30/139 (21%), Positives = 43/139 (30%), Gaps = 17/139 (12%)

Query: 285  ASQQDSPPQTQPEPSTKPPPQSHLPPSKLK-----PDLVAPPHSAIPTLGAQYPQRVAPP 339
            A+ + + P     P+    P     P         P    PP  A P   A  P R    
Sbjct: 2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR 2785

Query: 340  PSG--------HYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTR- 390
            P+           P+P+     P   +    A +PP +   G   PP +A  T       
Sbjct: 2786 PAVASLSESRESLPSPWDPADPPAA-VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG 2844

Query: 391  -SCSTTPLCHSY-PGSPIS 407
                + PL  S  PG  + 
Sbjct: 2845 PPPPSLPLGGSVAPGGDVR 2863



 Score = 36.5 bits (84), Expect = 0.035
 Identities = 25/138 (18%), Positives = 30/138 (21%), Gaps = 14/138 (10%)

Query: 291  PPQTQPEPSTKPPPQSHLPPSKLKP------------DLVAPPHSAIPTLGAQYPQRVAP 338
            PP   P P   P       P    P                P   A P       +   P
Sbjct: 2701 PPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP 2760

Query: 339  PPSGHYPNPYP--AHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTP 396
            P +   P P P  A     P      AV            P   A         + +  P
Sbjct: 2761 PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820

Query: 397  LCHSYPGSPISTTCSTTP 414
                    P  T+   T 
Sbjct: 2821 AASPAGPLPPPTSAQPTA 2838



 Score = 35.7 bits (82), Expect = 0.062
 Identities = 23/124 (18%), Positives = 30/124 (24%), Gaps = 16/124 (12%)

Query: 290  SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPP--------PS 341
            SP      P+    P + LPP+      + PP SA PT     P    P         P 
Sbjct: 2800 SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG 2859

Query: 342  GHY---PNPYPAHPVPIPNIHP-----VHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCS 393
            G     P        P     P         V   ++ +      P      Q       
Sbjct: 2860 GDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQP 2919

Query: 394  TTPL 397
                
Sbjct: 2920 QPQP 2923



 Score = 34.5 bits (79), Expect = 0.14
 Identities = 25/113 (22%), Positives = 35/113 (30%), Gaps = 11/113 (9%)

Query: 282  QQQASQQDSPPQ-TQPEPSTKP--PPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
              +A + D+PPQ  +P         P+   PPS L PD  AP            P     
Sbjct: 2583 TSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPP--------PSPSPA 2634

Query: 339  PPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
                    P    P   P   P    V    +     R   A+   Q+ + R+
Sbjct: 2635 ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRA 2687



 Score = 33.0 bits (75), Expect = 0.40
 Identities = 23/91 (25%), Positives = 25/91 (27%), Gaps = 7/91 (7%)

Query: 298  PSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPN 357
            PS  P  +   P       L  P  S      A  P +   PP    P P    P P P 
Sbjct: 2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPP 2926

Query: 358  IHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
              P     PP        RP P    T    
Sbjct: 2927 PQPQPPPPPPP-------RPQPPLAPTTDPA 2950



 Score = 31.1 bits (70), Expect = 1.4
 Identities = 23/99 (23%), Positives = 28/99 (28%), Gaps = 11/99 (11%)

Query: 289 DSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPY 348
           D      P P++ P P     P+       APP  A P   A+      P P    P   
Sbjct: 403 DQTRPAAPVPASVPTPAPTPVPAS------APPPPATPLPSAEPGSDDGPAPP---PERQ 453

Query: 349 PAHPVPIPNIHPVHAVVPPGSQYYGHHRP--PPAAGSTQ 385
           P  P   P                   RP  PP A   +
Sbjct: 454 PPAPATEPAPDDPDDATRKALDALRERRPPEPPGADLAE 492



 Score = 28.8 bits (64), Expect = 7.8
 Identities = 25/92 (27%), Positives = 35/92 (38%), Gaps = 7/92 (7%)

Query: 290  SPPQTQPEPSTKP-PPQSHLPPSKLKPDLVAPPHSAIPTL--GAQYPQRVAPPPSGHYPN 346
            +P ++ P P   P P +  +     +PD  APP SA P      +   R   PPS   P+
Sbjct: 2563 APDRSVPPPRPAPRPSEPAVTSRARRPD--APPQSARPRAPVDDRGDPRGPAPPSPLPPD 2620

Query: 347  PYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPP 378
             +   P P P+  P  A  P          P 
Sbjct: 2621 THAPDP-PPPSPSP-AANEPDPHPPPTVPPPE 2650



 Score = 28.4 bits (63), Expect = 9.2
 Identities = 29/150 (19%), Positives = 41/150 (27%), Gaps = 25/150 (16%)

Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQ------SHLPPSKLKPDLVAPPHSAIPTLGAQYPQ 334
           ++        P    P  +  PP        +  P +   P    PP  A          
Sbjct: 272 ARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPAPAGDAEEEDDED 331

Query: 335 RV------APPPSGHYPNPYPAH-------PVPIPNI-----HPVHAVVPPGSQYYGHHR 376
                    P P  HYP  +P         P  + ++     HP  A +P   +    H 
Sbjct: 332 GAMEVVSPLPRPRQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRRSARHA 391

Query: 377 PPP-AAGSTQQTQTRSCSTTPLCHSYPGSP 405
             P A G     QTR  +  P     P   
Sbjct: 392 ATPFARGPGGDDQTRPAAPVPASVPTPAPT 421


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 46.5 bits (110), Expect = 1e-05
 Identities = 32/145 (22%), Positives = 44/145 (30%), Gaps = 9/145 (6%)

Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPP-HSAIPTLGAQYP---- 333
           SQ Q   +QQ    Q QP P      Q   PP + +     PP   + P +   YP    
Sbjct: 126 SQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESP 185

Query: 334 -QRVAPPPSGHYPNPYPAHPVP---IPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQT 389
            Q  + PP+   P+     P      P+        P    Y G    P +   + Q   
Sbjct: 186 YQPQSYPPNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPP 245

Query: 390 RSCSTTPLCHSYPGSPISTTCSTTP 414
            S       +S P        S   
Sbjct: 246 PSQGQEGYGYSGPPPSKGNHGSVAS 270



 Score = 36.1 bits (83), Expect = 0.029
 Identities = 33/134 (24%), Positives = 40/134 (29%), Gaps = 9/134 (6%)

Query: 281 SQQQASQQDSPPQTQPEPSTKPPP--QSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
            Q    QQ   P +   PS  PP   QS       + +   PP S      AQ PQ   P
Sbjct: 82  PQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQP 141

Query: 339 PPSGHYPN-----PYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCS 393
            P    P        P  P    N  P     P  S  Y    P      +        S
Sbjct: 142 QPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQP--QSYPPNEPLPS 199

Query: 394 TTPLCHSYPGSPIS 407
           +  +   Y G+P S
Sbjct: 200 SMAMQPPYSGAPPS 213



 Score = 33.8 bits (77), Expect = 0.15
 Identities = 30/126 (23%), Positives = 36/126 (28%), Gaps = 24/126 (19%)

Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSH---LPPSKLKPDLVAPPHSAIPTLGAQYPQR 335
           S       Q  +P Q         P Q     LPP        AP   +      Q  Q 
Sbjct: 41  SFHVDSTKQPPAPEQVAKHELADAPLQQVNAALPP--------APAPQSPQPDQQQQSQ- 91

Query: 336 VAPPPSGHYPNPYPAHPVPIPNIHPVHAVVP----------PGSQYYGHHRPPPAAGSTQ 385
              PPS  YP+  P   V      P     P          P +Q     +P P     Q
Sbjct: 92  --APPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVPQ 149

Query: 386 QTQTRS 391
           Q Q +S
Sbjct: 150 QQQYQS 155


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 44.3 bits (105), Expect = 9e-05
 Identities = 18/108 (16%), Positives = 23/108 (21%), Gaps = 1/108 (0%)

Query: 291 PPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPA 350
            P   P       P     P+       APP +A P   A              P     
Sbjct: 388 APAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVAL 447

Query: 351 HPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLC 398
            P P     P    +P              A +      R  + T   
Sbjct: 448 APAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAAR-LTPTEEG 494



 Score = 34.7 bits (80), Expect = 0.088
 Identities = 16/68 (23%), Positives = 21/68 (30%), Gaps = 3/68 (4%)

Query: 290 SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYP 349
            P       +  P       P+ +      P  +A  T     P RVAP P+     P P
Sbjct: 422 PPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPET--VAIPVRVAPEPAVASAAPAP 479

Query: 350 A-HPVPIP 356
           A  P    
Sbjct: 480 AAAPAAAR 487



 Score = 34.3 bits (79), Expect = 0.11
 Identities = 18/119 (15%), Positives = 27/119 (22%), Gaps = 2/119 (1%)

Query: 296 PEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPI 355
            E  T   P++  P +       A P  A     A       P  +   P   PA   P 
Sbjct: 376 AEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPA 435

Query: 356 PNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLCHSYPGSPISTTCSTTP 414
                  A V            P       +        +        +P +   + T 
Sbjct: 436 AAPAAAPAAVALAPAPP-AQAAPETVAIPVRVAPEPAVASAAPAP-AAAPAAARLTPTE 492



 Score = 28.9 bits (65), Expect = 6.7
 Identities = 22/99 (22%), Positives = 29/99 (29%), Gaps = 9/99 (9%)

Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAP---PHSA------IPTLGA 330
            +Q  A+   +        +   PP +  P     P   AP   P +A       P   A
Sbjct: 394 VAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPA 453

Query: 331 QYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGS 369
           Q        P    P P  A   P P   P  A + P  
Sbjct: 454 QAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTE 492



 Score = 28.1 bits (63), Expect = 10.0
 Identities = 19/89 (21%), Positives = 28/89 (31%), Gaps = 6/89 (6%)

Query: 278 YSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
            +     A+   + P   P  +      +  PP++  P+ VA P    P          A
Sbjct: 420 AAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASA---A 476

Query: 338 PPPSGHYPNPYPAHPVPIPNIHPVHAVVP 366
           P P+     P  A   P       HA V 
Sbjct: 477 PAPAAA---PAAARLTPTEEGDVWHATVQ 502


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 44.0 bits (104), Expect = 1e-04
 Identities = 25/121 (20%), Positives = 28/121 (23%), Gaps = 4/121 (3%)

Query: 272 HQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQ 331
            Q   L    QQ   Q   P Q        P      P           P   +P     
Sbjct: 167 QQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQA 226

Query: 332 YPQRVAPP--PSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGH--HRPPPAAGSTQQT 387
             Q   PP  P    P   P  P     + P     P   Q       +PPP    T   
Sbjct: 227 PAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHP 286

Query: 388 Q 388
            
Sbjct: 287 G 287



 Score = 42.1 bits (99), Expect = 5e-04
 Identities = 27/112 (24%), Positives = 29/112 (25%), Gaps = 6/112 (5%)

Query: 278 YSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
           Y Q  Q   +Q  P Q  P PS  P       P +L            P L  Q P    
Sbjct: 203 YPQPPQGHPEQVQPQQFLPAPSQAPAQPPL--PPQLPQQPPPLQQPQFPGLSQQMPPPPP 260

Query: 338 PPPSGHYPNPYP-AHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
            PP      P P A P P     P                PP         Q
Sbjct: 261 QPPQQQQQPPQPQAQPPPQNQPTPHP---GLPQGQNAPLPPPQQPQLLPLVQ 309



 Score = 42.1 bits (99), Expect = 5e-04
 Identities = 29/115 (25%), Positives = 36/115 (31%), Gaps = 8/115 (6%)

Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
            Q     SQ  + P   P+   +PPP        L    + PP    P    Q PQ  A 
Sbjct: 217 QQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGL-SQQMPPPPPQPPQQQQQPPQPQAQ 275

Query: 339 PPSGHYPNPYPAHPVPIPNIHPVHA-------VVPPGSQYYGHHRPPPAAGSTQQ 386
           PP  + P P+P  P       P          V  P  Q  G          +QQ
Sbjct: 276 PPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQLSQQ 330



 Score = 41.7 bits (98), Expect = 6e-04
 Identities = 24/116 (20%), Positives = 26/116 (22%), Gaps = 5/116 (4%)

Query: 278 YSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQY--PQR 335
                Q       P Q  P    + P  S   P    P    P     P        PQ 
Sbjct: 224 SQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPP---PPPQPPQQQQQPPQPQAQPPPQN 280

Query: 336 VAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
              P  G         P P           P G Q     R      S QQ +  S
Sbjct: 281 QPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQLSQQQREALS 336



 Score = 41.3 bits (97), Expect = 8e-04
 Identities = 20/115 (17%), Positives = 22/115 (19%), Gaps = 8/115 (6%)

Query: 278 YSQSQQQASQQDSPPQTQP----EPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYP 333
               Q   SQ       Q     E       Q    P   +P     P    P   A   
Sbjct: 134 PQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQ 193

Query: 334 QRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
           Q     P G+   P        P          P         PP         Q
Sbjct: 194 QGPPEQPPGYPQPPQGHPEQVQPQQFLPA----PSQAPAQPPLPPQLPQQPPPLQ 244



 Score = 38.2 bits (89), Expect = 0.007
 Identities = 21/108 (19%), Positives = 23/108 (21%), Gaps = 15/108 (13%)

Query: 272 HQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQ 331
            Q       SQQ       PPQ Q +P   P PQ+  PP          P          
Sbjct: 243 LQQPQFPGLSQQMPPPPPQPPQQQQQP---PQPQAQPPPQNQPT---PHPGLPQGQNAPL 296

Query: 332 YPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPP 379
            P    P      P                  V     Q     R   
Sbjct: 297 PP----PQQPQLLPLVQQPQGQQRGPQFREQLV-----QLSQQQREAL 335



 Score = 38.2 bits (89), Expect = 0.009
 Identities = 21/100 (21%), Positives = 28/100 (28%), Gaps = 7/100 (7%)

Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPP 339
           + + Q  Q+   PQ         PPQ  LP          P     P     YPQ     
Sbjct: 158 EVEAQLQQRQQAPQLPQ------PPQQVLPQGMPPRQAAFPQQGP-PEQPPGYPQPPQGH 210

Query: 340 PSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPP 379
           P    P  +   P   P   P+   +P         + P 
Sbjct: 211 PEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPG 250



 Score = 37.4 bits (87), Expect = 0.012
 Identities = 22/119 (18%), Positives = 32/119 (26%), Gaps = 15/119 (12%)

Query: 282 QQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDL-------VAPPHSAIPTLGAQYPQ 334
           Q   S   + P+ +P+P   P  Q   P +  +  L               P L    PQ
Sbjct: 120 QPDPSLWGTAPKPEPQPPQAPESQ-PQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQP-PQ 177

Query: 335 RVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHH----RPPPAAGSTQQTQT 389
           +V P         +P      P   P +   P G             P  A +      
Sbjct: 178 QVLPQGMPPRQAAFPQQGP--PEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPP 234


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 43.5 bits (102), Expect = 2e-04
 Identities = 37/129 (28%), Positives = 53/129 (41%), Gaps = 18/129 (13%)

Query: 274 VLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYP 333
            L L SQ+Q  +    S    QP+   +PP +  LPP+   P +  PP + IP L     
Sbjct: 306 PLPLPSQAQPHSHTPPSQSALQPQ---QPPREQPLPPAPSMPHIKPPPTTPIPQLP---N 359

Query: 334 QRVAPPPSGHYPNPYPAHPV---PIPNIHPVHAVV---PPGSQYYGHHRPPPAAGSTQQT 387
           Q    PP    P+P+P  P    P P + P+ ++    PP +       PPP     Q  
Sbjct: 360 QSHKHPPHLQGPSPFPQMPSNLPPPPALKPLSSLPTHHPPSAH------PPPLQLMPQSQ 413

Query: 388 QTRSCSTTP 396
             +S    P
Sbjct: 414 PLQSVPAQP 422



 Score = 40.1 bits (93), Expect = 0.002
 Identities = 42/172 (24%), Positives = 55/172 (31%), Gaps = 40/172 (23%)

Query: 280 QSQQQASQQDSPPQTQPEPST------KPPPQSHLP----------------------PS 311
              Q A Q   PP+ QP P        KPPP + +P                      PS
Sbjct: 320 PPSQSALQPQQPPREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPS 379

Query: 312 KLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVP--IPNIHPVHAVVPPGS 369
            L P     P S++PT    +P    PPP    P   P   VP   P +    ++ P  S
Sbjct: 380 NLPPPPALKPLSSLPT---HHPPSAHPPPLQLMPQSQPLQSVPAQPPVLTQSQSLPPKAS 436

Query: 370 ---QYYGHHRPP--PAAGSTQQTQTRSCSTTPLCH--SYPGSPISTTCSTTP 414
                  H  PP  P A     +        P     S P +P   +  + P
Sbjct: 437 THPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAPPRASSGSQP 488



 Score = 37.7 bits (87), Expect = 0.011
 Identities = 32/146 (21%), Positives = 41/146 (28%), Gaps = 19/146 (13%)

Query: 277 LYSQSQQQASQQDSPPQTQPEPSTKPPPQSH--LPP------SKLKPDLVAPPHSAIPTL 328
              Q Q   S Q  PP     PS  PP  S   +PP      ++  P    P   ++ + 
Sbjct: 171 QLLQPQGPPSIQ-VPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISA 229

Query: 329 GAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
            + +PQR+  P     P          P     H           HH P P      Q  
Sbjct: 230 PSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSS-------HHGPGPPMPHALQQG 282

Query: 389 TRSCSTTPLCHSYPGSPISTTCSTTP 414
                      S P  P     S  P
Sbjct: 283 P---VFLQHPSSNPPQPFGLAQSQVP 305



 Score = 34.7 bits (79), Expect = 0.10
 Identities = 34/126 (26%), Positives = 45/126 (35%), Gaps = 16/126 (12%)

Query: 283 QQASQQDSPPQTQPEPS--TKPPPQSHLPPSKLKPD-LVAPPHSAIPTLGAQYPQRVAPP 339
           Q  S    PP  +P  S  T  PP +H PP +L P         A P +  Q  Q + P 
Sbjct: 376 QMPSNLPPPPALKPLSSLPTHHPPSAHPPPLQLMPQSQPLQSVPAQPPVLTQ-SQSLPPK 434

Query: 340 PSGHYPN---------PYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTR 390
            S H  +         P+  HP       P  A+ PP S        PP A S  Q    
Sbjct: 435 ASTHPHSGLHSGPPQSPFAQHPF-TSGGLP--AIGPPPSLPTSTPAAPPRASSGSQPPGS 491

Query: 391 SCSTTP 396
           +  ++ 
Sbjct: 492 ALPSSG 497



 Score = 34.3 bits (78), Expect = 0.14
 Identities = 24/123 (19%), Positives = 33/123 (26%), Gaps = 6/123 (4%)

Query: 282 QQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPS 341
           Q Q + Q SP    P             P         P     P+     PQ      S
Sbjct: 245 QPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNP--PQPFGLAQS 302

Query: 342 GHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLCHSY 401
              P P P+   P  +  P  + + P         PP  +        +   TTP+    
Sbjct: 303 QVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQPLPPAPSMP----HIKPPPTTPIPQLP 358

Query: 402 PGS 404
             S
Sbjct: 359 NQS 361



 Score = 28.5 bits (63), Expect = 7.7
 Identities = 25/122 (20%), Positives = 31/122 (25%), Gaps = 17/122 (13%)

Query: 275 LDLYSQSQQQAS----------QQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSA 324
           L L  QSQ   S           Q  PP+    P +         P    P       + 
Sbjct: 406 LQLMPQSQPLQSVPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAI 465

Query: 325 IPTLGAQYPQRVAPP-------PSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRP 377
            P          APP       P G           P P + P+     P  +      P
Sbjct: 466 GPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPIQIKEEPLDEAEEPESP 525

Query: 378 PP 379
           PP
Sbjct: 526 PP 527


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 41.6 bits (97), Expect = 8e-04
 Identities = 22/106 (20%), Positives = 31/106 (29%), Gaps = 6/106 (5%)

Query: 280 QSQQQASQQDSPPQTQPEPSTKP-PPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
           +   Q SQ   P     EP  +P  PQ        +     P ++  P   AQ P     
Sbjct: 378 EGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPA 437

Query: 339 PPSGHYPNPYPAHPV-----PIPNIHPVHAVVPPGSQYYGHHRPPP 379
           P      N + A        P            P +Q   + +P P
Sbjct: 438 PEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQP 483



 Score = 40.8 bits (95), Expect = 0.001
 Identities = 28/89 (31%), Positives = 33/89 (37%), Gaps = 6/89 (6%)

Query: 279 SQSQQQASQQDSPPQT---QPEPSTKPPPQSHLP--PSKLKPDLVAPPHSAIPTLGAQYP 333
            Q Q Q  QQ   PQ    QP+    P PQ   P  P   +P    P     P    Q P
Sbjct: 761 PQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQP 820

Query: 334 QR-VAPPPSGHYPNPYPAHPVPIPNIHPV 361
           Q+ VAP P    P    A       +HP+
Sbjct: 821 QQPVAPQPQYQQPQQPVAPQPQDTLLHPL 849



 Score = 35.8 bits (82), Expect = 0.052
 Identities = 20/92 (21%), Positives = 27/92 (29%), Gaps = 9/92 (9%)

Query: 278 YSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
           Y+ + +Q +QQ        +P      Q+    S   P                Y Q  A
Sbjct: 421 YAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQT-------YQQPAA 473

Query: 338 PPPSGHYPNPYPAHPV--PIPNIHPVHAVVPP 367
             P    P P    PV  P P +       PP
Sbjct: 474 QEPLYQQPQPVEQQPVVEPEPVVEETKPARPP 505



 Score = 32.0 bits (72), Expect = 0.73
 Identities = 29/114 (25%), Positives = 36/114 (31%), Gaps = 3/114 (2%)

Query: 275 LDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQ 334
           LD +  S  +A   D P +    P  +P  Q   P +  +         A      Q  Q
Sbjct: 724 LDDFEFSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQ 783

Query: 335 RVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
            VAP P    P    A   P P        V P  QY    +P       QQ Q
Sbjct: 784 PVAPQPQYQQPQQPVA---PQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQ 834



 Score = 32.0 bits (72), Expect = 0.81
 Identities = 29/127 (22%), Positives = 41/127 (32%), Gaps = 13/127 (10%)

Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPP 340
           +Q  A+  +   QT P  S   PP    P    +P  V  P +  P         +AP P
Sbjct: 330 TQSWAAPVEPVTQTPPVASVDVPPAQ--PTVAWQP--VPGPQTGEPV--------IAPAP 377

Query: 341 SGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLCHS 400
            G+      A P  +    P+   V P   YY      PA            +  P    
Sbjct: 378 EGYPQQSQYAQP-AVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAP 436

Query: 401 YPGSPIS 407
            P  P++
Sbjct: 437 APEQPVA 443



 Score = 31.2 bits (70), Expect = 1.3
 Identities = 26/146 (17%), Positives = 38/146 (26%), Gaps = 17/146 (11%)

Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKL------------KPDLVAPPHSAIPTL 328
           +Q   + Q  P     EP   P P+ +   S+             +P     P+ A    
Sbjct: 354 AQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAE 413

Query: 329 GAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
                   AP P      PY A     P            S +      P +   T+QT 
Sbjct: 414 QPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTF-----APQSTYQTEQTY 468

Query: 389 TRSCSTTPLCHSYPGSPISTTCSTTP 414
            +  +  PL                P
Sbjct: 469 QQPAAQEPLYQQPQPVEQQPVVEPEP 494



 Score = 28.9 bits (64), Expect = 6.4
 Identities = 26/110 (23%), Positives = 33/110 (30%), Gaps = 2/110 (1%)

Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAP-PHSAIPTLGAQYPQRVAP 338
           Q+   AS    P Q        P PQ+  P     P+        A P +    P +   
Sbjct: 342 QTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPV 401

Query: 339 PPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
            P   Y  P    P   P   P     P    YY      P AG+  Q +
Sbjct: 402 QPQQPYYAPAAEQPAQQPYYAP-APEQPAQQPYYAPAPEQPVAGNAWQAE 450


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 40.4 bits (95), Expect = 0.001
 Identities = 26/118 (22%), Positives = 39/118 (33%), Gaps = 12/118 (10%)

Query: 279 SQSQQQASQQDSPP---QTQPEPSTKPPPQSHLP-PSKLKPDLVAPPHSAIPTLGAQYPQ 334
            ++ + + Q    P     QP    + PP++ +P     +P   AP     P    Q  Q
Sbjct: 81  HEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQ 140

Query: 335 RVAPPPSGHY-PNPYPAHPVPIP---NIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
            +  P S    P P P H  P P      P   V  P  +        PA    +  +
Sbjct: 141 PLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPE----PVAEPAPVMDKPKR 194



 Score = 35.0 bits (81), Expect = 0.059
 Identities = 26/118 (22%), Positives = 35/118 (29%), Gaps = 10/118 (8%)

Query: 272 HQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQ 331
           HQ    Y+ +Q +   Q  P    P      P Q    P +       P     P     
Sbjct: 90  HQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAY--QPQPEQ-PLQQPV 146

Query: 332 YPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQT 389
            PQ VAP P   +  P PA         P   V  P  +      P P     ++ + 
Sbjct: 147 SPQ-VAPAPQPVHSAPQPAQQA----FQPAEPVAAPQPE--PVAEPAPVMDKPKRKEA 197



 Score = 30.4 bits (69), Expect = 1.6
 Identities = 12/97 (12%), Positives = 19/97 (19%), Gaps = 4/97 (4%)

Query: 300 TKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIH 359
            +                 +P H   P   +  P++    P        P    P P   
Sbjct: 69  HRVNHAPANAQEHEAARP-SPQHQYQPPYASAQPRQPVQQPPEA---QVPPQHAPRPAQP 124

Query: 360 PVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTP 396
               V  P  Q                   +   + P
Sbjct: 125 APQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAP 161



 Score = 29.2 bits (66), Expect = 3.6
 Identities = 16/97 (16%), Positives = 18/97 (18%), Gaps = 10/97 (10%)

Query: 271 CHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGA 330
                            Q +P   Q       P Q    P          P  A      
Sbjct: 106 QQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQQP--------VSPQVAPAP--Q 155

Query: 331 QYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPP 367
                  P      P    A P P P   P   +  P
Sbjct: 156 PVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVMDKP 192



 Score = 28.8 bits (65), Expect = 5.2
 Identities = 17/109 (15%), Positives = 25/109 (22%), Gaps = 10/109 (9%)

Query: 283 QQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSG 342
            + +   +  Q        P  Q   P +  +P               Q P+   PP   
Sbjct: 69  HRVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQ----------QPPEAQVPPQHA 118

Query: 343 HYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
             P      PV  P   P                 P    S  Q   ++
Sbjct: 119 PRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQA 167


>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
          Length = 931

 Score = 40.6 bits (95), Expect = 0.002
 Identities = 20/99 (20%), Positives = 31/99 (31%), Gaps = 8/99 (8%)

Query: 272 HQVLDLYSQSQQQASQQDSPPQTQPE--PSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLG 329
            ++ +L S   + A+   +    Q     + +  P      S   P++  P   A    G
Sbjct: 30  RELRELRSLVAEGAAGPVAKAAEQMAAPEAAEAAPLPAAAESIASPEVPPPVPPAPAQEG 89

Query: 330 AQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPG 368
                  A  PS   P P  A P P   + P      P 
Sbjct: 90  ---EAPAAEQPSAV-PAP-SAAPAPAEPVEPS-LAANPF 122


>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR.  This region is found
           to the N-terminus of the pfam00170 transcription factor
           domain. It is between 150 and 200 amino acids in length.
           The N-terminal half is rather rich in proline residues
           and has been termed the PRD (proline rich domain),
           whereas the C-terminal half is more polar and has been
           called the MFMR (multifunctional mosaic region). It has
           been suggested that this family is composed of three
           sub-families called A, B and C, classified according to
           motif composition. It has been suggested that some of
           these motifs may be involved in mediating
           protein-protein interactions. The MFMR region contains a
           nuclear localisation signal in bZIP opaque and GBF-2.
           The MFMR also contains a transregulatory activity in
           TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention
           signals.
          Length = 189

 Score = 38.3 bits (89), Expect = 0.003
 Identities = 27/118 (22%), Positives = 36/118 (30%), Gaps = 30/118 (25%)

Query: 289 DSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPP------SG 342
            S      + S K   Q   P   + PD           + A Y  R  PP       S 
Sbjct: 3   SSEEGKPSKSSPKTSVQEDTPTPTVYPDW--------SAMQAYYGPRPPPPYFNSSVASS 54

Query: 343 HYPNPY---PAH--------PVPIPNIHP-----VHAVVPPGSQYYGHHRPPPAAGST 384
             P+PY   P          P P   ++P      H  +PPGS  +  +  P A    
Sbjct: 55  PQPHPYMWGPQQPMMPPYGTPPPYAAMYPPGGVYAHPSMPPGSHPFSPYAMPSAEVPG 112


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This family
           contains bacterial fibronectin-attachment proteins
           (FAP). Family members are rich in alanine and proline,
           are approximately 300 long, and seem to be restricted to
           mycobacteria. These proteins contain a
           fibronectin-binding motif that allows mycobacteria to
           bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 38.3 bits (89), Expect = 0.005
 Identities = 20/74 (27%), Positives = 24/74 (32%), Gaps = 4/74 (5%)

Query: 295 QPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVP 354
            P P   PP  +   P+   P    PP +             APPP    PN  P  P P
Sbjct: 40  DPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPD--DPNAAPPPPPADPNAPP--PPP 95

Query: 355 IPNIHPVHAVVPPG 368
           +    P      PG
Sbjct: 96  VDPNAPPPPAPEPG 109



 Score = 37.2 bits (86), Expect = 0.013
 Identities = 17/68 (25%), Positives = 21/68 (30%)

Query: 290 SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYP 349
            PP T         P    PP+         P++A P   A       PP   + P P  
Sbjct: 46  PPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPA 105

Query: 350 AHPVPIPN 357
             P  I N
Sbjct: 106 PEPGRIDN 113



 Score = 30.6 bits (69), Expect = 1.2
 Identities = 16/57 (28%), Positives = 19/57 (33%), Gaps = 1/57 (1%)

Query: 291 PPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNP 347
            P   P P+  P PQ   P +   P    P     P +    P   AP P G   N 
Sbjct: 59  APPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPAPEP-GRIDNA 114


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 38.1 bits (89), Expect = 0.005
 Identities = 24/99 (24%), Positives = 30/99 (30%), Gaps = 2/99 (2%)

Query: 285 ASQQDSPPQTQPEPSTKPPPQ-SHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGH 343
           AS  D P  +   PS   PP+    P     P   +   S  P    + P   +PPP   
Sbjct: 182 ASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPA 241

Query: 344 YPNPYPAHPVP-IPNIHPVHAVVPPGSQYYGHHRPPPAA 381
            P P P   VP +    P            G       A
Sbjct: 242 APPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLDDDA 280



 Score = 36.6 bits (85), Expect = 0.019
 Identities = 20/93 (21%), Positives = 24/93 (25%), Gaps = 3/93 (3%)

Query: 278 YSQSQQQASQQDSPPQTQPEPSTKPPPQSHLP--PSKLKPDLVAPPHSAIPTLGAQYPQR 335
            S S   +S    P    P      P  S LP  PS  + D   P   +           
Sbjct: 182 ASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPA 241

Query: 336 VAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPG 368
             PPP      P  +   P P           G
Sbjct: 242 APPPPPVQ-QVPPLSTAKPTPPSASATPAPIGG 273



 Score = 35.4 bits (82), Expect = 0.042
 Identities = 23/115 (20%), Positives = 31/115 (26%), Gaps = 9/115 (7%)

Query: 276 DLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVA--PPHSAIPTLGAQYP 333
              + +    S             + PP  S   PS   P      P  S++P   + + 
Sbjct: 161 ADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQ 220

Query: 334 QRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
               PP      NP P      P   PV  V P  +         P   S   T 
Sbjct: 221 SDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLST-------AKPTPPSASATP 268



 Score = 34.3 bits (79), Expect = 0.095
 Identities = 18/97 (18%), Positives = 26/97 (26%), Gaps = 1/97 (1%)

Query: 286 SQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYP 345
           +   S   + P  S+   P    PP        +    A  +  +  P      P+   P
Sbjct: 177 ADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSP 236

Query: 346 NPYP-AHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAA 381
            P P A P P     P  +   P          P   
Sbjct: 237 PPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGG 273



 Score = 33.5 bits (77), Expect = 0.19
 Identities = 22/88 (25%), Positives = 28/88 (31%), Gaps = 6/88 (6%)

Query: 274 VLDLYSQSQQQASQQDSPPQTQP-EPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQY 332
           V    S  +  +S  DS     P    +  PP S   P+   P     P +  P    Q 
Sbjct: 194 VPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPT--NPSPPPGPAAPPPPPVQQV 251

Query: 333 PQRVAPPPSGHYPNPYPAHPVPIPNIHP 360
           P      P+   P    A P PI  I  
Sbjct: 252 PPLSTAKPT---PPSASATPAPIGGITL 276



 Score = 33.1 bits (76), Expect = 0.22
 Identities = 20/109 (18%), Positives = 27/109 (24%), Gaps = 8/109 (7%)

Query: 283 QQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSG 342
              +            S   PP S  P     P   +PP        +  P    P PS 
Sbjct: 167 NSDNSFPGEDADPASASPSDPP-SSSPGV---PSFPSPPEDPSSPSDSSLP----PAPSS 218

Query: 343 HYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
              +  P  P    N  P      P         P   A  T  + + +
Sbjct: 219 FQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASAT 267


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 38.6 bits (90), Expect = 0.006
 Identities = 20/93 (21%), Positives = 30/93 (32%), Gaps = 8/93 (8%)

Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHL--PPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
           Q  +  +   SP +  P PST+P   +    PP +   +   PP      +    P    
Sbjct: 367 QPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVA---PPVPH 423

Query: 338 PPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQ 370
            P S        A PV     +      PP  +
Sbjct: 424 TPESAP-KLTRAAIPVDEKPKYT--PPAPPKEE 453



 Score = 30.2 bits (68), Expect = 2.2
 Identities = 20/97 (20%), Positives = 24/97 (24%), Gaps = 7/97 (7%)

Query: 289 DSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQY-PQRVAPPPSGHYPNP 347
              P  QP   T   P     P +  P     P +A         P R    P    P P
Sbjct: 361 VPVPAPQPAKPTAAAP----SPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP 416

Query: 348 YPAHPVPIPNIHP--VHAVVPPGSQYYGHHRPPPAAG 382
                   P   P    A +P   +       PP   
Sbjct: 417 VAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPPKEE 453


>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI.  This minor capsid
           protein may act as a link between the external capsid
           and the internal DNA-protein core. The C-terminal 11
           residues may function as a protease cofactor leading to
           enzyme activation.
          Length = 238

 Score = 37.5 bits (87), Expect = 0.007
 Identities = 26/97 (26%), Positives = 32/97 (32%), Gaps = 16/97 (16%)

Query: 284 QASQQDSPPQT---QPEPSTKPP--PQSHLPPS-----KLKPDLVAPPH--SAIPTLGAQ 331
           Q      P Q    +P P  +    P +  PPS     K  P  V  P    AI      
Sbjct: 118 QEETVADPIQALQPRPRPDVEEVLVPAAPEPPSYEETIKPGPAPVEEPVDSMAIAVPAID 177

Query: 332 YPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPG 368
            P  +  PP+   P P P   VP P+   VH      
Sbjct: 178 TPVTLELPPA---PQP-PPPVVPQPSTMVVHRRSRIK 210


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 38.4 bits (90), Expect = 0.007
 Identities = 22/115 (19%), Positives = 28/115 (24%), Gaps = 17/115 (14%)

Query: 284 QASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGH 343
            A    +P      P+  P P +  P +       AP  +  P   A  P     PPS  
Sbjct: 392 GAPAAAAPSAAAAAPAAAPAPAAAAPAA---AAAPAPAAAPQP---APAPAPAPAPPSPA 445

Query: 344 YPNPYPAHPVPIPNIHP-----------VHAVVPPGSQYYGHHRPPPAAGSTQQT 387
              P    P P P   P                 P         P  A  +    
Sbjct: 446 GNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAP 500



 Score = 35.0 bits (81), Expect = 0.082
 Identities = 16/120 (13%), Positives = 17/120 (14%), Gaps = 17/120 (14%)

Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHS-----------AIPTLG 329
           +   A           E S  P P    P    K   V                A P   
Sbjct: 622 AAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAP 681

Query: 330 AQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQT 389
              P   AP          PA                                S      
Sbjct: 682 PPAPAPAAPAAPAGAAPAQPAPA------PAATPPAGQADDPAAQPPQAAQGASAPSPAA 735



 Score = 34.6 bits (80), Expect = 0.11
 Identities = 16/106 (15%), Positives = 24/106 (22%), Gaps = 3/106 (2%)

Query: 279 SQSQQQASQQDSPPQTQPEPSTKP-PPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
           +     A    + P         P P  +        P    P  +   +  +       
Sbjct: 680 APPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPV 739

Query: 338 PPPS--GHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAA 381
           P P      P+P  A   P P   P  A  P  +            
Sbjct: 740 PLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEM 785



 Score = 33.8 bits (78), Expect = 0.18
 Identities = 16/113 (14%), Positives = 22/113 (19%), Gaps = 14/113 (12%)

Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
             +   A+          +P+  P                A           Q  Q  + 
Sbjct: 682 PPAPAPAAPAAPAGAAPAQPAPAPAATPP-----------AGQADDPAAQPPQAAQGASA 730

Query: 339 PPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
           P         P  P P     P  A  P          P  A  +       S
Sbjct: 731 PSPAADDPV-PLPPEPDD--PPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780



 Score = 32.7 bits (75), Expect = 0.42
 Identities = 15/80 (18%), Positives = 19/80 (23%), Gaps = 6/80 (7%)

Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
             +  Q +   +P    P       P    P         A P  A P    +     AP
Sbjct: 425 PAAAPQPAPAPAPAP-APPSPAGNAPAGGAPSPPPAAAPSAQPAPA-PAAAPEPTAAPAP 482

Query: 339 PPSGHYPNPY----PAHPVP 354
            P            PA P  
Sbjct: 483 APPAAPAPAAAPAAPAAPAA 502



 Score = 30.7 bits (70), Expect = 1.5
 Identities = 21/131 (16%), Positives = 31/131 (23%), Gaps = 13/131 (9%)

Query: 284 QASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGH 343
             +     P      +    P      +       AP  +A     A  P     P    
Sbjct: 386 GVAGGAGAPAAAAPSAAAAAP----AAA------PAPAAAAPAAAAAPAPAAAPQPAPA- 434

Query: 344 YPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLCHSYPG 403
            P P PA P       P      P        +P PA  +  +         P   +   
Sbjct: 435 -PAPAPAPP-SPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAA 492

Query: 404 SPISTTCSTTP 414
           +P +      P
Sbjct: 493 APAAPAAPAAP 503



 Score = 30.0 bits (68), Expect = 2.6
 Identities = 22/104 (21%), Positives = 29/104 (27%), Gaps = 19/104 (18%)

Query: 283 QQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSG 342
             A    + PQ  P P+  P P S    +        PP        A    + AP P+ 
Sbjct: 420 AAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPP-------AAAPSAQPAPAPAA 472

Query: 343 HYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQ 386
                    P   P   P  A  P  +         PAA +   
Sbjct: 473 ------APEPTAAPAPAPPAAPAPAAAP------AAPAAPAAPA 504



 Score = 30.0 bits (68), Expect = 2.9
 Identities = 15/92 (16%), Positives = 21/92 (22%), Gaps = 4/92 (4%)

Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
           + +   A+     P   P P    PP               P  +         P     
Sbjct: 420 AAAPAPAAAPQPAPAPAPAP---APPSPAGNAPAGGAPSPPPAAAPSAQPAPA-PAAAPE 475

Query: 339 PPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQ 370
           P +   P P  A         P     P G+ 
Sbjct: 476 PTAAPAPAPPAAPAPAAAPAAPAAPAAPAGAD 507



 Score = 28.8 bits (65), Expect = 6.1
 Identities = 15/106 (14%), Positives = 21/106 (19%), Gaps = 7/106 (6%)

Query: 292 PQTQPEPSTKPPPQ------SHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYP 345
           P     P+    P       +    S      VA P      +                 
Sbjct: 615 PAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAG 674

Query: 346 NPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
              PA P P P      A     +       P     + Q     +
Sbjct: 675 GAAPAAPPPAP-APAAPAAPAGAAPAQPAPAPAATPPAGQADDPAA 719


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 37.9 bits (88), Expect = 0.009
 Identities = 18/114 (15%), Positives = 25/114 (21%), Gaps = 14/114 (12%)

Query: 291 PPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSG------HY 344
           P    P       P+   P  +       P     P    +  Q     P+         
Sbjct: 95  PAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPE 154

Query: 345 PNPYPAHPVPIPNIHPVHAVVP-----PGSQYYGHH---RPPPAAGSTQQTQTR 390
           P  +P                P       + Y       R P  AG  +  Q R
Sbjct: 155 PGAWPRAADDYGWQQQRLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRR 208



 Score = 33.6 bits (77), Expect = 0.18
 Identities = 20/110 (18%), Positives = 27/110 (24%), Gaps = 22/110 (20%)

Query: 301 KPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQ--YPQRVAPPPSGHYPNPYPAHPVPIPNI 358
           +P P         +P+L  P        G      +    P     P   PA+P      
Sbjct: 94  EPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYP------ 147

Query: 359 HPVHAVVPPGSQYYGHHRP---PPAAGSTQQTQTRSCSTTPLCHSYPGSP 405
                       Y     P   P AA      Q R        ++ P S 
Sbjct: 148 -----------AYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASY 186



 Score = 30.2 bits (68), Expect = 2.7
 Identities = 28/119 (23%), Positives = 39/119 (32%), Gaps = 13/119 (10%)

Query: 289 DSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPY 348
           D  P   P     P  +   P  + +P+  A P +A    G Q  QR+  PP   Y +P 
Sbjct: 127 DDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWPRAADD-YGWQQ-QRLGFPPRAPYASPA 184

Query: 349 PAHPVPIPNIHPVHAVVPPGSQYYGHHRPP---PAAGSTQQTQTRSCSTTPLCHSYPGS 404
              P    +  P  A    G   Y   R     P     +  + R    T      PG+
Sbjct: 185 SYAPEQERDREPYDA----GRPEYDQRRRDYDHPRPDWDRPRRDR----TDRPEPPPGA 235


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 37.8 bits (88), Expect = 0.009
 Identities = 17/79 (21%), Positives = 19/79 (24%), Gaps = 3/79 (3%)

Query: 283 QQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSG 342
           Q A+           PS   P QS        P     P    PT+    P  V   P  
Sbjct: 385 QPAAAPQPSAAAAASPS---PSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPS 441

Query: 343 HYPNPYPAHPVPIPNIHPV 361
             P              PV
Sbjct: 442 TAPQAVRPAQFKEEKKIPV 460



 Score = 34.8 bits (80), Expect = 0.081
 Identities = 16/67 (23%), Positives = 22/67 (32%), Gaps = 4/67 (5%)

Query: 279 SQSQQQASQQDSPPQTQP--EPSTKPPPQSHLP--PSKLKPDLVAPPHSAIPTLGAQYPQ 334
           S S  Q+S    P   Q   +P+  PP  S  P     + P   AP          +   
Sbjct: 399 SPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKI 458

Query: 335 RVAPPPS 341
            V+   S
Sbjct: 459 PVSKVSS 465



 Score = 32.1 bits (73), Expect = 0.69
 Identities = 12/75 (16%), Positives = 20/75 (26%), Gaps = 2/75 (2%)

Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPP 340
           +  Q S   +   +  + S     Q   P S  +P    P  S  P            P 
Sbjct: 388 AAPQPSAAAAASPSPSQSSA--AAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQ 445

Query: 341 SGHYPNPYPAHPVPI 355
           +           +P+
Sbjct: 446 AVRPAQFKEEKKIPV 460



 Score = 30.5 bits (69), Expect = 1.9
 Identities = 25/109 (22%), Positives = 37/109 (33%), Gaps = 5/109 (4%)

Query: 292 PQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQR--VAPPPSGHYPNPYP 349
            Q   + S    P+ H+ P   +P   A P  +     +  P +   A  PS       P
Sbjct: 363 TQKGDDASGGRGPKQHIKPVFTQP--AAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQP 420

Query: 350 AHPVPIPNIHPVHAV-VPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPL 397
           A   P  ++ P  AV V P S      RP       +   ++  S  P 
Sbjct: 421 AGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPS 469


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 37.7 bits (87), Expect = 0.013
 Identities = 15/69 (21%), Positives = 19/69 (27%), Gaps = 1/69 (1%)

Query: 289 DSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPY 348
              PQ         P +S +      P     P    P           P P G  P P 
Sbjct: 381 RQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVP-PQ 439

Query: 349 PAHPVPIPN 357
           P +P  +P 
Sbjct: 440 PTNPYVMPI 448


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 37.3 bits (86), Expect = 0.014
 Identities = 26/144 (18%), Positives = 40/144 (27%), Gaps = 23/144 (15%)

Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVA-------------PPH--SA 324
           Q QQQ    + P   Q + + +      +                        P    S 
Sbjct: 355 QQQQQQQGGNHPAAHQQQMNQQVGQGGQMVALGYLNIQGNQGGLGANPMQQGQPGMMSSP 414

Query: 325 IPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGST 384
            P    Q  Q +  PP      P P  P   P       ++P          P P+   +
Sbjct: 415 SPVPQVQTNQSMPQPPQPS--VPSPGGPGSQPPQSVSGGMIPSPP----ALMPSPSPQMS 468

Query: 385 QQTQTRSCSTTPLCHSYPGSPIST 408
           Q   ++   T       PG P++T
Sbjct: 469 QSPASQ--RTIQQDMVSPGGPLNT 490


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 35.9 bits (83), Expect = 0.030
 Identities = 19/106 (17%), Positives = 25/106 (23%), Gaps = 5/106 (4%)

Query: 287 QQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPT-LGAQYPQRVAPPPSGHYP 345
              +PP     P   P PQ     +              PT   A  P      P+G   
Sbjct: 52  SPQAPPPVAQLPQ--PLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAG 109

Query: 346 NPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
                   P         V+   +        P    + QQ Q R 
Sbjct: 110 PTI--QTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRY 153



 Score = 33.9 bits (78), Expect = 0.13
 Identities = 24/119 (20%), Positives = 29/119 (24%), Gaps = 20/119 (16%)

Query: 272 HQVLDLYSQSQQQASQQDSPPQTQP--EPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLG 329
                  +Q+ Q     D      P   P+  PP    LP     P +   P       G
Sbjct: 65  PLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPTIQTEP-------G 117

Query: 330 AQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
             YP  V  P              P               Q YG     PA+G     Q
Sbjct: 118 QLYP--VQVPVMVTQNPANSPLDQPAQQRALQQLQ-----QRYGA----PASGQLPSQQ 165



 Score = 29.7 bits (67), Expect = 3.2
 Identities = 17/118 (14%), Positives = 21/118 (17%), Gaps = 19/118 (16%)

Query: 288 QDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNP 347
                        +P PQ     +              PT                 P  
Sbjct: 51  PSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPT-----------GSPAANPPA 99

Query: 348 YPAHPVPIPNIHPVHAVVPPGSQYYGHHRPP----PAAGSTQQTQTRSCSTTPLCHSY 401
             A P              PG Q Y    P       A S      +  +   L   Y
Sbjct: 100 TFALP---AGPAGPTIQTEPG-QLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRY 153


>gnl|CDD|223535 COG0459, GroL, Chaperonin GroEL (HSP60 family) [Posttranslational
           modification, protein turnover, chaperones].
          Length = 524

 Score = 36.0 bits (84), Expect = 0.034
 Identities = 7/35 (20%), Positives = 15/35 (42%), Gaps = 3/35 (8%)

Query: 208 VNDSLCTTLCLQWEPEIIA---VALMYLAGKLSKF 239
           + D+L        E +I+     A +  A +L ++
Sbjct: 380 IEDALNVVRAAVEEGKIVPGGGAAEIEAALRLREY 414


>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
          Length = 256

 Score = 35.4 bits (82), Expect = 0.035
 Identities = 16/45 (35%), Positives = 18/45 (40%)

Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSA 324
           Q QQQA QQ  P  T       PP Q+  P     P  V    +A
Sbjct: 120 QIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAA 164



 Score = 30.4 bits (69), Expect = 1.7
 Identities = 13/61 (21%), Positives = 15/61 (24%)

Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPP 339
             +     Q    Q QP  +T  P     P     P     P        A   Q V  P
Sbjct: 114 VPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAP 173

Query: 340 P 340
            
Sbjct: 174 K 174



 Score = 29.6 bits (67), Expect = 2.9
 Identities = 12/71 (16%), Positives = 18/71 (25%), Gaps = 15/71 (21%)

Query: 279 SQSQQQASQQ---------DSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLG 329
            Q+ Q                PP T  +P    PP+    P + +         A P   
Sbjct: 109 EQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPV-- 166

Query: 330 AQYPQRVAPPP 340
                +    P
Sbjct: 167 ----TQAVEAP 173



 Score = 28.9 bits (65), Expect = 4.4
 Identities = 15/65 (23%), Positives = 21/65 (32%), Gaps = 2/65 (3%)

Query: 273 QVLDLYSQSQQQASQQDSPPQTQPEPSTKP-PPQSHLPPSKLKPDLVA-PPHSAIPTLGA 330
           Q+     Q Q  A+     P T P  +T P  PQ+  P        V     +       
Sbjct: 120 QIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAPKVEAEK 179

Query: 331 QYPQR 335
           +  QR
Sbjct: 180 EKEQR 184


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 35.9 bits (83), Expect = 0.048
 Identities = 19/124 (15%), Positives = 30/124 (24%), Gaps = 10/124 (8%)

Query: 285 ASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHY 344
                +PP         P P    PP    PDL +     + + G           +   
Sbjct: 102 REGSPTPPGPSSPDPPPPTPPPASPPPSPAPDL-SEMLRPVGSPGPPPAASPPAAGAS-- 158

Query: 345 PNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLCHSYPGS 404
           P    +            +            R P +  +     T   + +P       S
Sbjct: 159 PAAVASDAASSRQAALPLSSPEE------TARAPSSPPAEPPPSTPPAAASPRPPR-RSS 211

Query: 405 PIST 408
           PIS 
Sbjct: 212 PISA 215



 Score = 33.6 bits (77), Expect = 0.20
 Identities = 24/108 (22%), Positives = 32/108 (29%), Gaps = 6/108 (5%)

Query: 279 SQSQQQASQQDSPPQTQPEPSTKPP---PQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQR 335
           S +++ +     P    P P T PP   P S  P        V  P              
Sbjct: 99  SPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGAS 158

Query: 336 VAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGS 383
            A   S    +   A P+  P      A   P ++       PPAA S
Sbjct: 159 PAAVASDAASSRQAALPLSSPE-ETARAPSSPPAEPPP--STPPAAAS 203



 Score = 31.3 bits (71), Expect = 1.3
 Identities = 25/129 (19%), Positives = 40/129 (31%), Gaps = 6/129 (4%)

Query: 292 PQTQPEPSTKPPPQSHL--PPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYP 349
              + EP T PPP      P ++ +        +  P   A+      P PS   P P  
Sbjct: 61  ACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPT 120

Query: 350 AHPV-PIPNIHPVHAVVPPGSQYYGHH-RPPPAAGSTQQTQTRSCSTTPLCHSYPGS--P 405
             P  P P+  P  + +       G      P A         S + +    + P S   
Sbjct: 121 PPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPE 180

Query: 406 ISTTCSTTP 414
            +    ++P
Sbjct: 181 ETARAPSSP 189


>gnl|CDD|227357 COG5024, COG5024, Cyclin [Cell division and chromosome
           partitioning].
          Length = 440

 Score = 34.4 bits (79), Expect = 0.11
 Identities = 30/118 (25%), Positives = 49/118 (41%), Gaps = 18/118 (15%)

Query: 118 CLFLAGKVEETPKKCKDVIKTARMLVDDKKFKTFGDDPREEVMTLERILLQTIKFDLQVD 177
            LF+A K EE    C          + D  + T G   R++++  ER +L+ + F++   
Sbjct: 262 ALFIASKYEEV--NCP--------SIKDLVYATDGAFTRDDIIRAERYMLEVLDFNISWP 311

Query: 178 HPYSFLLKYAKSLKGD--KTKLEKMLQMAWTFVNDSLCTTLCLQWEPEIIAVALMYLA 233
            P SFL + +K+   D       K      + V+        +Q  P   A A MYL+
Sbjct: 312 SPMSFLRRISKASDYDIFSRTPAKFS-SEISPVDYKF-----IQISPSWCAAAAMYLS 363


>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2).  DAZ
           associated protein 2 has a highly conserved sequence
           throughout evolution including a conserved polyproline
           region and several SH2/SH3 binding sites. It occurs as a
           single copy gene with a four-exon organisation and is
           located on chromosome 12. It encodes a ubiquitously
           expressed protein and binds to DAZ and DAZL1 through DAZ
           repeats.
          Length = 136

 Score = 32.5 bits (74), Expect = 0.14
 Identities = 20/76 (26%), Positives = 29/76 (38%), Gaps = 8/76 (10%)

Query: 278 YSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
           YS+  Q +       Q     +  P P  +LP +++       P S+ P + A YP    
Sbjct: 8   YSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMA---VGPQSSHPPM-AYYPIGAP 63

Query: 338 PPPSGHYPNPYPAHPV 353
           PP    YP P     V
Sbjct: 64  PPV---YP-PGSTVLV 75



 Score = 31.0 bits (70), Expect = 0.50
 Identities = 19/71 (26%), Positives = 24/71 (33%), Gaps = 2/71 (2%)

Query: 320 PPHSAI-PTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQY-YGHHRP 377
           P +S +        PQ   P  S  YP P    P+         +  PP + Y  G   P
Sbjct: 6   PAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPP 65

Query: 378 PPAAGSTQQTQ 388
               GST   Q
Sbjct: 66  VYPPGSTVLVQ 76


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 34.1 bits (78), Expect = 0.14
 Identities = 28/102 (27%), Positives = 35/102 (34%), Gaps = 15/102 (14%)

Query: 290 SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLG-AQYPQRVAPPP-------- 340
           + P  QP P+   P  +   P+       A P +A      A  P R +P P        
Sbjct: 379 AAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQ 438

Query: 341 -SGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAA 381
            S   P   PA P P P   P  A  P  +      RP  AA
Sbjct: 439 ASARGPGGAPA-PAPAPAAAPAAAARPAAAGP----RPVAAA 475



 Score = 32.9 bits (75), Expect = 0.36
 Identities = 23/108 (21%), Positives = 31/108 (28%), Gaps = 14/108 (12%)

Query: 285 ASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPP---- 340
           A      P   P  + +P      P +       A P  A P   A  P    PPP    
Sbjct: 447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAA---AAPARAAPA-AAPAPADDDPPPWEEL 502

Query: 341 SGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
              + +P PA P   P      ++  P +        P  A  T    
Sbjct: 503 PPEFASPAPAQPDAAPAGWVAESIPDPAT------ADPDDAFETLAPA 544



 Score = 32.5 bits (74), Expect = 0.44
 Identities = 18/101 (17%), Positives = 23/101 (22%), Gaps = 8/101 (7%)

Query: 291 PPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTL---GAQYPQRVAPPPSGHYPNP 347
           P      P+  P   +        P   +P   A+       A+ P     P       P
Sbjct: 399 PAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAP 458

Query: 348 YPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
             A         PV A            R  PAA       
Sbjct: 459 AAAARPAAAGPRPVAAAAAAAPA-----RAAPAAAPAPADD 494



 Score = 29.1 bits (65), Expect = 5.2
 Identities = 13/84 (15%), Positives = 20/84 (23%), Gaps = 1/84 (1%)

Query: 284 QASQQDSPPQTQPE-PSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSG 342
            A    +  +     P+ + P    L  ++           A     A  P   A P + 
Sbjct: 408 AAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAA 467

Query: 343 HYPNPYPAHPVPIPNIHPVHAVVP 366
                  A         P  A  P
Sbjct: 468 GPRPVAAAAAAAPARAAPAAAPAP 491



 Score = 29.1 bits (65), Expect = 5.3
 Identities = 17/105 (16%), Positives = 29/105 (27%)

Query: 285 ASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHY 344
           A+   +P +  P  +  P      P  +L P+  +P  +      A +     P P+   
Sbjct: 474 AAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATAD 533

Query: 345 PNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQT 389
           P+       P P   P                P  +A        
Sbjct: 534 PDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFD 578



 Score = 29.1 bits (65), Expect = 5.8
 Identities = 14/82 (17%), Positives = 22/82 (26%)

Query: 300 TKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIH 359
             P   +  P ++  P   AP  +A            AP  +        A     P   
Sbjct: 372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPE 431

Query: 360 PVHAVVPPGSQYYGHHRPPPAA 381
            + A     ++  G    P  A
Sbjct: 432 ALAAARQASARGPGGAPAPAPA 453



 Score = 28.7 bits (64), Expect = 6.4
 Identities = 15/98 (15%), Positives = 21/98 (21%), Gaps = 5/98 (5%)

Query: 293 QTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHP 352
              P  +   P     P +         P +      A      A       P      P
Sbjct: 371 GAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAP 430

Query: 353 VPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTR 390
             +       A  P G+       P PA  +      R
Sbjct: 431 EALAAARQASARGPGGAP-----APAPAPAAAPAAAAR 463


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 34.2 bits (79), Expect = 0.15
 Identities = 23/80 (28%), Positives = 26/80 (32%), Gaps = 8/80 (10%)

Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPP 340
             Q  S   S     P P  K PP     P   +P   APP +  P      P   A   
Sbjct: 511 ESQSGSA--SNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPP------PPPTATQA 562

Query: 341 SGHYPNPYPAHPVPIPNIHP 360
           S + P   PA   P P I  
Sbjct: 563 SSNAPAQIPADSSPPPPIPE 582



 Score = 33.8 bits (78), Expect = 0.16
 Identities = 18/80 (22%), Positives = 25/80 (31%), Gaps = 1/80 (1%)

Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPP 339
           +SQ  ++   +     P+ S  PP  +   P          P    PT         A  
Sbjct: 511 ESQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQI 570

Query: 340 PSGHYP-NPYPAHPVPIPNI 358
           P+   P  P P  P P P  
Sbjct: 571 PADSSPPPPIPEEPTPSPTK 590



 Score = 31.9 bits (73), Expect = 0.74
 Identities = 15/78 (19%), Positives = 21/78 (26%), Gaps = 3/78 (3%)

Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPP 339
              + A+       T     + PP        K K    A    + P   A  P  V   
Sbjct: 364 FISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQ---AATTPSPPPAKASPPIPVPAE 420

Query: 340 PSGHYPNPYPAHPVPIPN 357
           P+   P P        P+
Sbjct: 421 PTEPSPTPPANAANAPPS 438


>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin.  Amelogenins play a role in
           biomineralisation. They seem to regulate the formation
           of crystallites during the secretory stage of tooth
           enamel development. thought to play a major role in the
           structural organisation and mineralisation of developing
           enamel. They are found in the extracellular matrix.
           Mutations in X-chromosomal amelogenin can cause
           Amelogenesis imperfecta.
          Length = 174

 Score = 33.0 bits (75), Expect = 0.16
 Identities = 17/78 (21%), Positives = 23/78 (29%), Gaps = 2/78 (2%)

Query: 303 PPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPV- 361
           P    +P  +        PH  +  L  Q P    P      P       +  P  HP+ 
Sbjct: 52  PLSPQMPQQQQSAHPKLTPHHQLLILPPQQPMMPVPGHHPMVPMTGQQPHLQPPAQHPLQ 111

Query: 362 -HAVVPPGSQYYGHHRPP 378
                 P  Q   H +PP
Sbjct: 112 PTYGQNPQPQQPTHTQPP 129


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score = 33.7 bits (77), Expect = 0.18
 Identities = 16/76 (21%), Positives = 25/76 (32%), Gaps = 3/76 (3%)

Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
            +++Q A    SP      P   PP ++  P +K      A P +  P       +R   
Sbjct: 12  GEAEQPAPAPPSPAAAPAPP---PPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPK 68

Query: 339 PPSGHYPNPYPAHPVP 354
           P S      +   P  
Sbjct: 69  PASLWKLEDFVVEPQE 84


>gnl|CDD|180523 PRK06305, PRK06305, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 451

 Score = 33.6 bits (77), Expect = 0.19
 Identities = 26/137 (18%), Positives = 52/137 (37%), Gaps = 16/137 (11%)

Query: 216 LCLQWEPEIIAVALMYLAGKLSKFEVVDWIGRNSKHLR---WWEMFVEDMTMDLLEDICH 272
           L  Q+  ++ +VA  Y + +L   E++D++G ++KH++   + + F+E + + L+  I  
Sbjct: 295 LLKQYNKQLSSVATKYSSEQL--LEIIDFLGESAKHIQLTIFEKTFLETVIIHLIR-IYQ 351

Query: 273 QVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPS-KLKPD-------LVAPPHSA 324
           +       SQ ++            P  K         S + +P        L+      
Sbjct: 352 RPTLSQLVSQIKSPAFTQLRLESCPPQIKSSNSQQTVSSPQPQPVAKLEQGSLLTAASP- 410

Query: 325 IPTLGAQYPQRVAPPPS 341
             T      Q V  P  
Sbjct: 411 -QTKKETIHQEVPSPSL 426


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 31.7 bits (72), Expect = 0.26
 Identities = 26/110 (23%), Positives = 41/110 (37%), Gaps = 4/110 (3%)

Query: 244 WIGRNSKHLRWWEMFVE-DMTMDLLEDICHQVLDLYSQSQQQASQQDSPPQTQPEPSTKP 302
           WIGR  +     +  VE ++  D   +I  Q+ + +  S +Q +++   P T P P    
Sbjct: 35  WIGRLKRSFNAIKQEVEREIGAD---EIRRQLHNEHILSMEQEARKILAPLTPPAPPEPV 91

Query: 303 PPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHP 352
            P +   P+   P        A+P   A      A  P    P   P  P
Sbjct: 92  TPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQPPRAP 141


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 33.1 bits (75), Expect = 0.32
 Identities = 24/94 (25%), Positives = 28/94 (29%), Gaps = 12/94 (12%)

Query: 322 HSAIPTLGAQYPQRVAP-----PPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHR 376
            + IPT G Q PQ  AP     P S     P P+   P P + P     P   +     R
Sbjct: 394 AATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPS-AAPSPRV-PWDDAPPAPPRSGIPPR 451

Query: 377 PPPAAGSTQQT-----QTRSCSTTPLCHSYPGSP 405
           P P                S S  P     P   
Sbjct: 452 PAPRMPEASPVPGAPDSVASASDAPPTLGDPSDT 485


>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
           family.  The proteins represented by this model contain
           three RNA recognition motifs (rrm: pfam00076) and have
           been characterized as poly-pyrimidine tract binding
           proteins associated with RNA splicing factors. In the
           case of PUF60 (GP|6176532), in complex with p54, and in
           the presence of U2AF, facilitates association of U2
           snRNP with pre-mRNA.
          Length = 612

 Score = 32.7 bits (74), Expect = 0.33
 Identities = 21/91 (23%), Positives = 31/91 (34%), Gaps = 8/91 (8%)

Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSA---IPTLGAQYPQRV 336
            + +  +           P+T   P S LP       +V+        +P L    P  V
Sbjct: 313 MAAEAVAGAAVLGPRAQSPAT---PSSSLPTDIGNKAVVSSAKKEAEEVPPLPQAAPAVV 369

Query: 337 APPPSGHYPNPYPAHPVPIPN-IHPVHAVVP 366
            P P    P P P   + IP+ + P   V P
Sbjct: 370 KPGPM-EIPTPVPPPGLAIPSLVAPPGLVAP 399


>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein.  Tymoviruses
           are single stranded RNA viruses. This family includes a
           protein of unknown function that has been named based on
           its molecular weight. Tymoviruses such as the ononis
           yellow mosaic tymovirus encode only three proteins. Of
           these two are overlapping this protein overlaps a larger
           ORF that is thought to be the polymerase.
          Length = 458

 Score = 32.8 bits (75), Expect = 0.37
 Identities = 24/129 (18%), Positives = 35/129 (27%), Gaps = 2/129 (1%)

Query: 277 LYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRV 336
           +   S QQ   +    ++  +       Q   P  +    L  PP              V
Sbjct: 330 IRKNSLQQTRARLGHSKSLGQSPNLRSSQLPPPTKRRLRLLPVPPPKVQALPLTALAPLV 389

Query: 337 APPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTP 396
              PS   P+P  A P  +      H  +PP        R    + S      R  +  P
Sbjct: 390 RHSPSIPLPHPPSALPSHVGASSSKHHRLPP--SVLPGPRLSSPSPSPSLPTRRPGTPPP 447

Query: 397 LCHSYPGSP 405
                  SP
Sbjct: 448 PASPPTPSP 456


>gnl|CDD|218108 pfam04487, CITED, CITED.  CITED, CBP/p300-interacting
           transactivator with ED-rich tail, are characterized by a
           conserved 32-amino acid sequence at the C-terminus.
           CITED proteins do not bind DNA directly and are thought
           to function as transcriptional co-activators.
          Length = 206

 Score = 32.2 bits (73), Expect = 0.39
 Identities = 22/92 (23%), Positives = 26/92 (28%), Gaps = 11/92 (11%)

Query: 290 SPPQTQPEPSTKPPPQSHLP-PSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPY 348
            P Q+ P      P     P      P L+A     +  L  QY      P  GH     
Sbjct: 71  HPHQSMPAYMMFNPSSKPQPFMLVPGPQLMASMQ--LQKLNTQYQGHAGAPA-GHPGGGG 127

Query: 349 PAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPA 380
           P    P           PPG Q+      PP 
Sbjct: 128 PQQFRPGAG-------QPPGMQHMPAPALPPN 152


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 32.2 bits (73), Expect = 0.43
 Identities = 22/96 (22%), Positives = 28/96 (29%), Gaps = 16/96 (16%)

Query: 308 LPPS--KLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVV 365
           LP S  +L P  V  P +  PT          P PS     P    P P   I    A  
Sbjct: 190 LPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGT 249

Query: 366 -----------PPGSQYYGHHRPPPAAGSTQQTQTR 390
                       PG    G   P  A  + + ++  
Sbjct: 250 TPEAEGTPAPPTPGG---GEAPPANATPAPEASRYE 282



 Score = 29.2 bits (65), Expect = 4.3
 Identities = 16/72 (22%), Positives = 21/72 (29%), Gaps = 8/72 (11%)

Query: 291 PPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRV--------APPPSG 342
           P   +P P T   P++   PS           +   T+ A               P P G
Sbjct: 205 PATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGG 264

Query: 343 HYPNPYPAHPVP 354
               P  A P P
Sbjct: 265 GEAPPANATPAP 276


>gnl|CDD|217307 pfam02984, Cyclin_C, Cyclin, C-terminal domain.  Cyclins regulate
           cyclin dependent kinases (CDKs). Human CCNO is a
           Uracil-DNA glycosylase that is related to other cyclins.
           Cyclins contain two domains of similar all-alpha fold,
           of which this family corresponds with the C-terminal
           domain.
          Length = 117

 Score = 30.7 bits (70), Expect = 0.44
 Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 179 PYSFLLKYAKSLKGDKTKLEKMLQMAWTFVNDSLCTTLCLQWEPEIIAVALMYLA 233
           P SFL +++K+   D     +   +A   +  SL     L++ P +IA A +YLA
Sbjct: 3   PLSFLRRFSKAADYDL----ETRTLAKYLLELSLLDYDFLKYPPSLIAAAAVYLA 53


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 31.7 bits (72), Expect = 0.49
 Identities = 18/119 (15%), Positives = 29/119 (24%), Gaps = 6/119 (5%)

Query: 290 SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYP 349
                  EP  +P P    P    +P+    P    P    +  ++   P          
Sbjct: 48  VLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKP-----K 102

Query: 350 AHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLC-HSYPGSPIS 407
             P P P + P      P S+        P   +   +   +         SY      
Sbjct: 103 PKPKPKPKVKPQPKPKKPPSKTAAKAPAAPNQPARPPSAASASGAATGPSASYLSGLRR 161


>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
           synthetase/uroporphyrin-III C-methyltransferase;
           Reviewed.
          Length = 656

 Score = 32.4 bits (74), Expect = 0.52
 Identities = 10/60 (16%), Positives = 13/60 (21%)

Query: 271 CHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGA 330
                D  +Q    A        T    S    P +        P+  A P       G 
Sbjct: 262 FLTWADAAAQPATAAPAPSRMTDTNDSKSVTSQPAAAAAAPAPPPNPPATPPEPPARRGR 321


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 32.2 bits (73), Expect = 0.53
 Identities = 26/87 (29%), Positives = 30/87 (34%), Gaps = 8/87 (9%)

Query: 296 PEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPI 355
           P     PPP        L P   +   SA  T G + P RV+  P    P P PA P   
Sbjct: 183 PGAIQPPPPS---SLPGLPPGSSSLAPSASSTPGNRLP-RVSFNP--FLPGPSPAQP-SA 235

Query: 356 PNIHPVHAVVPPGSQYYGHHRPPPAAG 382
           P        +PP  QY     P P   
Sbjct: 236 PPASIPAPPIPPVIQYV-APPPVPPPQ 261



 Score = 30.7 bits (69), Expect = 1.6
 Identities = 24/93 (25%), Positives = 31/93 (33%), Gaps = 4/93 (4%)

Query: 279 SQSQQQASQQDSPPQTQP-EPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
            Q    +S    PP +    PS    P + LP     P L  P  +      A  P    
Sbjct: 186 IQPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPAPPI 245

Query: 338 PPPSGHY---PNPYPAHPVPIPNIHPVHAVVPP 367
           PP   +    P P P   +PI +I  V    P 
Sbjct: 246 PPVIQYVAPPPVPPPQPIIPIQHIRAVTGETPA 278


>gnl|CDD|237868 PRK14960, PRK14960, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 702

 Score = 32.3 bits (73), Expect = 0.55
 Identities = 22/83 (26%), Positives = 34/83 (40%), Gaps = 6/83 (7%)

Query: 277 LYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKL--KPDLVAPPHSAIPTLGAQYPQ 334
           L S+  QQ  Q +    +Q + + +  P S + P ++  +P +V P     P    +   
Sbjct: 367 LVSEPVQQNGQAEVGLNSQAQTAQEITPVSAVQPVEVISQPAMVEPEPEPEPEPEPEPEP 426

Query: 335 RVAPPPSGHYPNPYPAHPVPIPN 357
              P P    P P P  P P PN
Sbjct: 427 EPEPEPE---PEPEPE-PEPQPN 445


>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
           role in the biomineralisation of teeth.  They seem to
           regulate formation of crystallites during the secretory
           stage of tooth enamel development and are thought to
           play a major role in the structural organisation and
           mineralisation of developing enamel. The extracellular
           matrix of the developing enamel comprises two major
           classes of protein: the hydrophobic amelogenins and the
           acidic enamelins. Circular dichroism studies of porcine
           amelogenin have shown that the protein consists of 3
           discrete folding units: the N-terminal region appears to
           contain beta-strand structures, while the C-terminal
           region displays characteristics of a random coil
           conformation. Subsequent studies on the bovine protein
           have indicated the amelogenin structure to contain a
           repetitive beta-turn segment and a "beta-spiral" between
           Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
           rich region. The beta-spiral offers a probable site for
           interactions with Ca2+ ions. Muatations in the human
           amelogenin gene (AMGX) cause X-linked hypoplastic
           amelogenesis imperfecta, a disease characterised by
           defective enamel. A 9bp deletion in exon 2 of AMGX
           results in the loss of codons for Ile5, Leu6, Phe7 and
           Ala8, and replacement by a new threonine codon,
           disrupting the 16-residue (Met1-Ala16) amelogenin signal
           peptide.
          Length = 165

 Score = 30.9 bits (70), Expect = 0.62
 Identities = 22/101 (21%), Positives = 29/101 (28%), Gaps = 7/101 (6%)

Query: 286 SQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAI-------PTLGAQYPQRVAP 338
           SQQ  P  T       P   +  P    +P +  P   ++       P L     Q   P
Sbjct: 43  SQQHPPTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQP 102

Query: 339 PPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPP 379
            P        P  P P  +  P     PP    +     PP
Sbjct: 103 QPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPP 143


>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
           Provisional.
          Length = 376

 Score = 31.8 bits (72), Expect = 0.63
 Identities = 19/75 (25%), Positives = 27/75 (36%), Gaps = 12/75 (16%)

Query: 282 QQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPS 341
           + +   + +PP T    +TKP    H  P  + PD  A   +  PT         APP +
Sbjct: 52  RSKKENEPAPPSTMIATNTKP---FHPAPIDVPPDPPAAQEAVQPT---------APPSA 99

Query: 342 GHYPNPYPAHPVPIP 356
              P      P   P
Sbjct: 100 QSEPERNEPRPEETP 114


>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 584

 Score = 31.8 bits (72), Expect = 0.70
 Identities = 15/46 (32%), Positives = 19/46 (41%)

Query: 285 ASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGA 330
           A+   + PQ  P PS   P   H P  + +P L   P S  P   A
Sbjct: 389 ANLLHNAPQAAPAPSAAAPEPKHQPAPEPRPVLAPTPASGEPNAAA 434


>gnl|CDD|222912 PHA02624, PHA02624, large T antigen; Provisional.
          Length = 647

 Score = 31.9 bits (73), Expect = 0.75
 Identities = 20/69 (28%), Positives = 25/69 (36%), Gaps = 8/69 (11%)

Query: 253 RWWEMFVEDMTMDLLEDICHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSK 312
           +WWE F E    DL    C +  +L S   +      SPP   P    +    S   P K
Sbjct: 91  QWWEEFNEKWDEDLF---CDE--ELSSSDDEDEPPPPSPP---PSQEEESQSSSQATPPK 142

Query: 313 LKPDLVAPP 321
            K    AP 
Sbjct: 143 KKKKNPAPN 151


>gnl|CDD|236362 PRK09010, PRK09010, single-stranded DNA-binding protein;
           Provisional.
          Length = 177

 Score = 31.0 bits (70), Expect = 0.76
 Identities = 10/34 (29%), Positives = 13/34 (38%), Gaps = 1/34 (2%)

Query: 278 YSQSQQ-QASQQDSPPQTQPEPSTKPPPQSHLPP 310
           + Q QQ Q   Q S         + P   S+ PP
Sbjct: 135 WGQPQQPQGGNQFSGGAQSRPQQSAPAAPSNEPP 168


>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
           dihydrolipoamide succinyltransferase.  This model
           represents an Actinobacterial clade of E2 enzyme, a
           component of the 2-oxoglutarate dehydrogenase complex
           involved in the TCA cycle. These proteins have multiple
           domains including the catalytic domain (pfam00198), one
           or two biotin domains (pfam00364) and an E3-component
           binding domain (pfam02817).
          Length = 579

 Score = 31.9 bits (72), Expect = 0.78
 Identities = 19/79 (24%), Positives = 26/79 (32%), Gaps = 11/79 (13%)

Query: 289 DSPPQTQPEPSTKPPPQSHLPPSKLKPDLVA-PPHSAIPTLGAQYPQRVAPPPSGHYPNP 347
           D+          + P  S    S+  PD  A  PH+A        P   AP P+      
Sbjct: 202 DANAAPAEPAEEEAPAPSE-AGSEPAPDPAARAPHAA--------PDPPAPAPAPAKTAA 252

Query: 348 YPAHPVPIPNIHPVHAVVP 366
            PA   P+ +      V P
Sbjct: 253 -PAAAAPVSSGDSGPYVTP 270



 Score = 30.8 bits (69), Expect = 1.5
 Identities = 14/59 (23%), Positives = 23/59 (38%), Gaps = 4/59 (6%)

Query: 284 QASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSG 342
           +A  + +P   +PE + +P   +  P     P   AP  +A    G+     V  P  G
Sbjct: 81  EAGSEPAPAAPEPEAAPEPEAPAPAP----TPAAEAPAPAAPQAGGSGEATEVKMPELG 135


>gnl|CDD|216078 pfam00716, Peptidase_S21, Assemblin (Peptidase family S21). 
          Length = 326

 Score = 31.2 bits (71), Expect = 0.87
 Identities = 17/85 (20%), Positives = 20/85 (23%), Gaps = 12/85 (14%)

Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQY-----PQR 335
                S Q   P         PP  S +  + L     A     +      Y      Q 
Sbjct: 245 PSVSPSGQPLSPAA-------PPGTSSVAGTALSASPAALFGDMVYVPLDAYNQLLAGQA 297

Query: 336 VAPPPSGHYPNPYPAHPVPIPNIHP 360
              PP    P P      P P   P
Sbjct: 298 FNQPPDPQGPAPPAELAPPAPAPPP 322


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 31.7 bits (72), Expect = 0.89
 Identities = 21/87 (24%), Positives = 29/87 (33%), Gaps = 9/87 (10%)

Query: 282 QQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPS 341
           Q    Q +  P   P  S  P P    P   L+P+ +AP  +A+             PP 
Sbjct: 403 QGPQQQFNGQPLGWPRMSMMPTPMG--PGGPLRPNGLAPM-NAVRAPSRNAQNAAQKPPM 459

Query: 342 G--HYPNPYPAHPV----PIPNIHPVH 362
               YP  Y + P+    P P      
Sbjct: 460 QPVMYPPNYQSLPLSQDLPQPQSTASQ 486


>gnl|CDD|226348 COG3827, COG3827, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 231

 Score = 31.0 bits (70), Expect = 0.97
 Identities = 12/78 (15%), Positives = 16/78 (20%), Gaps = 5/78 (6%)

Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPP 339
            ++  A    +  Q Q     K  P S      +K         A P   A      A  
Sbjct: 80  IARDAAPGPAAVAQAQNPDEKKNEPASI--EDIVKEISGVIAPKARPPKNAAGENAPASE 137

Query: 340 P---SGHYPNPYPAHPVP 354
               S             
Sbjct: 138 DRPESTEAVTQSEEATAI 155


>gnl|CDD|237081 PRK12372, PRK12372, ribonuclease III; Reviewed.
          Length = 413

 Score = 31.4 bits (71), Expect = 1.00
 Identities = 14/68 (20%), Positives = 21/68 (30%), Gaps = 6/68 (8%)

Query: 289 DSPPQTQPEPSTKPPPQSHLPPSKL--KPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPN 346
           + P +  P  + KP  Q+  P S    KP   A   +  P       +  A P +   P 
Sbjct: 342 EKPAEAAPRAADKPAGQAADPASSSADKPGASADAAARTPA----RARDAAAPDADTPPG 397

Query: 347 PYPAHPVP 354
                   
Sbjct: 398 GASLAAAQ 405


>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription
           factor N terminal domain.  The N terminus of the PEA3
           transcription factors is implicated in transactivation
           and in inhibition of DNA binding. Transactivation is
           potentiated by activation of the Ras/MAP kinase and
           protein kinase A signalling cascades. The N terminal
           region contains conserved MAP kinase phosphorylation
           sites.
          Length = 336

 Score = 31.0 bits (70), Expect = 1.0
 Identities = 16/101 (15%), Positives = 22/101 (21%), Gaps = 3/101 (2%)

Query: 278 YSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
               +  +  +   P + P     P                +PP    P           
Sbjct: 121 AYDRKPASGFKPPTPPSTPCSPVNPQETVRQLQPSGPLSNSSPPSPHTPLPNQSPLPPPM 180

Query: 338 PPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPP 378
             P   YP     H        P     PP  +     RPP
Sbjct: 181 SSPDSSYP---SEHRFQRQLSEPCLPFPPPPGRGSRDGRPP 218


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 31.2 bits (70), Expect = 1.0
 Identities = 29/117 (24%), Positives = 45/117 (38%), Gaps = 18/117 (15%)

Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQS-HLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPP 339
           +Q   +  +  PQT+P PS + P  S   PPS    D       +    G ++ QR   P
Sbjct: 221 TQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQD------QSTTGDGQEHTQRRKTP 274

Query: 340 PSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTP 396
           P+    N    H           A  PP ++     RP P   +T Q+ +    ++P
Sbjct: 275 PA--TSNRRSPHS---------TATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSP 320



 Score = 30.4 bits (68), Expect = 2.1
 Identities = 25/135 (18%), Positives = 47/135 (34%), Gaps = 6/135 (4%)

Query: 279 SQSQQQASQQDSPP-QTQPEPSTKPPPQSHLPPS-KLKPDLVAPPHSAIPTLGAQYPQRV 336
           S++    ++   P   +Q  P ++P  Q H   +   +     P  +   T  +    + 
Sbjct: 175 SKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPEPQT 234

Query: 337 APPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQT--QTRSCST 394
            PPPS   P+  P HP    +     +    G ++    + PPA  + +          T
Sbjct: 235 EPPPSQRGPSGSPQHPPSTTS--QDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPT 292

Query: 395 TPLCHSYPGSPISTT 409
           T    +   +P  T 
Sbjct: 293 TKRQETGRPTPRPTA 307


>gnl|CDD|218397 pfam05044, Prox1, Homeobox prospero-like protein (PROX1).  The
           homeobox gene Prox1 is expressed in a subpopulation of
           endothelial cells that, after budding from veins, gives
           rise to the mammalian lymphatic system. Prox1 has been
           found to be an early specific marker for the developing
           liver and pancreas in the mammalian foregut endoderm.
           This family contains an atypical homeobox domain.
          Length = 908

 Score = 31.2 bits (70), Expect = 1.2
 Identities = 29/156 (18%), Positives = 46/156 (29%), Gaps = 24/156 (15%)

Query: 280 QSQQQASQQDSPPQTQPEP--STKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
              Q    QD+P QT+      T    +  +  +++ P  V+   +    +G        
Sbjct: 558 CMPQTPQPQDAPEQTEALSLVVTPKKKRHKVTDTRITPRTVSRILALRDAVGPAAGTHHQ 617

Query: 338 P-PPSGHYPN-----PYPAHPVP------IPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQ 385
           P  PS    +     P   HP P      IPN     + V  G  +   H   P+     
Sbjct: 618 PLHPSSLSASMGFHPPPFRHPFPLPLTVAIPNPSLHQSEVFMGYPFQSPHLGAPSGSPPG 677

Query: 386 QTQTRSCSTTPLC----------HSYPGSPISTTCS 411
           + +       P            H    SP   + S
Sbjct: 678 KDRDSPDLPRPTTSLHPKLLSAHHHPGSSPDGLSLS 713


>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein.  The WWbp domain is
           characterized by several short PY and PT-like motifs of
           the PPPPY form. These appear to bind directly to the WW
           domains of WWP1 and WWP2 and other such diverse proteins
           as dystrophin and YAP (Yes-associated protein). This is
           the WW-domain binding protein WWbp via PY and PY_like
           motifs. The presence of a phosphotyrosine residue in the
           pWBP-1 peptide abolishes WW domain binding which
           suggests a potential regulatory role for tyrosine
           phosphorylation in modulating WW domain-ligand
           interactions. Given the likelihood that WWP1 and WWP2
           function as E3 ubiquitin-protein ligases, it is possible
           that initial substrate-specific recognition occurs via
           WW domain-substrate protein interaction followed by
           ubiquitin transfer and subsequent proteolysis. This
           domain lies just downstream of the GRAM (pfam02893) in
           many members.
          Length = 111

 Score = 29.3 bits (66), Expect = 1.4
 Identities = 19/55 (34%), Positives = 21/55 (38%), Gaps = 5/55 (9%)

Query: 332 YPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQ 386
           Y Q  APPP G  P  YPA P   P   P ++      Q      PPP      Q
Sbjct: 62  YGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMADPQ-----EPPPPYPGPPQ 111



 Score = 27.4 bits (61), Expect = 6.6
 Identities = 15/76 (19%), Positives = 20/76 (26%), Gaps = 7/76 (9%)

Query: 330 AQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQT 389
           AQ  Q V+     + P     H  P+P      A  PP       + P P          
Sbjct: 34  AQRAQPVSRESGYYPPPGAYVHLEPLPAYGQYAA--PPPYGPPPPYYPAPPGVYPTPPP- 90

Query: 390 RSCSTTPLCHSYPGSP 405
                     + P  P
Sbjct: 91  ----PNSGYMADPQEP 102


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 30.9 bits (70), Expect = 1.4
 Identities = 19/99 (19%), Positives = 26/99 (26%), Gaps = 13/99 (13%)

Query: 291 PPQTQPEPSTKP----PPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPN 346
           P    P P   P     P      ++  P      H+A+    A    R   PP G    
Sbjct: 657 PGVPGPVPVGMPAHTARPSR---VARGDPVRPTAHHAALR---APQAPRPGGPPGGGGGL 710

Query: 347 PYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQ 385
           P P          P  + +            P   G+ Q
Sbjct: 711 PPPPDLPAAAGPAPCGSSLIASPT---APPEPEPPGAEQ 746


>gnl|CDD|216412 pfam01285, TEA, TEA/ATTS domain family. 
          Length = 424

 Score = 30.9 bits (70), Expect = 1.4
 Identities = 11/50 (22%), Positives = 18/50 (36%), Gaps = 1/50 (2%)

Query: 319 APPHSAIPTLGAQYPQRVAPPPSGHYP-NPYPAHPVPIPNIHPVHAVVPP 367
           A     I   G  +   + P  S HY  +   A+   +  + P    +PP
Sbjct: 143 AQHLFYIIPGGPSWRTSIKPFSSSHYGSHNSSAYSDHLRPLQPYSGELPP 192


>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor.  This family is secreted by
           gram-negative Gammaproteobacteria such as Pseudomonas
           syringae of tomato and the fire blight plant pathogen
           Erwinia amylovora, amongst others. It is an essential
           pathogenicity factor of approximately 198 kDa. Its
           injection into the host-plant is dependent upon the
           bacterial type III or Hrp secretion system. The family
           is long and carries a number of predicted functional
           regions, including an ERMS or endoplasmic reticulum
           membrane retention signal at both the C- and the
           N-termini, a leucine-zipper motif from residues 539-560,
           and a nuclear localisation signal at 1358-1361. this
           conserved AvrE-family of effectors is among the few that
           are required for full virulence of many phytopathogenic
           pseudomonads, erwinias and pantoeas.
          Length = 1771

 Score = 30.9 bits (70), Expect = 1.7
 Identities = 28/150 (18%), Positives = 38/150 (25%), Gaps = 22/150 (14%)

Query: 279 SQSQQQASQQDS--------------PPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSA 324
            QS +  +Q+ S              P   Q   S+ P      PP  L P   AP  S 
Sbjct: 29  QQSSESPTQRASHSLASEGKKNRKKMPKVFQK--SSAPRQIQAAPPQALNPTAAAPQSSR 86

Query: 325 IPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGST 384
            PTL            +            P          V          RP     + 
Sbjct: 87  GPTLRELLALPEDDGETQA------PESSPSARRLTRSEGVARHEMEDLAGRPVVKPDAD 140

Query: 385 QQTQTRSCSTTPLCHSYPGSPISTTCSTTP 414
           +Q +    + +      P S    T S  P
Sbjct: 141 RQLRQDILNKSSSSRRPPVSKEEGTSSKMP 170


>gnl|CDD|148303 pfam06619, DUF1149, Protein of unknown function (DUF1149).  This
           family consists of several hypothetical bacterial
           proteins of unknown function.
          Length = 126

 Score = 29.2 bits (66), Expect = 1.7
 Identities = 10/29 (34%), Positives = 15/29 (51%)

Query: 143 VDDKKFKTFGDDPREEVMTLERILLQTIK 171
           + D+ F    +  +EEV  L R LL  +K
Sbjct: 78  ILDRLFGEPSEFEQEEVEKLARPLLDILK 106


>gnl|CDD|221562 pfam12406, DUF3664, Surface protein.  This family of proteins is
           found in eukaryotes. Proteins in this family are
           typically between 131 and 312 amino acids in length.
          Length = 100

 Score = 28.9 bits (64), Expect = 1.7
 Identities = 20/77 (25%), Positives = 30/77 (38%), Gaps = 3/77 (3%)

Query: 273 QVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQY 332
           Q++D   QSQ+   Q+   PQ   +P T+P     L P  +  ++  P  S  P    Q 
Sbjct: 22  QLIDQAEQSQEPTQQEPIEPQQPTQPETEP---EELEPETVTVEVPEPVTSEEPKESDQT 78

Query: 333 PQRVAPPPSGHYPNPYP 349
            +      S   P   P
Sbjct: 79  EEPTETQDSKQEPTQQP 95


>gnl|CDD|219805 pfam08347, CTNNB1_binding, N-terminal CTNNB1 binding.  This region
           tends to appear at the N-terminus of proteins also
           containing DNA-binding HMG (high mobility group) boxes
           (pfam00505) and appears to bind the armadillo repeat of
           CTNNB1 (beta-catenin), forming a stable complex.
           Signaling by Wnt through TCF/LCF is involved in
           developmental patterning, induction of neural tissues,
           cell fate decisions and stem cell differentiation.
           Isoforms of HMG T-cell factors lacking the N-terminal
           CTNNB1-binding domain cannot fulfill their role as
           transcriptional activators in T-cell differentiation.
          Length = 200

 Score = 29.9 bits (67), Expect = 1.8
 Identities = 23/121 (19%), Positives = 29/121 (23%), Gaps = 15/121 (12%)

Query: 267 LEDICHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPP-PQSHLPPSKLKPDLVAPPHSAI 325
           LED+ H+  D+        S         P        P                 H   
Sbjct: 83  LEDVSHKHQDMGYYKGPPYSGYPFLMLPNPYLPNGSLSPLPPSSNKVPVVQPPHHVHPLT 142

Query: 326 PTLGAQYPQRVAPPPSGHYPN--------PYPAHPVPIPNIHPVHAVVPPGSQYYGHHRP 377
           P +           P  H P         P P HP  I   +P    + PG    G    
Sbjct: 143 PLITYSNEHFSPGTPPPHLPYDVDPKTGIPRPPHPPDISPFYP----LSPGGV--GQIPH 196

Query: 378 P 378
           P
Sbjct: 197 P 197


>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
           (DUF1750).  This is a fungal domain of unknown function.
          Length = 669

 Score = 30.8 bits (69), Expect = 1.8
 Identities = 20/69 (28%), Positives = 25/69 (36%), Gaps = 8/69 (11%)

Query: 326 PTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHP-VHAVVPPGSQYYGH-----HRPPP 379
           PT+    P ++A       P PYP   V     +P   AV  P  +  GH     HR   
Sbjct: 200 PTIP--LPPQMAGQSMYQPPGPYPNAMVGRQPFYPQPGAVAGPPKRRGGHKAPRGHRASQ 257

Query: 380 AAGSTQQTQ 388
           AAG      
Sbjct: 258 AAGQAASLD 266


>gnl|CDD|225987 COG3456, COG3456, Predicted component of the type VI protein
           secretion system, contains a FHA domain [Intracellular
           trafficking, secretion, and vesicular    transport;
           Signal transduction mechanisms].
          Length = 430

 Score = 30.5 bits (69), Expect = 1.9
 Identities = 16/103 (15%), Positives = 26/103 (25%), Gaps = 7/103 (6%)

Query: 289 DSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPY 348
           D     +PEP     P     P +L   +  P   +   L A   + +   P      P 
Sbjct: 131 DFLADIEPEPR---LPPGLTDPLQLADPV--PNGFSADPLSALQSESLIAQPDPAGGAPS 185

Query: 349 PAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
            +     P   P  A        +      P   ++       
Sbjct: 186 ISRNSEAPLSDPASA--GGIDTPFPDDLDVPPIAASPPGPQEG 226


>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN.  FtsN is a poorly
           conserved protein active in cell division in a number of
           Proteobacteria. The N-terminal 30 residue region tends
           to by Lys/Arg-rich, and is followed by a
           membrane-spanning region. This is followed by an acidic
           low-complexity region of variable length and a
           well-conserved C-terminal domain of two tandem regions
           matched by pfam05036 (Sporulation related repeat), found
           in several cell division and sporulation proteins. The
           role of FtsN as a suppressor for other cell division
           mutations is poorly understood; it may involve cell wall
           hydrolysis [Cellular processes, Cell division].
          Length = 298

 Score = 30.0 bits (67), Expect = 1.9
 Identities = 13/69 (18%), Positives = 21/69 (30%), Gaps = 3/69 (4%)

Query: 273 QVLDLYSQSQQQASQQDSPPQTQPEPST---KPPPQSHLPPSKLKPDLVAPPHSAIPTLG 329
           Q  +   Q  + A  Q +P +T+   S        Q  LP    +    + P    P   
Sbjct: 152 QTAEKKPQKARTAEAQKTPVETEKIASKVKEAKQKQKALPKQTAETQSNSKPIETAPKAD 211

Query: 330 AQYPQRVAP 338
                +  P
Sbjct: 212 KADKTKPKP 220


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 30.6 bits (69), Expect = 1.9
 Identities = 16/83 (19%), Positives = 19/83 (22%), Gaps = 4/83 (4%)

Query: 276 DLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHS--AIPTLGAQYP 333
           D     +      DS   + P     P       P    P    P     A     A   
Sbjct: 460 DSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASRE 519

Query: 334 QRVAPP--PSGHYPNPYPAHPVP 354
              A    P+     P PA   P
Sbjct: 520 DAPAAAAPPAPEARPPTPAAAAP 542



 Score = 29.4 bits (66), Expect = 4.5
 Identities = 16/101 (15%), Positives = 23/101 (22%), Gaps = 5/101 (4%)

Query: 285 ASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHY 344
           A    +               +    + L P   A    A  T     P   APP +   
Sbjct: 382 APGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAA---AAATRAEAPPAAPAPPATADR 438

Query: 345 PNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQ 385
            +       P+P      A              PPA   + 
Sbjct: 439 GDDAADGDAPVPAKANARASADSRCDERDAQ--PPADSGSA 477



 Score = 28.7 bits (64), Expect = 7.4
 Identities = 22/139 (15%), Positives = 31/139 (22%), Gaps = 30/139 (21%)

Query: 285 ASQQDSPPQTQPEPSTKPPPQSHLPPSKLK-PDLVAPPHSAIPTLGAQYPQRVAPP---- 339
           A+ + +P    P  +T         P+     D  A      P      P   AP     
Sbjct: 488 AAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAG 547

Query: 340 -------------------------PSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGH 374
                                     +        A P P      V    P      G 
Sbjct: 548 GAAAALDVLRNAGMRVSSDRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGD 607

Query: 375 HRPPPAAGSTQQTQTRSCS 393
             P  AA + Q  ++R   
Sbjct: 608 APPNGAARAEQAAESRGAP 626


>gnl|CDD|220633 pfam10214, Rrn6, RNA polymerase I-specific transcription-initiation
           factor.  RNA polymerase I-specific
           transcription-initiation factor Rrn6 and Rrn7 represent
           components of a multisubunit transcription factor
           essential for the initiation of rDNA transcription by
           Pol I. These proteins are found in fungi.
          Length = 753

 Score = 30.5 bits (69), Expect = 2.0
 Identities = 10/25 (40%), Positives = 11/25 (44%)

Query: 276 DLYSQSQQQASQQDSPPQTQPEPST 300
           DL   S Q  S  D P Q+Q E   
Sbjct: 710 DLPPSSSQAFSLSDLPMQSQSESGL 734


>gnl|CDD|218035 pfam04344, CheZ, Chemotaxis phosphatase, CheZ.  This family
           represents the bacterial chemotaxis phosphatase, CheZ.
           This protein forms a dimer characterized by a long
           four-helix bundle, composed of two helices from each
           monomer. CheZ dephosphorylates CheY in a reaction that
           is essential to maintain a continuous chemotactic
           response to environmental changes. It is thought that
           CheZ's conserved residue Gln 147 orientates a water
           molecule for nucleophilic attack at the CheY active
           site.
          Length = 205

 Score = 29.8 bits (68), Expect = 2.0
 Identities = 7/35 (20%), Positives = 17/35 (48%)

Query: 253 RWWEMFVEDMTMDLLEDICHQVLDLYSQSQQQASQ 287
           RW  +   ++ ++  +++CH      S  ++  SQ
Sbjct: 80  RWDRLMAREIELEEFKELCHDTDQFLSDVEEDTSQ 114


>gnl|CDD|215448 PLN02834, PLN02834, 3-dehydroquinate synthase.
          Length = 433

 Score = 30.1 bits (68), Expect = 2.1
 Identities = 13/57 (22%), Positives = 19/57 (33%), Gaps = 2/57 (3%)

Query: 286 SQQDSPPQTQPE-PSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPS 341
           + + + P      PS     Q+    S  + DL    H   P  GA   + V    S
Sbjct: 8   NSESNTPTVLSRSPSDAFFDQNSSIESSKEGDLTEVIHEKCPVSGANKSE-VTKTAS 63


>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein.  This is a
           family of proteins which appears to have a highly
           conserved zinc finger domain at the C terminal end,
           described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
           predicted to contain a coiled coil. Members are
           annotated as being tumour-associated antigen HCA127 in
           humans but this could not confirmed.
          Length = 215

 Score = 29.8 bits (67), Expect = 2.2
 Identities = 22/111 (19%), Positives = 35/111 (31%), Gaps = 18/111 (16%)

Query: 258 FVEDMTMDLLEDICHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDL 317
            + +M    L     ++  L+ + +Q+ S   SP    P P     P +           
Sbjct: 96  SINEMRKLEL--GLEELPQLHEEERQKISPATSPVPPVPLPDPPAFPST----------- 142

Query: 318 VAPPHSAIPTLGAQYPQRVAPPPSG--HYPNPYPAHPVPIPNIHPVHAVVP 366
             P ++A      Q  QR  P      H   P    P P+ N    H  + 
Sbjct: 143 -LPANAAAAAAAQQ--QRDVPQNLINRHRMPPSFQQPPPMKNCLSCHQQIH 190


>gnl|CDD|111097 pfam02166, Androgen_recep, Androgen receptor. 
          Length = 423

 Score = 30.1 bits (67), Expect = 2.3
 Identities = 22/62 (35%), Positives = 27/62 (43%), Gaps = 8/62 (12%)

Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSK-LKPDLVAPP---HSAIPT----LGAQ 331
             +QQ SQQ S  +  PE    P P +    SK L     APP    SA P+    LG  
Sbjct: 85  DEEQQPSQQQSALECHPESGCVPEPGAATAASKGLPQQPPAPPDEDDSAAPSTLSLLGPT 144

Query: 332 YP 333
           +P
Sbjct: 145 FP 146


>gnl|CDD|215194 PLN02340, PLN02340, endoglucanase.
          Length = 614

 Score = 30.2 bits (68), Expect = 2.4
 Identities = 10/39 (25%), Positives = 19/39 (48%)

Query: 271 CHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLP 309
              ++ L+++ Q  +   D+    +P P+T  P QS  P
Sbjct: 483 NAPLVGLFAKLQSASETADAYGSYKPTPNTSSPKQSGAP 521


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 30.2 bits (69), Expect = 2.4
 Identities = 15/111 (13%), Positives = 24/111 (21%), Gaps = 22/111 (19%)

Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKL-----------------KPDLVAPP 321
           + +         PP     P   P        S+L                 K +  A  
Sbjct: 388 TAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSEPAAAS 447

Query: 322 -----HSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPP 367
                +SA+  L +  P   A   +      Y         +       P 
Sbjct: 448 RARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPK 498



 Score = 28.7 bits (65), Expect = 7.0
 Identities = 15/89 (16%), Positives = 28/89 (31%), Gaps = 7/89 (7%)

Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
             +  QA+   +     P+    PPP +  P        V  P +    L A+   + A 
Sbjct: 377 PAASAQATAAPTAAVAPPQAPAVPPPPASAPQQ---APAVPLPETTSQLLAARQQLQRAQ 433

Query: 339 PPSGHYPNPYPAHPVPIPNIHPVHAVVPP 367
             +        + P       PV++ +  
Sbjct: 434 GATK----AKKSEPAAASRARPVNSALER 458


>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
          Length = 226

 Score = 29.6 bits (67), Expect = 2.6
 Identities = 24/86 (27%), Positives = 26/86 (30%), Gaps = 19/86 (22%)

Query: 291 PPQTQPEPSTKPPPQSHL------PPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHY 344
           P  TQ  P+  P   +            L P  VAPP     T     P  V PP     
Sbjct: 57  PAATQALPTQPPEGAAEAVRAGDAAAPSLDPATVAPP----NTPVEPEPAPVEPPKPKPV 112

Query: 345 PNPYP---------AHPVPIPNIHPV 361
             P P         A P P P   PV
Sbjct: 113 EKPKPKPKPQQKVEAPPAPKPEPKPV 138


>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 709

 Score = 30.1 bits (67), Expect = 2.6
 Identities = 17/96 (17%), Positives = 30/96 (31%), Gaps = 2/96 (2%)

Query: 284 QASQQDSPPQTQPEPSTKPPPQSHLPPSKLK--PDLVAPPHSAIPTLGAQYPQRVAPPPS 341
           Q +    P +  P+ +      +      ++   +   PP + +    A   +  AP   
Sbjct: 428 QENNDVPPWEDAPDEAQTAAGTAQTSAKSIQTASEAETPPENQVSKNKAADNETDAPLSE 487

Query: 342 GHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRP 377
               NP  A P          A   P   +YG+  P
Sbjct: 488 VPSENPIQATPNDEAVETETFAHEAPAEPFYGYGFP 523


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 29.7 bits (66), Expect = 2.6
 Identities = 15/55 (27%), Positives = 21/55 (38%), Gaps = 8/55 (14%)

Query: 280 QSQQQASQQDSPP-------QTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPT 327
           Q +Q++    + P       QT  E ST P  +   PPS   P  +    S  P 
Sbjct: 100 QEEQKSLNISATPAPKQEQSQTTTE-STTPKTKVTTPPSTNTPQPMQSTKSDTPQ 153


>gnl|CDD|222579 pfam14179, YppG, YppG-like protein.  The YppG-like protein family
           includes the B. subtilis YppG protein, which is
           functionally uncharacterized. This family of proteins is
           found in bacteria. Proteins in this family are typically
           between 115 and 181 amino acids in length. There are two
           completely conserved residues (F and G) that may be
           functionally important.
          Length = 110

 Score = 28.5 bits (64), Expect = 2.7
 Identities = 13/54 (24%), Positives = 14/54 (25%), Gaps = 3/54 (5%)

Query: 332 YPQRVAPPPSGHYPNPY--PAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGS 383
            PQ     P    P     P  P   P     H  +PP  Q Y    P      
Sbjct: 11  PPQNQQQQPYQQQPYHQQMPPPPYSPPQQQQGHF-MPPQPQPYPKQSPQQQQPP 63


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 30.0 bits (67), Expect = 2.8
 Identities = 32/117 (27%), Positives = 37/117 (31%), Gaps = 14/117 (11%)

Query: 283 QQASQQDSPPQTQPEPSTKP--PPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRV---A 337
           Q A     PP   P P   P  PP     P+       A P +A P   A+ P      A
Sbjct: 688 QWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGR--ARPPAAAPG-RARPPAAAPGRA 744

Query: 338 PPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCST 394
            PP+       P    P     P  A   P  Q      PPP A    Q + R   T
Sbjct: 745 RPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQ------PPPQAPPAPQQRPRGAPT 795


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 29.6 bits (66), Expect = 3.0
 Identities = 22/111 (19%), Positives = 35/111 (31%), Gaps = 9/111 (8%)

Query: 268 EDICHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPT 327
           E +  Q+  +  Q++QQA + D   +   +  T+P                 P     P 
Sbjct: 110 EQMAEQLQRISEQNEQQAIEIDPTEEVNTQEPTQPAGV---------NVANNPQPQVQPQ 160

Query: 328 LGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPP 378
            G    QR+ P   G    P          + P    +PP     G +  P
Sbjct: 161 FGPNPQQRINPQRFGFPMQPNMGMRPGFNQMPPHMPGMPPNQMRPGFNPMP 211


>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in bacteria and eukaryotes, and
           is typically between 384 and 430 amino acids in length.
          Length = 414

 Score = 29.8 bits (67), Expect = 3.2
 Identities = 24/102 (23%), Positives = 34/102 (33%), Gaps = 9/102 (8%)

Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPT-LGAQYPQRVA 337
            + QQ+ S  D+   + P+P     P+S   P K  P          P     +  +  A
Sbjct: 239 EKPQQETSSMDTEKSSAPKPRETLDPKS---PEKAPPIDTTEEELKSPEASPKESEEASA 295

Query: 338 PPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPP 379
              S    +P P    P P   P  +   P S      RP P
Sbjct: 296 RKRSPSLLSPSPKAESPKPLASPGKSPRDPLSP-----RPKP 332


>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
          Length = 1000

 Score = 30.0 bits (67), Expect = 3.3
 Identities = 39/163 (23%), Positives = 47/163 (28%), Gaps = 38/163 (23%)

Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
           S+    +S   + P +  E      P   L  S L PD   PP    P  G + PQ    
Sbjct: 701 SEESHLSSMSPTQPISHEEQPRYEDPDDPLDLS-LHPDQAPPPSHQAPYSGHEEPQAQQA 759

Query: 339 PPSGHY---PNPYPAHPVPIPNIHPVHAVVPPG---------------------SQYYGH 374
           P  G++   P   P      P    V     PG                     SQY GH
Sbjct: 760 PYPGYWEPRPPQAPYLGYQEPQAQGVQVSSYPGYAGPWGLRAQHPRYRHSWAYWSQYPGH 819

Query: 375 ------------HRPPPAAGSTQQTQTRSCSTTPLCHSYPGSP 405
                       H PP   GS    Q +      L  S  G P
Sbjct: 820 GHPQGPWAPRPPHLPPQWDGSAGHGQDQVSQFPHL-QSETGPP 861


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 29.7 bits (67), Expect = 3.3
 Identities = 19/71 (26%), Positives = 20/71 (28%), Gaps = 7/71 (9%)

Query: 321 PHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPA 380
           P       GA  P   A PP    P   PA P       P  A  PP +         P 
Sbjct: 382 PAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAA-------AAPP 434

Query: 381 AGSTQQTQTRS 391
           A S       S
Sbjct: 435 ARSADPAAAAS 445


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 29.9 bits (68), Expect = 3.3
 Identities = 18/109 (16%), Positives = 24/109 (22%), Gaps = 13/109 (11%)

Query: 274 VLDLYSQSQQQASQQD----------SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHS 323
           V ++Y Q     +  D           P  T    +      +        P   AP   
Sbjct: 11  VEEMYQQYLADPNSVDPSWREFFADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAP 70

Query: 324 AIPTLGAQYPQRVAPPPSGHYPNPYP--AHPVPIPNIHPVHAVVPPGSQ 370
           A     A       P P+          A P       P  A V     
Sbjct: 71  APAPPAAA-APAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVT 118



 Score = 29.1 bits (66), Expect = 5.3
 Identities = 14/102 (13%), Positives = 23/102 (22%), Gaps = 10/102 (9%)

Query: 254 WWEMFVEDMTMDLLEDICHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKL 313
           W E F  D                   +   A+   +        +  P   +  PP+  
Sbjct: 29  WREFF-ADYGPGSTAA---------PTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAA 78

Query: 314 KPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPI 355
            P     P +A     A      A   +            P+
Sbjct: 79  APAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVTPL 120


>gnl|CDD|233154 TIGR00859, ENaC, sodium channel transporter.  The Epithelial Na+
           Channel (ENaC) Family (TC 1.A.06)The ENaC family
           consists of sodium channels from animals and has no
           recognizable homologues in other eukaryotes or bacteria.
           The vertebrate ENaC proteins from epithelial cells
           cluster tightly together on the phylogenetic tree:
           voltage-insensitive ENaC homologues are also found in
           the brain. Eleven sequenced C. elegans proteins,
           including the degenerins, are distantly related to the
           vertebrate proteins as well as to each other. At least
           some ofthese proteins form part of a mechano-transducing
           complex for touch sensitivity. Other members of the ENaC
           family, the acid-sensing ion channels, ASIC1-3,are homo-
           or hetero-oligomeric neuronal H+-gated channels that
           mediate pain sensation in response to tissue acidosis.
           The homologous Helix aspersa(FMRF-amide)-activated Na+
           channel is the first peptide neurotransmitter-gated
           ionotropic receptor to be sequenced.Mammalian ENaC is
           important for the maintenance of Na+ balance and the
           regulation of blood pressure. Three homologous ENaC
           subunits, a, b and g, havebeen shown to assemble to form
           the highly Na+-selective channel.This model is designed
           from the vertebrate members of the ENaC family
           [Transport and binding proteins, Cations and iron
           carrying compounds].
          Length = 595

 Score = 29.7 bits (67), Expect = 3.5
 Identities = 19/77 (24%), Positives = 32/77 (41%), Gaps = 6/77 (7%)

Query: 266 LLEDICHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAI 325
           +++ I   +L L  + ++   ++  PP  +P     P P S   P  L+ D      SA+
Sbjct: 520 IIDLIFITLLRLLWRFRKWWQRRRGPPYAEP-----PEPVSADTPPSLQLDDPPTFPSAL 574

Query: 326 PTLGAQYPQ-RVAPPPS 341
           P   A        PPP+
Sbjct: 575 PLPHASGLSLPGTPPPN 591


>gnl|CDD|235161 PRK03814, PRK03814, oxaloacetate decarboxylase subunit gamma;
           Provisional.
          Length = 85

 Score = 27.4 bits (61), Expect = 3.6
 Identities = 15/48 (31%), Positives = 18/48 (37%), Gaps = 6/48 (12%)

Query: 292 PQTQPEPSTKPPPQSHLPP--SKLKPDLVAPPHSAIPTLGAQYPQRVA 337
           PQ  PEP   P       P  S + P +VA    AI     Q+    A
Sbjct: 41  PQEVPEPIAAPKKSKKQQPTSSAVSPQVVA----AISAAVHQHRASKA 84


>gnl|CDD|233508 TIGR01649, hnRNP-L_PTB, hnRNP-L/PTB/hephaestus splicing factor
           family.  Included in this family of heterogeneous
           ribonucleoproteins are PTB (polypyrimidine tract binding
           protein ) and hnRNP-L. These proteins contain four RNA
           recognition motifs (rrm: pfam00067).
          Length = 481

 Score = 29.4 bits (66), Expect = 3.7
 Identities = 18/92 (19%), Positives = 26/92 (28%)

Query: 288 QDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNP 347
            DS   T P+   +  P       + +P L+    S+    G         P +G     
Sbjct: 183 DDSRDYTNPDLPGRRDPGLDQTHRQRQPALLGQHPSSYGHDGYSSHGGPLAPLAGGDRMG 242

Query: 348 YPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPP 379
            P  P          A + P    YG     P
Sbjct: 243 PPHGPPSRYRPAYEAAPLAPAISSYGPAGGGP 274


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
           family of proteins are found in large numbers in the
           Trichomonas vaginalis proteome. The function of this
           protein is unknown.
          Length = 422

 Score = 29.3 bits (65), Expect = 4.0
 Identities = 9/62 (14%), Positives = 17/62 (27%), Gaps = 4/62 (6%)

Query: 291 PPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPA 350
            P+   +P+ + P Q     +  +     P    +        Q     P+   P     
Sbjct: 200 APKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQ----PAQQPTPQNPAQQPPQTEQG 255

Query: 351 HP 352
           H 
Sbjct: 256 HK 257



 Score = 28.9 bits (64), Expect = 6.2
 Identities = 9/31 (29%), Positives = 10/31 (32%)

Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPP 310
              QQ  QQ      QP     P   +  PP
Sbjct: 220 NPAQQPQQQPQQQPVQPAQQPTPQNPAQQPP 250



 Score = 28.5 bits (63), Expect = 7.7
 Identities = 8/28 (28%), Positives = 13/28 (46%)

Query: 283 QQASQQDSPPQTQPEPSTKPPPQSHLPP 310
           QQ + Q+   Q Q +P  +P   +  P 
Sbjct: 214 QQPTVQNPAQQPQQQPQQQPVQPAQQPT 241


>gnl|CDD|235585 PRK05733, PRK05733, single-stranded DNA-binding protein;
           Provisional.
          Length = 172

 Score = 28.4 bits (63), Expect = 4.4
 Identities = 24/86 (27%), Positives = 32/86 (37%), Gaps = 11/86 (12%)

Query: 231 YLAGKLSKFEVVDWIGRNSKHLRWWEMFVEDM--TMDLL----EDICHQVLDLYSQSQQQ 284
           Y+ GKL   E   W     K  R+    V DM  TM LL    +          + +Q  
Sbjct: 78  YIEGKLQTRE---WEKDGIK--RYTTEIVVDMQGTMQLLGGRPQGDDQGGQGGGNYNQSA 132

Query: 285 ASQQDSPPQTQPEPSTKPPPQSHLPP 310
             QQ   PQ   +  ++P PQ   P 
Sbjct: 133 PRQQAQRPQQAAQQQSRPAPQQPAPQ 158



 Score = 28.0 bits (62), Expect = 6.1
 Identities = 14/53 (26%), Positives = 21/53 (39%), Gaps = 1/53 (1%)

Query: 273 QVLDLYSQSQQQASQ-QDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSA 324
           Q+L    Q   Q  Q   +  Q+ P    + P Q+    S+  P   AP  +A
Sbjct: 109 QLLGGRPQGDDQGGQGGGNYNQSAPRQQAQRPQQAAQQQSRPAPQQPAPQPAA 161


>gnl|CDD|218735 pfam05760, IER, Immediate early response protein (IER).  This
           family consists of several eukaryotic immediate early
           response (IER) 2 and 5 proteins. The role of IER5 is
           unclear although it play an important role in mediating
           the cellular response to mitogenic signals. Again,
           little is known about the function of IER2 although it
           is thought to play a role in mediating the cellular
           responses to a variety of extracellular signals.
          Length = 272

 Score = 28.7 bits (64), Expect = 4.7
 Identities = 21/121 (17%), Positives = 26/121 (21%), Gaps = 10/121 (8%)

Query: 292 PQTQPEPSTKPPPQSHL-PPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPA 350
            Q    P  + P    L PPS  +P L A   S         P+   P P   +  P  +
Sbjct: 62  EQPMVPPPEQQPGAPRLHPPSGAEPPLPARAAS---------PEEEQPAPEPAFYRPSVS 112

Query: 351 HPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLCHSYPGSPISTTC 410
              P     P       G               T           P             C
Sbjct: 113 EGFPASGERPDATSTGSGQCSQTTVLDLDTHVVTTVESPYLHQDCPCQAEDGQPAPPAVC 172

Query: 411 S 411
            
Sbjct: 173 K 173


>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
          Length = 291

 Score = 28.8 bits (64), Expect = 4.8
 Identities = 14/58 (24%), Positives = 24/58 (41%), Gaps = 1/58 (1%)

Query: 284 QASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPS 341
           QA+   +P  T+ E     PP + +   + KP+   PP + +     Q P    P  +
Sbjct: 26  QAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEA-PQQTPNATTPSST 82



 Score = 28.1 bits (62), Expect = 7.9
 Identities = 11/47 (23%), Positives = 21/47 (44%), Gaps = 5/47 (10%)

Query: 286 SQQDSPPQTQPE-----PSTKPPPQSHLPPSKLKPDLVAPPHSAIPT 327
           + Q +PP T+ E     P+   PP + +   +  P+   P  + + T
Sbjct: 40  APQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSSTKVET 86


>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 29.1 bits (65), Expect = 4.9
 Identities = 18/65 (27%), Positives = 22/65 (33%), Gaps = 5/65 (7%)

Query: 295 QPEPSTKPPPQSHLPPSKLK-PDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYP--AH 351
            P P+  P  +   P   L  PD    P  A P      P  +  P S   P   P  A 
Sbjct: 381 APSPAGSPDVKKKAPEPDLPQPDRHPGP--AKPEAPGARPAELPSPASAPTPEQQPPVAR 438

Query: 352 PVPIP 356
             P+P
Sbjct: 439 SAPLP 443


>gnl|CDD|222456 pfam13925, Katanin_con80, con80 domain of Katanin.  The con80
           domain of katanin is the C-terminal region of the
           protein that binds to the N-terminal domain of
           katanin-p60, the catalytic ATPase. The complex
           associates with a specific subregion of the mitotic
           spindle leading to increased microtubule disassembly and
           targeting of p60 to the spindle poles. The assembly and
           function of the mitotic spindle requires the activity of
           a number of microtubule-binding proteins. Katanin, a
           heterodimeric microtubule-severing ATPase, is found
           localized at mitotic spindle poles. A proposed model is
           that katanin is targeted to spindle poles through a
           combination of direct microtubule binding by the p60
           subunit and through interactions between the WD40 domain
           and an unknown protein.
          Length = 164

 Score = 28.4 bits (64), Expect = 5.1
 Identities = 4/23 (17%), Positives = 11/23 (47%)

Query: 48  KDIRNSPSFREGIDYETEQRYRR 70
           +   ++     G+D   E+R ++
Sbjct: 105 RSALSAAPSSVGVDLSREERLQK 127


>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
          Length = 327

 Score = 28.9 bits (65), Expect = 5.5
 Identities = 21/98 (21%), Positives = 26/98 (26%), Gaps = 12/98 (12%)

Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPP 340
           SQQQ     +    Q   ST P   + + PSK             P    Q P    P  
Sbjct: 154 SQQQGQVNAASQNAQGNTSTLPTAPATVAPSKGAKVPATAETHPTP---PQKPATKKPAV 210

Query: 341 SGHYPNPYPAHPVPIPNIHPVHAVVP-------PGSQY 371
           + H          P  +  P             P S Y
Sbjct: 211 NHH-KTA-TVAVPPATSGKPKSGAASARALSSAPASHY 246



 Score = 28.1 bits (63), Expect = 8.0
 Identities = 17/67 (25%), Positives = 22/67 (32%), Gaps = 7/67 (10%)

Query: 276 DLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQR 335
           DL   S   +  Q SP       S           +  + D+ APP S  PT  A     
Sbjct: 80  DLSGSSSLSSGNQSSPSAANN-TSDGHDASGVKNTAPPQ-DISAPPISPTPTQAAP---- 133

Query: 336 VAPPPSG 342
               P+G
Sbjct: 134 -PQTPNG 139


>gnl|CDD|135173 PRK04654, PRK04654, sec-independent translocase; Provisional.
          Length = 214

 Score = 28.6 bits (63), Expect = 5.7
 Identities = 20/71 (28%), Positives = 29/71 (40%), Gaps = 2/71 (2%)

Query: 290 SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYP 349
           +P   Q +P    P Q+ +P       +V  PH+A        P  VAP  +G   +P P
Sbjct: 146 APVIAQAQPIAPAPHQTLVPAPH--DTIVPAPHAAHLPSAPATPVSVAPVDAGTSASPTP 203

Query: 350 AHPVPIPNIHP 360
           + P  I    P
Sbjct: 204 SEPTKIQEKQP 214


>gnl|CDD|215182 PLN02321, PLN02321, 2-isopropylmalate synthase.
          Length = 632

 Score = 28.8 bits (64), Expect = 5.9
 Identities = 11/69 (15%), Positives = 16/69 (23%), Gaps = 4/69 (5%)

Query: 290 SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPP-SGHYPNPY 348
           SP ++    +  P   S                 +            A P        P 
Sbjct: 14  SPAKSLSAFTPAPTRSSASSARFPAFLARPAAARSPSLASRASSALAASPSRPQVARRPR 73

Query: 349 PAHPVPIPN 357
           P +   IPN
Sbjct: 74  PEY---IPN 79


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 28.9 bits (64), Expect = 6.0
 Identities = 18/97 (18%), Positives = 24/97 (24%), Gaps = 20/97 (20%)

Query: 294 TQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPV 353
                +  P P+ H   +  KPD    PH A     ++ P     P S     P  A   
Sbjct: 19  IANLNTNIPIPELHTSAATQKPDPAPAPHQA----ASRAPDPAVAPTSAASRKPDLA--- 71

Query: 354 PIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTR 390
                        P        +  PA    Q     
Sbjct: 72  -----------QAPTPAAS--EKFDPAPAPHQAASRA 95


>gnl|CDD|220840 pfam10667, DUF2486, Protein of unknown function (DUF2486).  This
           family is made up of members from various Burkholderia
           spp. The function is unknown.
          Length = 245

 Score = 28.3 bits (63), Expect = 6.0
 Identities = 21/116 (18%), Positives = 28/116 (24%), Gaps = 23/116 (19%)

Query: 292 PQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPP----------S 341
            Q  P            P    +     P   A+  +   +   VA P           +
Sbjct: 45  DQIVPGAEQAASAA---PVHAAREATADPEFVAVEPVPTPHVPAVALPGDTDAPAEPGAA 101

Query: 342 GHYP-NPYPAHPVPIPNIHPVH--AVVPPGSQYY---GHHRP---PPAAGSTQQTQ 388
            H       A   P+P+          PP        G   P   PPAAG      
Sbjct: 102 PHVVAERAAAMQAPLPS-ALAADDPQAPPAGATAADAGDAAPDATPPAAGDASPPA 156


>gnl|CDD|173815 cd07490, Peptidases_S8_6, Peptidase S8 family domain,
           uncharacterized subfamily 6.  This family is a member of
           the Peptidases S8 or Subtilases serine endo- and
           exo-peptidase clan. They have an Asp/His/Ser catalytic
           triad similar to that found in trypsin-like proteases,
           but do not share their three-dimensional structure and
           are not homologous to trypsin. The stability of
           subtilases may be enhanced by calcium, some members have
           been shown to bind up to 4 ions via binding sites with
           different affinity. Some members of this clan contain
           disulfide bonds. These enzymes can be intra- and
           extracellular, some function at extreme temperatures and
           pH values.
          Length = 254

 Score = 28.3 bits (63), Expect = 6.2
 Identities = 11/42 (26%), Positives = 12/42 (28%), Gaps = 9/42 (21%)

Query: 304 PQSHLPPSKLKPDLVAPPHSAIPTLGAQ-YPQRVAPPPSGHY 344
                P    KPD+ AP        G   Y  R      G Y
Sbjct: 180 APDSPPDEYTKPDVAAP--------GVDVYSARQGANGDGQY 213


>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
           Tim54.  Mitochondrial function depends on the import of
           hundreds of different proteins synthesised in the
           cytosol. Protein import is a multi-step pathway which
           includes the binding of precursor proteins to surface
           receptors, translocation of the precursor across one or
           both mitochondrial membranes, and folding and assembly
           of the imported protein inside the mitochondrion. Most
           precursor proteins carry amino-terminal targeting
           signals, called pre-sequences, and are imported into
           mitochondria via import complexes located in both the
           outer and the inner membrane (IM). The IM complex, TIM,
           is made up of at least two proteins which mediate
           translocation of proteins into the matrix by removing
           their signal peptide and another pair of proteins, Tim54
           and Tim22, that insert the polytopic proteins, that
           carry internal targetting information, into the inner
           membrane.
          Length = 377

 Score = 28.6 bits (64), Expect = 6.5
 Identities = 18/81 (22%), Positives = 31/81 (38%), Gaps = 11/81 (13%)

Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
             ++  A   +   +T PE +   P + +  P K     V  P+ +      +YP    P
Sbjct: 218 PAAESPAEPAEETAETTPEETEDAPEEENNKPVKPP---VPKPYIS----PDEYPSAPLP 270

Query: 339 PPSGHYPNPY-PAHPVPIPNI 358
           P     P    P+  +P PN+
Sbjct: 271 P---ELPQLLQPSLVIPFPNL 288


>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN).  This family
           consists of several eukaryotic survival motor neuron
           (SMN) proteins. The Survival of Motor Neurons (SMN)
           protein, the product of the spinal muscular
           atrophy-determining gene, is part of a large
           macromolecular complex (SMN complex) that functions in
           the assembly of spliceosomal small nuclear
           ribonucleoproteins (snRNPs). The SMN complex functions
           as a specificity factor essential for the efficient
           assembly of Sm proteins on U snRNAs and likely protects
           cells from illicit, and potentially deleterious,
           non-specific binding of Sm proteins to RNAs.
          Length = 264

 Score = 28.4 bits (63), Expect = 6.5
 Identities = 19/89 (21%), Positives = 30/89 (33%), Gaps = 1/89 (1%)

Query: 269 DICHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTL 328
           D+    L   + ++ ++S  +S   +              PPS   P     P    P  
Sbjct: 124 DVDEDALKEANVNETESSTDESDRSSHSHEVRSKSNFPMGPPSPWNPRFPPGPPPPPPGF 183

Query: 329 GAQYPQ-RVAPPPSGHYPNPYPAHPVPIP 356
           G    +    PP    +P P+P  P  IP
Sbjct: 184 GRHGEKPSGWPPFLSGWPPPFPLGPPMIP 212


>gnl|CDD|144451 pfam00859, CTF_NFI, CTF/NF-I family transcription modulation
           region. 
          Length = 295

 Score = 28.5 bits (63), Expect = 6.7
 Identities = 31/136 (22%), Positives = 43/136 (31%), Gaps = 14/136 (10%)

Query: 281 SQQQASQQDSPPQTQPEPSTKPP-----PQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQR 335
           S     + D    + P P    P      Q H P       + A PH     L       
Sbjct: 105 SSPTMKKPDKSLFSSPSPQDSSPRLSAFTQHHRPVITGHSGISASPHPTPSPLHFPTSPI 164

Query: 336 VAPPPSGHYPNP---YPAHPVPIPNIHP-VHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
           +   PS ++P+    YP H  P   +   V  V  P SQ     +     GS Q      
Sbjct: 165 LPQQPSSYFPHTAIRYPPHLHPQDPLKEFVQLVCDPSSQ-----QAGQPNGSGQGKVPNH 219

Query: 392 CSTTPLCHSYPGSPIS 407
              TP+    P  P++
Sbjct: 220 FLPTPMLAPPPPPPMA 235


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 28.9 bits (64), Expect = 6.7
 Identities = 17/65 (26%), Positives = 22/65 (33%), Gaps = 1/65 (1%)

Query: 290 SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYP 349
           S  +     S K P    +P S  +P+    P    P      P+R   P     P P P
Sbjct: 602 SAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKP-P 660

Query: 350 AHPVP 354
             P P
Sbjct: 661 KSPKP 665


>gnl|CDD|221501 pfam12273, RCR, Chitin synthesis regulation, resistance to Congo
           red.  RCR proteins are ER membrane proteins that
           regulate chitin deposition in fungal cell walls.
           Although chitin, a linear polymer of beta-1,4-linked
           N-acetylglucosamine, constitutes only 2% of the cell
           wall it plays a vital role in the overall protection of
           the cell wall against stress, noxious chemicals and
           osmotic pressure changes. Congo red is a cell
           wall-disrupting benzidine-type dye extensively used in
           many cell wall mutant studies that specifically targets
           chitin in yeast cells and inhibits growth. RCR proteins
           render the yeasts resistant to Congo red by diminishing
           the content of chitin in the cell wall. RCR proteins are
           probably regulating chitin synthase III interact
           directly with ubiquitin ligase Rsp5, and the VPEY motif
           is necessary for this, via interaction with the WW
           domains of Rsp5.
          Length = 124

 Score = 27.4 bits (61), Expect = 7.0
 Identities = 19/86 (22%), Positives = 27/86 (31%), Gaps = 22/86 (25%)

Query: 278 YSQSQQQASQQDSPPQTQPEP----------------STKPPPQSHLPPSKLKPDLVAPP 321
           Y QSQQQ +        Q  P                  +  P    P  +L+P    PP
Sbjct: 45  YRQSQQQYNNTPQGRPPQYVPPYTETANENDLGYYDGQGEFHPNPKAPAIELQP----PP 100

Query: 322 HSAIPTLGAQYPQRVAPPPSGHYPNP 347
           ++      +        PP+G  P P
Sbjct: 101 NAYERGTRSPTTDDEYQPPAG--PPP 124


>gnl|CDD|236394 PRK09169, PRK09169, hypothetical protein; Validated.
          Length = 2316

 Score = 28.9 bits (65), Expect = 7.1
 Identities = 22/86 (25%), Positives = 33/86 (38%), Gaps = 16/86 (18%)

Query: 277  LYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPP--SKLKPDL-----VAP-PHSAIPTL 328
            L++Q +  A+     P   P P+  PP    L    ++  P L      A  P + +  L
Sbjct: 1685 LHAQLEALAAA----PANTPSPAAPPPA---LSAALAQRFPPLAEAYAQAGTPRARLAAL 1737

Query: 329  GAQYPQRVAPPPSGHYPNPYPAHPVP 354
               YP    P P   +P   P + VP
Sbjct: 1738 VDHYPVARLPHPDEQFPEGQPGY-VP 1762


>gnl|CDD|218610 pfam05498, RALF, Rapid ALkalinization Factor (RALF).  RALF, a 5-kDa
           ubiquitous polypeptide in plants, arrests root growth
           and development.
          Length = 66

 Score = 26.2 bits (58), Expect = 7.8
 Identities = 11/44 (25%), Positives = 13/44 (29%), Gaps = 8/44 (18%)

Query: 356 PNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLCH 399
            N  P       G+ YY      PA        +R CS    C 
Sbjct: 30  RNSVPCS---RRGASYYNCKPGAPA-----NPYSRGCSAITRCR 65


>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
           partitioning].
          Length = 324

 Score = 28.3 bits (63), Expect = 7.9
 Identities = 23/96 (23%), Positives = 28/96 (29%), Gaps = 10/96 (10%)

Query: 282 QQQASQQDSPPQTQPEPSTKPP--PQSHLPPSKLKPDLVAPPHSA-IPTLG--AQYPQRV 336
            Q   +  S     P P       P +H  P++   D   PP  A  P L   A  PQ V
Sbjct: 88  HQYQPEYASAQIKIPVPQPPQISDPPAHPQPTQPALDQEQPPEEARQPVLPQEAPAPQPV 147

Query: 337 A--PPPSG---HYPNPYPAHPVPIPNIHPVHAVVPP 367
               P        P        P   + P   V  P
Sbjct: 148 HSAAPQPAVQTVQPAVPEQQVQPEEVVEPAPEVKRP 183


>gnl|CDD|234683 PRK00191, tatA, twin arginine translocase protein A; Provisional.
          Length = 84

 Score = 26.5 bits (58), Expect = 8.1
 Identities = 24/87 (27%), Positives = 39/87 (44%), Gaps = 13/87 (14%)

Query: 220 WEPEIIAVALMYLAGKLSKFEVVDWIGRNSKHLRWWEMFVEDMTMDLLEDICHQVLDLYS 279
           WE  II + ++ L G     +    IGR+   +R ++  V++M+ D              
Sbjct: 6   WEIGIIVLLIIVLFGAKKLPDAARSIGRS---MRIFKSEVKEMSKDD---------QTPE 53

Query: 280 QSQQQASQQDSPPQT-QPEPSTKPPPQ 305
           Q QQQ  QQ +P Q   P+P+ +   Q
Sbjct: 54  QQQQQPQQQIAPNQIEAPQPNFQQHYQ 80


>gnl|CDD|218116 pfam04503, SSDP, Single-stranded DNA binding protein, SSDP.  This
           is a family of eukaryotic single-stranded DNA binding
           proteins with specificity to a pyrimidine-rich element
           found in the promoter region of the alpha2(I) collagen
           gene.
          Length = 293

 Score = 28.1 bits (62), Expect = 8.1
 Identities = 18/76 (23%), Positives = 23/76 (30%), Gaps = 3/76 (3%)

Query: 330 AQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQT 389
           A +    A  PS    N  P   +P     P       G Q+     P   +    Q  T
Sbjct: 8   AFHDYSAAAAPSPVLGNMPPGDGMPQGPDPPGFFQGAGGKQHQQKKTPQSGSTPQMQNTT 67

Query: 390 RSCSTTPLCHSYPGSP 405
                +P    YPG P
Sbjct: 68  SQPFMSP---RYPGGP 80


>gnl|CDD|215915 pfam00429, TLV_coat, ENV polyprotein (coat polyprotein). 
          Length = 560

 Score = 28.2 bits (63), Expect = 8.4
 Identities = 27/133 (20%), Positives = 41/133 (30%), Gaps = 28/133 (21%)

Query: 288 QDSPPQTQPEPSTKPPPQSHLPPSKL-----KPDLVAPPHSAI----PTL---------- 328
           +D  P +QP P   PP      P          DL+   +  +    P+L          
Sbjct: 208 RDQKPPSQPHP-LAPPLPRPTSPPGPGTGDRLLDLLQGAYLTLNATNPSLTQDCWLCLVS 266

Query: 329 GAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHR---PPPAAGSTQ 385
           G  Y + +   P+G   N   A   P  ++ P  A+        G      P        
Sbjct: 267 GPPYYEGI--APNGSVSNHTSA---PACSLGPQLALTLSEVTGQGCCIGGIPVGHQALCN 321

Query: 386 QTQTRSCSTTPLC 398
            T + S  +  LC
Sbjct: 322 TTVSTSSGSHYLC 334


>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
          Length = 222

 Score = 28.1 bits (62), Expect = 8.7
 Identities = 16/62 (25%), Positives = 27/62 (43%), Gaps = 6/62 (9%)

Query: 272 HQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSK-----LKPDLVAPPHSAIP 326
            + L+      ++  Q    PQ+QP+P  + P Q +    K     L PD   P ++ I 
Sbjct: 161 QEKLEESEIQSEKQEQPSQQPQSQPQPQQQ-PQQPNNNKRKAMEEGLSPDRTRPRYNGIT 219

Query: 327 TL 328
           T+
Sbjct: 220 TV 221


>gnl|CDD|234630 PRK00095, mutL, DNA mismatch repair protein; Reviewed.
          Length = 617

 Score = 28.3 bits (64), Expect = 9.0
 Identities = 15/118 (12%), Positives = 23/118 (19%), Gaps = 13/118 (11%)

Query: 264 MDLLEDICHQVL-------------DLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPP 310
            DL+     + L              +   ++ +       P      S      S  PP
Sbjct: 314 HDLIVQAIQEALAQSGLIPAAAGANQVLEPAEPEPLPLQQTPLYASGSSPPASSPSSAPP 373

Query: 311 SKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPG 368
            + +           P          A   S       PA               P G
Sbjct: 374 EQSEESQEESSAEKNPLQPNASQSEAAAAASAEAAAAAPAAAPEPAEAAEEADSFPLG 431


>gnl|CDD|217197 pfam02711, Pap_E4, E4 protein.  This is is a family of
           Papillomavirus proteins, E4, coded for by ORF4. A splice
           variant, E1--E4, exists but neither the function of E4
           or E1--E4 is known.
          Length = 95

 Score = 26.5 bits (59), Expect = 9.0
 Identities = 18/80 (22%), Positives = 25/80 (31%), Gaps = 11/80 (13%)

Query: 339 PPSGHYP-----NPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCS 393
           P +  YP     + Y   P P P   P     P   +     R    + S+  T   S  
Sbjct: 5   PVTPKYPLLDLLHTYTTPPRPPP--PPP----PWAPKKRCRRRLESDSDSSSTTSPSSPP 58

Query: 394 TTPLCHSYPGSPISTTCSTT 413
           T       P +  +T  S T
Sbjct: 59  TPQPTTECPWTVTTTGSSVT 78


>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
          Length = 935

 Score = 28.5 bits (63), Expect = 9.6
 Identities = 19/82 (23%), Positives = 25/82 (30%), Gaps = 5/82 (6%)

Query: 288 QDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPP---PSGHY 344
              PP   P    +P  Q        +P     P  A P +          P    +   
Sbjct: 468 AQLPPG--PLQDLEPGDQLPGVVQDGRPACAPVPAPAGPIVRPWEASLSQVPGVAFAPVM 525

Query: 345 PNPYPAHPVPIPNIHPVHAVVP 366
           P P P  PVP+P +     V P
Sbjct: 526 PQPMPVEPVPVPTVALERPVCP 547


>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824).  This is
           a repeating domain found in fungal proteins. It is
           proline-rich, and the function is not known.
          Length = 135

 Score = 27.2 bits (60), Expect = 9.9
 Identities = 14/42 (33%), Positives = 16/42 (38%), Gaps = 1/42 (2%)

Query: 338 PPPSGHYPNPYPAH-PVPIPNIHPVHAVVPPGSQYYGHHRPP 378
           PPP G  P P P   P   P  +P      P  Q Y +   P
Sbjct: 78  PPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQNYPYPPGP 119


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.136    0.432 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 21,395,639
Number of extensions: 2058749
Number of successful extensions: 3904
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3020
Number of HSP's successfully gapped: 349
Length of query: 414
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 315
Effective length of database: 6,546,556
Effective search space: 2062165140
Effective search space used: 2062165140
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (27.1 bits)