RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14745
         (854 letters)



>gnl|CDD|111745 pfam02891, zf-MIZ, MIZ/SP-RING zinc finger.  This domain has SUMO
           (small ubiquitin-like modifier) ligase activity and is
           involved in DNA repair and chromosome organisation.
          Length = 50

 Score = 98.5 bits (246), Expect = 3e-25
 Identities = 30/50 (60%), Positives = 39/50 (78%)

Query: 177 VSLKCPITFKKITLPARGHDCKHIQCFDLESYLQLNCERGSWRCPVCNKP 226
           VSLKCPI++ +I++P RG  CKHIQCFDL S+L+ N    +W CPVC+KP
Sbjct: 1   VSLKCPISYLRISIPVRGRFCKHIQCFDLLSFLESNERTPTWNCPVCDKP 50


>gnl|CDD|192832 pfam11789, zf-Nse, Zinc-finger of the MIZ type in Nse subunit.
           Nse1 and Nse2 are novel non-SMC subunits of the fission
           yeast Smc5-6 DNA repair complex. This family is the
           zinc-finger domain similar to the MIZ type of
           zinc-finger.
          Length = 57

 Score = 39.2 bits (92), Expect = 3e-04
 Identities = 14/51 (27%), Positives = 25/51 (49%), Gaps = 4/51 (7%)

Query: 176 KVSLKCPITFKKITLPARGHDCKHIQCFDLESYLQLNCERGSWRCPV--CN 224
            +SL CP+T +    P     C H+  F+ ++ L +     + +CPV  C+
Sbjct: 9   TISLTCPLTLQPFEEPVTSKKCNHV--FEKDAILSMLRRNKTVKCPVAGCS 57


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 43.2 bits (102), Expect = 7e-04
 Identities = 55/362 (15%), Positives = 89/362 (24%), Gaps = 53/362 (14%)

Query: 484 PLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPTFSDIKPTFSDIKPA 543
           P PG  T      S +         ST +P               +     PT     P 
Sbjct: 71  PPPGPGTEAPANESRS---TPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPP 127

Query: 544 INPHPQSIASGSMMNNNSGPPSYSMSMNRSSGGNGGGFNDFPSGGNSGDFPGDNSLESLN 603
            +P                P    M     S G     +   +G +      D +     
Sbjct: 128 PSP---------------APDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQA 172

Query: 604 AMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDIPS 663
           A+  S  E+    P +P     P TP     P  P  SS       + +     +    +
Sbjct: 173 ALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDA 232

Query: 664 DLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNNPGGPT 723
             +   ++  +  G G    N  P     P+ L + +        PSS     ++   P 
Sbjct: 233 GASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPR 292

Query: 724 TNDDILALEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDIP 783
                                 +P +PG    PS P +SS+     ++SSS+  S+ +  
Sbjct: 293 ----------------ERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESS 336

Query: 784 SDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNNPGGP 843
                 P                                PP             +     
Sbjct: 337 RGAAVSPGPSPSRSPSPSR-------------------PPPPADPSSPRKRPRPSRAPSS 377

Query: 844 TT 845
             
Sbjct: 378 PA 379



 Score = 37.5 bits (87), Expect = 0.037
 Identities = 41/261 (15%), Positives = 72/261 (27%), Gaps = 12/261 (4%)

Query: 588 GNSGDFPGDNSLESLNAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQD 647
           G+ G    D++  +   +    +      P T   P        N   S P  S +    
Sbjct: 38  GSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAP 97

Query: 648 TQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSY-LDPPDLA 706
              +   + +     S     P                 P   +  M        PP  A
Sbjct: 98  ASPAREGSPTPPGPSSPDPPPPTPPPASPPPS-------PAPDLSEMLRPVGSPGPPPAA 150

Query: 707 TPPSSGSSNGNNPGGPTTNDDILALEQMP----HTPHTPHTPHTPHTPGGNGPPSVPPSS 762
           +PP++G+S         ++         P      P +P     P TP     P  P  S
Sbjct: 151 SPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRS 210

Query: 763 STQQQDTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLD 822
           S       + +     +    +  +   ++  +  G G    N  P     P+ L + + 
Sbjct: 211 SPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIW 270

Query: 823 PPDLATPPSSGSSNGNNPGGP 843
                  PSS     ++   P
Sbjct: 271 EASGWNGPSSRPGPASSSSSP 291



 Score = 34.0 bits (78), Expect = 0.46
 Identities = 42/246 (17%), Positives = 67/246 (27%), Gaps = 28/246 (11%)

Query: 482 SSPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPTFSDIKPTFSDIK 541
            + +  +       A   S P   +  + SSP   PP S       P         S   
Sbjct: 159 PAAVASDAASSRQAALPLSSPEETAR-APSSPPAEPPPSTPPAAASPRPPRRSSPISA-- 215

Query: 542 PAINPHPQSIASGSMMNNNSGPPSYSMSMNRSSGGNGGGFNDFPSGGNS-GDFPG--DNS 598
            A +P P   A G    +++G  S   S + SSG   G  N+ P    +    P     +
Sbjct: 216 SASSPAP---APGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEA 272

Query: 599 LESLNAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVST 658
                   +        +P       +P +PG    PS P +SS+     ++SSS+  S+
Sbjct: 273 SGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSS 332

Query: 659 DDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNN 718
            +        P                                PP             + 
Sbjct: 333 SESSRGAAVSPGPSPSRSPSPSR-------------------PPPPADPSSPRKRPRPSR 373

Query: 719 PGGPTT 724
                 
Sbjct: 374 APSSPA 379



 Score = 29.8 bits (67), Expect = 8.8
 Identities = 36/198 (18%), Positives = 55/198 (27%), Gaps = 26/198 (13%)

Query: 481 HSSPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRD--VKPTFSDIKPTFS 538
            +   P    PP TP ++ S  P    +S  S   S P    GR        S    + S
Sbjct: 184 RAPSSPPAEPPPSTPPAAASPRPPR-RSSPISASASSPAPAPGRSAADDAGASSSDSSSS 242

Query: 539 DIK--------------PAINPHPQSIASGSMMNN---------NSGPPSYSMSMNRSSG 575
           +                PA    P  I   S  N          +S  P         S 
Sbjct: 243 ESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSS 302

Query: 576 GNGGGFNDFPSGGNSGDFPGDNSLESLNAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPP 635
              G     P   +S     ++S  S ++  +S          +P    +P  P     P
Sbjct: 303 PGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADP 362

Query: 636 SVPPSSSTQQQDTQTSSS 653
           S P       +   + ++
Sbjct: 363 SSPRKRPRPSRAPSSPAA 380


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 42.2 bits (99), Expect = 0.001
 Identities = 72/433 (16%), Positives = 114/433 (26%), Gaps = 46/433 (10%)

Query: 468  QQNEMRNNLTYQQHSSPLPGNPT-PPLTPASSTSVPPFVSPNSTSSPFISP-PHSVSGRD 525
                 R +   Q      P +    P  PA  + +PP         P  SP  +      
Sbjct: 2583 TSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHP 2642

Query: 526  VKPTFSDIKPTFSDIKPAINPHPQSIASGSMMNNNSGPPSYSMSMNRSSGGNGGGFNDFP 585
                    +P        ++   ++   G     +S P        R + G+     D P
Sbjct: 2643 PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPP 2702

Query: 586  SGGNSGDFPGDNSLESL-----NAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPS 640
                + +      + +       A  +  S  +P  P  P  P  P TPGG   P+ PP+
Sbjct: 2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT 2762

Query: 641  SSTQQQDT---------------QTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNL 685
            ++                        +S + S + +PS  +                   
Sbjct: 2763 TAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAA 2822

Query: 686  LPDNVVDPMELLSYLDPPDLATPPSSGSSNGNN--PGG--------------PTTNDDI- 728
             P   + P        PP    PP      G +  PGG              P       
Sbjct: 2823 SPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPP 2882

Query: 729  ---LALEQMPHTPHT----PHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDD 781
               LA   +  +  +    P  P  P  P    PP   P      Q              
Sbjct: 2883 VRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942

Query: 782  IPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNNPG 841
            +    +   A    G         L+P  V  P   +    P   A   S+    G++  
Sbjct: 2943 LAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLS 3002

Query: 842  GPTTNDDILALFE 854
              ++    LAL E
Sbjct: 3003 RVSSWASSLALHE 3015



 Score = 39.9 bits (93), Expect = 0.008
 Identities = 55/321 (17%), Positives = 89/321 (27%), Gaps = 34/321 (10%)

Query: 486  PGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPTFSDIKPTFSDIKPAIN 545
            P  P PP  PA+    P  ++  + +S   S     S  D     + +    + + PA +
Sbjct: 2766 PPAPAPPAAPAAGP--PRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAS 2823

Query: 546  PHPQSIASGSMMNNNSGPPSYSMSMNRSSGGN---GGGFNDFPSGGNSGDFPGDNSLESL 602
            P        S       PP      +   GG+   GG     P   +    P   +   +
Sbjct: 2824 PAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPV 2883

Query: 603  NAMEK-SLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDI 661
              + + ++S         P  P  P  P    PP   P      Q              +
Sbjct: 2884 RRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPL 2943

Query: 662  PSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNNPGG 721
                +   A    G         L+P  V  P   +    P   A   S+    G++   
Sbjct: 2944 APTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSR 3003

Query: 722  PTTNDDILALEQMPHTP-----HTPHTPHT-----------------------PHTPGGN 753
             ++    LAL +    P      T   P                         P  P  +
Sbjct: 3004 VSSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPEPH 3063

Query: 754  GPPSVPPSSSTQQQDTQTSSS 774
             P +  P  +T +   + S S
Sbjct: 3064 DPFAHEPDPATPEAGARESPS 3084



 Score = 31.1 bits (70), Expect = 4.0
 Identities = 20/82 (24%), Positives = 27/82 (32%), Gaps = 10/82 (12%)

Query: 685  LLPDNVVDPMELLSYLDPP---DLATPPSSGSSNGNNPGGPTTNDDILALEQMPHTPHTP 741
            L  D+  DP   L    PP   D + PP   +     P  P     + +  + P  P   
Sbjct: 2543 LASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAPR---PSEPA----VTSRARRPDAPPQS 2595

Query: 742  HTPHTPHTPGGNGPPSVPPSSS 763
              P  P    G+     PPS  
Sbjct: 2596 ARPRAPVDDRGDPRGPAPPSPL 2617



 Score = 30.3 bits (68), Expect = 5.9
 Identities = 20/86 (23%), Positives = 27/86 (31%), Gaps = 3/86 (3%)

Query: 337  PNGYSMPGRPSFQGHHQYPSQQPGSGPNGAPFG--PNSAPSSRGSSVRQSTPSFPSSSGP 394
            P G + PG P+        +  P   P  AP    P        +S+ +S  S PS   P
Sbjct: 2745 PAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDP 2804

Query: 395  NSNANGGSNHQYFGAGNQFP-GGVPP 419
                               P G +PP
Sbjct: 2805 ADPPAAVLAPAAALPPAASPAGPLPP 2830


>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 944

 Score = 38.6 bits (90), Expect = 0.015
 Identities = 27/151 (17%), Positives = 48/151 (31%), Gaps = 13/151 (8%)

Query: 626 PHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNL 685
           P TP    PP+     ++    +QTS+S ++  D   +     P      E         
Sbjct: 645 PKTPPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQSVP------EAALASGSAP 698

Query: 686 LPDNVVDPMELLSYLDPPDLA----TPPSSGSSNGNNPGGPTTNDDILALEQMPHTPHTP 741
            P  V DP +   + + P++A     P ++   N +      +N ++ A+EQ        
Sbjct: 699 APPPVPDPYDRPPWEEAPEVASANDGPNNAAEGNLSESVEDASNSELQAVEQQATHQPQV 758

Query: 742 HTPHTPHTPGGNGPPSVPPSSSTQQQDTQTS 772
                              SS  Q  +    
Sbjct: 759 QAEAQSPAS---TTALTQTSSEVQDTELNLV 786



 Score = 31.6 bits (72), Expect = 2.3
 Identities = 45/284 (15%), Positives = 71/284 (25%), Gaps = 50/284 (17%)

Query: 595 GDNSLESLNAMEKSLSEQMPHTPHTPHTPHTPH-TPGGNGPPSVPPSSSTQQQDTQTSSS 653
           G  +  SL+A   ++ EQ+  T        +   T   +   S   ++     D  ++  
Sbjct: 471 GFEASSSLDADNSAVPEQIDSTAEQSVVNPSVTDTQVDDTSASNNSAADNTVDDNYSAED 530

Query: 654 NNVSTDDIPSDLNFDPA-----AVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATP 708
              S      D   D A              +    L  D                 A  
Sbjct: 531 TLESNGLDEGDYAQDSAPLDAYQDDYVAFSSESYNALSDDEQHSAN--------VQSAQS 582

Query: 709 PSSGSSNGNNPGGPTT---------NDDIL---------------ALEQMPHTPHTPHTP 744
            +    +  +    +          +DDIL               AL             
Sbjct: 583 AAEAQPSSQSLSPISAVTTAAASLADDDILDAVLAARDSLLSDLDALSPKEGDGKKSSAD 642

Query: 745 HTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENL 804
             P TP    PP+     ++    +QTS+S ++  D   +     P      E       
Sbjct: 643 RKPKTPPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQSVP------EAALASGS 696

Query: 805 NLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNNPGGPTTNDD 848
              P  V DP       D P     P   S+N         N  
Sbjct: 697 APAPPPVPDPY------DRPPWEEAPEVASANDGPNNAAEGNLS 734


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 38.1 bits (89), Expect = 0.015
 Identities = 16/104 (15%), Positives = 31/104 (29%), Gaps = 1/104 (0%)

Query: 676 EGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNNPGGPTTNDDILALEQMP 735
             +  E+ ++   N  +              + P S S    +   P  +    +   +P
Sbjct: 154 LDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLP 213

Query: 736 HTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVST 779
             P +  +   P +P     PS PP  +             +ST
Sbjct: 214 PAPSSFQSDTPPPSPESPTNPSPPPGPAAPPP-PPVQQVPPLST 256



 Score = 35.0 bits (81), Expect = 0.12
 Identities = 17/93 (18%), Positives = 23/93 (24%), Gaps = 3/93 (3%)

Query: 464 FTGFQQNEMRNNLTYQQHSSPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSG 523
             G                S LP  P+   +     S     +P+    P   PP  V  
Sbjct: 191 SPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQ 250

Query: 524 RDVKPTFSDIKPTFSDIKPAINPH---PQSIAS 553
                T     P+ S     I        +IA 
Sbjct: 251 VPPLSTAKPTPPSASATPAPIGGITLDDDAIAK 283



 Score = 32.3 bits (74), Expect = 0.86
 Identities = 15/88 (17%), Positives = 24/88 (27%), Gaps = 1/88 (1%)

Query: 571 NRSSGGNGGGFNDFPSGGNSGDFPGDNSLESLNAMEKSLSEQMPHTPHTPHTPHTPHTPG 630
           N   G +    +  PS   S      +               +P  P +  +   P +P 
Sbjct: 170 NSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPE 229

Query: 631 GNGPPSVPPSSSTQQQDTQTSSSNNVST 658
               PS PP  +             +ST
Sbjct: 230 SPTNPSPPPGPAAPPP-PPVQQVPPLST 256



 Score = 31.2 bits (71), Expect = 2.4
 Identities = 18/85 (21%), Positives = 23/85 (27%)

Query: 293 SPGSMTLPTMSNWEMGQSMSPYIPPDMNSKSPFMHIESSAILSFPNGYSMPGRPSFQGHH 352
              S   P         S  P  P    S +P    ES    S P G + P  P  Q   
Sbjct: 193 GVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVP 252

Query: 353 QYPSQQPGSGPNGAPFGPNSAPSSR 377
              + +P      A   P    +  
Sbjct: 253 PLSTAKPTPPSASATPAPIGGITLD 277



 Score = 29.7 bits (67), Expect = 7.1
 Identities = 32/134 (23%), Positives = 43/134 (32%), Gaps = 6/134 (4%)

Query: 718 NPGGPTT-NDDILALEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSS-STQQQDTQTSSSN 775
           NPG P    D+   +         P     P +   + PPS  P   S        SS +
Sbjct: 149 NPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPS 208

Query: 776 NVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATP-PSSGS 834
           + S    PS    D             N +  P     P   +  + P   A P P S S
Sbjct: 209 DSSLPPAPSSFQSDTPPPSPES---PTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSAS 265

Query: 835 SNGNNPGGPTTNDD 848
           +     GG T +DD
Sbjct: 266 ATPAPIGGITLDDD 279


>gnl|CDD|206094 pfam13923, zf-C3HC4_2, Zinc finger, C3HC4 type (RING finger). 
          Length = 45

 Score = 33.6 bits (77), Expect = 0.019
 Identities = 13/49 (26%), Positives = 21/49 (42%), Gaps = 6/49 (12%)

Query: 178 SLKCPITFKKITLPARGHDCKHIQCFD-LESYLQLNCERGSWRCPVCNK 225
            L+CPI    +  P     C H+ C + +  YL     +   +CP+C  
Sbjct: 2   ELECPICLDLLRDPVVLTPCGHVFCRECILRYL-----KKKSKCPICRT 45


>gnl|CDD|216205 pfam00937, Corona_nucleoca, Coronavirus nucleocapsid protein. 
          Length = 346

 Score = 35.8 bits (83), Expect = 0.073
 Identities = 16/54 (29%), Positives = 18/54 (33%), Gaps = 3/54 (5%)

Query: 353 QYPSQQPGSGPNG---APFGPNSAPSSRGSSVRQSTPSFPSSSGPNSNANGGSN 403
             P +     P G     F   S  SSR SS   S      SS  NS     S+
Sbjct: 134 AIPLRFSPGLPKGFYIEGFRGRSRSSSRSSSRSNSRGPSRGSSRNNSRNRNSSS 187


>gnl|CDD|238093 cd00162, RING, RING-finger (Really Interesting New Gene) domain, a
           specialized type of Zn-finger of 40 to 60 residues that
           binds two atoms of zinc; defined by the 'cross-brace'
           motif C-X2-C-X(9-39)-C-X(1-3)-
           H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved
           in mediating protein-protein interactions; identified in
           a proteins with a wide range of functions such as viral
           replication, signal transduction, and development; has
           two variants, the C3HC4-type and a C3H2C3-type (RING-H2
           finger), which have different cysteine/histidine
           pattern; a subset of RINGs are associated with B-Boxes
           (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
          Length = 45

 Score = 32.0 bits (73), Expect = 0.081
 Identities = 13/48 (27%), Positives = 22/48 (45%), Gaps = 5/48 (10%)

Query: 180 KCPITFKKITLPARGHDCKHIQCFD-LESYLQLNCERGSWRCPVCNKP 226
           +CPI  ++   P     C H+ C   ++ +L    + G   CP+C  P
Sbjct: 1   ECPICLEEFREPVVLLPCGHVFCRSCIDKWL----KSGKNTCPLCRTP 44


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 36.2 bits (83), Expect = 0.083
 Identities = 71/378 (18%), Positives = 115/378 (30%), Gaps = 30/378 (7%)

Query: 468 QQNEMRNNLTYQQHSSPLPGNPTPPLTP-ASSTSVPPFVSPNSTSSPFISPPHSVSGRDV 526
           Q NE  ++ + QQ      G P+  + P A+     P  +P++ + P    P +      
Sbjct: 158 QDNESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQ---- 213

Query: 527 KPTFSDIKPTFSDIKPAINPHPQSIASGSMMNNNSGPPSYSMSMNRSSGGNGGGFNDFPS 586
            P     +P+   +  A + HPQ + S         PP    + ++ S       +  P 
Sbjct: 214 -PAPQPQQPSPLSLISAPSLHPQRLPS-------PHPPLQPQTASQQSPQPPAPSSRHPQ 265

Query: 587 GGNSGDFPGDNSLESLNAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQ 646
             + G  P          M  +L +      H    P  P     +  P +P  S  Q  
Sbjct: 266 SSHHGPGP---------PMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPH 316

Query: 647 DTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLA 706
                S + +     P +    PA  +        ++   P   +  +   S+  PP L 
Sbjct: 317 SHTPPSQSALQPQQPPREQPLPPAPSM-------PHIKPPPTTPIPQLPNQSHKHPPHLQ 369

Query: 707 TPPSSGSSNGNNPGGPTTNDDILALEQMPHTPHTPHTPHTPHTPGGNGPPSVPPS-SSTQ 765
            P        N P  P            P + H P     P +      P+ PP  + +Q
Sbjct: 370 GPSPFPQMPSNLPPPPALKPLSSLPTHHPPSAHPPPLQLMPQSQPLQSVPAQPPVLTQSQ 429

Query: 766 QQDTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPD 825
               + S+  +      P    F       G          LP +        S    P 
Sbjct: 430 SLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAPPRASSGSQPP 489

Query: 826 LATPPSSGSSNGNNPGGP 843
            +  PSSG   G  P  P
Sbjct: 490 GSALPSSGGCAGPGPPLP 507


>gnl|CDD|236504 PRK09418, PRK09418, bifunctional 2',3'-cyclic nucleotide
           2'-phosphodiesterase/3'-nucleotidase precursor protein;
           Reviewed.
          Length = 780

 Score = 35.5 bits (81), Expect = 0.14
 Identities = 26/82 (31%), Positives = 33/82 (40%), Gaps = 7/82 (8%)

Query: 707 TPPSSGSSNGNNPGGPTTNDDILALEQMPHTPHTPH-----TPHTPHTPGGNGPPSVPPS 761
           TPP+    NG NP  P T +      + P TP T        P TP T  GN   S   +
Sbjct: 669 TPPTGEGDNGENPTTPPTGEG--NNGENPTTPPTGEGNNGGNPTTPSTDEGNNAGSGQTT 726

Query: 762 SSTQQQDTQTSSSNNVSTDDIP 783
           +  Q     T+ S N    D+P
Sbjct: 727 TDNQNSKETTTVSENKEERDLP 748


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 33.6 bits (77), Expect = 0.45
 Identities = 31/175 (17%), Positives = 51/175 (29%), Gaps = 8/175 (4%)

Query: 497 SSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPTFSDIKPTFSDIKPAINPHPQSIASGSM 556
              +   F + +S++S   +    +     K                 + +  +    ++
Sbjct: 71  QDNNDKKFSTIDSSTSDSNNIIDFI----YKNLPQTNINQLLTKNKYDDNYSLTTLIQNL 126

Query: 557 MNNNSGPPSYSMSMNRSSGGNGGGFNDFPSGGNSGDFPGDNSLESLNAMEKSLSEQMPHT 616
            N NS    Y    N     N    N   S  N  D       ++ N    S +   P T
Sbjct: 127 FNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPST 186

Query: 617 PHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDPAA 671
            +       P  P  +     P S  T  Q +  SS +N S  D   D   D  +
Sbjct: 187 SNKQPNSPKPTQPNQSNSQ--PASDDTANQKS--SSKDNQSMSDSALDSILDQYS 237



 Score = 31.3 bits (71), Expect = 2.9
 Identities = 29/162 (17%), Positives = 53/162 (32%), Gaps = 4/162 (2%)

Query: 634 PPSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDP 693
             S   SS++   +       N+   +I   L  +            +NL  L  ++ D 
Sbjct: 77  KFSTIDSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISD- 135

Query: 694 MELLSYLDPPDLATPPSSGSSNGNNPGGPTTNDDILALEQMPHTPHT-PHTPHTPHTPGG 752
            E     +     +  +S SS  N+    ++  D    ++ P + +T P T +       
Sbjct: 136 YEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPK 195

Query: 753 NGPPSVPPSSSTQQQDTQ--TSSSNNVSTDDIPSDLNFDPAA 792
              P+   S           +SS +N S  D   D   D  +
Sbjct: 196 PTQPNQSNSQPASDDTANQKSSSKDNQSMSDSALDSILDQYS 237


>gnl|CDD|223025 PHA03253, PHA03253, UL35; Provisional.
          Length = 609

 Score = 33.4 bits (76), Expect = 0.52
 Identities = 33/179 (18%), Positives = 62/179 (34%), Gaps = 35/179 (19%)

Query: 592 DFPGDNSLESLNAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTS 651
                  +E LN  E   S +   +  T   P++P   G + P S+    +  + ++++S
Sbjct: 449 GLSLKQVIERLNVNEGRSSSRASPSHSTSTIPYSPPQSGRSTPTSILRQRTPIRSNSRSS 508

Query: 652 SSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSS 711
           S +    D   S  +             + N++    +   PM  L   D   +   P S
Sbjct: 509 SVSFSQGDSNRSHYS------------DETNIS----DYSYPMADLDLEDEEPMEDHPHS 552

Query: 712 GSSNGNNPGGPTTNDDILALEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQ 770
             S  +N     T+        + ++                 PP++ PSSS +QQ+  
Sbjct: 553 PQSISSNNSMSRTSR------ALQNSQR-------------RRPPTMFPSSSERQQNNA 592


>gnl|CDD|219133 pfam06682, DUF1183, Protein of unknown function (DUF1183).  This
           family consists of several eukaryotic proteins of around
           360 residues in length. The function of this family is
           unknown.
          Length = 317

 Score = 33.2 bits (76), Expect = 0.57
 Identities = 16/81 (19%), Positives = 21/81 (25%), Gaps = 7/81 (8%)

Query: 340 YSMPGRPSFQGHHQYPSQQP-----GSGPNGAPFGPNSAPSSRGSSVRQSTPSFPSSSGP 394
           Y              P +       G G  G   G +              P +   +GP
Sbjct: 174 YKFFLSCGGVRGGPRPERAGYGGGGGGGGGGGGGGGSGPGPPPPGFKSSFPPPYGPGAGP 233

Query: 395 NSNANGGSNHQYFGAGNQFPG 415
           +S    GS     G G   PG
Sbjct: 234 SS--GYGSGGTRSGQGGWGPG 252


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 33.0 bits (75), Expect = 0.78
 Identities = 35/182 (19%), Positives = 57/182 (31%), Gaps = 26/182 (14%)

Query: 470 NEMRNNLTYQQHSSPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPT 529
           +EM++ LT + +   L G P          SVP   SPN+   P         GR +  +
Sbjct: 59  DEMKDFLTDRSNQPHLVGIPKN--------SVPQ--SPNNKGKPSFFGDQR--GRSIPSS 106

Query: 530 FSDIKPTFSDIKPAINP-HPQSIASGSMMNNNSGPPSYSMSMNRSSGGNGGGFNDFPSGG 588
                 +     P+ N    Q  +        +G P  ++S ++  G  G   +      
Sbjct: 107 HQSTHTSAPGPAPSGNISQSQKRSQPRSQWTKAGHPPSTVSSSQRHGQQGSSHSHSGDHH 166

Query: 589 NSGDFPGDNSLESLNAMEKSLSEQMPHTPHTPHTPHTPHTPGG----NGPPSVPPSSSTQ 644
           +       +  +          E    +P  P               N PPS+   SS  
Sbjct: 167 SKEICRSKSPRDL---------EVNVQSPERPPDSLVLLPSSQHSVQNFPPSLMSKSSAM 217

Query: 645 QQ 646
           QQ
Sbjct: 218 QQ 219


>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
          Length = 327

 Score = 32.7 bits (75), Expect = 0.81
 Identities = 28/165 (16%), Positives = 46/165 (27%), Gaps = 22/165 (13%)

Query: 610 SEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDP 669
           S    +     H              S PP S T  Q     + N     ++P +++   
Sbjct: 94  SPSAANNTSDGHDASGVKNTAPPQDISAPPISPTPTQAAPPQTPNGQQRIELPGNIS--- 150

Query: 670 AAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNNPGGPTTNDDIL 729
               D   Q Q  +N    N               L T P++ + +       T      
Sbjct: 151 ----DALSQQQGQVNAASQNAQGN--------TSTLPTAPATVAPSKGAKVPATAE---- 194

Query: 730 ALEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSS 774
                P  P  P T             +VPP++S + +    S+ 
Sbjct: 195 ---THPTPPQKPATKKPAVNHHKTATVAVPPATSGKPKSGAASAR 236



 Score = 29.7 bits (67), Expect = 7.2
 Identities = 15/78 (19%), Positives = 23/78 (29%), Gaps = 11/78 (14%)

Query: 710 SSGSSNGNNPGGPTTNDDILALEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDT 769
           SS  S+GN       N+               H              S PP S T  Q  
Sbjct: 84  SSSLSSGNQSSPSAANNT-----------SDGHDASGVKNTAPPQDISAPPISPTPTQAA 132

Query: 770 QTSSSNNVSTDDIPSDLN 787
              + N     ++P +++
Sbjct: 133 PPQTPNGQQRIELPGNIS 150


>gnl|CDD|113815 pfam05058, ActA, ActA Protein.  The ActA family is found in
           Listeria and is associated with motility. ActA protein
           acts as a scaffold to assemble and activate host cell
           actin cytoskeletal factors at the bacterial surface,
           resulting in directional actin polymerisation and
           propulsion of the bacterium through the cytoplasm of the
           host cell.
          Length = 601

 Score = 32.5 bits (73), Expect = 0.97
 Identities = 21/84 (25%), Positives = 35/84 (41%), Gaps = 3/84 (3%)

Query: 483 SPLPGNPTPPLTP-ASSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPTFSDI--KPTFSD 539
           +P   +P P L+P     S P  +S  +  +PF +PP  +   + K T   +  K T   
Sbjct: 408 NPFASSPVPSLSPKVPKISAPALISDITKKAPFKNPPQPLKVFNKKTTTKTVLKKITPVK 467

Query: 540 IKPAINPHPQSIASGSMMNNNSGP 563
           I P +   P +    + +  N  P
Sbjct: 468 IAPKLAELPATKPQETAIGENKAP 491


>gnl|CDD|222279 pfam13639, zf-RING_2, Ring finger domain. 
          Length = 46

 Score = 28.9 bits (65), Expect = 1.2
 Identities = 13/49 (26%), Positives = 19/49 (38%), Gaps = 8/49 (16%)

Query: 181 CPITFKKITLPARGH--DCKHIQCFD-LESYLQLNCERGSWRCPVCNKP 226
           CPI   +           C H+   + L+ +L     R S  CP+C  P
Sbjct: 3   CPICLDEFEPGEEVVVLPCGHVFHKECLDKWL-----RSSNTCPLCRAP 46


>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal.  Members
           of this family are found in the N terminal region of
           cytochrome B561, as well as in various other putative
           uncharacterized proteins.
          Length = 559

 Score = 32.4 bits (74), Expect = 1.2
 Identities = 24/99 (24%), Positives = 32/99 (32%), Gaps = 17/99 (17%)

Query: 318 DMNSKSPFMHIESSAILSFPNGYSMPGRPSFQGHHQYPSQQPGSGPNGAPFGPNSAPSSR 377
            MN+  P +   SS   S     S      F          P S    +P    S   + 
Sbjct: 128 PMNTSEPLVPGHSS--FSDSPSRSASPSRKFS---------PSSTIQQSPQLTPSNKPAS 176

Query: 378 GSSVRQSTPSFPSS-----SGPNSNANGGSNHQYFGAGN 411
            SS  QS PS+ SS     S  N +    S      +G+
Sbjct: 177 PSSSYQS-PSYSSSLGPVNSSGNRSNLRSSPWALRSSGD 214



 Score = 31.3 bits (71), Expect = 2.2
 Identities = 25/135 (18%), Positives = 38/135 (28%), Gaps = 8/135 (5%)

Query: 462 LCFTGFQQNEMRNNLTYQQHSSPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSV 521
           L  +  Q   +       Q +       +PP +  S+            SS   SP  S 
Sbjct: 91  LVLSERQLGLLGVKAKDSQFTVVSQAKKSPPASKTSTPMNTSEPLVPGHSSFSDSPSRSA 150

Query: 522 SGRDVKPTFSDIKPTFSDIKPAINPHPQSIASGSMMNNNSGPPSYSMSMNRSSGGNGGGF 581
           S    K + S        + P+  P     AS S    +    S    +N S   +    
Sbjct: 151 SPSR-KFSPSSTIQQSPQLTPSNKP-----ASPSSSYQSPSYSSSLGPVNSSGNRSNLRS 204

Query: 582 NDFPSGGNSGDFPGD 596
           +  P    S     D
Sbjct: 205 S--PWALRSSGDKKD 217


>gnl|CDD|178806 PRK00030, minC, septum formation inhibitor; Provisional.
          Length = 292

 Score = 31.2 bits (70), Expect = 2.0
 Identities = 21/68 (30%), Positives = 26/68 (38%), Gaps = 5/68 (7%)

Query: 702 PPDLATP-PSSGSSNGNNPGGPTTNDDILALEQ----MPHTPHTPHTPHTPHTPGGNGPP 756
           P D+ATP PS   +           DD    EQ      H P +  T     T   N P 
Sbjct: 116 PNDVATPVPSVPEATAEAAAKAGPQDDEADGEQADEAPAHNPESVPTRAARETTEANRPT 175

Query: 757 SVPPSSST 764
           + PP SS+
Sbjct: 176 ATPPQSSS 183


>gnl|CDD|166894 PRK00294, hscB, co-chaperone HscB; Provisional.
          Length = 173

 Score = 30.6 bits (69), Expect = 2.1
 Identities = 17/42 (40%), Positives = 22/42 (52%), Gaps = 7/42 (16%)

Query: 18  PPFRLEHNLAVSNHVFQLKTTVH------QTLMWRSDLE-LQ 52
           PP R  + LA+S H   L+ TVH      Q +  R +LE LQ
Sbjct: 67  PPRRARYLLALSGHEVPLEVTVHDPEFLLQQMQLREELEELQ 108


>gnl|CDD|235088 PRK02943, PRK02943, SecA regulator SecM; Provisional.
          Length = 167

 Score = 30.3 bits (69), Expect = 2.2
 Identities = 10/35 (28%), Positives = 17/35 (48%), Gaps = 3/35 (8%)

Query: 599 LESLNAM---EKSLSEQMPHTPHTPHTPHTPHTPG 630
           L++LNA+   E +  E +  T +   TP    + G
Sbjct: 116 LDTLNALLTQEGTPPEIIRRTSYAHFTPQASFSTG 150


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 31.2 bits (70), Expect = 2.5
 Identities = 30/159 (18%), Positives = 46/159 (28%), Gaps = 4/159 (2%)

Query: 637 VPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMEL 696
            P ++ T  +   TS    VS    PS +         G      N  L     V     
Sbjct: 167 EPKTAVTTSK--TTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGT 224

Query: 697 LSYLDPPDLATPPSS--GSSNGNNPGGPTTNDDILALEQMPHTPHTPHTPHTPHTPGGNG 754
            +  +P     PP S  G S        TT+ D               TP          
Sbjct: 225 TTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPH 284

Query: 755 PPSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDPAAV 793
             + PP ++ +Q+  + +     +T    S  +  P  V
Sbjct: 285 STATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGV 323


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 31.3 bits (71), Expect = 2.5
 Identities = 20/109 (18%), Positives = 31/109 (28%), Gaps = 19/109 (17%)

Query: 696 LLSYLDP-PDLATPPSSGSSNGNNPGGPTTNDDILA----LEQ--------------MPH 736
           +L +LD  PD   P         +      ++DIL     LEQ               P 
Sbjct: 1   VLEFLDSLPDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPG 60

Query: 737 TPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDIPSD 785
           TP  P     P           P       + T++S   + +      +
Sbjct: 61  TPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEE 109


>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain.  Syndecans are transmembrane
           heparin sulfate proteoglycans which are implicated in
           the binding of extracellular matrix components and
           growth factors.
          Length = 207

 Score = 30.5 bits (69), Expect = 2.6
 Identities = 25/135 (18%), Positives = 43/135 (31%), Gaps = 8/135 (5%)

Query: 635 PSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPM 694
           P++   ++ +  D +       S DD     + +      G G G       P +  D  
Sbjct: 17  PALAAQAAAEYPDERYLDEEG-SGDDDEFI-DDEMDDEYSGSGSGA-----TPSDDEDSE 69

Query: 695 ELLSYLDPPDLATPPSSGSSNGNNPGGPT-TNDDILALEQMPHTPHTPHTPHTPHTPGGN 753
            + +   PP L T  SS S++       T T+  +        +P    T     T    
Sbjct: 70  PVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEATTTVSTE 129

Query: 754 GPPSVPPSSSTQQQD 768
            P     S++T    
Sbjct: 130 TPTEGGSSAATDPSK 144


>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
          Length = 486

 Score = 31.1 bits (70), Expect = 2.7
 Identities = 12/69 (17%), Positives = 24/69 (34%), Gaps = 1/69 (1%)

Query: 710 SSGSSNGNNPGGPTTNDDILALEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDT 769
           +SG+S  ++      +   +       T HT     +P   G +   +   + ST Q   
Sbjct: 25  ASGASTSSSIENSDNSTAEMQSTPATPT-HTTSNLTSPFGTGTDNSTNANGTESTTQASQ 83

Query: 770 QTSSSNNVS 778
             S    ++
Sbjct: 84  PHSHETTIT 92


>gnl|CDD|219420 pfam07466, DUF1517, Protein of unknown function (DUF1517).  This
           family consists of several hypothetical glycine rich
           plant and bacterial proteins of around 300 residues in
           length. The function of this family is unknown.
          Length = 280

 Score = 30.8 bits (70), Expect = 2.7
 Identities = 18/43 (41%), Positives = 20/43 (46%), Gaps = 1/43 (2%)

Query: 552 ASGSMMNNNS-GPPSYSMSMNRSSGGNGGGFNDFPSGGNSGDF 593
           ASG  +   S   PS S S  RSS   GGG+   P GG    F
Sbjct: 3   ASGGRIGGGSFRAPSRSSSSPRSSSPGGGGYYGSPGGGFGFPF 45


>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
          Length = 171

 Score = 30.1 bits (68), Expect = 3.0
 Identities = 9/54 (16%), Positives = 15/54 (27%)

Query: 483 SPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPTFSDIKPT 536
                 P   +T  + +   P V+           P   + R+VK        T
Sbjct: 68  GNFRTMPVQTVTATTPSGSRPAVTATPEQKLLAPRPLFTAAREVKTVVPVSSVT 121


>gnl|CDD|221634 pfam12548, DUF3740, Sulfatase protein.  This domain family is found
           in eukaryotes, and is typically between 144 and 173
           amino acids in length. The family is found in
           association with pfam00884.
          Length = 144

 Score = 29.7 bits (67), Expect = 3.2
 Identities = 15/54 (27%), Positives = 25/54 (46%), Gaps = 3/54 (5%)

Query: 134 LQGLLRKRLLTADHCIAKIKRNFNNSNSSSMSSSDS-IEQ--TALKVSLKCPIT 184
           L  LL+ R L   H   + KR+    +   M S +S I     +++V+ +C I 
Sbjct: 32  LLDLLQSRNLLKRHEEHRSKRDTEEYSGEGMLSDNSNIASVPQSIQVTHRCFIL 85


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 30.8 bits (70), Expect = 3.3
 Identities = 24/86 (27%), Positives = 31/86 (36%), Gaps = 11/86 (12%)

Query: 469 QNEMRN--NLTYQQHSSPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRDV 526
            NE RN   LT  Q  +PL G    PL       V P      T +P  +PP   SG  +
Sbjct: 25  MNEARNLRALTETQ--TPLLGGENTPLHETDFDGVTPRKQQIQTPNPLATPPFR-SGNGI 81

Query: 527 KPTFSDIKPTFSDI------KPAINP 546
             T       +  +      K +IN 
Sbjct: 82  GATPLRGGSGYGPLRTPNRDKLSIND 107


>gnl|CDD|227914 COG5627, MMS21, DNA repair protein MMS21 [DNA replication,
           recombination, and repair].
          Length = 275

 Score = 30.4 bits (68), Expect = 3.3
 Identities = 22/86 (25%), Positives = 31/86 (36%), Gaps = 13/86 (15%)

Query: 163 SMSSSDSIEQTALKV-------SLKCPITFKKITLPARGHDCKHIQCFDL-ESYLQLNCE 214
            M S +  E    K+       S +CPIT      P     C H    DL    LQ+ C 
Sbjct: 167 VMKSPNPEEGLVDKILIHQELLSNRCPITLNPDFYPILSSKCNHKPEMDLINKKLQVECT 226

Query: 215 RGSWRCPV--CNKPAQLEGLEVDQYM 238
           R    CP   C++   ++    D  +
Sbjct: 227 R---VCPRLICSQKEVVDPYVCDHIL 249


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 30.5 bits (69), Expect = 3.5
 Identities = 18/98 (18%), Positives = 27/98 (27%), Gaps = 8/98 (8%)

Query: 346 PSFQGHHQYPSQQPGSGPNGAPFGPNSAPSSRGSSVRQSTPSFPSSSGPNSNANGGSNHQ 405
              Q   Q P  Q            ++ P+   ++   +T + P        A       
Sbjct: 61  QLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALP--------AGPAGPTI 112

Query: 406 YFGAGNQFPGGVPPNSNQYQEIKECDQKAQISMTYQMH 443
               G  +P  VP    Q       DQ AQ     Q+ 
Sbjct: 113 QTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQ 150


>gnl|CDD|110602 pfam01611, Filo_glycop, Filovirus glycoprotein.  This family
           includes an extracellular region from the envelope
           glycoprotein of Ebola and Marburg viruses. This region
           is also produced as a separate transcript that gives
           rise to a non-structural, secreted glycoprotein, which
           is produced in large amounts and has an unknown
           function. Processing of this protein may be involved in
           viral pathogenicity.
          Length = 364

 Score = 30.6 bits (69), Expect = 3.8
 Identities = 14/58 (24%), Positives = 20/58 (34%)

Query: 606 EKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDIPS 663
              LS        T H+         N     P +++T    T  +S +N ST  IP 
Sbjct: 290 STCLSTPSLSPRTTNHSRQAVTELDKNRTSLQPSTNNTTTISTNNTSKHNFSTQSIPL 347


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 30.9 bits (70), Expect = 4.0
 Identities = 22/133 (16%), Positives = 31/133 (23%), Gaps = 11/133 (8%)

Query: 294 PGSMTLPTMSNWEM-GQSMSPYIPPDMNSKSPFMHIESSAILSFPNGYSMPGRPSFQGHH 352
           P +     +S  E+  Q       P      P    +       P   + P +   +   
Sbjct: 146 PQTPAQKMLSLEEVEAQLQQRQQAPQ----LPQPPQQVLPQGMPPRQAAFPQQGPPEQPP 201

Query: 353 QYPSQQPGSGPNGAPFGPNSAPSSRGSSVRQSTPSFPSSSGPNSNANGGSNHQYFGAGNQ 412
            YP    G      P     APS        + P  P    P              +   
Sbjct: 202 GYPQPPQGHPEQVQPQQFLPAPSQ-----APAQPPLPPQL-PQQPPPLQQPQFPGLSQQM 255

Query: 413 FPGGVPPNSNQYQ 425
            P    P   Q Q
Sbjct: 256 PPPPPQPPQQQQQ 268



 Score = 29.7 bits (67), Expect = 7.8
 Identities = 27/158 (17%), Positives = 40/158 (25%), Gaps = 15/158 (9%)

Query: 284 RGMCAGKAMSPGSMTLPTMSNWEMGQSMSPYIPPDMNSKSPFMHIESSAILSFPNGYSMP 343
           + +  G      +           G    P   P+      F+   S A    P    +P
Sbjct: 178 QVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLP 237

Query: 344 GRPSFQGHHQYP--SQQPGSGPNGAP------------FGPNSAPSSRGSSVRQSTPSFP 389
            +P      Q+P  SQQ    P   P              P + P+      +      P
Sbjct: 238 QQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLP 297

Query: 390 SSSGPNSNANGGSNHQYFGAGNQFPGGVPPNSNQYQEI 427
               P               G QF   +   S Q +E 
Sbjct: 298 PPQQPQLLPLVQQPQG-QQRGPQFREQLVQLSQQQREA 334


>gnl|CDD|215570 PLN03091, PLN03091, hypothetical protein; Provisional.
          Length = 459

 Score = 30.3 bits (68), Expect = 4.8
 Identities = 25/95 (26%), Positives = 37/95 (38%), Gaps = 10/95 (10%)

Query: 485 LPGNPTPPLTPASSTSVPPFVSP-NSTSSPFISPPHSVSGRDVKPTFSDIKPTFSDIKPA 543
           L  NP P L  + ++      S  +S+ +P I PP SV+        S   PT    KP+
Sbjct: 256 LSTNPNPTLWFSQNSKSFEMNSEFSSSMTPSILPP-SVT--------SSFLPTPMSYKPS 306

Query: 544 INPHPQSIASGSMMNNNSGPPSYSMSMNRSSGGNG 578
           I+    + +  S   N           N S+  NG
Sbjct: 307 ISLPSDNPSIPSFTVNGVRNWEAGAFSNNSNSSNG 341


>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
           inhibitor.
          Length = 670

 Score = 30.4 bits (68), Expect = 4.8
 Identities = 21/87 (24%), Positives = 30/87 (34%)

Query: 576 GNGGGFNDFPSGGNSGDFPGDNSLESLNAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPP 635
           GN G  N  P+G  +      +S      +  S S    H    P TP    +P  + P 
Sbjct: 564 GNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPPA 623

Query: 636 SVPPSSSTQQQDTQTSSSNNVSTDDIP 662
           S   S ST     ++S     +    P
Sbjct: 624 SHLGSPSTTPSSPESSIKVASTETASP 650


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 30.1 bits (67), Expect = 5.7
 Identities = 55/305 (18%), Positives = 88/305 (28%), Gaps = 40/305 (13%)

Query: 482 SSPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPTFSDIKPTFSDIK 541
           +S  P   +P     +  +  P     S +    SP   V G     T +   P      
Sbjct: 493 TSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIG----VTTTATSPPTGTTS 548

Query: 542 PAINPHPQSIASGSMMNNNSGPPSYSMSMNRSSGGNGGGFNDFPSGGNSGDFPGDNSLES 601
                 PQ +   S +NN + P   S     +S               +G  P       
Sbjct: 549 VPNATSPQ-VTEESPVNNTNTPVVTSAPSVLTSA-------VTTGQHGTGSSPT------ 594

Query: 602 LNAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDI 661
                         +  TP +  T  TP      + P       ++T +  S    +   
Sbjct: 595 -----SQQPGIPSSSHSTPRSNSTSTTPLLTS--AHPTGGENITEETPSVPSTTHVSTLS 647

Query: 662 PSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNNPGG 721
           P      P       G G  + +  P  V      ++   P   AT PS+ S        
Sbjct: 648 PGP---GPGTTSQVSGPGNSSTSRYPGEVH-----VTEGMPNPNATSPSAPSGQKTAVPT 699

Query: 722 PTTNDDILALEQMPHTPHTPHTPHTPHTPGGNGPPSV-------PPSSSTQQQDTQTSSS 774
            T+            +  T     +PHT  G    +        PPS+S++ +   T +S
Sbjct: 700 VTSTGGKANSTTKETSGSTLMASTSPHTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFTS 759

Query: 775 NNVST 779
             V+T
Sbjct: 760 PPVTT 764


>gnl|CDD|184256 PRK13700, PRK13700, conjugal transfer protein TraD; Provisional.
          Length = 732

 Score = 29.9 bits (67), Expect = 6.4
 Identities = 17/84 (20%), Positives = 26/84 (30%), Gaps = 17/84 (20%)

Query: 721 GPTTNDDILALEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTD 780
               +       Q P  P  P  P  P  P       V P  +    D ++ +  NV   
Sbjct: 611 ASGEDVTQAEQPQQPQQPQQPQQPQQPQQP-------VSPVIN----DKKSDAGVNVPAG 659

Query: 781 DIPSDLNFDPAAVIDGEGQGQENL 804
            I  +L   P      E + ++ L
Sbjct: 660 GIEQELKMKP------EEEMEQQL 677


>gnl|CDD|221868 pfam12938, M_domain, M domain of GW182. 
          Length = 238

 Score = 29.5 bits (66), Expect = 6.5
 Identities = 30/107 (28%), Positives = 36/107 (33%), Gaps = 5/107 (4%)

Query: 336 FPNGYSMPGRPSF-QGHHQYPSQQPGSGPNGAPFGPNSAPSSRGSSVRQSTPSFPSSSGP 394
           FP+G S    P F Q +        G GP G   G N   +S  S   Q      S   P
Sbjct: 16  FPSGGSSVNSPPFSQNNLPNNLGGGGGGPGGGGGGNNPNLASLSSLTSQGLGKILSGLQP 75

Query: 395 NSNANGGSNHQYFGAGNQFPGGVPPNSNQYQEIKECDQKAQISMTYQ 441
               NGG      GAG   P G            + + +AQ   T Q
Sbjct: 76  PPLGNGGG----SGAGGPGPVGGGGGPGVAPNNIQPNAQAQQPSTQQ 118


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 29.8 bits (67), Expect = 6.6
 Identities = 18/75 (24%), Positives = 24/75 (32%)

Query: 483 SPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPTFSDIKPTFSDIKP 542
                   PP  P   T+ PP V P   + P    P S           D KP ++   P
Sbjct: 390 KAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAP 449

Query: 543 AINPHPQSIASGSMM 557
                   IA G ++
Sbjct: 450 PKEEEKALIADGDVL 464


>gnl|CDD|218056 pfam04388, Hamartin, Hamartin protein.  This family includes the
           hamartin protein which is thought to function as a
           tumour suppressor. The hamartin protein interacts with
           the tuberin protein pfam03542. Tuberous sclerosis
           complex (TSC) is an autosomal dominant disorder and is
           characterized by the presence of hamartomas in many
           organs, such as brain, skin, heart, lung, and kidney. It
           is caused by mutation either TSC1 or TSC2 tumour
           suppressor gene. TSC1 encodes a protein, hamartin,
           containing two coiled-coil regions, which have been
           shown to mediate binding to tuberin. The TSC2 gene codes
           for tuberin pfam03542. These two proteins function
           within the same pathway(s) regulating cell cycle, cell
           growth, adhesion, and vesicular trafficking.
          Length = 667

 Score = 29.9 bits (67), Expect = 7.2
 Identities = 38/176 (21%), Positives = 56/176 (31%), Gaps = 32/176 (18%)

Query: 601 SLNAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDD 660
           SL+  E S SE       +   P++          S+P ++ST        +S   +T  
Sbjct: 262 SLDPTETS-SEDGYSFSRSSAYPNSSPRQALPPSISLPQNTSTSGSLHSAQTSRRPNT-- 318

Query: 661 IPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSS-GSSNGNNP 719
                 FD AA    +           D++  P  L         ATPPSS G S     
Sbjct: 319 -----TFDKAASSGTK-----------DSLWSPSSLCGM------ATPPSSIGMSPLILS 356

Query: 720 GGPTTNDDILALEQMPHTP-HTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSS 774
             P+         + P T       P +  TP  + PP        +     +S S
Sbjct: 357 LSPSHLS-----GRAPGTTGSGKGEPASESTPSTSPPPPGLADDIVRAIFATSSRS 407


>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
           UL32.  The large phosphorylated protein (UL32-like) of
           herpes viruses is the polypeptide most frequently
           reactive in immuno-blotting analyses with antisera when
           compared with other viral proteins.
          Length = 777

 Score = 29.5 bits (66), Expect = 9.4
 Identities = 32/184 (17%), Positives = 50/184 (27%), Gaps = 3/184 (1%)

Query: 634 PPSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDP 693
           P      SS    D ++   +   +   P  L    +   D           L  + VD 
Sbjct: 302 PLLDLTGSSLWSDDEESGDEDGDGSGFAPEPLIKTDSRSNDTLVDLGRGGGSLKLDSVDA 361

Query: 694 MELLSYLDPPDLATPPSSGSSNGNNPGGPTTNDDILALEQMPHTPHTPHTPHTPHTPGGN 753
               SYL  P L+  P+SG            +  + + +     P             G 
Sbjct: 362 PGTSSYLFEPGLSPSPNSGKEMPGILTTENLDLPLASTDSTEMDPEDKRGGAVKINNSGI 421

Query: 754 --GPPSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDP-AAVIDGEGQGQENLNLLPDN 810
                  P  +   ++    SS       D PS L       VID      +     P +
Sbjct: 422 LAWGLKTPGLAVNDERSIAVSSDGITDVLDPPSPLRLHSSDKVIDSVSPPSKRRVSAPAS 481

Query: 811 VVDP 814
            +D 
Sbjct: 482 RLDD 485


>gnl|CDD|240430 PTZ00473, PTZ00473, Plasmodium Vir superfamily; Provisional.
          Length = 420

 Score = 29.4 bits (66), Expect = 9.8
 Identities = 21/77 (27%), Positives = 33/77 (42%), Gaps = 6/77 (7%)

Query: 337 PNGYSMPGRPSFQGHHQYPSQQPGSGPNGAPFGPNSAP----SSRGSSVRQSTPSFPSSS 392
            +    P   ++ G     S + GS  +   F  +S+     SS G+S   ST ++ S S
Sbjct: 311 GHDSRGPYNANYGGQFNSRSGRTGSSESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRS 370

Query: 393 GPNSNANGGSNHQYFGA 409
             +S+  GGS  Q  G 
Sbjct: 371 TFDSSTGGGS--QSGGG 385


>gnl|CDD|220662 pfam10265, DUF2217, Uncharacterized conserved protein (DUF2217).
           This is a family of conserved proteins of from 500 - 600
           residues found from worms to humans. Its function is not
           known.
          Length = 515

 Score = 29.3 bits (66), Expect = 9.8
 Identities = 21/69 (30%), Positives = 29/69 (42%), Gaps = 5/69 (7%)

Query: 357 QQPGSGPNGAPFGPNSAPSSRGSSVRQSTPSF--PSSSGPNSNANGGSNHQYFGAGNQFP 414
           + P S PN    G +S  SS+ S    S  S    +SS   S AN GS   +  AG + P
Sbjct: 69  RSPSSKPNDTLSGASSKLSSKHSGSSHSLASVSDRNSSSSGSCANSGS---WEAAGMEEP 125

Query: 415 GGVPPNSNQ 423
                 + +
Sbjct: 126 INTTDTTPE 134


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.311    0.129    0.395 

Gapped
Lambda     K      H
   0.267   0.0641    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 42,491,098
Number of extensions: 3993594
Number of successful extensions: 4031
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3758
Number of HSP's successfully gapped: 208
Length of query: 854
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 749
Effective length of database: 6,280,432
Effective search space: 4704043568
Effective search space used: 4704043568
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 63 (28.2 bits)