RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10461
         (1359 letters)



>gnl|CDD|227508 COG5181, HSH155, U2 snRNP spliceosome subunit [RNA processing and
            modification].
          Length = 975

 Score =  638 bits (1646), Expect = 0.0
 Identities = 239/464 (51%), Positives = 306/464 (65%), Gaps = 2/464 (0%)

Query: 896  ARLVGASEIINRVVDDLKDENEQYRKMVMESIEKTMSNLGAADIDSRLEEQLIDGILYAF 955
            A++ G   +  ++++   DE E YRKM    + +  S LG    D RLEE+L D IL AF
Sbjct: 514  AKMGGDPRVSRKILEYYSDEPEPYRKMNAGLVSRIFSRLGRLGFDERLEERLYDSILNAF 573

Query: 956  QEQTTEDVVMLNGFGTIVNQLGKRVKPYLPQICGTILWRLNNKSAKVRQQAADLISRIAV 1015
            QEQ T   ++L  F T++  L  R KP+L  I  TIL  L +K   VR +AADL+  +A 
Sbjct: 574  QEQDTTVGLILPCFSTVLVSLEFRGKPHLSMIVSTILKLLRSKPPDVRIRAADLMGSLAK 633

Query: 1016 VMKTCQEEKLMGHLGVVLYEYLGEEYPEVLGSILGALKAIVNVIGMTKMTPPIKDLLPRL 1075
            V+K C E K +  LG +LYE LGE+YPEVLGSIL A+ +I +V     M PPI  +LP L
Sbjct: 634  VLKACGETKELAKLGNILYENLGEDYPEVLGSILKAICSIYSVHRFRSMQPPISGILPSL 693

Query: 1076 TPILKNRHEKVQENCIDLVGRIADRGPEYVSAREWMRICFELLELLKAHKKAIRRATVNT 1135
            TPIL+N+H+KV  N I LVG I    PEY+  REWMRICFEL++ LK+  K IRR    T
Sbjct: 694  TPILRNKHQKVVANTIALVGTICMNSPEYIGVREWMRICFELVDSLKSWNKEIRRNATET 753

Query: 1136 FGYIAKAIGPHDVLATLLNNLKVQERQNRVCTTVAIAIVAETCSPFTVLPALMNEYRVPE 1195
            FG I++AIGP DVL  LLNNLKVQERQ RVCT+VAI+IVAE C PF+VLP LM++Y  PE
Sbjct: 754  FGCISRAIGPQDVLDILLNNLKVQERQQRVCTSVAISIVAEYCGPFSVLPTLMSDYETPE 813

Query: 1196 LNVQNGVLKALSFLFEYIGEMGKDYIYAVTPLLEDALMDRDLVHRQTACATIKHMALGVY 1255
             NVQNGVLKA+ F+FEYIG+   DY+Y++TPLLEDAL DRD VHRQTA   I+H+ L   
Sbjct: 814  ANVQNGVLKAMCFMFEYIGQASLDYVYSITPLLEDALTDRDPVHRQTAMNVIRHLVLNCP 873

Query: 1256 GFGCEDALTHLLNYVWPNIFETSPHLVQAFMDAVEGLRVALGPVRILQYVLQGLFHPARK 1315
            G G EDA  HLLN +WPNI E SPH++Q+F + +E     LG   +++YV QGLFHP+  
Sbjct: 874  GTGDEDAAIHLLNLLWPNILEPSPHVIQSFDEGMESFATVLGSGAMMKYVQQGLFHPSST 933

Query: 1316 VRDVYWKIYNSLYIGGQDALISAYPRIQNDMKNVYLRYELDYVL 1359
            VR  YW +YN +Y+   DA++  YP  ++   N  L   L   +
Sbjct: 934  VRKRYWTVYNIMYVFDSDAMVPCYPVEEDL--NPELARTLHICI 975



 Score =  281 bits (721), Expect = 6e-79
 Identities = 110/198 (55%), Positives = 129/198 (65%)

Query: 407 AKYMDNQPKGNLPFLKPEDAQYFDKLLVDVDEDSLSPEEAKERKIMKLLLKIKNGTPPMR 466
               D     +L F K ED +YF     D     L   E  ER + +LLLK+KNG   MR
Sbjct: 126 KALTDFHGYADLGFFKVEDLKYFADDEKDFFMPLLEDREGDERDVYRLLLKVKNGGKRMR 185

Query: 467 KAALRQITDKAREFGAGPLFNQILPLLMSPTLEDQERHLLVKVIDRILYKLDDLVRPYVH 526
              LR +TDKA  FGA  +FN++LP+LMS  LEDQERHL+VK+IDR+LY LDDL  PYVH
Sbjct: 186 MEGLRILTDKAVNFGAAAVFNKVLPMLMSRELEDQERHLVVKLIDRLLYGLDDLKVPYVH 245

Query: 527 KILVVIEPLLIDEDYYARVEGREIISNLAKAAGLATMISTMRPDIDNIDEYVRNTTARAF 586
           KILVV+ PLLIDED   R  GREII NL    GL   +S+MRPDI + DEYVRN T RA 
Sbjct: 246 KILVVVGPLLIDEDLKRRCMGREIILNLVYRCGLGFSVSSMRPDITSKDEYVRNVTGRAV 305

Query: 587 AVVASALGIPSLLPFLKA 604
            VVA ALG+  LLPFL+A
Sbjct: 306 GVVADALGVEELLPFLEA 323



 Score = 33.4 bits (76), Expect = 1.2
 Identities = 37/200 (18%), Positives = 62/200 (31%), Gaps = 43/200 (21%)

Query: 458 IKNGTPPMRKAALRQITDKAREFGAGPLFNQILPLLMS---PTLEDQERHLLVKVIDRIL 514
           I +    +R    R +   A   G      ++LP L +        + RH  +++  +I 
Sbjct: 290 ITSKDEYVRNVTGRAVGVVADALGV----EELLPFLEALCGSRKSWEARHTGIRIAQQIC 345

Query: 515 YKLDDLVRPYVHKILVVIEPLLIDEDYYARVEGREIISNLAKAAG--------------- 559
             L      ++  +L  I  LL D   + R++    +S LA+  G               
Sbjct: 346 ELLGRSRLSHLGPLLKCISKLLKDRSRFVRIDTANALSYLAELVGPYGIEQFDEVLCPLW 405

Query: 560 ----------LATMISTMRPDIDNID-EYVRNTTARAFAVVASAL------GIPSLLPFL 602
                     L + +  M   I  +  EY  + T     +V              LL   
Sbjct: 406 EGASQHRGKELVSFLKAMGFIIPLMSPEYACHDTREHMEIVFREFKSPDEEMKKDLLVVE 465

Query: 603 ----KAGGVTPAATRWDETP 618
               K G  TP   R   +P
Sbjct: 466 RICDKVGTDTPWKLRDQVSP 485



 Score = 32.2 bits (73), Expect = 2.3
 Identities = 31/148 (20%), Positives = 59/148 (39%), Gaps = 7/148 (4%)

Query: 469 ALRQITDKAREFGAGPLFNQILPLLMSPTLEDQERHLLVKVIDRILYKLDDLVRPYV--- 525
           A+  I    R     P  + ILP L  P L ++ + ++   I  ++  +      Y+   
Sbjct: 669 AICSIYSVHRFRSMQPPISGILPSLT-PILRNKHQKVVANTIA-LVGTICMNSPEYIGVR 726

Query: 526 --HKILVVIEPLLIDEDYYARVEGREIISNLAKAAGLATMISTMRPDIDNIDEYVRNTTA 583
              +I   +   L   +   R    E    +++A G   ++  +  ++   +   R  T+
Sbjct: 727 EWMRICFELVDSLKSWNKEIRRNATETFGCISRAIGPQDVLDILLNNLKVQERQQRVCTS 786

Query: 584 RAFAVVASALGIPSLLPFLKAGGVTPAA 611
            A ++VA   G  S+LP L +   TP A
Sbjct: 787 VAISIVAEYCGPFSVLPTLMSDYETPEA 814



 Score = 31.1 bits (70), Expect = 5.0
 Identities = 31/137 (22%), Positives = 57/137 (41%), Gaps = 10/137 (7%)

Query: 485 LFNQILPLLMSP--TLEDQERHLLVKVIDRILYKLDDLVRPYVHKILVVIEPLLIDEDYY 542
            F+ +LP LMS   T E   ++ ++K +  +   +      YV+ I  ++E  L D D  
Sbjct: 798 PFS-VLPTLMSDYETPEANVQNGVLKAMCFMFEYIGQASLDYVYSITPLLEDALTDRDPV 856

Query: 543 ARVEGREIISNLA-KAAGLA---TMISTMRPDIDNIDE---YVRNTTARAFAVVASALGI 595
            R     +I +L     G       I  +     NI E   +V  +        A+ LG 
Sbjct: 857 HRQTAMNVIRHLVLNCPGTGDEDAAIHLLNLLWPNILEPSPHVIQSFDEGMESFATVLGS 916

Query: 596 PSLLPFLKAGGVTPAAT 612
            +++ +++ G   P++T
Sbjct: 917 GAMMKYVQQGLFHPSST 933



 Score = 30.3 bits (68), Expect = 8.5
 Identities = 48/266 (18%), Positives = 103/266 (38%), Gaps = 29/266 (10%)

Query: 905  INRVVDDLKDENEQYRKMVMESIEKTMSNLGAADIDSRLEEQLIDGILYAFQEQTTEDVV 964
            + R++  +K+  ++ R   +  +     N GAA + +++   L+   L   ++Q    VV
Sbjct: 170  VYRLLLKVKNGGKRMRMEGLRILTDKAVNFGAAAVFNKVLPMLMSREL---EDQERHLVV 226

Query: 965  MLNGFGTIVNQLGKRVKPYLPQICGTILWRLNNKSAKVRQQAADLISRIAVVMKTCQEEK 1024
             L     ++  L     PY+ +I   +   L ++  K R    ++I  +           
Sbjct: 227  KL--IDRLLYGLDDLKVPYVHKILVVVGPLLIDEDLKRRCMGREIILNLVYRCG------ 278

Query: 1025 LMGHLGVVLYEYLGEEYPEVLGSILGALKAIVNVIGMTKMTPPIKDLLPRLTPILKNRHE 1084
             +G     +   +  +   V      A+  + + +G+ ++ P ++ L        + RH 
Sbjct: 279  -LGFSVSSMRPDITSKDEYVRNVTGRAVGVVADALGVEELLPFLEALC-GSRKSWEARH- 335

Query: 1085 KVQENCIDLVGRIADRGPEYVSAREWMRICFELLE----LLKAHKKAIRRATVNTFGYIA 1140
                  I +  +I +     +  R  +     LL+    LLK   + +R  T N   Y+A
Sbjct: 336  ----TGIRIAQQICE-----LLGRSRLSHLGPLLKCISKLLKDRSRFVRIDTANALSYLA 386

Query: 1141 KAIGPH--DVLATLLNNLKVQERQNR 1164
            + +GP+  +    +L  L     Q+R
Sbjct: 387  ELVGPYGIEQFDEVLCPLWEGASQHR 412


>gnl|CDD|220070 pfam08920, SF3b1, Splicing factor 3B subunit 1.  This family
           consists of several eukaryotic splicing factor 3B
           subunit 1 proteins, which associate with p14 through a
           C-terminus beta-strand that interacts with beta-3 of the
           p14 RNA recognition motif (RRM) beta-sheet, which is in
           turn connected to an alpha-helix by a loop that makes
           extensive contacts with both the shorter C-terminal
           helix and RRM of p14. This subunit is required for 'A'
           splicing complex assembly (formed by the stable binding
           of U2 snRNP to the branchpoint sequence in pre-mRNA) and
           'E' splicing complex assembly.
          Length = 142

 Score =  119 bits (299), Expect = 5e-31
 Identities = 52/114 (45%), Positives = 63/114 (55%), Gaps = 13/114 (11%)

Query: 713 SSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMA 772
             K+RSRWD+TP    +GA+TP A T                TP M   G TP GN+   
Sbjct: 5   KKKKRSRWDQTPVPEAAGAVTPEAPTKK--------RSRWDQTPQM--EGQTPIGNE-GG 53

Query: 773 MATPTPGHLAAMTPEQIQAYRWEREIDERNRPLTDDELEAMFPP-GYKVLQPPA 825
           MATP       MTPEQ   +R   EIDERNRPLTD+EL+ + P  GYK+L+PP 
Sbjct: 54  MATPMTPSSV-MTPEQYIPFRIGTEIDERNRPLTDEELDELLPSEGYKILEPPP 106



 Score =  115 bits (289), Expect = 1e-29
 Identities = 50/81 (61%), Positives = 58/81 (71%), Gaps = 4/81 (4%)

Query: 330 AMTPEQIQAYRWEREIDERNRPLTDDELEAMFPP-GYKVLQPPAGYIPIRTPARKLTATP 388
            MTPEQ   +R   EIDERNRPLTD+EL+ + P  GYK+L+PP GY PIRTPARKL ATP
Sbjct: 63  VMTPEQYIPFRIGTEIDERNRPLTDEELDELLPSEGYKILEPPPGYEPIRTPARKLLATP 122

Query: 389 TPIAGTPTGFFIQQEDKTAKY 409
           TPI G   GF IQ+ED+    
Sbjct: 123 TPITG---GFMIQEEDRGLAK 140



 Score = 47.8 bits (114), Expect = 3e-06
 Identities = 23/36 (63%), Positives = 25/36 (69%), Gaps = 3/36 (8%)

Query: 1   YIPIRTPARKLTATPTPIAGTPTGFFIQQEDKTAKY 36
           Y PIRTPARKL ATPTPI G   GF IQ+ED+    
Sbjct: 108 YEPIRTPARKLLATPTPITG---GFMIQEEDRGLAK 140



 Score = 40.1 bits (94), Expect = 0.001
 Identities = 20/49 (40%), Positives = 21/49 (42%), Gaps = 3/49 (6%)

Query: 613 RWDETPGHPKPGAETPGA-TPSTRLWDATPGH--ATPGAATPGRETPSH 658
           RWD+TP     GA TP A T     WD TP     TP     G  TP  
Sbjct: 11  RWDQTPVPEAAGAVTPEAPTKKRSRWDQTPQMEGQTPIGNEGGMATPMT 59



 Score = 38.2 bits (89), Expect = 0.007
 Identities = 24/68 (35%), Positives = 30/68 (44%), Gaps = 9/68 (13%)

Query: 626 ETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTERETPGHSS 685
            TP        WD TP     GA TP   T          +R+RWD+TP+ E +TP  + 
Sbjct: 1   NTPEKKKKRSRWDQTPVPEAAGAVTPEAPTK---------KRSRWDQTPQMEGQTPIGNE 51

Query: 686 GWAETPKT 693
           G   TP T
Sbjct: 52  GGMATPMT 59


>gnl|CDD|219152 pfam06728, PIG-U, GPI transamidase subunit PIG-U.  Many eukaryotic
           proteins are anchored to the cell surface via
           glycosylphosphatidylinositol (GPI), which is
           posttranslationally attached to the carboxyl-terminus by
           GPI transamidase. The mammalian GPI transamidase is a
           complex of at least four subunits, GPI8, GAA1, PIG-S,
           and PIG-T. PIG-U is thought to represent a fifth subunit
           in this complex and may be involved in the recognition
           of either the GPI attachment signal or the lipid portion
           of GPI.
          Length = 373

 Score = 67.7 bits (166), Expect = 1e-11
 Identities = 17/51 (33%), Positives = 28/51 (54%), Gaps = 2/51 (3%)

Query: 813 MFPPGYKVLQPP--AACFFIGCSMFAPTVWHLWIYTRSANANFYFGVTLAF 861
           +F   +K ++ P       +   + +P  +HLWIY  S NANF+F +TL +
Sbjct: 323 LFRHLFKYMRYPFLVLLTLLVALLLSPIFYHLWIYLGSGNANFFFAITLVY 373


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 52.7 bits (127), Expect = 1e-06
 Identities = 31/218 (14%), Positives = 45/218 (20%), Gaps = 10/218 (4%)

Query: 572 DNIDEYVRNTTARAFAVVASALGIPSLLPFLKAGGVTPAATRWDETPGHPKPGAETPGAT 631
           + +   +       + V A     P        G   PA++   E    P   A      
Sbjct: 568 EVLVTALAEELGGDWQVEAVVGPAPGAAGG--EGPPAPASSGPPEEAARPAAPAAPAAPA 625

Query: 632 PSTRLWDATPGHATPGAATPGRETPSH-----DKAQSSIRRNRWDETPKTERETP-GHSS 685
                           +A P     +        A             K     P     
Sbjct: 626 APAP--AGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPP 683

Query: 686 GWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPS 745
             A       AG         P     + +      + PQA    +    AA      P 
Sbjct: 684 APAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPP 743

Query: 746 TPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPGHLAA 783
            P  P   +     P         A   A P P   + 
Sbjct: 744 EPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSE 781



 Score = 36.5 bits (85), Expect = 0.12
 Identities = 21/145 (14%), Positives = 30/145 (20%), Gaps = 22/145 (15%)

Query: 606 GVTPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSI 665
            +  A          P   A  P A P+     A    A   A  P              
Sbjct: 384 RLGVAGGAGAPAAAAPSAAAAAPAAAPA----PAAAAPAAAAAPAPAAAPQPAPAPAP-- 437

Query: 666 RRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQ 725
                              +  A    +             P P   ++   +     P 
Sbjct: 438 ----------APAPPSPAGNAPAGGAPSPPPAAAP---SAQPAPAPAAAPEPTA---APA 481

Query: 726 ATPSGAMTPSAATPGGMTPSTPITP 750
             P  A  P+AA      P+ P   
Sbjct: 482 PAPPAAPAPAAAPAAPAAPAAPAGA 506



 Score = 32.3 bits (74), Expect = 2.1
 Identities = 25/154 (16%), Positives = 33/154 (21%), Gaps = 22/154 (14%)

Query: 638 DATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAG 697
           +A  G A   A   G   P                 P  E   P   +  A       AG
Sbjct: 585 EAVVGPAPGAAGGEGPPAP-------------ASSGPPEEAARPAAPAAPAAPAAPAPAG 631

Query: 698 PGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPL 757
                      P   S+                A+  ++    G            + P 
Sbjct: 632 A-------AAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPA--KAGGAAPAAPP 682

Query: 758 MTPSGVTPTGNKAMAMATPTPGHLAAMTPEQIQA 791
             P+   P      A A P P   A     Q   
Sbjct: 683 PAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADD 716



 Score = 30.3 bits (69), Expect = 8.9
 Identities = 21/110 (19%), Positives = 26/110 (23%), Gaps = 5/110 (4%)

Query: 685 SGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTP 744
           +G A  P                 P   ++   +     P A P  A  P+ A       
Sbjct: 388 AGGAGAPAAAAPSAAAA--APAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPA 445

Query: 745 STPITPHVGSTPLMTPSGVTPTGNKAMAMAT---PTPGHLAAMTPEQIQA 791
                    S P        P    A A      P P   AA  P    A
Sbjct: 446 GNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPA 495


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 51.3 bits (123), Expect = 4e-06
 Identities = 33/174 (18%), Positives = 57/174 (32%), Gaps = 28/174 (16%)

Query: 610 AATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGR-----ETPSHDKAQSS 664
            A+ W+      +PG  +  ++P  R    +P     G A          + S + + SS
Sbjct: 271 EASGWNGPS--SRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSS 328

Query: 665 IRRNRWDETPKTERETPGHS----------SGWAETPKTDRAGPGGDLIQETPTPGGVSS 714
              +   E+ +    +PG S             A+     +               G  +
Sbjct: 329 TSSS--SESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPT 386

Query: 715 KRRSRWDETPQATPSGAMTPSAATPGGMTPSTPITPHVGS------TPLMTPSG 762
           +RR+R     +A    A   +   P G    +P+     S       PL+TPSG
Sbjct: 387 RRRARAAVAGRARRRDA---TGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSG 437



 Score = 36.3 bits (84), Expect = 0.16
 Identities = 30/177 (16%), Positives = 46/177 (25%), Gaps = 24/177 (13%)

Query: 607 VTPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIR 666
               +          + G+ TP    S       P    P A+ P    PS     S + 
Sbjct: 87  TPTWSLSTLAPASPAREGSPTPPGPSS----PDPPPPTPPPASPP----PSPAPDLSEML 138

Query: 667 RNRWDETPKTERETPGHSSGWAETPKTDRAGPGG----DLIQETPTPGGVSSKRRSRWDE 722
           R      P      P   +  A       +           +ET                
Sbjct: 139 RPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPS----------- 187

Query: 723 TPQATPSGAMTPSAATPGGMTPSTPI-TPHVGSTPLMTPSGVTPTGNKAMAMATPTP 778
           +P A P  +  P+AA+P     S+PI        P    S     G  +   ++   
Sbjct: 188 SPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSES 244



 Score = 33.6 bits (77), Expect = 0.91
 Identities = 28/178 (15%), Positives = 45/178 (25%), Gaps = 17/178 (9%)

Query: 607 VTPAATRWDETPGHP----KPGAETPGATPSTRLWDATPGHATPGAATPGRET--PSHDK 660
                 R   +P        P A      P      +    A+  A  PGR     +   
Sbjct: 178 SPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS--ASSPAPAPGRSAADDAGAS 235

Query: 661 AQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRW 720
           +  S            E E P         P       G +    +  PG  SS      
Sbjct: 236 SSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNG--PSSRPGPASS------ 287

Query: 721 DETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTP 778
             +     S + +PS+   G    S   +    S+   + S  + +   +   A    
Sbjct: 288 -SSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPG 344


>gnl|CDD|237987 cd00020, ARM, Armadillo/beta-catenin-like repeats. An approximately
            40 amino acid long tandemly repeated sequence motif first
            identified in the Drosophila segment polarity gene
            armadillo; these repeats were also found in the mammalian
            armadillo homolog beta-catenin, the junctional plaque
            protein plakoglobin, the adenomatous polyposis coli (APC)
            tumor suppressor protein, and a number of other proteins.
            ARM has been implicated in mediating protein-protein
            interactions, but no common features among the target
            proteins recognized by the ARM repeats have been
            identified; related to the HEAT domain; three consecutive
            copies of the repeat are represented by this alignment
            model.
          Length = 120

 Score = 45.0 bits (107), Expect = 2e-05
 Identities = 17/104 (16%), Positives = 34/104 (32%), Gaps = 4/104 (3%)

Query: 1035 EYLGEEYPEVLGSILGALKAIVNVIGMTKMTPPIKDL--LPRLTPILKNRHEKVQENCID 1092
              L      V      AL  +    G       + +   LP L  +LK+  E+V +  + 
Sbjct: 14   SLLSSSDENVQREAAWALSNLS--AGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKAALW 71

Query: 1093 LVGRIADRGPEYVSAREWMRICFELLELLKAHKKAIRRATVNTF 1136
             +  +A    +            +L+ LL +  + I++      
Sbjct: 72   ALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGAL 115



 Score = 31.9 bits (73), Expect = 0.66
 Identities = 18/100 (18%), Positives = 35/100 (35%), Gaps = 17/100 (17%)

Query: 1070 DLLPRLTPILKNRHEKVQENCIDLVGRIADRGPE----YVSAREWMRICFE-LLELLKAH 1124
              LP L  +L +  E VQ      +  ++    +     V A          L++LLK+ 
Sbjct: 7    GGLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAG-----GLPALVQLLKSE 61

Query: 1125 KKAIRRATVNTFGYIA-------KAIGPHDVLATLLNNLK 1157
             + + +A +     +A         +     +  L+N L 
Sbjct: 62   DEEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVNLLD 101



 Score = 31.1 bits (71), Expect = 1.4
 Identities = 11/69 (15%), Positives = 27/69 (39%), Gaps = 4/69 (5%)

Query: 990  TILWRLNNKSAKVRQQAADLISRIAVVMKTCQEEKLMGHLGVV--LYEYLGEEYPEVLGS 1047
             ++  L ++  +V + A   +  +A   +  +   ++   G V  L   L     ++  +
Sbjct: 53   ALVQLLKSEDEEVVKAALWALRNLAAGPEDNK--LIVLEAGGVPKLVNLLDSSNEDIQKN 110

Query: 1048 ILGALKAIV 1056
              GAL  + 
Sbjct: 111  ATGALSNLA 119


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 44.5 bits (105), Expect = 5e-04
 Identities = 37/184 (20%), Positives = 45/184 (24%), Gaps = 11/184 (5%)

Query: 600  PFLKAGGVTPAATRWDETPGHPKPGAETPGAT-----PSTRLWDATPGHATPGAATPGRE 654
            P   A    P   R  +  G   P    P        P +    A      P    P  E
Sbjct: 2592 PPQSARPRAPVDDR-GDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPE 2650

Query: 655  TPSHDKAQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSS 714
             P  D A   + R R         +                 G    L    P P     
Sbjct: 2651 RPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEP 2710

Query: 715  KRRSRWDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMA 774
               +    TP   P G      A+P    P+ P  P V + P  TP G            
Sbjct: 2711 APHALVSATP--LPPGPAAARQASPA--LPAAPAPPAVPAGP-ATPGGPARPARPPTTAG 2765

Query: 775  TPTP 778
             P P
Sbjct: 2766 PPAP 2769



 Score = 38.0 bits (88), Expect = 0.052
 Identities = 56/228 (24%), Positives = 68/228 (29%), Gaps = 16/228 (7%)

Query: 609  PAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRN 668
            PAA      P  P P    P  T   R  DA P  A P A    R  P      S +   
Sbjct: 2561 PAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLP-- 2618

Query: 669  RWDETPKTERETPGHSSGWAETPKTDR-AGPGGDLIQETPTPGGVSSKRRSRWDETP--- 724
               +T   +   P  S    E         P  +  ++ P PG VS  RR+R        
Sbjct: 2619 --PDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQA 2676

Query: 725  QATPSGAMTPSA-ATPGGMT-----PSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTP 778
             + P      +A  T G +T     P  P TP      L   +   P G  A   A+P  
Sbjct: 2677 SSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHAL-VSATPLPPGPAAARQASPAL 2735

Query: 779  GHLAAMTPEQIQAYRWEREIDERNRPLTDDELEAMFPPGYKVLQPPAA 826
               A   P                RP T     A  PP      PP  
Sbjct: 2736 P-AAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRR 2782



 Score = 36.1 bits (83), Expect = 0.16
 Identities = 50/234 (21%), Positives = 67/234 (28%), Gaps = 22/234 (9%)

Query: 605 GGVTPAATRWDETPG-HPKPGAETPGATPSTRLWDA----TPGHATPGAATPGRETPSHD 659
               P   R    P   P+  A    A P   +W A     P  A P    P    P+ D
Sbjct: 265 ADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAP-LALPAPPDPPPPAPAGD 323

Query: 660 KAQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSR 719
             +        +      R    +  G+   PK  R         E  + G    KR S 
Sbjct: 324 AEEEDDEDGAMEVVSPLPRPRQHYPLGF---PKRRRPTWTPPSSLEDLSAGRHHPKRASL 380

Query: 720 WDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPG 779
                ++    A TP A  PGG   + P  P   S P   P+ V  +     A   P+  
Sbjct: 381 PTRKRRSARHAA-TPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAE 439

Query: 780 HLAAMTPEQIQAYRWEREIDERNRPLTD-------DELEAMFPPGYKVLQPPAA 826
             +   P      +      E      D       D L    PP     +PP A
Sbjct: 440 PGSDDGPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPP-----EPPGA 488



 Score = 34.1 bits (78), Expect = 0.69
 Identities = 33/211 (15%), Positives = 51/211 (24%), Gaps = 12/211 (5%)

Query: 609  PAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRN 668
            PA          P   A    A+P+     A P     G ATPG          ++    
Sbjct: 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA-GPATPGGPARPARPPTTA---- 2764

Query: 669  RWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATP 728
                 P         ++G              +  +  P+P   +    +         P
Sbjct: 2765 ----GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820

Query: 729  SGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPGHLAAMTPEQ 788
            + +       P    P+ P  P     P +   G    G                  P +
Sbjct: 2821 AASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPAR 2880

Query: 789  IQAYRWEREIDERNR---PLTDDELEAMFPP 816
                R  R    R+     L  D+ E    P
Sbjct: 2881 PPVRRLARPAVSRSTESFALPPDQPERPPQP 2911



 Score = 33.4 bits (76), Expect = 1.2
 Identities = 42/211 (19%), Positives = 56/211 (26%), Gaps = 32/211 (15%)

Query: 609  PAATRWDETPGHPKPGAETPG-----ATPSTRLWDATPGHATPGAATPGRETPSHDKAQS 663
            P     D  P  P P A  P        P        P  A    + P R       AQ+
Sbjct: 2619 PDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDP--APGRVSRPRRARRLGRAAQA 2676

Query: 664  SIRRNRWDETPKTERETPGHSSGWAETPKTDR--AGPGGDLIQETPTPGGVSSKRRSR-- 719
            S    R     +  R T G  +  A+ P            L+  TP P G ++ R++   
Sbjct: 2677 SSPPQRP--RRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPA 2734

Query: 720  --WDETPQATPSGAMT---------------PSAATPGGMTPSTP--ITPHVGSTPLMTP 760
                  P A P+G  T               P A  P     + P           L   
Sbjct: 2735 LPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSES 2794

Query: 761  SGVTPTGNKAMAMATPTPGHLAAMTPEQIQA 791
                P+               AA+ P    A
Sbjct: 2795 RESLPSPWDPADPPAAVLAPAAALPPAASPA 2825



 Score = 30.7 bits (69), Expect = 7.9
 Identities = 44/247 (17%), Positives = 55/247 (22%), Gaps = 35/247 (14%)

Query: 581  TTARAFAVVASALGIPSLLPFLKAGGVTPAATRWDETPGHPK-PGAETPGATPSTRLWDA 639
                A    + AL      P + AG  TP        P     P A  P A P+      
Sbjct: 2723 PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAA----G 2778

Query: 640  TPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPG 699
             P   T  A      + S +   S      WD            +   A  P    AGP 
Sbjct: 2779 PPRRLTRPAVASL--SESRESLPSP-----WDPADPPAAVLAPAA---ALPPAASPAGP- 2827

Query: 700  GDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTPIT--PHVGSTPL 757
                   P P             T    P G   PS    G + P   +   P   S   
Sbjct: 2828 ------LPPPTSAQ--------PTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPA- 2872

Query: 758  MTPSGVTPTGNKAMAMATPTPGHLAAMTPEQIQAYRWEREIDERNRPLTDDELEAMFPPG 817
                   P       +A P                    +      P    +      P 
Sbjct: 2873 --AKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQ 2930

Query: 818  YKVLQPP 824
                 PP
Sbjct: 2931 PPPPPPP 2937


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 42.4 bits (99), Expect = 0.002
 Identities = 29/87 (33%), Positives = 35/87 (40%), Gaps = 14/87 (16%)

Query: 697 GPGGDLIQETPTPGGVSSKRRSRWDETPQAT-PSGAMTPSAATPGGMTPS---TPITPHV 752
           GP        PTPG             PQ T P+  MTPS+A P    PS   +P  P  
Sbjct: 388 GPASGGAATIPTPGTQG----------PQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWD 437

Query: 753 GSTPLMTPSGVTPTGNKAMAMATPTPG 779
            + P    SG+ P     M  A+P PG
Sbjct: 438 DAPPAPPRSGIPPRPAPRMPEASPVPG 464



 Score = 32.7 bits (74), Expect = 1.5
 Identities = 26/107 (24%), Positives = 34/107 (31%), Gaps = 14/107 (13%)

Query: 673 TPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAM 732
           TP T+    G +     TP +  A P        P+P       R  WD+ P A P   +
Sbjct: 399 TPGTQG-PQGTAPAAGMTPSS--AAPATPAPSAAPSP-------RVPWDDAPPAPPRSGI 448

Query: 733 TPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPG 779
            P  A         P  P   +    + S   PT       A  TP 
Sbjct: 449 PPRPAPRMPEASPVPGAPDSVA----SASDAPPTLGDPSDTAEHTPS 491



 Score = 32.3 bits (73), Expect = 2.0
 Identities = 24/90 (26%), Positives = 27/90 (30%), Gaps = 7/90 (7%)

Query: 606 GVTPAATRWDETPGHPKPGAETPGATPSTRL-WDATPGHATPGAATPGRETPSHDKAQSS 664
           G  PAA     TP    P    P A PS R+ WD  P  A P +  P R  P   +A   
Sbjct: 407 GTAPAAGM---TPSSAAPATPAPSAAPSPRVPWDDAPP-APPRSGIPPRPAPRMPEASPV 462

Query: 665 IRRNRWDETPKTERETPGHSSGWAETPKTD 694
                 D         P           T 
Sbjct: 463 P--GAPDSVASASDAPPTLGDPSDTAEHTP 490


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 40.4 bits (94), Expect = 0.006
 Identities = 31/156 (19%), Positives = 47/156 (30%), Gaps = 17/156 (10%)

Query: 608 TPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRR 667
            P           P        AT + RL       +T    T G  T S+ + Q+    
Sbjct: 186 HPTYPSQVTPQSQPATQGHQT-ATANQRL------SSTEPVGTQGTTTSSNPEPQTEPPP 238

Query: 668 NRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQAT 727
           ++   +  + +  P  +S           G G +  Q   TP   S++R      TP  T
Sbjct: 239 SQRGPS-GSPQHPPSTTS-----QDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPT 292

Query: 728 P----SGAMTPSAATPGGMTPSTPITPHVGSTPLMT 759
                +G  TP          S P +   G     T
Sbjct: 293 TKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPT 328


>gnl|CDD|205691 pfam13513, HEAT_EZ, HEAT-like repeat.  The HEAT repeat family is
            related to armadillo/beta-catenin-like repeats (see
            pfam00514). These EZ repeats are found in subunits of
            cyanobacterial phycocyanin lyase and other proteins and
            probably carry out a scaffolding role.
          Length = 55

 Score = 35.9 bits (83), Expect = 0.007
 Identities = 19/56 (33%), Positives = 30/56 (53%), Gaps = 1/56 (1%)

Query: 1042 PEVLGSILGALKAIVNVIGMTKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRI 1097
             EV  +   AL A+    G   + P + +LLP L P+LK+  ++V+E     +GRI
Sbjct: 1    WEVREAAALALGALAGG-GPELLRPAVPELLPALLPLLKDDDDEVREAAAWALGRI 55



 Score = 34.3 bits (79), Expect = 0.023
 Identities = 11/48 (22%), Positives = 23/48 (47%)

Query: 966  LNGFGTIVNQLGKRVKPYLPQICGTILWRLNNKSAKVRQQAADLISRI 1013
                G +     + ++P +P++   +L  L +   +VR+ AA  + RI
Sbjct: 8    ALALGALAGGGPELLRPAVPELLPALLPLLKDDDDEVREAAAWALGRI 55



 Score = 28.2 bits (63), Expect = 3.7
 Identities = 12/53 (22%), Positives = 20/53 (37%)

Query: 1198 VQNGVLKALSFLFEYIGEMGKDYIYAVTPLLEDALMDRDLVHRQTACATIKHM 1250
            V+     AL  L     E+ +  +  + P L   L D D   R+ A   +  +
Sbjct: 3    VREAAALALGALAGGGPELLRPAVPELLPALLPLLKDDDDEVREAAAWALGRI 55


>gnl|CDD|215026 smart01104, CTD, Spt5 C-terminal nonapeptide repeat binding Spt4.
           The C-terminal domain of the transcription elongation
           factor protein Spt5 is necessary for binding to Spt4 to
           form the functional complex that regulates early
           transcription elongation by RNA polymerase II. The
           complex may be involved in pre-mRNA processing through
           its association with mRNA capping enzymes. This CTD
           domain carries a regular nonapeptide repeat that can be
           present in up to 18 copies, as in S. pombe. The repeat
           has a characteristic TPA motif.
          Length = 121

 Score = 37.1 bits (86), Expect = 0.010
 Identities = 32/155 (20%), Positives = 40/155 (25%), Gaps = 42/155 (27%)

Query: 617 TPGHPKPGAETP---GATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDET 673
           TP     G++TP     TP T    A          +  R TP+   A S          
Sbjct: 4   TPAWGASGSKTPAWGSRTPGTAAGGAPTA----RGGSGSR-TPAWGGAGSR--------- 49

Query: 674 PKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMT 733
                 TP          +T   G          +  G  S   S       A   G   
Sbjct: 50  ------TPAWGGAGPTGSRTPAWGGA--------SAWGNKSSEGSAS---SWAAGPGG-A 91

Query: 734 PSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGN 768
             A TPG     +   P        TP G    G+
Sbjct: 92  YGAPTPGYGGTPSAYGPA-------TPGGGAMAGS 119



 Score = 31.7 bits (72), Expect = 0.73
 Identities = 22/111 (19%), Positives = 32/111 (28%), Gaps = 19/111 (17%)

Query: 688 AETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPS-T 746
             TP    +G       +TP  G   +   +          SG+ TP+    G  TP+  
Sbjct: 2   GRTPAWGASG------SKTPAWGS-RTPGTAAGGAPTARGGSGSRTPAWGGAGSRTPAWG 54

Query: 747 PITPHVGSTP-----------LMTPSGVTPTGNKAMAMATPTPGHLAAMTP 786
              P    TP               S  +       A   PTPG+    + 
Sbjct: 55  GAGPTGSRTPAWGGASAWGNKSSEGSASSWAAGPGGAYGAPTPGYGGTPSA 105


>gnl|CDD|221787 pfam12815, CTD, Spt5 C-terminal nonapeptide repeat binding Spt4.
           The C-terminal domain of the transcription elongation
           factor protein Spt5 is necessary for binding to Spt4 to
           form the functional complex that regulates early
           transcription elongation by RNA polymerase II. The
           complex may be involved in pre-mRNA processing through
           its association with mRNA capping enzymes. This CTD
           domain carries a regular nonapeptide repeat that can be
           present in up to 18 copies, as in S. pombe. The repeat
           has a characteristic TPA motif.
          Length = 112

 Score = 37.1 bits (86), Expect = 0.010
 Identities = 22/67 (32%), Positives = 24/67 (35%), Gaps = 10/67 (14%)

Query: 600 PFLKAGGVTPA--ATRWDETP----GHPKPGAETPGATPSTRLWDATPGH----ATPGAA 649
           P    G  TPA        TP         GA TPGA P+      TPG      TPG +
Sbjct: 46  PAFGDGSRTPAYGGESNGRTPAYGGSGSYLGASTPGAAPTPAKSAPTPGAWGGAPTPGYS 105

Query: 650 TPGRETP 656
                TP
Sbjct: 106 ADSAPTP 112



 Score = 30.9 bits (70), Expect = 1.5
 Identities = 31/146 (21%), Positives = 41/146 (28%), Gaps = 37/146 (25%)

Query: 624 GAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETP---KTERET 680
           G+ TP    ++     TP     G+ TP                     TP       +T
Sbjct: 1   GSRTPAY--NSSAGSRTPAWGQGGSRTP---AYGGAG----------GRTPAWGGAGNKT 45

Query: 681 PGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPG 740
           P    G + TP       GG+    TP  GG              +   GA TP AA   
Sbjct: 46  PAFGDG-SRTPAY-----GGESNGRTPAYGG------------SGSYL-GASTPGAAPTP 86

Query: 741 GMTPSTPITPHVGSTPLMTPSGVTPT 766
             +  TP       TP  +       
Sbjct: 87  AKSAPTPGAWGGAPTPGYSADSAPTP 112


>gnl|CDD|227493 COG5164, SPT5, Transcription elongation factor [Transcription].
          Length = 607

 Score = 40.0 bits (93), Expect = 0.010
 Identities = 32/174 (18%), Positives = 44/174 (25%), Gaps = 20/174 (11%)

Query: 628 PGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTER-ETPGHSSG 686
                S  +   TP +AT G A     T S     S  R   W + PKT     PG   G
Sbjct: 421 KPLRYSEAIGSKTPNYATGGIAAGAAATSS---GLSGYRTPGWKDGPKTPALNDPGIDWG 477

Query: 687 WAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWD--ETPQATPSGAMTPS----AATPG 740
             E     +      L +             + W    T  A+    + P          
Sbjct: 478 -DEKDNWYKGSLIHALGKGLALEDYRDGLFMTAWKGQATS-ASEISFVQPRWNNLVCVLD 535

Query: 741 GMTPSTP-----ITPHVGSTPLM-TPSGVT--PTGNKAMAMATPTPGHLAAMTP 786
           G    T      I    G T +  T   V   P  + +  +             
Sbjct: 536 GEKKGTCGILNGINGDWGGTVIRSTKGSVVHWPMNDNSKKIYDGGASAWGNQDD 589


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 39.8 bits (92), Expect = 0.011
 Identities = 42/191 (21%), Positives = 56/191 (29%), Gaps = 20/191 (10%)

Query: 609 PAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRN 668
           P  T        P+   E+P    +T +  + P   T    T    T S   +Q      
Sbjct: 543 PTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGI-- 600

Query: 669 RWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATP 728
                P +   TP  +S       T     GG+ I E  TP   S+   S     P    
Sbjct: 601 -----PSSSHSTPRSNSTSTTPLLTSAHPTGGENITEE-TPSVPSTTHVSTLSPGPGPGT 654

Query: 729 ----SGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGV-----TPTGNKAMAMA--TPT 777
               SG    S +   G    T   P+  +T    PSG      T T     A +    T
Sbjct: 655 TSQVSGPGNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANSTTKET 714

Query: 778 PGHLAAMTPEQ 788
            G    M    
Sbjct: 715 SGS-TLMASTS 724



 Score = 34.8 bits (79), Expect = 0.41
 Identities = 36/187 (19%), Positives = 54/187 (28%), Gaps = 23/187 (12%)

Query: 594 GIPSLL---PFLKAGGVTPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAAT 650
           GIPS     P   +   TP  T    T G  +   E   + PST        H +  +  
Sbjct: 599 GIPSSSHSTPRSNSTSTTPLLTSAHPTGG--ENITEETPSVPSTT-------HVSTLSPG 649

Query: 651 PGRETPSHDKAQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPG 710
           PG  T S      +   +R+          PG        P  +   P     Q+T  P 
Sbjct: 650 PGPGTTSQVSGPGNSSTSRY----------PGEVHVTEGMPNPNATSPSAPSGQKTAVPT 699

Query: 711 GVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKA 770
             S+  ++      + + S  M  ++         T         P  T S + P     
Sbjct: 700 VTSTGGKANSTTK-ETSGSTLMASTSPHTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFT 758

Query: 771 MAMATPT 777
               T  
Sbjct: 759 SPPVTTK 765



 Score = 34.8 bits (79), Expect = 0.41
 Identities = 28/123 (22%), Positives = 34/123 (27%), Gaps = 25/123 (20%)

Query: 617 TPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKT 676
           T G P P A +P A    +    T       A +  +ET                 T   
Sbjct: 676 TEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANSTTKETSGS--------------TLMA 721

Query: 677 ERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDET-PQATPSGAMTPS 735
                 +   +  TP              T  P   SSK R RW  T P  T   A  P 
Sbjct: 722 STSPHTNEGAFRTTPYN----------ATTYLPPSTSSKLRPRWTFTSPPVTTKQATVPV 771

Query: 736 AAT 738
             T
Sbjct: 772 PPT 774



 Score = 31.3 bits (70), Expect = 4.1
 Identities = 35/166 (21%), Positives = 53/166 (31%), Gaps = 11/166 (6%)

Query: 617 TPGHPKPGAETPGATPS---TRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDET 673
           T  H +P   T   T           T   A P + TP   T S     +S        T
Sbjct: 437 TLVHVEPHKTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSAT 496

Query: 674 PKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMT 733
           P     TP  ++  A +P T +     +    TP   GV++        T  + P+G  +
Sbjct: 497 PNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTT--------TATSPPTGTTS 548

Query: 734 PSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPG 779
              AT   +T  +P+           PS +T            +P 
Sbjct: 549 VPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPT 594


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 39.7 bits (92), Expect = 0.012
 Identities = 41/174 (23%), Positives = 58/174 (33%), Gaps = 8/174 (4%)

Query: 606 GVTPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSH--DKAQS 663
                A      PG  +P A  PG     R   A PG A P AA PG  TP        +
Sbjct: 729 AAPGRARPPAAAPGRARPPAAAPGRA---RPPAAAPGRARPPAAAPGAPTPQPPPQAPPA 785

Query: 664 SIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDET 723
             +R R   TP+   +    S          + GP   ++++  T G    +   +    
Sbjct: 786 PQQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAA 845

Query: 724 PQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPT 777
            +   +   TPS   PG  T    +   V   P++ P  V        A A  T
Sbjct: 846 LERQAAAGPTPS---PGSGTSDKIVQAPVFYPPVLQPIQVMRQLGSVRAAAAST 896



 Score = 35.8 bits (82), Expect = 0.22
 Identities = 43/210 (20%), Positives = 59/210 (28%), Gaps = 35/210 (16%)

Query: 609 PAATRWDETPGHPKPG----AETPGATPST-RLWDATPGHA-TPGAATPGRETPSHDKAQ 662
           PA+T        P PG       P  +P+T +L  + P +A TP       +TP     Q
Sbjct: 553 PASTEPVHDQLLPAPGLGPLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQ 612

Query: 663 SSIRRN----RWDETPKTERETPGHSSGWAETPKTDRAGPGGD----------------- 701
           S I       +W   P   R  P                P                    
Sbjct: 613 SHIPETSAPRQW---PMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQI 669

Query: 702 -LIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTP 760
             I   P+P G ++    +W       P  A TP    P    P         +T    P
Sbjct: 670 GHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTP-MRPPAA--PPGRAQRPAAATGRARP 726

Query: 761 SGVTPTGNKAMAMATPTPGHLAAMTPEQIQ 790
               P   +  A A P      A  P + +
Sbjct: 727 PAAAPGRARPPAAA-PGRARPPAAAPGRAR 755



 Score = 35.4 bits (81), Expect = 0.24
 Identities = 44/192 (22%), Positives = 54/192 (28%), Gaps = 26/192 (13%)

Query: 608 TPAATRWDETPGHPKPGAETP--GATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSI 665
            P        P    PG       AT   R   A PG A P AA PGR  P         
Sbjct: 696 PPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPA------- 748

Query: 666 RRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQ 725
                   P   R         A       A   G    + P     + ++R R   TPQ
Sbjct: 749 ------AAPGRARPPAA-----APGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQ 797

Query: 726 ATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMT------PSGVTPTGNKAMAMATPTPG 779
             P    T     P         T  +    L        PS   P   +  A A PTP 
Sbjct: 798 PPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQAAAGPTPS 857

Query: 780 HLAAMTPEQIQA 791
             +  + + +QA
Sbjct: 858 PGSGTSDKIVQA 869


>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal.  Mediator is a
           large complex of up to 33 proteins that is conserved
           from plants to fungi to humans - the number and
           representation of individual subunits varying with
           species. It is arranged into four different sections, a
           core, a head, a tail and a kinase-activity part, and the
           number of subunits within each of these is what varies
           with species. Overall, Mediator regulates the
           transcriptional activity of RNA polymerase II but it
           would appear that each of the four different sections
           has a slightly different function. Mediator subunit
           Hrs1/Med3 is a physical target for Cyc8-Tup1, a yeast
           transcriptional co-repressor.
          Length = 381

 Score = 38.8 bits (90), Expect = 0.019
 Identities = 30/173 (17%), Positives = 42/173 (24%), Gaps = 28/173 (16%)

Query: 621 PKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTERET 680
              GA       S     AT         TP         A S          P T    
Sbjct: 123 GNAGASASITKTSNGSDAATTSSTAN---TPAAAKVLKANAAS---------APNTT-TG 169

Query: 681 PGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPG 740
            G ++  A    T    P          P      +++      +A  S      A+   
Sbjct: 170 VGSAATTAAISATTATTPT----TTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQASAYN 225

Query: 741 GMTPSTPITPHVGSTPLMTPSGVTPTGNKAM-----AMATPTPGHLAAMTPEQ 788
            M   +   P   S     P+               AMA+P    L  M+P +
Sbjct: 226 QM--GSLGVPQNTSMLAQIPNPTPLMQLLNGVSPNNAMASP----LNNMSPMR 272



 Score = 35.4 bits (81), Expect = 0.18
 Identities = 12/76 (15%), Positives = 24/76 (31%), Gaps = 3/76 (3%)

Query: 705 ETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVT 764
           +    G  +S  ++        T S A TP+AA    +  +    P+  +  + + +   
Sbjct: 121 QLGNAGASASITKTSNGSDAATTSSTANTPAAAK--VLKANAASAPNTTTG-VGSAATTA 177

Query: 765 PTGNKAMAMATPTPGH 780
                     T T   
Sbjct: 178 AISATTATTPTTTQKK 193


>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
           factor (transactivator).  This family includes EBV BRLF1
           and similar ORF 50 proteins from other herpesviruses.
          Length = 500

 Score = 37.8 bits (88), Expect = 0.038
 Identities = 42/232 (18%), Positives = 66/232 (28%), Gaps = 27/232 (11%)

Query: 607 VTPAATRW--DETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSS 664
           V PA+T+   +ET   P     +P +   +   +      +   A P +   S+    S 
Sbjct: 247 VWPASTQSPREETSAEP-LAPASPSSRRPSTAQE-EQIACSSPQAEPEQGVQSYVPQSSD 304

Query: 665 IRRNRWDETPKTERETPG-------HSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRR 717
            R +     P      P          +     P+       G         GG     R
Sbjct: 305 SRPSC---FPAPSTTQPTFLPPNTNKKAKRDRRPQMVTPKQEGGAAVSQNHDGGTVRAPR 361

Query: 718 SRWDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPT 777
            R   + Q+ PS +   S+         TP         L+ P+ V      A     PT
Sbjct: 362 GRPSGSGQSPPSNSPLLSS------LADTPSGAAHQPASLLPPAVVQQQLEDASDKQPPT 415

Query: 778 PGH---LAAMTPEQIQAYRWEREIDERNRPLTDDELEAMFPPGYKVLQPPAA 826
           PG          E   +     + D+         + A  P  Y +  PP  
Sbjct: 416 PGSSLVPQPDEQELGPSVMALLDRDQG----ILALILAPIPGLYDLSGPPNL 463


>gnl|CDD|221040 pfam11235, Med25_SD1, Mediator complex subunit 25 synapsin 1.  The
           overall function of the full-length Med25 is efficiently
           to coordinate the transcriptional activation of RAR/RXR
           (retinoic acid receptor/retinoic X receptor) in higher
           eukaryotic cells. Human Med25 consists of several
           domains with different binding properties, the
           N-terminal, VWA, domain, this SD1 - synapsin 1 - domain
           from residues 229-381, a PTOV(B) or ACID domain from
           395-545, an SD2 domain from residues 564-645 and a
           C-terminal NR box-containing domain (646-650) from
           646-747. This The function of the SD domains is unclear.
          Length = 168

 Score = 36.0 bits (82), Expect = 0.057
 Identities = 33/173 (19%), Positives = 51/173 (29%), Gaps = 9/173 (5%)

Query: 621 PKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTERET 680
           P  G   PG   S +     P    P  + P  + P        +      + P+     
Sbjct: 2   PVGGGSVPGPLQSKQPVSLPPAAVLPPQSLPAPQNPLPPVTPPQM------QVPQNVSLH 55

Query: 681 PGHSSGWA--ETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAAT 738
             H +     E  K  + G        TP                P   P     P  + 
Sbjct: 56  AAHDAAQKAVEAAKNQKQGLKNRFSPITPLQQAPIVGPPFSQAPAPVLPPGPPGAPKPSP 115

Query: 739 PGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMA-TPTPGHLAAMTPEQIQ 790
              ++  T ++P  G  P++T   V P   +  +M  TP  G +    P Q Q
Sbjct: 116 ASQLSLVTTVSPGSGLAPVLTQQQVPPQQPQQPSMVPTPALGGVQPPQPSQQQ 168


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 36.6 bits (85), Expect = 0.082
 Identities = 23/117 (19%), Positives = 45/117 (38%), Gaps = 5/117 (4%)

Query: 136 AKKKVAAAEWEKEADVNIEAQIRDIQNRKKDVSEGGGEEKQKRVGLGESGYFDSDIYDGG 195
             +K+A AE EK     ++ +I  +      + E   E + K    GE G  + +  +  
Sbjct: 20  EDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEEEAK 79

Query: 196 ---GKFEGYVKSIADDDFDYQASFNQNKRS--GYTAPAALLNDIAQSEKDYDPFADR 247
               +F  Y++   D   + + + +    S  GY  P     +I +  ++Y P    
Sbjct: 80  AEAAEFRAYLRGGDDALAEERKALSTGTDSDGGYLVPEEFSTEIIELLREYSPLRQL 136


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 36.3 bits (84), Expect = 0.14
 Identities = 28/134 (20%), Positives = 35/134 (26%), Gaps = 14/134 (10%)

Query: 620 HPKPGAETPGATPSTRLWDATPGHATPGA--ATPGRETPSHDKAQSSIRRNRWDETPKTE 677
            P+PGA           W        P A  A+P    P  +         R  E     
Sbjct: 152 RPEPGAWPR--AADDYGWQQQRLGFPPRAPYASPASYAPEQE---------RDREPYDAG 200

Query: 678 RETPGHSSGWAETPKTDRAGPGGDLIQET-PTPGGVSSKRRSRWDETPQATPSGAMTPSA 736
           R          + P+ D   P  D      P PG     R           P   + PSA
Sbjct: 201 RPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRGGPGPPERDDAPVVPIRPSA 260

Query: 737 ATPGGMTPSTPITP 750
             P    P+    P
Sbjct: 261 PGPLAAQPAPAPGP 274



 Score = 31.3 bits (71), Expect = 4.3
 Identities = 21/133 (15%), Positives = 29/133 (21%), Gaps = 17/133 (12%)

Query: 608 TPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRR 667
                 W +      P A                              P +D+ +     
Sbjct: 161 AADDYGWQQQRLGFPPRAPYASPASYAP-------EQERDREPYDAGRPEYDQRRRDYDH 213

Query: 668 NR--WDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQ 725
            R  WD   +   + P    G        R GPG     + P         R        
Sbjct: 214 PRPDWDRPRRDRTDRPEPPPGAGHVH---RGGPGPPERDDAPVV-----PIRPSAPGPLA 265

Query: 726 ATPSGAMTPSAAT 738
           A P+ A  P   T
Sbjct: 266 AQPAPAPGPGEPT 278


>gnl|CDD|202817 pfam03937, Sdh5, Flavinator of succinate dehydrogenase.  This
           family of uncharacterized proteins. Based on personal
           observation it was previously annotated in Pfam as being
           a divergent TPR repeat but structural evidence has
           indicated this is not true.This family is now found to
           be a highly conserved mitochondrial protein, Sdh5. Both
           yeast and human Sdh5 interact with the catalytic subunit
           of the succinate dehydrogenase (SDH) complex, a
           component of both the electron transport chain and the
           tricarboxylic acid cycle. Sdh5 is required for
           SDH-dependent respiration and for Sdh1 flavination
           (incorporation of the flavin adenine dinucleotide
           cofactor). Mutational inactivation of Sdh5 confers tumor
           susceptibility in humans.
          Length = 51

 Score = 32.0 bits (74), Expect = 0.14
 Identities = 10/39 (25%), Positives = 20/39 (51%), Gaps = 10/39 (25%)

Query: 259 YRAIRRRM-----IISPERVDPFAEGHLAAMTPEQIQAY 292
           +R+ RR M     ++       FA+ HL  ++ E++ A+
Sbjct: 1   WRSWRRGMLELDLLLGR-----FADAHLDTLSEEELDAF 34


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 34.4 bits (79), Expect = 0.46
 Identities = 31/188 (16%), Positives = 44/188 (23%), Gaps = 24/188 (12%)

Query: 604 AGGVTPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQS 663
            GGV             P PGA    A  ++    A P       A      P     ++
Sbjct: 369 GGGVPARVAG-----AVPAPGARAAAAVGAS----AVPAVTAVTGAAGAALAP-----KA 414

Query: 664 SIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDET 723
           +                P  ++   +      A                  +     D  
Sbjct: 415 AAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDER-----DAQ 469

Query: 724 PQATPSGAMTPSAATPGGMT--PSTPITPHVGSTPLMTPSGVTPTG---NKAMAMATPTP 778
           P A    A  P++  P      P+        +TP   P    P       A A A P  
Sbjct: 470 PPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPA 529

Query: 779 GHLAAMTP 786
                 TP
Sbjct: 530 PEARPPTP 537



 Score = 30.2 bits (68), Expect = 9.2
 Identities = 13/109 (11%), Positives = 21/109 (19%), Gaps = 7/109 (6%)

Query: 680 TPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATP 739
           T G + G     +   A P                                A+ P AA  
Sbjct: 363 TGGGAPGGGVPARVAGAVPAP-----GARAAAAVGASAVPAVTAVTGAAGAALAPKAAAA 417

Query: 740 GGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPGHLAAMTPEQ 788
                +    P     P  T        +    +        +A +   
Sbjct: 418 --AAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCD 464


>gnl|CDD|221752 pfam12752, SUZ, SUZ domain.  The SUZ domain is a conserved
           RNA-binding domain found in eukaryotes and enriched in
           positively charged amino acids. It was first
           characterized in the C.elegans protein Szy-20 where it
           has been shown to bind RNA and allow their localization
           to the centrosome. Warning- the domain has a
           compositionally biased character.
          Length = 57

 Score = 30.0 bits (68), Expect = 0.79
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query: 246 DRRQKTVAEKEDEYRAIRRR 265
           DR+ K++ E+E EY   R R
Sbjct: 32  DRKAKSLEEREAEYAEARER 51


>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
           inhibitor.
          Length = 670

 Score = 33.5 bits (76), Expect = 0.86
 Identities = 22/103 (21%), Positives = 35/103 (33%), Gaps = 16/103 (15%)

Query: 700 GDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTPITP--------- 750
           G+      TP G ++   + +      +PS  + PS + P G   S P TP         
Sbjct: 564 GNPGSTNSTPTGSAASSNTTFSSD---SPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTS 620

Query: 751 ----HVGSTPLMTPSGVTPTGNKAMAMATPTPGHLAAMTPEQI 789
               H+GS      S  +     +   A+P      A T   +
Sbjct: 621 PPASHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESSV 663


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 33.4 bits (76), Expect = 0.92
 Identities = 34/112 (30%), Positives = 43/112 (38%), Gaps = 14/112 (12%)

Query: 672 ETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQAT--PS 729
             P  E    G S   A      R GP    +  TP P         R   +P+ T  PS
Sbjct: 174 APPLGEGSADG-SCDPALPLSAPRLGPADVFVPATPRP-------TPRTTASPETTPTPS 225

Query: 730 GAMTPSAATPGGMTPSTPITPHVGSTPLM--TPSGVTPTGNKAM-AMATPTP 778
              +P + T    +  T   P  G+TP    TP+  TP G +A  A ATP P
Sbjct: 226 TTTSPPSTTIPAPST-TIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAP 276



 Score = 30.7 bits (69), Expect = 6.2
 Identities = 16/66 (24%), Positives = 21/66 (31%), Gaps = 2/66 (3%)

Query: 608 TPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRR 667
           TP  +     P    P   T  A P         G  TP   TPG        A  +   
Sbjct: 221 TPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEG--TPAPPTPGGGEAPPANATPAPEA 278

Query: 668 NRWDET 673
           +R++ T
Sbjct: 279 SRYELT 284


>gnl|CDD|221190 pfam11727, ISG65-75, Invariant surface glycoprotein.  This family
           is found in Trypanosome species, and appears to be one
           of two invariant surface glycoproteins, ISG65 and ISG75.
           that are found in the mammalian stage of the parasitic
           protozoan. the sequence suggests the two families are
           polypeptides with N-terminal signal sequences,
           hydrophilic extracellular domains, single trans-membrane
           alpha-helices and short cytoplasmic domains. they are
           both expressed in the bloodstream form but not in the
           midgut stage. Both polypeptides are distributed over the
           entire surface of the parasite.
          Length = 286

 Score = 32.9 bits (75), Expect = 0.93
 Identities = 27/157 (17%), Positives = 52/157 (33%), Gaps = 27/157 (17%)

Query: 71  DVGSRTYAEVMREQLVRGEEKEVRKKLQDKAKDGTLKAVPTNGEAKAAPKKRGRWDQTSD 130
            + SR+ ++     L   ++K++++   +KAKD   +A+P   +A    +K         
Sbjct: 84  KLESRSDSK-----LSDSDKKKIKEIC-EKAKDKIDEALPEAKKAGREAEKLA------- 130

Query: 131 GDVTPAKKKVAAAEWEKEADVNIEAQIRDIQN---RKKDVSEGGGEEKQKRVGLGES--G 185
            +V  A  K  A  W  +   +    +  + N     K   E        +    ++  G
Sbjct: 131 EEVKEAAAK--ALGWALDEGTSNSTGLSKVLNWHCGGKKPGEN------AKSSPSQNCDG 182

Query: 186 YFDSDIYDGGGKFEGYVKSI-ADDDFDYQASFNQNKR 221
                 Y  GG+          D  F    +    K 
Sbjct: 183 IAFKRHYSDGGRNVIDCSDTGVDRVFYTDVTSEAMKS 219


>gnl|CDD|214547 smart00185, ARM, Armadillo/beta-catenin-like repeats.  Approx. 40
            amino acid repeat. Tandem repeats form superhelix of
            helices that is proposed to mediate interaction of
            beta-catenin with its ligands. Involved in transducing
            the Wingless/Wnt signal. In plakoglobin arm repeats bind
            alpha-catenin and N-cadherin.
          Length = 41

 Score = 29.3 bits (67), Expect = 0.98
 Identities = 7/28 (25%), Positives = 14/28 (50%)

Query: 1072 LPRLTPILKNRHEKVQENCIDLVGRIAD 1099
            LP L  +LK+  E+V +     +  ++ 
Sbjct: 14   LPALVELLKSEDEEVVKEAAWALSNLSS 41


>gnl|CDD|220165 pfam09290, AcetDehyd-dimer, Prokaryotic acetaldehyde dehydrogenase,
           dimerisation.  Members of this family are found in
           prokaryotic acetaldehyde dehydrogenase (acylating), and
           adopt a structure consisting of an
           alpha-beta-alpha-beta(3) core. They mediate dimerisation
           of the protein.
          Length = 137

 Score = 31.7 bits (73), Expect = 1.0
 Identities = 21/51 (41%), Positives = 28/51 (54%), Gaps = 10/51 (19%)

Query: 543 ARVEGREIISNLA-KAAGLATMISTMRPDIDNIDEYVRNTTARAFAVVASA 592
           A V+  EI++++A K+AG  T     R    NIDE+   TTARA   V  A
Sbjct: 17  APVDYAEIVASIASKSAGPGT-----R---ANIDEFTE-TTARAIEKVGGA 58


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 33.3 bits (76), Expect = 1.2
 Identities = 38/195 (19%), Positives = 56/195 (28%), Gaps = 20/195 (10%)

Query: 642 GHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGD 701
           G A P  A      P    A ++         P      P  +   A   +   A P   
Sbjct: 370 GGAGPATAAAA---PVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPA-- 424

Query: 702 LIQETPTPGGVSSKRRSRWDE---TPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLM 758
             + +P P  +++ R++        P   P+ A  P+AA         P+     + P  
Sbjct: 425 --RRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPAR 482

Query: 759 TPSGVTPTGNKAMAMATP----TPGHLAAMTPEQIQAYRWEREIDERNRPLT---DDELE 811
                 P          P     P   A+  P Q  A       +    P T   DD  E
Sbjct: 483 AAPAAAP---APADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFE 539

Query: 812 AMFPPGYKVLQPPAA 826
            + P       P AA
Sbjct: 540 TLAPAPAAAPAPRAA 554



 Score = 30.2 bits (68), Expect = 8.5
 Identities = 29/186 (15%), Positives = 40/186 (21%), Gaps = 11/186 (5%)

Query: 607 VTPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIR 666
             PAA         P      P A P+          A P   +P  E  +  +  S+  
Sbjct: 385 PAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAA-PARRSPAPEALAAARQASARG 443

Query: 667 RNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSR------W 720
                         P      A  P      P        P     ++           W
Sbjct: 444 PGGAPAPAPAPAAAPAA----AARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPW 499

Query: 721 DETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPGH 780
           +E P    S A     A P G    +   P          +            A      
Sbjct: 500 EELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEP 559

Query: 781 LAAMTP 786
           + A  P
Sbjct: 560 VVAPRP 565


>gnl|CDD|239573 cd03493, SQR_QFR_TM, Succinate:quinone oxidoreductase (SQR) and
           Quinol:fumarate reductase (QFR) family, transmembrane
           subunits; SQR catalyzes the oxidation of succinate to
           fumarate coupled to the reduction of quinone to quinol,
           while QFR catalyzes the reverse reaction. SQR, also
           called succinate dehydrogenase or Complex II, is part of
           the citric acid cycle and the aerobic respiratory chain,
           while QFR is involved in anaerobic respiration with
           fumarate as the terminal electron acceptor. SQRs may
           reduce either high or low potential quinones while QFRs
           oxidize only low potential quinols. SQR and QFR share a
           common subunit arrangement, composed of a flavoprotein
           catalytic subunit, an iron-sulfur protein and one or two
           hydrophobic transmembrane subunits. The structural
           arrangement allows efficient electron transfer between
           the catalytic subunit, through iron-sulfur centers, and
           the transmembrane subunit(s) containing the electron
           donor/acceptor (quinol or quinone). The reversible
           reduction of quinone is an essential feature of
           respiration, allowing the transfer of electrons between
           respiratory complexes. SQRs and QFRs can be classified
           into five types (A-E) according to the number of their
           hydrophobic subunits and heme groups. This
           classification is consistent with the characteristics
           and phylogeny of the catalytic and iron-sulfur subunits.
           Type E proteins, e.g. non-classical archael SQRs,
           contain atypical transmembrane subunits and are not
           included in this hierarchy. The heme and quinone binding
           sites reside in the transmembrane subunits. Although
           succinate oxidation and fumarate reduction are carried
           out by separate enzymes in most organisms, some
           bifunctional enzymes that exhibit both SQR and QFR
           activities exist.
          Length = 98

 Score = 30.7 bits (70), Expect = 1.2
 Identities = 10/46 (21%), Positives = 16/46 (34%)

Query: 831 GCSMFAPTVWHLWIYTRSANANFYFGVTLAFATSQIFLLTDLLFAY 876
           G ++      HL          + F   +AF +S +  L  LL   
Sbjct: 7   GVALLLFLPLHLLGLLALLGGPYAFAEVVAFLSSPLGKLLYLLLLL 52


>gnl|CDD|184128 PRK13541, PRK13541, cytochrome c biogenesis protein CcmA;
           Provisional.
          Length = 195

 Score = 32.2 bits (73), Expect = 1.4
 Identities = 20/76 (26%), Positives = 35/76 (46%), Gaps = 11/76 (14%)

Query: 851 ANFYFGVTLAFATSQIFLLTDLLFAYLKRDYTLENGIQKTIKGKPARLVGASE---IINR 907
           +  Y      +A    F L DLL    ++ Y+L +G+QK +    ARL+       +++ 
Sbjct: 95  SEIYNSAETLYAAIHYFKLHDLLD---EKCYSLSSGMQKIV--AIARLIACQSDLWLLDE 149

Query: 908 VVDDLKDENEQYRKMV 923
           V  +L  EN   R ++
Sbjct: 150 VETNLSKEN---RDLL 162


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 32.4 bits (74), Expect = 1.6
 Identities = 20/73 (27%), Positives = 25/73 (34%), Gaps = 1/73 (1%)

Query: 720 WDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPG 779
           WD +PQA P  A  P        T +    P        TP+G +P  N     A P   
Sbjct: 49  WDPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTG-SPAANPPATFALPAGP 107

Query: 780 HLAAMTPEQIQAY 792
               +  E  Q Y
Sbjct: 108 AGPTIQTEPGQLY 120



 Score = 30.9 bits (70), Expect = 4.9
 Identities = 17/79 (21%), Positives = 24/79 (30%), Gaps = 10/79 (12%)

Query: 723 TPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVT----------PTGNKAMA 772
           TP  +P+     + A P G    T  T      P+  P  VT          P   +A+ 
Sbjct: 88  TPTGSPAANPPATFALPAGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQ 147

Query: 773 MATPTPGHLAAMTPEQIQA 791
                 G  A+      Q 
Sbjct: 148 QLQQRYGAPASGQLPSQQQ 166


>gnl|CDD|236615 PRK09709, PRK09709, exonuclease VIII; Reviewed.
          Length = 877

 Score = 32.6 bits (74), Expect = 1.9
 Identities = 26/106 (24%), Positives = 38/106 (35%), Gaps = 11/106 (10%)

Query: 672 ETPKTERETPGHSSGWAET--PKTDRAGPGGDLI------QETPTPGGVSSKRRSRWDET 723
           E P      P H + +      +TD A P  +++         P P  V+ + +   +E 
Sbjct: 341 EAPIGIEVIPAHVTEYLNKVLTETDHANPDPEIVDIACGRSSAPMPQRVTEEGKQDDEEK 400

Query: 724 PQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNK 769
           PQ  PSG      AT   + P      H  + PL   S V     K
Sbjct: 401 PQ--PSGTTADEQATAETVEPDA-TEHHQDTQPLDAQSQVNSVDAK 443


>gnl|CDD|213664 TIGR01909, C_GCAxxG_C_C, C_GCAxxG_C_C family probable redox protein. 
            This model represents a putative redox-active protein of
            about 140 residues, with four perfectly conserved Cys
            residues. It includes a CGAXXG motif. Most members are
            found within one or two loci of transporter or
            oxidoreductase genes. A member from Geobacter
            sulfurreducens, located in a molybdenum transporter
            operon, has a TAT (twin-arginine translocation) signal
            sequence for Sec-independent transport across the plasma
            membrane, a hallmark of bound prosthetic groups such as
            FeS clusters.
          Length = 120

 Score = 30.6 bits (69), Expect = 1.9
 Identities = 19/79 (24%), Positives = 29/79 (36%), Gaps = 11/79 (13%)

Query: 1046 GSILGALKAIVNVIGMTKMTPPIKDLLPRLTPILKNRHEKVQE-----NCIDLVGRIADR 1100
            G I GAL   V  +G+   T    + +     +    +EK +E     NC DL  +    
Sbjct: 38   GHICGALTGAVMALGLYFGTEGDGEQVNNTYALCAELYEKFREKNGSPNCGDLNPKKDLV 97

Query: 1101 GPEYVSAREWMRICFELLE 1119
              E        R C E++ 
Sbjct: 98   SKEQ------RRKCIEIVS 110


>gnl|CDD|233516 TIGR01661, ELAV_HUD_SF, ELAV/HuD family splicing factor.  This
           model describes the ELAV/HuD subfamily of splicing
           factors found in metazoa. HuD stands for the human
           paraneoplastic encephalomyelitis antigen D of which
           there are 4 variants in human. ELAV stnds for the
           Drosophila Embryonic lethal abnormal visual protein.
           ELAV-like splicing factors are also known in human as
           HuB (ELAV-like protein 2), HuC (ELAV-like protein 3,
           Paraneoplastic cerebellar degeneration-associated
           antigen) and HuR (ELAV-like protein 1). These genes are
           most closely related to the sex-lethal subfamily of
           splicing factors found in Dipteran insects (TIGR01659).
           These proteins contain 3 RNA-recognition motifs (rrm:
           pfam00076).
          Length = 352

 Score = 31.8 bits (72), Expect = 2.3
 Identities = 27/116 (23%), Positives = 45/116 (38%), Gaps = 9/116 (7%)

Query: 723 TPQATPSGAMTPSAATPGGMTPS-----TPITPHVGSTPLMTPSGVTPTGNKAMAMATPT 777
           T   T   A  PS++   G+         P T  V  + ++T +G+ P  + A A   P+
Sbjct: 160 TEPITVKFANNPSSSNSKGLLSQLEAVQNPQTTRVPLSTILTAAGIGPM-HHAAARFRPS 218

Query: 778 PG-HLAAMTPEQIQAYRWEREIDERNRPLTDDELEAMFPPGYKVLQPPAA--CFFI 830
            G   A +  +Q Q    ++   +R  P   D   A    G ++     A  C F+
Sbjct: 219 AGDFTAVLAHQQQQHAVAQQHAAQRASPPATDGQTAGLAAGAQIAASDGAGYCIFV 274


>gnl|CDD|173356 PTZ00061, PTZ00061, DNA-directed RNA polymerase; Provisional.
          Length = 205

 Score = 31.3 bits (71), Expect = 2.7
 Identities = 29/117 (24%), Positives = 48/117 (41%), Gaps = 19/117 (16%)

Query: 209 DFDYQASFNQNKRSGYTAPAALLNDIAQSEKDYDPFADRRQKT-------VAEKEDEYRA 261
            F  +   N+  RS     A+   D   + +    FAD  +KT       + EK +E   
Sbjct: 37  TFKERFEENERLRSRMLMVASHKTD--PTNRIIVYFADETKKTGVKPIRELTEKMEE-HD 93

Query: 262 IRRRMIISPERVDPFAEGHLAAMTPEQIQAYRWE---------REIDERNRPLTDDE 309
           I+R ++++   + PFA+  +    P  I     E          E+  ++ PLTDDE
Sbjct: 94  IQRAILVTQNVLTPFAKDAILEAAPRHIIENFLETELLVNITKHELVPKHIPLTDDE 150


>gnl|CDD|227484 COG5155, ESP1, Separase, a protease involved in sister chromatid
            separation [Cell division and chromosome partitioning /
            Posttranslational modification, protein turnover,
            chaperones].
          Length = 1622

 Score = 31.9 bits (72), Expect = 3.0
 Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 4/60 (6%)

Query: 1171 IAIVAETCS--PFTVLPALMNE--YRVPELNVQNGVLKALSFLFEYIGEMGKDYIYAVTP 1226
            I ++ E+    P+  LP L N+  YRVP L      L +  +     G  G+ + Y + P
Sbjct: 1389 ILVLDESTHEFPWESLPVLRNKSVYRVPSLEYLENALSSFHYQLPLQGASGRSFSYILNP 1448


>gnl|CDD|223648 COG0575, CdsA, CDP-diglyceride synthetase [Lipid metabolism].
          Length = 265

 Score = 31.1 bits (71), Expect = 3.0
 Identities = 10/57 (17%), Positives = 18/57 (31%), Gaps = 7/57 (12%)

Query: 825 AACFFIGCSMFAPTVWHLWIYTRSANANFYFGVTLAFATSQIFLLTDLLFAYLKRDY 881
           A    +   +    +  L +       N +  + L         L DL  +Y+KR  
Sbjct: 179 ALGAVLVAVLVIFLLSSLIL-------NIWTLLILGLLLVLTSQLGDLFESYIKRLL 228


>gnl|CDD|105710 PRK12514, PRK12514, RNA polymerase sigma factor; Provisional.
          Length = 179

 Score = 30.9 bits (70), Expect = 3.1
 Identities = 24/73 (32%), Positives = 31/73 (42%), Gaps = 8/73 (10%)

Query: 1072 LPRLTPILKNRHEKVQENCIDLVGRIAD--RGPE--YVSAREWMRI--CFELLELLKAHK 1125
            + R   I + R  K     ID    +AD   GPE   ++  E  RI  C E  EL K   
Sbjct: 78   IARNHAIDRLRARKAVAVDIDEAHDLADPSPGPEAEVIAGDEGQRIDACLE--ELEKDRA 135

Query: 1126 KAIRRATVNTFGY 1138
             A+RRA +    Y
Sbjct: 136  AAVRRAYLEGLSY 148


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 31.4 bits (71), Expect = 3.4
 Identities = 20/79 (25%), Positives = 27/79 (34%), Gaps = 1/79 (1%)

Query: 723 TPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPGHLA 782
           TP    S A   SAA    +   +P      +T   +PS   P+     A          
Sbjct: 98  TPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPSTSVPSSGSDAASTLVVGSERE 157

Query: 783 AMTPEQIQAYRWEREIDER 801
               E+I    +ERE  ER
Sbjct: 158 TTI-EEIMEMGYEREEVER 175


>gnl|CDD|236779 PRK10864, PRK10864, putative methyltransferase; Provisional.
          Length = 346

 Score = 31.3 bits (71), Expect = 3.6
 Identities = 17/79 (21%), Positives = 27/79 (34%), Gaps = 8/79 (10%)

Query: 653 RETPSHDKAQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGV 712
           R      +     R +      + +R +      W ++P    +   GD   E    GG+
Sbjct: 31  RTGKGGGRPSGKSRADGGRRPARDDRNSQSRDRKWEDSPWRTVSRAPGDETPEKADHGGI 90

Query: 713 SSK--------RRSRWDET 723
           S K        RR R +ET
Sbjct: 91  SGKSFIDPEVLRRQRAEET 109


>gnl|CDD|236227 PRK08300, PRK08300, acetaldehyde dehydrogenase; Validated.
          Length = 302

 Score = 31.3 bits (72), Expect = 3.6
 Identities = 20/52 (38%), Positives = 27/52 (51%), Gaps = 10/52 (19%)

Query: 542 YARVEGREIISNLA-KAAGLATMISTMRPDIDNIDEYVRNTTARAFAVVASA 592
            A V   EI++++A K+AG  T     R    NIDE+   TT+RA   V  A
Sbjct: 144 VAPVHYAEIVASIASKSAGPGT-----R---ANIDEFTE-TTSRAIEKVGGA 186


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
           subunit 62; Provisional.
          Length = 576

 Score = 31.4 bits (71), Expect = 3.7
 Identities = 49/232 (21%), Positives = 71/232 (30%), Gaps = 36/232 (15%)

Query: 551 ISNLAKAAGLATMISTMRPDIDNIDEYVRNTTARAFAVVASALGIPSLLPFLKAGGVTPA 610
           +SNL  A  +A M    R     + E +  TTA    +      IPS     K       
Sbjct: 279 VSNLQVAELMACMAKNRRLSYCKVVEVIAETTAPLTPMEELLAKIPSQRVPPKE----SD 334

Query: 611 ATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRW 670
           A    + P   KP      + P                     E P   KA      + +
Sbjct: 335 AADGPK-PVPTKPVTPEAPSPPI-------------------EEEPPQPKAVVPRPLSPY 374

Query: 671 ----DETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSS--KRRSRWDETP 724
               D  P T    P   S    + K+  A          P+PG  S+  +      E  
Sbjct: 375 TAYEDLKPPTS-PIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAK 433

Query: 725 QATPSGAMTPSAATPGGMTPSTPI-TPHVG-STPLMTPSGVTPTGNKAMAMA 774
           +  P   ++P A       P++P  T   G S  + + S V    + A A A
Sbjct: 434 KTRP---LSPYARYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATA 482


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 31.5 bits (71), Expect = 3.8
 Identities = 61/284 (21%), Positives = 95/284 (33%), Gaps = 52/284 (18%)

Query: 540 DYYARVEGREIISNLAKAAGLATMISTMRPDIDNIDEYVRNTTARAFAVVASALGIPSLL 599
           ++Y    GR+ + +          + +  P      E VR  T      + ++    + +
Sbjct: 153 NHYISTSGRQRVKSYTPKP--RGEVKSSSPT--QTREKVRKYTPSPLGRMVNSYAPSTFM 208

Query: 600 PFLKAGGVTPAATRWD-ETPG-----HPKPGAETPGATPSTRLWDATPGHATPGAATPGR 653
              ++ G+TP  T  D E           P   T G T  T L   T    TP   T   
Sbjct: 209 TMPRSHGITPRTTVKDSEITATYKMLETNPSKRTAGKTTPTPLKGMTDN--TPTFLTREV 266

Query: 654 ETPSHDKAQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPG--- 710
           ET      +S + +N      + E  +  +   W    K +   P G +++ TP      
Sbjct: 267 ETDLLTSPRSVVEKNTLTTPRRVESNSSTNH--WGLVGKNNLTTPQGTVLEHTPATSEGQ 324

Query: 711 -GVSSKRRSRWDETPQATP--------SGAMTP----SAATPGGM------TPSTPITPH 751
             +S    S   ET  +T         S    P    ++AT  G+       PSTP TP 
Sbjct: 325 VTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIASATFRGLEKNPSTAPSTPATPR 384

Query: 752 VGSTPLMT--------PSGVTPTGNKAMAMATPTPGHLAAMTPE 787
           V +             P+   PT        TP+P    A+ PE
Sbjct: 385 VRAVLTTQVHHCVVVKPAPAVPT--------TPSPSLTTALFPE 420


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 30.8 bits (70), Expect = 4.2
 Identities = 17/84 (20%), Positives = 33/84 (39%), Gaps = 1/84 (1%)

Query: 89  EEKEVRKKLQDKAKDGTLKA-VPTNGEAKAAPKKRGRWDQTSDGDVTPAKKKVAAAEWEK 147
           +E+E  K          LKA +    +AK   +++G  +   D       +K+   + ++
Sbjct: 45  KEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRKLQE 104

Query: 148 EADVNIEAQIRDIQNRKKDVSEGG 171
           E+D+N  A +       KD     
Sbjct: 105 ESDLNNAADLFGGPAPTKDTDLES 128


>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
          Length = 234

 Score = 30.6 bits (68), Expect = 4.6
 Identities = 22/102 (21%), Positives = 29/102 (28%), Gaps = 2/102 (1%)

Query: 690 TPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTPIT 749
            P T  A    +    T T   V+    +    T   T         AT  G   ST +T
Sbjct: 66  APITTTAILSTNTTTVTSTGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVT 125

Query: 750 PHVGSTPLMTPSGVTPTGNKAMAMATPTPGHLAAMTPEQIQA 791
            +V +    T S  T   N       PT          +  A
Sbjct: 126 SNVTTRSSSTTSATTRITNA--TTLAPTLSSKGTSNATKTTA 165


>gnl|CDD|146273 pfam03546, Treacle, Treacher Collins syndrome protein Treacle. 
          Length = 519

 Score = 31.0 bits (69), Expect = 4.7
 Identities = 34/166 (20%), Positives = 48/166 (28%), Gaps = 18/166 (10%)

Query: 639 ATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTERETPGHSSG------WAETPK 692
           A PG   P A     E P  D   SS   +  +E P  +       SG       A  P 
Sbjct: 3   APPGKTGPAATQAKAEKPEEDSESSSEDSDSEEEMPAAKNPPQAKPSGKSPQVKAASAPA 62

Query: 693 TDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTP---IT 749
            +    G       P   G +    ++  E    +          TP   T +T      
Sbjct: 63  KESPQKGA-----PPVTPGKAGPAAAQAGEEEAKSSEEESDSEGETPTAATLTTSPAQAK 117

Query: 750 PHVGSTPLMTPSGVTPTGNKAMAMATPTPGHLAAMTPEQIQAYRWE 795
           P   ++ +   S VTP G        P P          +Q  + E
Sbjct: 118 PLGKNSQVRPASTVTP-GPSGKGANLPCPQKAG---SAAVQVGKQE 159


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 31.3 bits (70), Expect = 5.1
 Identities = 27/104 (25%), Positives = 47/104 (45%), Gaps = 13/104 (12%)

Query: 83   EQLVRGEEK----EVRKKLQD--KAKDGTLKAVPTNGEAKAAPKK---RGRWDQTSDGDV 133
            ++  + EEK    E +KK ++  KA +   KA     +A AA KK     +  + +  + 
Sbjct: 1293 DEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEA 1352

Query: 134  TPAKKKVAAAEWEKEADVNIEAQIRDIQNRKKDVSEGGGEEKQK 177
              A  +  AAE + EA    + +      +K D ++   EEK+K
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEE----AKKKADAAKKKAEEKKK 1392


>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2.  A Saccharomyces
           cerevisiae member of this family (PGA2) is an ER protein
           which has been implicated in protein trafficking.
          Length = 139

 Score = 29.8 bits (67), Expect = 5.1
 Identities = 16/88 (18%), Positives = 32/88 (36%), Gaps = 3/88 (3%)

Query: 90  EKEVRKKLQDKAKDGTLKAVPTNGEAKAAPKKRGRWDQTSDGDVTPAKKKVAAAEWEKEA 149
           EKE  ++ + + K   +      G    A    G  D   + D        A  +W K+A
Sbjct: 47  EKERAEREEAREKKAKISPNALRG---GATAGHGEEDTDDEEDEEDFATPSAVPQWGKKA 103

Query: 150 DVNIEAQIRDIQNRKKDVSEGGGEEKQK 177
                  IR +   ++ + E   +++ +
Sbjct: 104 RKRQRKVIRKLLEAEEQLREDQYDDEDE 131


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 30.9 bits (70), Expect = 6.3
 Identities = 15/75 (20%), Positives = 24/75 (32%), Gaps = 4/75 (5%)

Query: 724 PQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPGHLAA 783
               PS A   S +       + P  P         P+G  PT +     A P      A
Sbjct: 388 AAPQPSAAAAASPSPSQSSAAAQPSAP----QSATQPAGTPPTVSVDPPAAVPVNPPSTA 443

Query: 784 MTPEQIQAYRWEREI 798
               +   ++ E++I
Sbjct: 444 PQAVRPAQFKEEKKI 458


>gnl|CDD|217439 pfam03229, Alpha_GJ, Alphavirus glycoprotein J. 
          Length = 126

 Score = 29.3 bits (65), Expect = 6.6
 Identities = 21/85 (24%), Positives = 29/85 (34%), Gaps = 17/85 (20%)

Query: 550 IISNLAKAAGLATMISTMRPDIDNIDEYVRNTTARAFAVVASALGIPSLLPFLKAGGVTP 609
           ++++L   A LAT                  TTA   A  A+A+   +      A   + 
Sbjct: 15  LVTDLTPKAALAT-----------------GTTATVGAANATAVATATGANATSAAAGST 57

Query: 610 AATRWDETPGHPKPGAETPGATPST 634
            A      P    P  ETPG  P T
Sbjct: 58  IAATAGAPPTDSTPAGETPGPFPPT 82


>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
          Length = 405

 Score = 30.5 bits (69), Expect = 7.1
 Identities = 20/61 (32%), Positives = 27/61 (44%), Gaps = 7/61 (11%)

Query: 77  YAEVMREQLVRGEEKEVRKKLQDKAKDGTLKAVPTNGEAKAAPKKRGR-WDQT--SDGDV 133
           YA+  R   +  E   V+ K +  AK    K    + E  AA K  G+ W +T  SD D 
Sbjct: 336 YAKTARRMALLAEAARVKAKKEKLAK----KRKQVSKEEAAAIKAAGKAWYKTMISDSDY 391

Query: 134 T 134
           T
Sbjct: 392 T 392


>gnl|CDD|222285 pfam13646, HEAT_2, HEAT repeats.  This family includes multiple HEAT
            repeats.
          Length = 88

 Score = 28.1 bits (63), Expect = 7.3
 Identities = 25/90 (27%), Positives = 37/90 (41%), Gaps = 18/90 (20%)

Query: 1068 IKDLLPRLTPILKNRHEKVQENCIDLVGRIADRGPEYVSAREWMRICFELLELLKAHKKA 1127
            ++ LL  L   L +   +V+      +G + D  PE + A         LLELLK     
Sbjct: 1    LEALLEAL---LSDPDPEVRAAAARALGELGD--PEALPA---------LLELLKDPDPE 46

Query: 1128 IRRATVNTFGYIAKAIGPHDVLATLLNNLK 1157
            +RRA     G     +G  + L  LL  L+
Sbjct: 47   VRRAAAEALG----KLGDPEALPALLELLQ 72


>gnl|CDD|235677 PRK06029, PRK06029, 3-octaprenyl-4-hydroxybenzoate carboxy-lyase;
           Provisional.
          Length = 185

 Score = 29.5 bits (67), Expect = 7.7
 Identities = 21/70 (30%), Positives = 33/70 (47%), Gaps = 15/70 (21%)

Query: 447 KERKIMKLLLKIKNGTPPMRKAALRQITDKAREFGAGPLFNQILPLLM-----SPTLEDQ 501
           KER+  +L+L ++    P+    LR +T K  E GA      I+P +        TLED 
Sbjct: 113 KERR--RLVLCVRE--TPLHLGHLRNMT-KLAEMGA-----IIMPPVPAFYHRPQTLEDM 162

Query: 502 ERHLLVKVID 511
               + +V+D
Sbjct: 163 VDQTVGRVLD 172


>gnl|CDD|177460 PHA02678, PHA02678, hypothetical protein; Provisional.
          Length = 89

 Score = 28.4 bits (63), Expect = 7.7
 Identities = 18/53 (33%), Positives = 24/53 (45%), Gaps = 4/53 (7%)

Query: 734 PSAATPGGM--TPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPGHLAAM 784
           P    P G+      P+ P  GSTP M      P   +A+ +  PT G LAA+
Sbjct: 13  PGTRFPEGLFDRQGRPLGP--GSTPTMEYEDSVPRDARALLLVNPTAGQLAAV 63


>gnl|CDD|233228 TIGR00998, 8a0101, efflux pump membrane protein (multidrug
           resistance protein A).  [Transport and binding proteins,
           Other].
          Length = 334

 Score = 30.1 bits (68), Expect = 7.8
 Identities = 12/34 (35%), Positives = 17/34 (50%)

Query: 173 EEKQKRVGLGESGYFDSDIYDGGGKFEGYVKSIA 206
           E + K V +G+     SD+Y     FEG V  I+
Sbjct: 249 ETQLKNVRIGQPVTIRSDLYGSDVVFEGKVTGIS 282


>gnl|CDD|202816 pfam03936, Terpene_synth_C, Terpene synthase family, metal binding
           domain.  It has been suggested that this gene family be
           designated tps (for terpene synthase). It has been split
           into six subgroups on the basis of phylogeny, called
           tpsa-tpsf. tpsa includes vetispiridiene synthase, 5-epi-
           aristolochene synthase, and (+)-delta-cadinene synthase.
           tpsb includes (-)-limonene synthase. tpsc includes
           kaurene synthase A. tpsd includes taxadiene synthase,
           pinene synthase, and myrcene synthase. tpse includes
           kaurene synthase B. tpsf includes linalool synthase.
          Length = 270

 Score = 30.0 bits (68), Expect = 8.3
 Identities = 23/103 (22%), Positives = 37/103 (35%), Gaps = 19/103 (18%)

Query: 850 NANFYFGVTLAFATSQIFL---LTDLLFAYLKRDYTLENGIQKTIKGKPARLVGASEIIN 906
           N     G  L    S + +     +  F +LK                  +LV  S +I 
Sbjct: 174 NGRVSIGAPLLLLLSFLGMGDSAGEEPFEWLK---------------SRPKLVKLSSLIL 218

Query: 907 RVVDDLKD-ENEQYRKMVMESIEKTMSNLGAADIDSRLEEQLI 948
           R+++D+   E E  R  V  SIE  M   G  + ++    Q +
Sbjct: 219 RLLNDIASYEKEMARGEVANSIECYMKEYGVTEEEALEHVQKM 261


>gnl|CDD|236386 PRK09129, PRK09129, NADH dehydrogenase subunit G; Validated.
          Length = 776

 Score = 30.3 bits (69), Expect = 8.5
 Identities = 27/83 (32%), Positives = 33/83 (39%), Gaps = 31/83 (37%)

Query: 1089 NCIDL--VGRIADRGPEYVSAREWMRICFELLELLKAHKKAIRRATVNTFGYIAKAIGPH 1146
            N IDL  VG +  +   Y SAR W        EL        RR          K++ PH
Sbjct: 191  NMIDLCPVGALTSKPFRY-SARTW--------EL-------SRR----------KSVSPH 224

Query: 1147 DVLATLLNNLKVQERQNRVCTTV 1169
            D L +   NL V  + NRV   V
Sbjct: 225  DSLGS---NLVVHVKNNRVMRVV 244


>gnl|CDD|226587 COG4102, COG4102, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 418

 Score = 30.2 bits (68), Expect = 9.7
 Identities = 25/123 (20%), Positives = 35/123 (28%), Gaps = 17/123 (13%)

Query: 628 PGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTERETPGH-SSG 686
           P      RL+ A         ATP R   SH   Q         +  +T    PGH  +G
Sbjct: 94  PSLPNFQRLFQAGEAAVVHAVATPYRLR-SHFDGQ---------DVLETGSAGPGHLETG 143

Query: 687 W-----AETPKTDRAGPGGDLIQETPTPGGVSSKRRS-RWDETPQATPSGAMTPSAATPG 740
           W        P        G L      P  +  K  +  W  +P    +  +        
Sbjct: 144 WLNRLLEALPAGAAFTTAGGLSVGPTAPLVLRGKAPALSWAPSPLFLSAARLAMLLNDLY 203

Query: 741 GMT 743
           G+ 
Sbjct: 204 GVL 206


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.135    0.402 

Gapped
Lambda     K      H
   0.267   0.0630    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 72,877,320
Number of extensions: 7599583
Number of successful extensions: 8484
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8311
Number of HSP's successfully gapped: 149
Length of query: 1359
Length of database: 10,937,602
Length adjustment: 109
Effective length of query: 1250
Effective length of database: 6,103,016
Effective search space: 7628770000
Effective search space used: 7628770000
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 65 (29.0 bits)