RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16259
         (896 letters)



>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 51.3 bits (123), Expect = 1e-06
 Identities = 29/185 (15%), Positives = 52/185 (28%), Gaps = 9/185 (4%)

Query: 43  GVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHIT 102
           GV+      SP  P P         L  P        +         +      ++P  T
Sbjct: 43  GVAEFPWDPSPQAPPPVAQLP--QPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPAT 100

Query: 103 QHTGPGIFQSP--PPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVL 160
                G            +  +  P   + + ++  + Q     +  Q         +  
Sbjct: 101 FALPAGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQ 160

Query: 161 SPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTS--QSSQQQSDLIKTNGTGDVNPDEF 218
            PS  Q   + D+   S +Q  P  +   QQ D +    S+    L + +GT +      
Sbjct: 161 LPSQQQSAQKNDE---SQLQQQPNGETPPQQTDGAGDDESEALVRLREADGTLEQRIKGA 217

Query: 219 NQLLA 223
               A
Sbjct: 218 EGGGA 222


>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
           only].
          Length = 467

 Score = 48.9 bits (116), Expect = 9e-06
 Identities = 83/428 (19%), Positives = 135/428 (31%), Gaps = 49/428 (11%)

Query: 377 SIDRPYCCSVCMKTFARKEHLENHTRCHTGETPYKCQY--CGKNFSRKEHMVNHVRKHTG 434
           +  RP  C  C  +F+R EHL  H R HTGE P +C Y  C K+FSR   +  H+R H  
Sbjct: 29  NAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHN 88

Query: 435 ETPHRCEICKKAFTRK-------------------EHYVNHVMWHTGETPHVCQVC--GK 473
                          K                     +         + P +  +     
Sbjct: 89  NPSDLNSKSLPLSNSKASSSSLSSSSSNSNDNNLLSSHSLPPSSRDPQLPDLLSISNLRN 148

Query: 474 KYTRKEHLANHMRSHTNDTPFKCEICTKSFTRKEHFTNHIMW--HTGETPHRCDFCSKTF 531
                 + ++     +N         + S     + +  I     T       +    + 
Sbjct: 149 NPLPGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNLSLLISSNVSTSIPSSSENSPLSSS 208

Query: 532 TRKEHLLNHVRQHTGESPHKCNYCAKSFTRKEHLVNHVRQHTGDSPHKCNYCAKSFTRKE 591
                  +        S       ++   +     +     + DS    +   +S     
Sbjct: 209 YSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESPRSSLPTA 268

Query: 592 HLVNHVRQHTGE-------TPYQCQYCPKAFSRKDHLVNHVRQ--HTGES--PHKC--SY 638
              +     +          P + + C  +FSR   L  H+R   H+GES  P  C  S 
Sbjct: 269 SSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSL 328

Query: 639 CKQSFTRKDHLNNHVRLHTGESPHKCHF--CSKSFTRKEHLNNHIRLHTGESPHKCD--- 693
           C + F+R D L  H+ LHT  SP K      S  F+   +      L   +         
Sbjct: 329 CGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSE 388

Query: 694 ----FCLKTFTRKEHLTNHLKQHAGETEHVCN--ICSKPFLKKEHLVNHSRSHTGERPFA 747
                C++ F R  +L+ H+  H     + C    CSK F +  +L+ H + HT   P  
Sbjct: 389 TLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPLL 448

Query: 748 CTDCGKSF 755
           C+      
Sbjct: 449 CSILKSFR 456



 Score = 42.8 bits (100), Expect = 6e-04
 Identities = 59/325 (18%), Positives = 101/325 (31%), Gaps = 22/325 (6%)

Query: 520 TPHRCDFCSKTFTRKEHLLNHVRQHTGESPHKCNY--CAKSFTRKEHLVNHVRQHTGDSP 577
            P  C  C+ +F+R EHL  H+R HTGE P +C+Y  C KSF+R   L  H+R H  +  
Sbjct: 32  RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNPS 91

Query: 578 HKCNYCAKS--FTRKEHLVNHVRQHTGETPYQCQYCPKAFSRKDHLVNHVRQHTGESPHK 635
              +              ++                    S +D  +  +   +    + 
Sbjct: 92  DLNSKSLPLSNSKASSSSLSSSS-SNSNDNNLLSSHSLPPSSRDPQLPDLLSISNLRNNP 150

Query: 636 CSYCKQSFTRKDHLNNHVRLHTGESPHKCHFCSKSFTRKEHLNNHIRLHTGESPHKCDFC 695
                 S       N+         P   +  SK  +    L     + T       +  
Sbjct: 151 LPGNNSSSVNTPQSNSL------HPPLPANSLSKDPSSNLSLLISSNVSTSIPSSSENSP 204

Query: 696 LKTFTRKEHLTNHLKQHAGETEHVCNICSKPFLKKEHLVNHSRSHTGERPFACTDCGKSF 755
           L +       ++        +       S   L  + L++ S S       + +      
Sbjct: 205 LSSSYSIPSSSSDQNLENSSSSLPLTTNS--QLSPKSLLSQSPSSLSSSDSSSSASESPR 262

Query: 756 PLKGNLLFHQRSHRKQNGEVLER---PFKCDVCPKDFTCKGHLVSHLRSHL----NERPH 808
                      S  + +    +    P K   C   F+    L  HLRS      + +P 
Sbjct: 263 SSLPTASSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPF 322

Query: 809 VCPH--CSKSFVEHTCMIKHIRKNH 831
            CP+  C K F  +  + +HI  + 
Sbjct: 323 SCPYSLCGKLFSRNDALKRHILLHT 347



 Score = 40.8 bits (95), Expect = 0.003
 Identities = 77/429 (17%), Positives = 134/429 (31%), Gaps = 33/429 (7%)

Query: 436 TPHRCEICKKAFTRKEHYVNHVMWHTGETPHVCQV--CGKKYTRKEHLANHMRSHTNDTP 493
            P  C  C  +F+R EH   H+  HTGE P  C    C K ++R   L+ H+R+H N+  
Sbjct: 32  RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNPS 91

Query: 494 FKCEIC---TKSFTRKEHFTNHIMWHTGETPHRCDFCSKTFTRKEHLLNHVRQHTGESPH 550
                    + S       ++                S   + ++  L  +   +    +
Sbjct: 92  DLNSKSLPLSNSKASSSSLSSSSSNSNDNNL--LSSHSLPPSSRDPQLPDLLSISNLRNN 149

Query: 551 KCNYCAKSFTRKEHLVNHVRQHTGDSPHKCNYCAKSFTRKEHLVNHVRQHTGETPYQCQY 610
                  S        +       +S  K      S     ++   +             
Sbjct: 150 PLPGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNLSLLISSNVSTSIPSS------SENS 203

Query: 611 CPKAFSRKDHLVNHVRQHTGESPHKCSYCKQSFTRKD------HLNNHVRLHTGESPHKC 664
              +        +        S    +   Q   +         L++     +     + 
Sbjct: 204 PLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESPRS 263

Query: 665 HFCSKSFTRKEHLNNHIRLHTGES-PHKCDFCLKTFTRKEHLTNHLKQH----AGETEHV 719
              + S        +      G S P K   C  +F+R   LT HL+             
Sbjct: 264 SLPTASSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFS 323

Query: 720 C--NICSKPFLKKEHLVNHSRSHTGERPFAC--TDCGKSFPLKGNLLFHQRSHRKQNGEV 775
           C  ++C K F + + L  H   HT   P      +    F    N    Q S ++     
Sbjct: 324 CPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNEPPQ-SLQQYKDLK 382

Query: 776 LERPFKCDV--CPKDFTCKGHLVSHLRSHLNERPHVC--PHCSKSFVEHTCMIKHIRKNH 831
            ++  +     C ++F    +L  H+ +HL+ RP+ C  P CSKSF  H  +I H + + 
Sbjct: 383 NDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHT 442

Query: 832 TLANVNVSP 840
             A +  S 
Sbjct: 443 NHAPLLCSI 451



 Score = 37.4 bits (86), Expect = 0.032
 Identities = 52/180 (28%), Positives = 74/180 (41%), Gaps = 18/180 (10%)

Query: 520 TPHRCDFCSKTFTRKEHLLNHVRQ--HTGES--PHKC--NYCAKSFTRKEHLVNHVRQHT 573
            P +   C+ +F+R   L  H+R   H+GES  P  C  + C K F+R + L  H+  HT
Sbjct: 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHT 347

Query: 574 GDSPHKCNYCAKSFTRKEHLVNHVRQHTGETPYQCQY---------CPKAFSRKDHLVNH 624
             SP K      S      L N   Q   +                C + F R  +L  H
Sbjct: 348 SISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLH 407

Query: 625 VRQHTGESPH--KCSYCKQSFTRKDHLNNHVRLHTGESPHKCHFCSKSFTRKEHLNNHIR 682
           +  H    P+  K   C +SF R  +L  H ++HT  +P       KSF R   L+NH +
Sbjct: 408 IITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPL-LCSILKSFRRDLDLSNHGK 466


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 46.3 bits (110), Expect = 7e-05
 Identities = 39/204 (19%), Positives = 51/204 (25%), Gaps = 35/204 (17%)

Query: 6   ELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTL 65
            L +  A  +          PP        PP          F    P +  P +     
Sbjct: 155 SLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQ------AAFPQQGPPEQPPGYPQPPQ 208

Query: 66  TSLTSPHQ---MPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPP-----TS 117
                      +P PS     PP P        Q  P + Q   PG+ Q  PP       
Sbjct: 209 GHPEQVQPQQFLPAPSQAPAQPPLPPQL----PQQPPPLQQPQFPGLSQQMPPPPPQPPQ 264

Query: 118 HISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHS 177
              + P P +  P  +      G   G N       Q               Q       
Sbjct: 265 QQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQP--------------QLLPLVQQ 310

Query: 178 TVQSPPGQQQQYQQMDTSQSSQQQ 201
                 G Q + Q +   Q SQQQ
Sbjct: 311 PQGQQRGPQFREQLV---QLSQQQ 331



 Score = 39.8 bits (93), Expect = 0.007
 Identities = 33/191 (17%), Positives = 42/191 (21%), Gaps = 26/191 (13%)

Query: 53  PSQPSPQHGHHTLTSL-------TSPHQMP-PPSPIATSPPPPSASDKHDSQSSPHITQH 104
            SQP PQ     + SL           Q P  P P     P                 Q 
Sbjct: 141 ESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMP------PRQAAFPQQ 194

Query: 105 TGPGI---FQSPPPTSHISKIPSPSSSSPSSSSQLM----QQGGGGGSNGQTHLHIQNIG 157
             P     +  PP        P     +PS +        Q         Q      +  
Sbjct: 195 GPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQ 254

Query: 158 AVLSPSY-GQHCIQQDDTQHST----VQSPPGQQQQYQQMDTSQSSQQQSDLIKTNGTGD 212
               P    Q   Q    Q         +P     Q Q        Q Q   +     G 
Sbjct: 255 MPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQ 314

Query: 213 VNPDEFNQLLA 223
               +F + L 
Sbjct: 315 QRGPQFREQLV 325



 Score = 39.0 bits (91), Expect = 0.014
 Identities = 21/157 (13%), Positives = 30/157 (19%), Gaps = 9/157 (5%)

Query: 48  FISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGP 107
            +  +  +P PQ                 P P   +    S  +                
Sbjct: 124 SLWGTAPKPEPQP---------PQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQ 174

Query: 108 GIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQH 167
              Q  P      +   P    P       Q   G     Q    +       +      
Sbjct: 175 PPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPP 234

Query: 168 CIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQSDL 204
            + Q        Q P   QQ           QQQ   
Sbjct: 235 QLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQ 271


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 45.7 bits (108), Expect = 1e-04
 Identities = 31/142 (21%), Positives = 45/142 (31%), Gaps = 12/142 (8%)

Query: 2    PIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHG 61
            P Q +    P             PPPP   R   P +P T     G    S + P P  G
Sbjct: 2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG--EPSGAVPQPWLG 2965

Query: 62   HHTLTSLTSPH-QMPPPSPIATSPPPPSASDKHDSQS-----SPHITQHTGPGIFQSPPP 115
                  +  P  ++P P+P   +P   +      S S     +  +  H        PPP
Sbjct: 2966 ALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEET----DPPP 3021

Query: 116  TSHISKIPSPSSSSPSSSSQLM 137
             S    +  P  +  S +  L 
Sbjct: 3022 VSLKQTLWPPDDTEDSDADSLF 3043



 Score = 44.2 bits (104), Expect = 4e-04
 Identities = 23/134 (17%), Positives = 38/134 (28%), Gaps = 14/134 (10%)

Query: 11   PASHKTSTSMEVDTPPPPSSSRGSEP--------PSPNTGGVSGGFISTSPSQPSPQHGH 62
            P     +       PP P+++R + P        P+   G  + G  +     P+     
Sbjct: 2708 PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT---- 2763

Query: 63   HTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPP-PTSHISK 121
                   +P   P   P      P  AS     +S P       P      P      + 
Sbjct: 2764 -AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAA 2822

Query: 122  IPSPSSSSPSSSSQ 135
             P+     P+S+  
Sbjct: 2823 SPAGPLPPPTSAQP 2836



 Score = 41.5 bits (97), Expect = 0.003
 Identities = 27/138 (19%), Positives = 38/138 (27%), Gaps = 14/138 (10%)

Query: 8    SQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVS--GGFISTSPSQPSPQH----G 61
            +  PA      +    T PPP       P  P  G V+  G      PS+          
Sbjct: 2821 AASPAGPLPPPTSAQPTAPPPPPGP-PPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPA 2879

Query: 62   HHTLTSLTSPHQMPPPSPIA----TSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS 117
               +  L  P         A        PP        Q  P       P   Q PPP  
Sbjct: 2880 RPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP---QPPPPPP 2936

Query: 118  HISKIPSPSSSSPSSSSQ 135
               + P   ++ P+ + +
Sbjct: 2937 PRPQPPLAPTTDPAGAGE 2954



 Score = 41.1 bits (96), Expect = 0.004
 Identities = 24/109 (22%), Positives = 33/109 (30%), Gaps = 5/109 (4%)

Query: 25   PPPPSSSRGSEPP--SPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIAT 82
             PP +  R   PP  +P     +    +  P  P PQ        +        P+P + 
Sbjct: 2559 APPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAP-PQSAR-PRAPVDDRGDPRGPAPPSP 2616

Query: 83   SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPS 131
             PP   A D      SP   +         PPP       P+P   S  
Sbjct: 2617 LPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERP-RDDPAPGRVSRP 2664



 Score = 39.5 bits (92), Expect = 0.010
 Identities = 26/125 (20%), Positives = 36/125 (28%), Gaps = 14/125 (11%)

Query: 11   PASHKTSTSMEVDTPPPPSSSRGSEPP--SPNTGGVSGG---FISTSPSQPSPQHGHHTL 65
            PA+++         PPP        P   S        G     S+ P +P  +    T+
Sbjct: 2633 PAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV 2692

Query: 66   TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSP 125
             SLTS    PPP P     P                    GP   +   P    +  P  
Sbjct: 2693 GSLTSLADPPPPPPTPEPAPHA---------LVSATPLPPGPAAARQASPALPAAPAPPA 2743

Query: 126  SSSSP 130
              + P
Sbjct: 2744 VPAGP 2748



 Score = 38.8 bits (90), Expect = 0.019
 Identities = 26/111 (23%), Positives = 37/111 (33%), Gaps = 20/111 (18%)

Query: 26  PPPSSSRGSEP--PSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
             P+  R S     +P   G  GG   T P+ P P          + P   P P P +  
Sbjct: 379 SLPTRKRRSARHAATPFARG-PGGDDQTRPAAPVPA---------SVPTPAPTPVPASAP 428

Query: 84  PPP--PSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSS 132
           PPP  P  S +  S   P       P   + PP  +       P  ++  +
Sbjct: 429 PPPATPLPSAEPGSDDGP------APPPERQPPAPATEPAPDDPDDATRKA 473



 Score = 37.2 bits (86), Expect = 0.051
 Identities = 26/137 (18%), Positives = 43/137 (31%), Gaps = 16/137 (11%)

Query: 19   SMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPS 78
            S+       PS    ++PP+      +    + SP+ P P       T+   P   PPPS
Sbjct: 2790 SLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPS 2849

Query: 79   -----------PIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSP-----PPTSHISKI 122
                        +   PP  S + K  + + P + +   P + +S      PP       
Sbjct: 2850 LPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP 2909

Query: 123  PSPSSSSPSSSSQLMQQ 139
               +   P    Q    
Sbjct: 2910 QPQAPPPPQPQPQPPPP 2926



 Score = 36.8 bits (85), Expect = 0.070
 Identities = 26/126 (20%), Positives = 36/126 (28%), Gaps = 16/126 (12%)

Query: 11   PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
                  +T      P  P ++ G   P+P     +G           P     + +  + 
Sbjct: 2743 AVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAG----PPRRLTRPAVASLSESRESL 2798

Query: 71   PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSP 130
            P    P  P A    P +A     S + P             PPPTS     P P    P
Sbjct: 2799 PSPWDPADPPAAVLAPAAALPPAASPAGP------------LPPPTSAQPTAPPPPPGPP 2846

Query: 131  SSSSQL 136
              S  L
Sbjct: 2847 PPSLPL 2852



 Score = 36.5 bits (84), Expect = 0.085
 Identities = 23/135 (17%), Positives = 35/135 (25%), Gaps = 21/135 (15%)

Query: 10   FPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLT 69
                   S        P P       PPSP+               P  +          
Sbjct: 2608 PRGPAPPSPLPPDTHAPDPP------PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV 2661

Query: 70   SPHQ---MPPPSPIATSPP----PPSASDKHDSQSS---PHITQHTGPGIFQSPPPTSHI 119
            S  +       +  A+SPP      +A     S +S   P       P     P P + +
Sbjct: 2662 SRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPP-----PPPPTPEPAPHALV 2716

Query: 120  SKIPSPSSSSPSSSS 134
            S  P P   + +  +
Sbjct: 2717 SATPLPPGPAAARQA 2731



 Score = 36.1 bits (83), Expect = 0.12
 Identities = 24/118 (20%), Positives = 36/118 (30%), Gaps = 9/118 (7%)

Query: 25   PPPPSSSRGSEPPS-PNTGGVSGGFISTSPSQPSPQHGHHTLTSLT-------SPHQMPP 76
                S  +     +   T G         P  P+P+   H L S T       +  Q  P
Sbjct: 2674 AQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASP 2733

Query: 77   PSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
              P A +PP   A        +      T  G     PP +  +  P    + P+ +S
Sbjct: 2734 ALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAG-PPRRLTRPAVAS 2790



 Score = 36.1 bits (83), Expect = 0.12
 Identities = 25/121 (20%), Positives = 37/121 (30%), Gaps = 11/121 (9%)

Query: 24   TPPPPSSSRG--SEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPS--P 79
             PP P+      + PP   T         +  S PSP        ++ +P    PP+  P
Sbjct: 2765 GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASP 2824

Query: 80   IATSPPPPSASDKHDSQSSPHITQHTGPGIFQSP-------PPTSHISKIPSPSSSSPSS 132
                PPP SA                  G   +P       PP+   +  P+  +  P  
Sbjct: 2825 AGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVR 2884

Query: 133  S 133
             
Sbjct: 2885 R 2885



 Score = 35.3 bits (81), Expect = 0.21
 Identities = 23/133 (17%), Positives = 39/133 (29%), Gaps = 15/133 (11%)

Query: 11   PASHKTSTSMEVDTPPPPSSSRGSEPP-SPNTGGVSGGFISTSPSQPSPQHGHHTLTSLT 69
            PA+ + ++      P PP+   G   P  P          +  P  P+P           
Sbjct: 2725 PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP--TTAGPPAPAPPAAPAA----G 2778

Query: 70   SPHQMPPPS--------PIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISK 121
             P ++  P+            SP  P+         +  +     P     PP ++  + 
Sbjct: 2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTA 2838

Query: 122  IPSPSSSSPSSSS 134
             P P    P S  
Sbjct: 2839 PPPPPGPPPPSLP 2851



 Score = 34.1 bits (78), Expect = 0.48
 Identities = 25/106 (23%), Positives = 38/106 (35%), Gaps = 17/106 (16%)

Query: 12   ASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSP 71
            +S  +S ++  +T PPP S + +  P  +T       +  S S+ S             P
Sbjct: 3005 SSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSD-------LEALDP 3057

Query: 72   HQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS 117
                P  P A  P P +      ++ SP      G      PPP S
Sbjct: 3058 LPPEPHDPFAHEPDPATPEA--GARESPS--SQFG------PPPLS 3093



 Score = 32.2 bits (73), Expect = 1.5
 Identities = 23/117 (19%), Positives = 36/117 (30%), Gaps = 19/117 (16%)

Query: 23   DTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPP----- 77
            D PPPP      EP            +S +P  P P        +L +    P       
Sbjct: 2700 DPPPPP---PTPEPAPHAL-------VSATPLPPGPAAARQASPALPAAPAPPAVPAGPA 2749

Query: 78   SPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHIS----KIPSPSSSSP 130
            +P   + P    +       +P      GP    + P  + +S     +PSP   + 
Sbjct: 2750 TPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPAD 2806



 Score = 31.8 bits (72), Expect = 2.1
 Identities = 26/118 (22%), Positives = 37/118 (31%), Gaps = 25/118 (21%)

Query: 27  PPSSSRGSEPPSPNTGGVSGGFISTSPSQPS----PQHGHHTLTSLTSPHQMPPPSPIAT 82
            P ++RG+  P P              + P+    P  G        +P  +P P     
Sbjct: 268 APETARGATGPPP----------PPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPD--- 314

Query: 83  SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSH----ISKIPSPSSSSPSSSSQL 136
            PPP  A D  +        +   P     P P  H      K   P+ + PSS   L
Sbjct: 315 PPPPAPAGDAEEEDDEDGAMEVVSP----LPRPRQHYPLGFPKRRRPTWTPPSSLEDL 368



 Score = 31.1 bits (70), Expect = 3.3
 Identities = 24/120 (20%), Positives = 34/120 (28%), Gaps = 12/120 (10%)

Query: 26   PPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGH-----HTLTSLTSPHQMPPPSPI 80
            P P      +P +P         I        P H         L  L S     PP P+
Sbjct: 2496 PDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPL 2555

Query: 81   ATSPPPPSASDKHDSQSSPHITQHTGPGIFQ-----SPPPTSHISKIPSPSSSSPSSSSQ 135
              + PP  A D+      P   + + P +         PP S   + P      P   + 
Sbjct: 2556 PPAAPPA-APDRSVPPPRP-APRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAP 2613


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 44.3 bits (105), Expect = 2e-04
 Identities = 35/126 (27%), Positives = 45/126 (35%), Gaps = 23/126 (18%)

Query: 17  STSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPP 76
           S   E   P   S    S+PPS + G          PS PSP     + +  + P   P 
Sbjct: 171 SFPGEDADPASASP---SDPPSSSPG---------VPSFPSPPEDPSSPSDSSLP---PA 215

Query: 77  PSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
           PS   +  PPPS      + S P       PG    PPP        S +  +P S+S  
Sbjct: 216 PSSFQSDTPPPSPESPT-NPSPP-------PGPAAPPPPPVQQVPPLSTAKPTPPSASAT 267

Query: 137 MQQGGG 142
               GG
Sbjct: 268 PAPIGG 273



 Score = 43.1 bits (102), Expect = 4e-04
 Identities = 26/111 (23%), Positives = 33/111 (29%), Gaps = 12/111 (10%)

Query: 25  PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSP 84
             PPSSS G           S    S+ P  PS         S  SP    PP   A  P
Sbjct: 185 SDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPP 244

Query: 85  PPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQ 135
           PPP       S + P            +PP  S         +    + ++
Sbjct: 245 PPPVQQVPPLSTAKP------------TPPSASATPAPIGGITLDDDAIAK 283



 Score = 40.8 bits (96), Expect = 0.002
 Identities = 29/130 (22%), Positives = 43/130 (33%), Gaps = 28/130 (21%)

Query: 6   ELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTL 65
           E      +   S +        P+S+  S+PPS + G          PS PSP     + 
Sbjct: 157 EDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPG---------VPSFPSPPEDPSSP 207

Query: 66  TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSP 125
           +  + P   P PS   +  PPPS     +                 SPPP       P  
Sbjct: 208 SDSSLP---PAPSSFQSDTPPPSPESPTN----------------PSPPPGPAAPPPPPV 248

Query: 126 SSSSPSSSSQ 135
               P S+++
Sbjct: 249 QQVPPLSTAK 258



 Score = 37.4 bits (87), Expect = 0.024
 Identities = 27/159 (16%), Positives = 35/159 (22%), Gaps = 25/159 (15%)

Query: 24  TPPPPSSSRGSEPPSPNTGG-VSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIAT 82
             P P      E    +     S        + P+            SP   P  SP   
Sbjct: 146 EDPNPGPPLDEEDEDADVATTNSDNSFPGEDADPAS----------ASPSDPPSSSPGVP 195

Query: 83  SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGG 142
           S P P       S SS              P P+S  S  P PS    S ++     G  
Sbjct: 196 SFPSPPEDPSSPSDSSL------------PPAPSSFQSDTPPPSP--ESPTNPSPPPGPA 241

Query: 143 GGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQS 181
                          A  +P                  +
Sbjct: 242 APPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLDDDA 280



 Score = 33.1 bits (76), Expect = 0.56
 Identities = 19/88 (21%), Positives = 28/88 (31%), Gaps = 7/88 (7%)

Query: 9   QFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHT-LTS 67
            FP+  +  +S    + PP  SS  S+ P P+          T+PS P            
Sbjct: 196 SFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPES------PTNPSPPPGPAAPPPPPVQ 249

Query: 68  LTSPHQMPPPSPIATSPPPPSASDKHDS 95
              P     P+P + S  P         
Sbjct: 250 QVPPLSTAKPTPPSASATPAPIGGITLD 277


>gnl|CDD|220749 pfam10428, SOG2, RAM signalling pathway protein.  SOG2 proteins in
           Saccharomyces cerevisiae are involved in cell separation
           and cytokinesis.
          Length = 419

 Score = 44.7 bits (106), Expect = 2e-04
 Identities = 26/116 (22%), Positives = 39/116 (33%), Gaps = 5/116 (4%)

Query: 24  TPPPPSSSRGSEPPSPNTGGVSGGFIS--TSPSQPSPQHGHHTLTSLTSPHQ---MPPPS 78
            PP     R +   SP++       IS    P   +P  G    +S  S      +  PS
Sbjct: 149 GPPLQHRKRDAVTASPSSMIARNTPISDRLRPRSVTPTRGRRPSSSPRSLSNPTTLESPS 208

Query: 79  PIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
            +  +   P       S+SS   +      I     P S  S   +P+S S S + 
Sbjct: 209 NLQVTTDVPPPYSNGTSRSSTMSSSANLSIISSLATPRSGESFRSTPTSGSSSINP 264


>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
           family). 
          Length = 619

 Score = 42.9 bits (101), Expect = 7e-04
 Identities = 32/147 (21%), Positives = 46/147 (31%), Gaps = 30/147 (20%)

Query: 15  KTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGH----HTLTSLTS 70
           KTST ME    PP  S+  S P +P+T        S +PS     +          SL S
Sbjct: 452 KTSTVME----PPYGSTESSVPSTPSTRRNDRNITSNTPSLKRTPNLTKSSLSQEASLIS 507

Query: 71  P-----HQMPPPSPIATSPPPPSAS---------------DKHDSQSSPHITQHTGPGIF 110
                 H+   P  + T P  P+AS                  D  S   I   +     
Sbjct: 508 KSTGNTHKHSTPRRLTTLPKLPAASRSSKGNLIRSGANGNASSDLSSPGSINSKSPEHSV 567

Query: 111 --QSPPPTSHISKIPSPSSSSPSSSSQ 135
                      +       S+PS++ +
Sbjct: 568 PLVRVFDIHLRASTTKGRHSTPSTNEK 594


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 41.3 bits (97), Expect = 0.003
 Identities = 21/133 (15%), Positives = 32/133 (24%), Gaps = 12/133 (9%)

Query: 11  PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTS-PSQPSPQHGHHTLTSLT 69
           PAS  +S   E    P PSS      PS      S      S  S  S            
Sbjct: 284 PASSSSSPR-ERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSS--------E 334

Query: 70  SPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSS 129
           S          + S  P  +     +  S    +           P +   +     + +
Sbjct: 335 SSRGAAVSPGPSPSRSPSPSRPPPPADPSS--PRKRPRPSRAPSSPAASAGRPTRRRARA 392

Query: 130 PSSSSQLMQQGGG 142
             +     +   G
Sbjct: 393 AVAGRARRRDATG 405



 Score = 40.9 bits (96), Expect = 0.003
 Identities = 27/117 (23%), Positives = 40/117 (34%), Gaps = 16/117 (13%)

Query: 23  DTPPPPSSSRGSEP-----PSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPP 77
            + P P+SS  S       PSP++ G SG   S+  +  S      + +S TS       
Sbjct: 279 SSRPGPASSSSSPRERSPSPSPSSPG-SGPAPSSPRASSSSSSSRESSSSSTSS-SSESS 336

Query: 78  SPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
              A SP P  +     S+  P               P        +PSS + S+  
Sbjct: 337 RGAAVSPGPSPSRSPSPSRPPP---------PADPSSPRKRPRPSRAPSSPAASAGR 384



 Score = 40.5 bits (95), Expect = 0.005
 Identities = 19/127 (14%), Positives = 39/127 (30%), Gaps = 8/127 (6%)

Query: 10  FPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQH--GHHTLTS 67
              +   +         PP       P + +    +    + +P+ P+ +        +S
Sbjct: 54  TVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSS 113

Query: 68  LTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSS 127
              P   PPP+    SP P  +       S       + P         S  + + S ++
Sbjct: 114 PDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPP-----AAGASP-AAVASDAA 167

Query: 128 SSPSSSS 134
           SS  ++ 
Sbjct: 168 SSRQAAL 174



 Score = 40.2 bits (94), Expect = 0.007
 Identities = 27/139 (19%), Positives = 35/139 (25%), Gaps = 28/139 (20%)

Query: 9   QFPASHKTSTSM-EVDTPPPPSSSRGSEPPSPN--------TGGVSGGFISTSPSQPSPQ 59
                   +        PPPP+    S PPSP           G  G   + SP      
Sbjct: 99  SPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGAS 158

Query: 60  HGHHTLTSLTSPHQMPP---PSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPT 116
                  + +S     P   P   A +P  P A                 P     P   
Sbjct: 159 PAAVASDAASSRQAALPLSSPEETARAPSSPPAE----------------PPPSTPPAAA 202

Query: 117 SHISKIPSPSSSSPSSSSQ 135
           S      S   S+ +SS  
Sbjct: 203 SPRPPRRSSPISASASSPA 221



 Score = 39.4 bits (92), Expect = 0.009
 Identities = 16/130 (12%), Positives = 37/130 (28%), Gaps = 4/130 (3%)

Query: 6   ELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTL 65
           E        +         P    ++         +G  +     T+  + S  H     
Sbjct: 770 EALALLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGG 829

Query: 66  TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSP 125
           +  + P + P  +  A  PP  S+       ++         G  +  PP       P  
Sbjct: 830 SESSGPARPPGAA--ARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEP--RARPGA 885

Query: 126 SSSSPSSSSQ 135
           ++   ++++ 
Sbjct: 886 AAPPKAAAAA 895



 Score = 38.2 bits (89), Expect = 0.026
 Identities = 27/114 (23%), Positives = 36/114 (31%), Gaps = 14/114 (12%)

Query: 22  VDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIA 81
           +  P     + G   PS   G  S    S     PSP          + P    P +  +
Sbjct: 263 ITLPTRIWEASGWNGPSSRPGPASSS-SSPRERSPSPSPSS----PGSGPAPSSPRASSS 317

Query: 82  TSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSS-SSPSSSS 134
           +S    S+S    S S         PG    P P+      PSPS    P+  S
Sbjct: 318 SSSSRESSSSSTSSSSESSRGAAVSPG----PSPSRS----PSPSRPPPPADPS 363



 Score = 36.7 bits (85), Expect = 0.065
 Identities = 9/67 (13%), Positives = 17/67 (25%)

Query: 25  PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSP 84
             PP  +  S P        +    + S  +P+ +     +             P     
Sbjct: 354 SRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPR 413

Query: 85  PPPSASD 91
           P P  + 
Sbjct: 414 PSPLDAG 420



 Score = 36.3 bits (84), Expect = 0.096
 Identities = 22/157 (14%), Positives = 37/157 (23%), Gaps = 28/157 (17%)

Query: 2   PIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHG 61
            ++   S  P    +  +         S +  S   +            T+ +  SP   
Sbjct: 137 MLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPE----ETARAPSSPPAE 192

Query: 62  HHTLTS--LTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHIT----------------- 102
               T     SP      SPI+ S   P+ +    +      +                 
Sbjct: 193 PPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPE 252

Query: 103 -----QHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
                    P    +    +     PS      SSSS
Sbjct: 253 NECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSS 289



 Score = 35.9 bits (83), Expect = 0.10
 Identities = 35/143 (24%), Positives = 47/143 (32%), Gaps = 18/143 (12%)

Query: 11  PASHKTSTSMEVDTPPPP------SSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHT 64
           PA    ST     +P PP      S+S  S  P+P          S+S S  S   G   
Sbjct: 190 PAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGW 249

Query: 65  LTSLTSPHQMPPPSPIATSP------------PPPSASDKHDSQSSPHITQHTGPGIFQS 112
                 P   P P  + T              P P++S     + SP  +  +       
Sbjct: 250 GPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAP 309

Query: 113 PPPTSHISKIPSPSSSSPSSSSQ 135
             P +  S   S  SSS S+SS 
Sbjct: 310 SSPRASSSSSSSRESSSSSTSSS 332



 Score = 33.6 bits (77), Expect = 0.53
 Identities = 28/131 (21%), Positives = 38/131 (29%), Gaps = 9/131 (6%)

Query: 26  PPPSSSRGSEPPSPNTGGVSGGFISTS--PSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
           P P ++ G       +G  S G + +                     P   PPP P   +
Sbjct: 22  PRPPATPGDAADDLLSG--SQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEA 79

Query: 84  PPPPSASDKHDSQS-----SPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQ 138
           P   S S    S S     SP       P    SP P        SP  S     S++++
Sbjct: 80  PANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLR 139

Query: 139 QGGGGGSNGQT 149
             G  G     
Sbjct: 140 PVGSPGPPPAA 150



 Score = 33.6 bits (77), Expect = 0.59
 Identities = 24/106 (22%), Positives = 33/106 (31%), Gaps = 9/106 (8%)

Query: 24  TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
            PPP  SS  S+      GG + G       +P          +        PP+     
Sbjct: 844 RPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAP 903

Query: 84  PPPPSASDKHDSQSSPHITQHTGP---GIFQSPPPTSHISKIPSPS 126
            P P+   K      P      GP   G F+  PP    +  PS +
Sbjct: 904 RPRPAPRVK-LGPMPPG-----GPDPRGGFRRVPPGDLHTPAPSAA 943


>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain. 
          Length = 26

 Score = 35.8 bits (83), Expect = 0.003
 Identities = 14/26 (53%), Positives = 18/26 (69%)

Query: 648 HLNNHVRLHTGESPHKCHFCSKSFTR 673
           +L  H+R HTGE P+KC  C KSF+ 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 35.4 bits (82), Expect = 0.004
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 676 HLNNHIRLHTGESPHKCDFCLKTFTR 701
           +L  H+R HTGE P+KC  C K+F+ 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 35.4 bits (82), Expect = 0.004
 Identities = 13/26 (50%), Positives = 19/26 (73%)

Query: 480 HLANHMRSHTNDTPFKCEICTKSFTR 505
           +L  HMR+HT + P+KC +C KSF+ 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 35.1 bits (81), Expect = 0.005
 Identities = 14/26 (53%), Positives = 18/26 (69%)

Query: 592 HLVNHVRQHTGETPYQCQYCPKAFSR 617
           +L  H+R HTGE PY+C  C K+FS 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 34.3 bits (79), Expect = 0.008
 Identities = 14/24 (58%), Positives = 18/24 (75%)

Query: 732 HLVNHSRSHTGERPFACTDCGKSF 755
           +L  H R+HTGE+P+ C  CGKSF
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSF 24



 Score = 34.3 bits (79), Expect = 0.008
 Identities = 14/26 (53%), Positives = 18/26 (69%)

Query: 536 HLLNHVRQHTGESPHKCNYCAKSFTR 561
           +L  H+R HTGE P+KC  C KSF+ 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 34.3 bits (79), Expect = 0.009
 Identities = 16/26 (61%), Positives = 18/26 (69%)

Query: 396 HLENHTRCHTGETPYKCQYCGKNFSR 421
           +L  H R HTGE PYKC  CGK+FS 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 33.9 bits (78), Expect = 0.010
 Identities = 11/26 (42%), Positives = 19/26 (73%)

Query: 424 HMVNHVRKHTGETPHRCEICKKAFTR 449
           ++  H+R HTGE P++C +C K+F+ 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 33.1 bits (76), Expect = 0.019
 Identities = 12/25 (48%), Positives = 17/25 (68%)

Query: 794 HLVSHLRSHLNERPHVCPHCSKSFV 818
           +L  H+R+H  E+P+ CP C KSF 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFS 25



 Score = 33.1 bits (76), Expect = 0.020
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 620 HLVNHVRQHTGESPHKCSYCKQSFTR 645
           +L  H+R HTGE P+KC  C +SF+ 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 33.1 bits (76), Expect = 0.025
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 564 HLVNHVRQHTGDSPHKCNYCAKSFTR 589
           +L  H+R HTG+ P+KC  C KSF+ 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 32.4 bits (74), Expect = 0.037
 Identities = 10/26 (38%), Positives = 16/26 (61%)

Query: 508 HFTNHIMWHTGETPHRCDFCSKTFTR 533
           +   H+  HTGE P++C  C K+F+ 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 32.4 bits (74), Expect = 0.039
 Identities = 11/26 (42%), Positives = 19/26 (73%)

Query: 368 SLQRHTKRHSIDRPYCCSVCMKTFAR 393
           +L+RH + H+ ++PY C VC K+F+ 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 30.8 bits (70), Expect = 0.15
 Identities = 12/21 (57%), Positives = 14/21 (66%)

Query: 315 VHRRYHTERKPFTCHVCGKAF 335
            H R HT  KP+ C VCGK+F
Sbjct: 4   RHMRTHTGEKPYKCPVCGKSF 24



 Score = 28.5 bits (64), Expect = 0.89
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 452 HYVNHVMWHTGETPHVCQVCGKKYTR 477
           +   H+  HTGE P+ C VCGK ++ 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 27.4 bits (61), Expect = 2.7
 Identities = 13/32 (40%), Positives = 17/32 (53%), Gaps = 6/32 (18%)

Query: 760 NLLFHQRSHRKQNGEVLERPFKCDVCPKDFTC 791
           NL  H R+H        E+P+KC VC K F+ 
Sbjct: 1   NLRRHMRTHTG------EKPYKCPVCGKSFSS 26



 Score = 27.0 bits (60), Expect = 3.4
 Identities = 9/24 (37%), Positives = 14/24 (58%)

Query: 704 HLTNHLKQHAGETEHVCNICSKPF 727
           +L  H++ H GE  + C +C K F
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSF 24


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 40.7 bits (96), Expect = 0.003
 Identities = 24/98 (24%), Positives = 30/98 (30%), Gaps = 6/98 (6%)

Query: 2   PIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHG 61
            I+  L     S   S S    TPPPP  S     P+P     +      +P  P P   
Sbjct: 505 SIKLNLE----SQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTAT 560

Query: 62  HHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSP 99
               +S          SP    P  P+ S   DS    
Sbjct: 561 Q--ASSNAPAQIPADSSPPPPIPEEPTPSPTKDSSPEE 596



 Score = 36.9 bits (86), Expect = 0.051
 Identities = 26/108 (24%), Positives = 34/108 (31%), Gaps = 24/108 (22%)

Query: 29  SSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPS 88
            S  GS   +  T           P Q SP        + T P   P  +    +PPPP 
Sbjct: 511 ESQSGSASNTAKT---------PPPPQKSPPP-----PAPTPPLPQPTATAPPPTPPPP- 555

Query: 89  ASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
                     P  TQ +     Q P  +S    IP   + SP+  S  
Sbjct: 556 ---------PPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTKDSSP 594



 Score = 31.1 bits (71), Expect = 3.3
 Identities = 19/95 (20%), Positives = 31/95 (32%), Gaps = 14/95 (14%)

Query: 48  FISTSPSQPSPQHGHHTLTSLTSPHQMPPPSP-IATSPPPPSASDKHDSQSSPHITQHTG 106
           FIS   +  +P +          P  + P +P    +   PS      S   P       
Sbjct: 364 FISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIP------- 416

Query: 107 PGIFQSPPPTSHISKIPSPSSSSPSSS--SQLMQQ 139
                   PT      P+ ++++P S    +L QQ
Sbjct: 417 ----VPAEPTEPSPTPPANAANAPPSLNLEELWQQ 447


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 40.0 bits (94), Expect = 0.006
 Identities = 17/136 (12%), Positives = 34/136 (25%), Gaps = 7/136 (5%)

Query: 11  PASHKTSTSMEVDTPPPPSSSRGSEPPSP-NTGGVSGGFISTSPSQPSPQHGHHTLTSLT 69
           PA   +    E   P  P++      P+P           +      +    H    ++ 
Sbjct: 601 PAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVP 660

Query: 70  SPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSS 129
                    P       P+A       +          G   +P   +       P+  +
Sbjct: 661 DASDGGDGWPAKAGGAAPAA----PPPAPAPAAPAAPAG--AAPAQPAPAPAATPPAGQA 714

Query: 130 PSSSSQLMQQGGGGGS 145
              ++Q  Q   G  +
Sbjct: 715 DDPAAQPPQAAQGASA 730



 Score = 39.2 bits (92), Expect = 0.012
 Identities = 14/114 (12%), Positives = 36/114 (31%), Gaps = 2/114 (1%)

Query: 22  VDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIA 81
                 PS++  +   +P     +    + +   P+         +       P  +  A
Sbjct: 393 APAAAAPSAAAAAPAAAPAPAAAAP--AAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPA 450

Query: 82  TSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQ 135
              P P  +    +Q +P       P    +P P +  +   +P++ +  ++  
Sbjct: 451 GGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA 504



 Score = 38.0 bits (89), Expect = 0.027
 Identities = 22/129 (17%), Positives = 35/129 (27%), Gaps = 4/129 (3%)

Query: 11  PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
              H             P+ + G+ P +P          + + + P+        T    
Sbjct: 653 HPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAG 712

Query: 71  PHQMPPPSP----IATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPS 126
               P   P       S P P+A D       P           Q PPP +        +
Sbjct: 713 QADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAA 772

Query: 127 SSSPSSSSQ 135
           +  PS  S+
Sbjct: 773 APPPSPPSE 781



 Score = 36.5 bits (85), Expect = 0.076
 Identities = 9/105 (8%), Positives = 19/105 (18%), Gaps = 5/105 (4%)

Query: 26  PPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPP 85
              ++         +         ++      P        +   P   P          
Sbjct: 638 EASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAA 697

Query: 86  PPSASDKHDSQSSP-----HITQHTGPGIFQSPPPTSHISKIPSP 125
           P   +    +            Q        S P  +    +P P
Sbjct: 698 PAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLP 742



 Score = 32.7 bits (75), Expect = 1.00
 Identities = 21/128 (16%), Positives = 40/128 (31%), Gaps = 18/128 (14%)

Query: 26  PPPSSSRGSEPPSPNTGGVSGGFIS--TSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
           P P ++ G  PP+P          +   +P+ P+                 P P+  A +
Sbjct: 590 PAPGAAGGEGPPAP-ASSGPPEEAARPAAPAAPAAP-------------AAPAPAGAAAA 635

Query: 84  PPPPSASDKHDSQSSPHITQHTG-PGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGG 142
           P   SA+      +  H  +H   P            +   +P++  P+ +         
Sbjct: 636 PAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPA-APAAPA 694

Query: 143 GGSNGQTH 150
           G +  Q  
Sbjct: 695 GAAPAQPA 702



 Score = 31.5 bits (72), Expect = 2.8
 Identities = 14/78 (17%), Positives = 22/78 (28%), Gaps = 4/78 (5%)

Query: 25  PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTL---TSLTSPHQMPPPSPIA 81
           P   +    ++PP    G  S    +     P P           +   P   P P+P A
Sbjct: 710 PAGQADDPAAQPPQAAQGA-SAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAA 768

Query: 82  TSPPPPSASDKHDSQSSP 99
                P  S   + +   
Sbjct: 769 APAAAPPPSPPSEEEEMA 786


>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
          Length = 1000

 Score = 38.5 bits (89), Expect = 0.016
 Identities = 48/230 (20%), Positives = 71/230 (30%), Gaps = 43/230 (18%)

Query: 25  PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHG--HHTLTSLTSP----HQMPPPS 78
            PP +   G + P      VS       P     QH    H+    +      H   P +
Sbjct: 768 RPPQAPYLGYQEPQAQGVQVSSYPGYAGPWGLRAQHPRYRHSWAYWSQYPGHGHPQGPWA 827

Query: 79  PIATSPPP---PSASDKHDSQSS-PHITQHTGPGIFQ----------------------S 112
           P     PP    SA    D  S  PH+   TGP   Q                      S
Sbjct: 828 PRPPHLPPQWDGSAGHGQDQVSQFPHLQSETGPPRLQLSQVPQLPYSQTLVSSSAPSWSS 887

Query: 113 PPPTSHISKIPS--PSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQ 170
           P P + I  IP+  P    P   S  +    G  S+G     +       +  Y Q    
Sbjct: 888 PQPRAPIRPIPTRFPPPPMPLQDSMAV----GCDSSGTACPSM-----PFASDYSQGAFT 938

Query: 171 QDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQSDLIKTNGTGDVNPDEFNQ 220
             D    T + P  ++  +     SQ++ +  +++  N    V P +  Q
Sbjct: 939 PLDINAQTPKRPRVEESSHGPARCSQATTEAQEILSDNSEISVFPKDAKQ 988


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 38.5 bits (89), Expect = 0.016
 Identities = 34/115 (29%), Positives = 41/115 (35%), Gaps = 16/115 (13%)

Query: 26  PPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMP-----PPSPI 80
             PSS+    PP P     S       PSQ  P        S   P Q P     PP+P 
Sbjct: 287 QHPSSN----PPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQPLPPAPS 342

Query: 81  AT--SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSS 133
                PPP +   +  +QS  H     GP  F   P     S +P P +  P SS
Sbjct: 343 MPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMP-----SNLPPPPALKPLSS 392



 Score = 37.0 bits (85), Expect = 0.048
 Identities = 27/141 (19%), Positives = 38/141 (26%), Gaps = 3/141 (2%)

Query: 7   LSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLT 66
           L+        S         P ++    +P  P+   +           PSP       T
Sbjct: 189 LAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPPLQPQT 248

Query: 67  SLTSPHQMPPPSPIATSPP--PPSASDKHDSQSSPHITQHTGPGIFQSPPPT-SHISKIP 123
           +     Q P PS          P     H  Q  P   QH      Q      S +  +P
Sbjct: 249 ASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLP 308

Query: 124 SPSSSSPSSSSQLMQQGGGGG 144
            PS + P S +   Q      
Sbjct: 309 LPSQAQPHSHTPPSQSALQPQ 329



 Score = 37.0 bits (85), Expect = 0.058
 Identities = 28/124 (22%), Positives = 39/124 (31%), Gaps = 5/124 (4%)

Query: 11  PASHKTSTSMEVDTPP--PPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSL 68
           P       S  + + P  PP  ++    P   +     G  S  P  P  QH     TS 
Sbjct: 404 PPLQLMPQSQPLQSVPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQH---PFTSG 460

Query: 69  TSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSS 128
             P   PPPS   ++P  P  +          +    G      P P   I + P   + 
Sbjct: 461 GLPAIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPIQIKEEPLDEAE 520

Query: 129 SPSS 132
            P S
Sbjct: 521 EPES 524



 Score = 36.6 bits (84), Expect = 0.074
 Identities = 37/139 (26%), Positives = 52/139 (37%), Gaps = 17/139 (12%)

Query: 25  PPPPSSSRGSEPPSPNTGGV--SGGFISTSPSQPSPQHGHHTLTSLTS--PHQMP-PPSP 79
           PP  + +  + PP+P+   V   G  I+  P+    Q    +L S  S  P ++P P  P
Sbjct: 184 PPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPP 243

Query: 80  I------ATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSS 133
           +        SP PP+ S +H  QSS H      P   Q  P       +  PSS+ P   
Sbjct: 244 LQPQTASQQSPQPPAPSSRH-PQSSHHGPGPPMPHALQQGPV-----FLQHPSSNPPQPF 297

Query: 134 SQLMQQGGGGGSNGQTHLH 152
                Q        Q   H
Sbjct: 298 GLAQSQVPPLPLPSQAQPH 316



 Score = 33.5 bits (76), Expect = 0.61
 Identities = 32/134 (23%), Positives = 42/134 (31%), Gaps = 26/134 (19%)

Query: 24  TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTS---LTSPHQMPP-PSP 79
           +P PP   + +   SP     S     +S   P P   H        L  P   PP P  
Sbjct: 239 SPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFG 298

Query: 80  IATSPPPPSASDKHDSQSSPHITQHTGPGIFQ-------------------SPPPTSHIS 120
           +A S  PP       SQ+ PH          Q                    PPPT+ I 
Sbjct: 299 LAQSQVPPLPL---PSQAQPHSHTPPSQSALQPQQPPREQPLPPAPSMPHIKPPPTTPIP 355

Query: 121 KIPSPSSSSPSSSS 134
           ++P+ S   P    
Sbjct: 356 QLPNQSHKHPPHLQ 369



 Score = 32.4 bits (73), Expect = 1.4
 Identities = 30/110 (27%), Positives = 38/110 (34%), Gaps = 19/110 (17%)

Query: 50  STSPSQPSPQHG---------HHTLTSLTSPHQMPPP--SPIATSPPPPSASDKHDSQSS 98
           S+SPS PSPQ              L     P    PP  +   ++PPP  ++     Q S
Sbjct: 149 SSSPSIPSPQDNESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGS 208

Query: 99  PHITQHTGPGIFQSPPPTSHISKIPSPS---SSSPSSSSQLMQQGGGGGS 145
           P   Q         P   S +S I +PS      PS    L  Q     S
Sbjct: 209 PIAAQPAPQ-----PQQPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQS 253



 Score = 32.4 bits (73), Expect = 1.6
 Identities = 27/126 (21%), Positives = 38/126 (30%), Gaps = 12/126 (9%)

Query: 19  SMEVDTPPPP---------SSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLT 69
            M  + PPPP         +    S  P P         + + P+QP       +L    
Sbjct: 376 QMPSNLPPPPALKPLSSLPTHHPPSAHPPPLQLMPQSQPLQSVPAQPPVLTQSQSLPPKA 435

Query: 70  SPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQ-HTGPGIFQSPPPTSHISKIPSPSSS 128
           S H  P     +  P  P A     S   P I    + P    + PP +     P  S+ 
Sbjct: 436 STH--PHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAPPRASSGSQPPGSAL 493

Query: 129 SPSSSS 134
             S   
Sbjct: 494 PSSGGC 499



 Score = 30.0 bits (67), Expect = 6.3
 Identities = 30/125 (24%), Positives = 42/125 (33%), Gaps = 9/125 (7%)

Query: 81  ATSPPPPSASDKH---DSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLM 137
           ++SP  PS  D     DS +   + Q  GP   Q PP  +     P P+ S+ +   Q  
Sbjct: 149 SSSPSIPSPQDNESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQ-- 206

Query: 138 QQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQS 197
               G     Q     Q    +   S      Q+  + H  +Q     QQ  Q    S  
Sbjct: 207 ----GSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSR 262

Query: 198 SQQQS 202
             Q S
Sbjct: 263 HPQSS 267



 Score = 30.0 bits (67), Expect = 7.0
 Identities = 26/139 (18%), Positives = 41/139 (29%), Gaps = 10/139 (7%)

Query: 11  PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
             SHK    ++  +P P   S    PP+          +S+ P+   P      L  +  
Sbjct: 359 NQSHKHPPHLQGPSPFPQMPSNLPPPPALKP-------LSSLPTHHPPSAHPPPLQLMPQ 411

Query: 71  PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSP 130
              +         PP  + S     ++S H       G  QSP      +    P+   P
Sbjct: 412 SQPLQSV---PAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPP 468

Query: 131 SSSSQLMQQGGGGGSNGQT 149
            S            S+G  
Sbjct: 469 PSLPTSTPAAPPRASSGSQ 487



 Score = 30.0 bits (67), Expect = 7.1
 Identities = 29/133 (21%), Positives = 43/133 (32%), Gaps = 10/133 (7%)

Query: 1   MPIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQH 60
           MP    L   PA     T  +  + PP +S+       P++G  SG   S     P    
Sbjct: 409 MPQSQPLQSVPAQPPVLT--QSQSLPPKAST------HPHSGLHSGPPQSPFAQHPFTSG 460

Query: 61  GHHTLTSLTSPHQMPPPSPIATSPP--PPSASDKHDSQSSPHITQHTGPGIFQSPPPTSH 118
           G   +    S     P +P   S    PP ++       +          I + P   + 
Sbjct: 461 GLPAIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPIQIKEEPLDEAE 520

Query: 119 ISKIPSPSSSSPS 131
             + P P   SPS
Sbjct: 521 EPESPPPPPRSPS 533


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 38.4 bits (89), Expect = 0.020
 Identities = 33/166 (19%), Positives = 43/166 (25%), Gaps = 10/166 (6%)

Query: 37  PSPNTGGVSGGFISTSPSQPSPQH--GHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHD 94
               T G   G +   P +P  Q   G  T ++L     +    P+  +   P       
Sbjct: 80  TGQGTRGPQMGPMGPGPGRPMGQQMGGPGTASNLLQSLNVRGQMPMGAAGMGPHQ-MSRV 138

Query: 95  SQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQ 154
               P   Q  G     S  P S       P        +  M QG  G          Q
Sbjct: 139 GTMQPG-GQAGGMMQQSSGQPQSQQPNQMGPQQGQAQGQAGGMNQGQQGPVGQ------Q 191

Query: 155 NIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQ 200
               +  P       Q    Q         Q  Q QQ   +Q  QQ
Sbjct: 192 QPPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQ 237



 Score = 38.1 bits (88), Expect = 0.023
 Identities = 23/71 (32%), Positives = 27/71 (38%), Gaps = 8/71 (11%)

Query: 70  SPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISK-IPSPSSS 128
            P  M  PSP+       S          P +    GPG   S PP S     IPSP + 
Sbjct: 407 QPGMMSSPSPVPQVQTNQSMP----QPPQPSVPSPGGPG---SQPPQSVSGGMIPSPPAL 459

Query: 129 SPSSSSQLMQQ 139
            PS S Q+ Q 
Sbjct: 460 MPSPSPQMSQS 470



 Score = 35.7 bits (82), Expect = 0.12
 Identities = 34/198 (17%), Positives = 44/198 (22%), Gaps = 28/198 (14%)

Query: 18  TSMEVDTPPPPSSSRGSEPPSPNTGGVSGG---FISTSPSQPSPQHGHHTLTSLTSPHQ- 73
           T      P       G   P     G  G     + +   +     G     +   PHQ 
Sbjct: 80  TGQGTRGPQMGPMGPGPGRPMGQQMGGPGTASNLLQSLNVRGQMPMG----AAGMGPHQM 135

Query: 74  -----MPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSS 128
                M P           S   +    +     Q  G    Q+           +    
Sbjct: 136 SRVGTMQPGGQAGGMMQQSSGQPQSQQPNQMGPQQ--GQAQGQAGGM--------NQGQQ 185

Query: 129 SPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQ-----QDDTQHSTVQSPP 183
            P    Q  Q G  G   G     +Q  G           +Q     Q   Q    Q P 
Sbjct: 186 GPVGQQQPPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQGPA 245

Query: 184 GQQQQYQQMDTSQSSQQQ 201
             Q Q  Q    Q   Q 
Sbjct: 246 DAQAQMGQQQQGQGGMQP 263



 Score = 34.6 bits (79), Expect = 0.27
 Identities = 31/177 (17%), Positives = 40/177 (22%), Gaps = 21/177 (11%)

Query: 53  PSQPSPQHGH----HTLTSLTS-PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGP 107
                   G     + L +LT    + P   P+   P  P               Q  GP
Sbjct: 60  QQVLQGGQGMPDPINALQNLTGQGTRGPQMGPMGPGPGRPMGQ------------QMGGP 107

Query: 108 G----IFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPS 163
           G    + QS      +    +       S    MQ GG  G   Q               
Sbjct: 108 GTASNLLQSLNVRGQMPMGAAGMGPHQMSRVGTMQPGGQAGGMMQQSSGQPQSQQPNQMG 167

Query: 164 YGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQSDLIKTNGTGDVNPDEFNQ 220
             Q   Q      +  Q  P  QQQ  QM                        + N 
Sbjct: 168 PQQGQAQGQAGGMNQGQQGPVGQQQPPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNP 224



 Score = 33.4 bits (76), Expect = 0.65
 Identities = 35/176 (19%), Positives = 44/176 (25%), Gaps = 14/176 (7%)

Query: 24  TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
                 SS   +   PN  G   G       Q   Q G           Q  PP      
Sbjct: 148 GGMMQQSSGQPQSQQPNQMGPQQG-------QAQGQAGGMNQGQQGPVGQQQPPQMGQPG 200

Query: 84  PPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGG 143
            P      +   Q  P   Q   P + Q      +  +        P+ +   M Q   G
Sbjct: 201 MPGGGGQGQMQQQGQPGGQQQQNPQMQQQL---QNQQQQQMDQQQGPADAQAQMGQQQQG 257

Query: 144 GSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQ 199
              G         G +  P   Q   QQ   Q S +   P Q QQ          Q
Sbjct: 258 --QGGMQPQQMQGGQMQVPMQQQPPQQQ--PQQSQLGMLPNQMQQMPGGGQGGPGQ 309


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 37.8 bits (87), Expect = 0.025
 Identities = 32/129 (24%), Positives = 47/129 (36%), Gaps = 6/129 (4%)

Query: 10  FPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLT 69
               HKT+      + PP S+        P +G  +G   ST P   SP     + T   
Sbjct: 440 HVEPHKTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNA 499

Query: 70  SPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHIS----KIPSP 125
           +       +P ATSP     SD  ++ S   I    G     + PPT   S      P  
Sbjct: 500 TSPTPAVTTPNATSPTTQKTSDTPNATSPTPIV--IGVTTTATSPPTGTTSVPNATSPQV 557

Query: 126 SSSSPSSSS 134
           +  SP +++
Sbjct: 558 TEESPVNNT 566



 Score = 35.1 bits (80), Expect = 0.21
 Identities = 33/164 (20%), Positives = 51/164 (31%), Gaps = 22/164 (13%)

Query: 8   SQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPS----QPSPQHGHH 63
           +    +  +    E       ++   +  PS  T  V+ G   T  S    QP      H
Sbjct: 546 TTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSH 605

Query: 64  TLTSLTSPHQMPPP-------SPIATSPPPPSASDKHDSQSSP-----HITQHTGPGIFQ 111
           +     S    P             T   P   S  H S  SP       +Q +GPG   
Sbjct: 606 STPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGPGNSS 665

Query: 112 SP--PPTSHISKI-PSP---SSSSPSSSSQLMQQGGGGGSNGQT 149
           +   P   H+++  P+P   S S+PS     +      G    +
Sbjct: 666 TSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANS 709



 Score = 29.8 bits (66), Expect = 7.9
 Identities = 34/140 (24%), Positives = 53/140 (37%), Gaps = 20/140 (14%)

Query: 2   PIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTG---GVSGGFISTSPSQPSP 58
           P     S++P     +  M     P P+++  S P    T      S G  + S ++ + 
Sbjct: 661 PGNSSTSRYPGEVHVTEGM-----PNPNATSPSAPSGQKTAVPTVTSTGGKANSTTKETS 715

Query: 59  QHGHHTLTSLTSPHQMPPPSP-IATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS 117
                  TS  +       +P  AT+  PPS S    S+  P  T       F SPP T+
Sbjct: 716 GSTLMASTSPHTNEGAFRTTPYNATTYLPPSTS----SKLRPRWT-------FTSPPVTT 764

Query: 118 HISKIPSPSSSSPSSSSQLM 137
             + +P P +  P  S+  M
Sbjct: 765 KQATVPVPPTQHPDHSNLSM 784


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 37.7 bits (87), Expect = 0.030
 Identities = 32/181 (17%), Positives = 54/181 (29%), Gaps = 10/181 (5%)

Query: 11  PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
           P + +          PP +    ++ P+  TG       +   ++P             +
Sbjct: 691 PGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAA 750

Query: 71  PHQMPPPSPIATSPPPPSASD-----KHDSQSSPHITQ--HTGPGIFQSP--PPTSHISK 121
           P +  PP+       PP+A+      +   Q+ P   Q     P     P   PTS    
Sbjct: 751 PGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLM 810

Query: 122 IPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQS 181
             +       +   L Q   GG   G+  L         + + G        T    VQ+
Sbjct: 811 PRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQAAA-GPTPSPGSGTSDKIVQA 869

Query: 182 P 182
           P
Sbjct: 870 P 870


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
           subunit 62; Provisional.
          Length = 576

 Score = 37.6 bits (87), Expect = 0.035
 Identities = 36/134 (26%), Positives = 47/134 (35%), Gaps = 40/134 (29%)

Query: 24  TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPP------- 76
           TP  PS     EPP P          +  P   SP   +  L   TSP   PP       
Sbjct: 348 TPEAPSPPIEEEPPQP---------KAVVPRPLSPYTAYEDLKPPTSPIPTPPSSSPASS 398

Query: 77  --------PSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSP--------PPTSHIS 120
                   P+     P P SAS+  + + +    + T P    SP        PPTS   
Sbjct: 399 KSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRP---LSPYARYEDLKPPTS--- 452

Query: 121 KIPSPSSSSPSSSS 134
             PSP++ +  S S
Sbjct: 453 --PSPTAPTGVSPS 464



 Score = 34.5 bits (79), Expect = 0.28
 Identities = 25/124 (20%), Positives = 41/124 (33%), Gaps = 22/124 (17%)

Query: 11  PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
             S  +S     DT P  +++  + PP  N   +S   +      P+       +  +  
Sbjct: 464 SVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAP 523

Query: 71  PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSP--------PPTSHISKI 122
                      ++PP   A ++H +Q  P            SP        PPTS     
Sbjct: 524 SSTNEVVKVGNSAPPTALADEQHHAQPKPRPL---------SPYTMYEDLKPPTS----- 569

Query: 123 PSPS 126
           P+PS
Sbjct: 570 PTPS 573



 Score = 31.0 bits (70), Expect = 3.0
 Identities = 15/81 (18%), Positives = 28/81 (34%), Gaps = 5/81 (6%)

Query: 53  PSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQS 112
           P+ PSP        S++S        P      P +A+    +    ++   +   ++  
Sbjct: 450 PTSPSPTAPTGVSPSVSS----TSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDD 505

Query: 113 -PPPTSHISKIPSPSSSSPSS 132
             PPTS     P    +  S+
Sbjct: 506 LKPPTSPSPAAPVGKVAPSST 526



 Score = 31.0 bits (70), Expect = 3.1
 Identities = 29/111 (26%), Positives = 39/111 (35%), Gaps = 37/111 (33%)

Query: 26  PPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPP 85
            PP  S  ++ P P         + T P  P            + P +  PP P A  P 
Sbjct: 328 VPPKESDAADGPKP---------VPTKPVTPEAP---------SPPIEEEPPQPKAVVPR 369

Query: 86  PPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
           P S    ++                   PPTS    IP+P SSSP+SS  +
Sbjct: 370 PLSPYTAYEDLK----------------PPTS---PIPTPPSSSPASSKSV 401



 Score = 31.0 bits (70), Expect = 3.3
 Identities = 21/108 (19%), Positives = 32/108 (29%), Gaps = 7/108 (6%)

Query: 24  TPPPPSSSRGSEPPS-PNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIAT 82
           T P   S   S   S P     +    +T  + P P +              PP SP   
Sbjct: 456 TAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPA 515

Query: 83  SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSP 130
           +P    A       S+  + +        +     H ++ P P   SP
Sbjct: 516 APVGKVA-----PSSTNEVVKVGNSAPPTALADEQHHAQ-PKPRPLSP 557


>gnl|CDD|215187 PLN02328, PLN02328, lysine-specific histone demethylase 1 homolog.
          Length = 808

 Score = 37.3 bits (86), Expect = 0.042
 Identities = 28/107 (26%), Positives = 35/107 (32%), Gaps = 20/107 (18%)

Query: 20  MEVDTPPPPSSSRG-----SEPPSPNTGGVSGGFISTSPSQPS------PQHGHHTLTSL 68
           ME +T  P   +       SE  SP T       +S SPSQ         Q+   T + L
Sbjct: 1   METETKEPEDPADNVNDVVSEASSPETD------LSLSPSQSEQNIENDGQNSPETQSPL 54

Query: 69  TSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPP 115
           T     P P       P    SD    +SS             +PPP
Sbjct: 55  TELQPSPLPPNTTLDAP---VSDSQGDESSSEQQPQNPNSTEPAPPP 98


>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
          Length = 590

 Score = 37.0 bits (86), Expect = 0.043
 Identities = 19/80 (23%), Positives = 32/80 (40%), Gaps = 5/80 (6%)

Query: 51  TSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIF 110
           +S S P+ +   H  +S +S  Q  PPS  +  P     S       +P I     P   
Sbjct: 1   SSSSNPTSRSQSHASSSSSSSSQSSPPSSTSPRPRRRKPS-ASSLLHTPSIL----PLPK 55

Query: 111 QSPPPTSHISKIPSPSSSSP 130
            S P    ++  P+ ++ +P
Sbjct: 56  LSSPSPPSVTLPPAATTQTP 75


>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain.  This is a
           short domain found in bacterial type II/III secretory
           system proteins. The architecture of these proteins
           suggest that this family may be functionally analogous
           to pfam03958.
          Length = 95

 Score = 34.2 bits (79), Expect = 0.046
 Identities = 12/52 (23%), Positives = 27/52 (51%), Gaps = 2/52 (3%)

Query: 239 TSSTLGLASTSTAQGDMNSAADNGVSGTNSVITNNSTENNTTSITLDIKADL 290
           TS T G  S+S +    +S+  +    ++S  +++   ++ T IT + ++D 
Sbjct: 19  TSVTSGSVSSSGSNSSSSSSNSSNGGSSSS--SSSGDSSSGTRITTESESDF 68



 Score = 28.4 bits (64), Expect = 4.3
 Identities = 15/65 (23%), Positives = 28/65 (43%), Gaps = 4/65 (6%)

Query: 226 EVFGKLQSSVVTHTSSTLGLASTSTAQGDMNSAADNGVSGTNSVITNNSTENNTTSITLD 285
           +  G   +SV + + S+ G  S S++    +S   +  S ++    ++S    TT    D
Sbjct: 12  KRSGSSNTSVTSGSVSSSG--SNSSSSSSNSSNGGSSSSSSSG--DSSSGTRITTESESD 67

Query: 286 IKADL 290
              DL
Sbjct: 68  FWGDL 72


>gnl|CDD|177301 PHA00733, PHA00733, hypothetical protein.
          Length = 128

 Score = 34.9 bits (80), Expect = 0.051
 Identities = 18/53 (33%), Positives = 26/53 (49%), Gaps = 3/53 (5%)

Query: 381 PYCCSVCMKTFARKEHLENHTRCHTGETPYKCQYCGKNFSRKEHMVNHV-RKH 432
           PY C +C+  F+    L+ H R    E    C  CGK F   +  ++HV +KH
Sbjct: 73  PYVCPLCLMPFSSSVSLKQHIR--YTEHSKVCPVCGKEFRNTDSTLDHVCKKH 123



 Score = 33.3 bits (76), Expect = 0.15
 Identities = 16/50 (32%), Positives = 20/50 (40%), Gaps = 2/50 (4%)

Query: 604 TPYQCQYCPKAFSRKDHLVNHVRQHTGESPHKCSYCKQSFTRKDHLNNHV 653
           +PY C  C   FS    L  H+R    E    C  C + F   D   +HV
Sbjct: 72  SPYVCPLCLMPFSSSVSLKQHIRY--TEHSKVCPVCGKEFRNTDSTLDHV 119



 Score = 32.9 bits (75), Expect = 0.22
 Identities = 20/61 (32%), Positives = 27/61 (44%), Gaps = 5/61 (8%)

Query: 779 PFKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSKSFVEHTCMIKHIRKNHTLANVNV 838
           P+ C +C   F+    L  H+R    E   VCP C K F      + H+ K H   N+ V
Sbjct: 73  PYVCPLCLMPFSSSVSLKQHIR--YTEHSKVCPVCGKEFRNTDSTLDHVCKKH---NICV 127

Query: 839 S 839
           S
Sbjct: 128 S 128



 Score = 32.5 bits (74), Expect = 0.31
 Identities = 19/51 (37%), Positives = 25/51 (49%), Gaps = 6/51 (11%)

Query: 688 SPHKCDFCLKTFTRKEHLTNHLKQHAGETEH--VCNICSKPFLKKEHLVNH 736
           SP+ C  CL  F      +  LKQH   TEH  VC +C K F   +  ++H
Sbjct: 72  SPYVCPLCLMPF----SSSVSLKQHIRYTEHSKVCPVCGKEFRNTDSTLDH 118



 Score = 31.4 bits (71), Expect = 0.80
 Identities = 15/52 (28%), Positives = 23/52 (44%), Gaps = 6/52 (11%)

Query: 548 SPHKCNYCAKSFTRKEHLVNHVR--QHTGDSPHKCNYCAKSFTRKEHLVNHV 597
           SP+ C  C   F+    L  H+R  +H+      C  C K F   +  ++HV
Sbjct: 72  SPYVCPLCLMPFSSSVSLKQHIRYTEHS----KVCPVCGKEFRNTDSTLDHV 119



 Score = 31.4 bits (71), Expect = 0.82
 Identities = 14/50 (28%), Positives = 21/50 (42%), Gaps = 2/50 (4%)

Query: 492 TPFKCEICTKSFTRKEHFTNHIMWHTGETPHRCDFCSKTFTRKEHLLNHV 541
           +P+ C +C   F+       HI +   E    C  C K F   +  L+HV
Sbjct: 72  SPYVCPLCLMPFSSSVSLKQHIRY--TEHSKVCPVCGKEFRNTDSTLDHV 119



 Score = 31.0 bits (70), Expect = 0.89
 Identities = 15/50 (30%), Positives = 20/50 (40%), Gaps = 2/50 (4%)

Query: 576 SPHKCNYCAKSFTRKEHLVNHVRQHTGETPYQCQYCPKAFSRKDHLVNHV 625
           SP+ C  C   F+    L  H+R         C  C K F   D  ++HV
Sbjct: 72  SPYVCPLCLMPFSSSVSLKQHIRYTEHSK--VCPVCGKEFRNTDSTLDHV 119



 Score = 30.6 bits (69), Expect = 1.4
 Identities = 20/65 (30%), Positives = 26/65 (40%), Gaps = 11/65 (16%)

Query: 660 SPHKCHFCSKSFTRKEHLNNHIRLHTGESPHKCDFCLKTFTRKEHLTNHL-KQHAGETEH 718
           SP+ C  C   F+    L  HIR    E    C  C K F   +   +H+ K+H      
Sbjct: 72  SPYVCPLCLMPFSSSVSLKQHIRY--TEHSKVCPVCGKEFRNTDSTLDHVCKKH------ 123

Query: 719 VCNIC 723
             NIC
Sbjct: 124 --NIC 126



 Score = 30.2 bits (68), Expect = 1.8
 Identities = 14/50 (28%), Positives = 21/50 (42%), Gaps = 2/50 (4%)

Query: 520 TPHRCDFCSKTFTRKEHLLNHVRQHTGESPHKCNYCAKSFTRKEHLVNHV 569
           +P+ C  C   F+    L  H+R    E    C  C K F   +  ++HV
Sbjct: 72  SPYVCPLCLMPFSSSVSLKQHIRY--TEHSKVCPVCGKEFRNTDSTLDHV 119



 Score = 29.8 bits (67), Expect = 2.9
 Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 2/50 (4%)

Query: 408 TPYKCQYCGKNFSRKEHMVNHVRKHTGETPHRCEICKKAFTRKEHYVNHV 457
           +PY C  C   FS    +  H+R    E    C +C K F   +  ++HV
Sbjct: 72  SPYVCPLCLMPFSSSVSLKQHIR--YTEHSKVCPVCGKEFRNTDSTLDHV 119



 Score = 29.1 bits (65), Expect = 5.3
 Identities = 15/47 (31%), Positives = 18/47 (38%), Gaps = 2/47 (4%)

Query: 298 HTCDLCGKFFPYRYQMIVHRRYHTERKPFTCHVCGKAFDFQAELMSH 344
           + C LC   F     +  H RY    K   C VCGK F      + H
Sbjct: 74  YVCPLCLMPFSSSVSLKQHIRYTEHSK--VCPVCGKEFRNTDSTLDH 118



 Score = 28.3 bits (63), Expect = 9.2
 Identities = 14/50 (28%), Positives = 20/50 (40%), Gaps = 2/50 (4%)

Query: 632 SPHKCSYCKQSFTRKDHLNNHVRLHTGESPHKCHFCSKSFTRKEHLNNHI 681
           SP+ C  C   F+    L  H+R    E    C  C K F   +   +H+
Sbjct: 72  SPYVCPLCLMPFSSSVSLKQHIRY--TEHSKVCPVCGKEFRNTDSTLDHV 119


>gnl|CDD|218825 pfam05956, APC_basic, APC basic domain.  This region of the APC
           family of proteins is known as the basic domain. It
           contains a high proportion of positively charged amino
           acids and interacts with microtubules.
          Length = 359

 Score = 36.3 bits (83), Expect = 0.064
 Identities = 25/79 (31%), Positives = 33/79 (41%), Gaps = 5/79 (6%)

Query: 66  TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS-----HIS 120
           +  T+P +  PP     S PP S S       S H           SPPP S      ++
Sbjct: 18  SQSTTPSKKGPPLKTQPSDPPKSPSPGQQRSRSLHRPAKPSELAELSPPPRSATPPARLA 77

Query: 121 KIPSPSSSSPSSSSQLMQQ 139
           K PS SSS  S+ SQ + +
Sbjct: 78  KTPSSSSSQTSTPSQPLPR 96



 Score = 30.1 bits (67), Expect = 5.6
 Identities = 29/106 (27%), Positives = 41/106 (38%), Gaps = 5/106 (4%)

Query: 28  PSSSRGSEPPSPNTGGVSGGFI-STSPSQPSP-QHGHHTLTSLTSPHQMPPPSPIATSPP 85
           P  +  S+  +P+  G       S  P  PSP Q    +L     P ++   SP   S  
Sbjct: 12  PGPANRSQSTTPSKKGPPLKTQPSDPPKSPSPGQQRSRSLHRPAKPSELAELSPPPRSAT 71

Query: 86  PPSASDKHDSQSSPHITQHTGPGIFQSPPPTS---HISKIPSPSSS 128
           PP+   K  S SS   +  + P     P PT      S +P P +S
Sbjct: 72  PPARLAKTPSSSSSQTSTPSQPLPRPLPRPTQSAGRNSILPGPGNS 117


>gnl|CDD|225751 COG3210, FhaB, Large exoproteins involved in heme utilization or
           adhesion [Intracellular trafficking and secretion].
          Length = 1013

 Score = 36.2 bits (83), Expect = 0.088
 Identities = 26/202 (12%), Positives = 51/202 (25%), Gaps = 16/202 (7%)

Query: 95  SQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQ 154
           +  +  +         Q       I  I   S +     +   Q        GQ    + 
Sbjct: 269 NGGTVGLPSTNAGLSLQVSASGGVIGDINLGSGNLGDLVALGGQPIVAYLKGGQIGGQVN 328

Query: 155 NIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQ-QSDLIKTNGTGDV 213
           NIG  L+     +  +      + + +                              G +
Sbjct: 329 NIGVQLASDKWDYKQEGLTGGGAAIIANAVTVLTGTAGAGLIGGGAIAGTGALAGLGGTL 388

Query: 214 NPDEFNQLLAYHEVFGKLQSSVVTHTSSTLGLASTSTAQGDMNSAADNG---------VS 264
                  + A     G L  +     +S  G  S    +GD+ +   NG         + 
Sbjct: 389 ANAASLVIAA-----GNLTINAGALLASQ-GSVSLINTKGDIGATGGNGGLKELGAQSLL 442

Query: 265 GTNSVITNNSTENNTTSITLDI 286
             NS   +++ +N T +     
Sbjct: 443 LNNSGGRSSTVKNLTVAAATAG 464


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This
          family contains bacterial fibronectin-attachment
          proteins (FAP). Family members are rich in alanine and
          proline, are approximately 300 long, and seem to be
          restricted to mycobacteria. These proteins contain a
          fibronectin-binding motif that allows mycobacteria to
          bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 35.6 bits (82), Expect = 0.10
 Identities = 18/68 (26%), Positives = 23/68 (33%), Gaps = 12/68 (17%)

Query: 24 TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
           PPPP  S  +  P+P          +    QP              P+  PPP P   +
Sbjct: 42 APPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDD------------PNAAPPPPPADPN 89

Query: 84 PPPPSASD 91
           PPP   D
Sbjct: 90 APPPPPVD 97



 Score = 30.2 bits (68), Expect = 5.1
 Identities = 17/60 (28%), Positives = 23/60 (38%)

Query: 66  TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSP 125
           T+   P   PPP   A + P P+A       ++P   Q   P     PPP    +  P P
Sbjct: 36  TANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPP 95


>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type.  The C2H2 zinc finger
           is the classical zinc finger domain. The two conserved
           cysteines and histidines co-ordinate a zinc ion. The
           following pattern describes the zinc finger.
           #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
           be any amino acid, and numbers in brackets indicate the
           number of residues. The positions marked # are those
           that are important for the stable fold of the zinc
           finger. The final position can be either his or cys. The
           C2H2 zinc finger is composed of two short beta strands
           followed by an alpha helix. The amino terminal part of
           the helix binds the major groove in DNA binding zinc
           fingers. The accepted consensus binding sequence for Sp1
           is usually defined by the asymmetric hexanucleotide core
           GGGCGG but this sequence does not include, among others,
           the GAG (=CTC) repeat that constitutes a high-affinity
           site for Sp1 binding to the wt1 promoter.
          Length = 22

 Score = 31.2 bits (71), Expect = 0.11
 Identities = 12/22 (54%), Positives = 15/22 (68%)

Query: 663 KCHFCSKSFTRKEHLNNHIRLH 684
           KC  C KSF+RK +L  H+R H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 30.4 bits (69), Expect = 0.18
 Identities = 10/22 (45%), Positives = 15/22 (68%)

Query: 467 VCQVCGKKYTRKEHLANHMRSH 488
            C  CGK ++RK +L  H+R+H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 29.6 bits (67), Expect = 0.31
 Identities = 12/22 (54%), Positives = 16/22 (72%)

Query: 411 KCQYCGKNFSRKEHMVNHVRKH 432
           KC  CGK+FSRK ++  H+R H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 29.6 bits (67), Expect = 0.35
 Identities = 11/22 (50%), Positives = 15/22 (68%)

Query: 635 KCSYCKQSFTRKDHLNNHVRLH 656
           KC  C +SF+RK +L  H+R H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 29.2 bits (66), Expect = 0.45
 Identities = 11/22 (50%), Positives = 15/22 (68%)

Query: 607 QCQYCPKAFSRKDHLVNHVRQH 628
           +C  C K+FSRK +L  H+R H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 29.2 bits (66), Expect = 0.55
 Identities = 10/22 (45%), Positives = 13/22 (59%)

Query: 495 KCEICTKSFTRKEHFTNHIMWH 516
           KC  C KSF+RK +   H+  H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 28.5 bits (64), Expect = 1.00
 Identities = 11/22 (50%), Positives = 15/22 (68%)

Query: 691 KCDFCLKTFTRKEHLTNHLKQH 712
           KC  C K+F+RK +L  HL+ H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 28.5 bits (64), Expect = 1.1
 Identities = 12/22 (54%), Positives = 15/22 (68%)

Query: 551 KCNYCAKSFTRKEHLVNHVRQH 572
           KC  C KSF+RK +L  H+R H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 28.5 bits (64), Expect = 1.1
 Identities = 12/22 (54%), Positives = 15/22 (68%)

Query: 579 KCNYCAKSFTRKEHLVNHVRQH 600
           KC  C KSF+RK +L  H+R H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 28.1 bits (63), Expect = 1.3
 Identities = 10/22 (45%), Positives = 15/22 (68%)

Query: 523 RCDFCSKTFTRKEHLLNHVRQH 544
           +C  C K+F+RK +L  H+R H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 27.3 bits (61), Expect = 2.2
 Identities = 13/21 (61%), Positives = 14/21 (66%)

Query: 748 CTDCGKSFPLKGNLLFHQRSH 768
           C DCGKSF  K NL  H R+H
Sbjct: 2   CPDCGKSFSRKSNLKRHLRTH 22



 Score = 26.9 bits (60), Expect = 3.5
 Identities = 8/22 (36%), Positives = 13/22 (59%)

Query: 439 RCEICKKAFTRKEHYVNHVMWH 460
           +C  C K+F+RK +   H+  H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22


>gnl|CDD|221509 pfam12287, Caprin-1_C, Cytoplasmic
           activation/proliferation-associated protein-1 C term.
           This family of proteins is found in eukaryotes. Proteins
           in this family are typically between 343 and 708 amino
           acids in length. This family is the C terminal region of
           caprin-1. Caprin-1 is a protein involved in regulating
           cellular proliferation. In mutated phenotypes, the G1
           phase of the cell cycle is greatly lengthened, impairing
           normal proliferation. The C terminal region of caprin-1
           contains RGG motifs which are characteristic of RNA
           binding domains. It is possible that caprin-1 functions
           through an RNA binding mechanism.
          Length = 319

 Score = 35.3 bits (81), Expect = 0.12
 Identities = 48/198 (24%), Positives = 70/198 (35%), Gaps = 38/198 (19%)

Query: 25  PPPPSSSRGSEPPS----------PNTGGVSGGFISTSP-------SQPSPQHG--HHTL 65
           PP  S SR S+P +          P     S G+ S+ P       ++P PQ        
Sbjct: 43  PPVHSESRLSQPSAVPVQPEPTQVPMVSPTSEGYTSSPPLYQPSHTAEPRPQTDPIDPIQ 102

Query: 66  TSLT-SPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPS 124
            S++ +  Q P  S +  +  P           S  I  +  P  FQS     +++  P 
Sbjct: 103 ASMSLNSEQTPTSSSLPAASQPQVFQTGSKPLHSSGINVNAAP--FQSMQTVFNMNA-PV 159

Query: 125 PSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPG 184
           P  + P S   L QQ     S  Q              S  QH ++Q + Q   +Q+  G
Sbjct: 160 PPVNEPES---LKQQNQYQASYNQGF-----------SSQPQHPVEQTELQQEQLQTVVG 205

Query: 185 Q-QQQYQQMDTSQSSQQQ 201
               Q QQ   + S  QQ
Sbjct: 206 SFHSQDQQAHQAASGHQQ 223


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 35.6 bits (82), Expect = 0.13
 Identities = 18/113 (15%), Positives = 28/113 (24%), Gaps = 6/113 (5%)

Query: 24  TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
            P  P +S  S  P                 Q       H       P +     P    
Sbjct: 175 PPRAPYASPASYAPEQERDREPYDAGRPEYDQRRRD-YDHPRPDWDRPRRDRTDRP---E 230

Query: 84  PPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
           PPP +          P         I   P     ++  P+P+      +++L
Sbjct: 231 PPPGAGHVHRGGPGPPERDDAPVVPI--RPSAPGPLAAQPAPAPGPGEPTARL 281



 Score = 35.2 bits (81), Expect = 0.20
 Identities = 27/172 (15%), Positives = 42/172 (24%), Gaps = 50/172 (29%)

Query: 25  PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSP 84
           PPPP + R SEP  P  G            +P   +G              P      + 
Sbjct: 97  PPPPHARRTSEPELPRPGR-----------RPYEGYGGP---RADDRPPGLPRQDQLPTA 142

Query: 85  PPPSASDKHDSQSSPHITQHTGPGIFQSPP--------PTSHISKIPSPSSSSPSSSSQL 136
            P          + P   Q   PG +                  + P  S +S +   + 
Sbjct: 143 RP----------AYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYAPEQER 192

Query: 137 MQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQ 188
            ++    G                 P Y Q     D  +       P + + 
Sbjct: 193 DREPYDAG----------------RPEYDQRRRDYDHPRPD--WDRPRRDRT 226


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 35.4 bits (81), Expect = 0.14
 Identities = 27/131 (20%), Positives = 40/131 (30%), Gaps = 8/131 (6%)

Query: 11  PASHKTSTSMEVDTPPPPSSSRG----SEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLT 66
              H+T+T+ +  +   P  ++G    S P        S    S SP  P         T
Sbjct: 201 TQGHQTATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQST 260

Query: 67  SLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPS 126
           +               +PP  S      S ++P  T         +P PT+      SP 
Sbjct: 261 TGDGQEHTQRR----KTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPP 316

Query: 127 SSSPSSSSQLM 137
            SSP       
Sbjct: 317 HSSPPGVQANP 327



 Score = 30.4 bits (68), Expect = 4.7
 Identities = 31/191 (16%), Positives = 53/191 (27%), Gaps = 12/191 (6%)

Query: 13  SHKTSTSMEVDTPPPPS--SSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
           S  TS   EV  P  PS  + +       +    +   +S++    +      +      
Sbjct: 175 SKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSS----NP 230

Query: 71  PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS--HISKIPSPSSS 128
             Q  PP     S   PS S +H   ++      TG G   +    +    S   SP S+
Sbjct: 231 EPQTEPPP----SQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHST 286

Query: 129 SPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQ 188
           +    +   Q+ G             +     SP   Q      +        PP     
Sbjct: 287 ATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTTQNLVDCKELDPPKPNSI 346

Query: 189 YQQMDTSQSSQ 199
              +     + 
Sbjct: 347 CYGVGIYNEAL 357


>gnl|CDD|219094 pfam06583, Neogenin_C, Neogenin C-terminus.  This family represents
           the C-terminus of eukaryotic neogenin precursor
           proteins, which contains several potential
           phosphorylation sites. Neogenin is a member of the N-CAM
           family of cell adhesion molecules (and therefore
           contains multiple copies of pfam00047 and pfam00041) and
           is closely related to the DCC tumour suppressor gene
           product - these proteins may play an integral role in
           regulating differentiation programmes and/or cell
           migration events within many adult and embryonic
           tissues.
          Length = 295

 Score = 34.9 bits (80), Expect = 0.14
 Identities = 22/130 (16%), Positives = 32/130 (24%), Gaps = 7/130 (5%)

Query: 11  PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
                 S      TP PP         S          I T+  +P+     H L S   
Sbjct: 143 HGDRAESVESVRQTPEPPYLPAAQSESSNAAEEAPSRSIPTAHVRPT-----HPLKSFAV 197

Query: 71  PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSP 130
           P    P S     P  PS         +          +       S +  +  PS+   
Sbjct: 198 P--ALPASMSTIEPKLPSTPLLTQQGPTLPKHSVKTASVGTLGRARSPLLPVTVPSAPDV 255

Query: 131 SSSSQLMQQG 140
             +   M + 
Sbjct: 256 LETGGKMLED 265


>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
           UL32.  The large phosphorylated protein (UL32-like) of
           herpes viruses is the polypeptide most frequently
           reactive in immuno-blotting analyses with antisera when
           compared with other viral proteins.
          Length = 777

 Score = 35.7 bits (82), Expect = 0.15
 Identities = 21/137 (15%), Positives = 33/137 (24%), Gaps = 16/137 (11%)

Query: 31  SRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPS----PIATSP-- 84
           S     P       S    S   S+ +        +  T+    P PS     I T P  
Sbjct: 522 SIKELRPRIGFINKSPPPKSPPKSRRTLIVALSLASPSTAGSPRPKPSLGKFVIGTDPFA 581

Query: 85  -PPPSASDKHDSQSSPHITQHTGPGIFQSPPPT---------SHISKIPSPSSSSPSSSS 134
                    +    +   +     G   S P T         +   K PS S    + ++
Sbjct: 582 FANTVRLTDNMRGGNGVGSSVKPKGSASSKPLTGPGSDLKPATLNGKTPSSSLVGAARNA 641

Query: 135 QLMQQGGGGGSNGQTHL 151
               +       G    
Sbjct: 642 GASSKVKIPSGLGGFTS 658


>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
          Length = 226

 Score = 34.6 bits (80), Expect = 0.15
 Identities = 18/96 (18%), Positives = 30/96 (31%), Gaps = 1/96 (1%)

Query: 21  EVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPI 80
           E D  P  + +  ++PP      V  G  +     P+             P  + PP P 
Sbjct: 52  EPDMMPAATQALPTQPPEGAAEAVRAGDAAAPSLDPATVAPP-NTPVEPEPAPVEPPKPK 110

Query: 81  ATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPT 116
               P P    +   ++ P       P + +   PT
Sbjct: 111 PVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKAAPT 146


>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1.  This region is found
           in some of the Diaphanous related formins (Drfs). It
           consists of low complexity repeats of around 12
           residues.
          Length = 160

 Score = 33.8 bits (77), Expect = 0.15
 Identities = 32/105 (30%), Positives = 38/105 (36%), Gaps = 9/105 (8%)

Query: 25  PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPI---- 80
           PPPP     + PP P   GV+G  I   P  P           L     +PPP P+    
Sbjct: 38  PPPPLPGGAAIPPPPPLPGVAG--IPPPPPLPGAT-AIPPPPPLPGAAGIPPPPPLPGGA 94

Query: 81  ATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSP 125
              PPPP           P +    GPG+   PPP      IP P
Sbjct: 95  GIPPPPPPLPGGAAVPPPPPLP--GGPGVPPPPPPFPGAPGIPPP 137



 Score = 30.3 bits (68), Expect = 2.7
 Identities = 20/64 (31%), Positives = 21/64 (32%), Gaps = 15/64 (23%)

Query: 25  PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSP 84
           PPP        PP P  GG   G     P  P                 +PPP P   SP
Sbjct: 100 PPPLPGGAAVPPPPPLPGG--PGVPPPPPPFPGAP-------------GIPPPPPGMGSP 144

Query: 85  PPPS 88
           PPP 
Sbjct: 145 PPPP 148



 Score = 29.9 bits (67), Expect = 3.9
 Identities = 32/111 (28%), Positives = 36/111 (32%), Gaps = 9/111 (8%)

Query: 25  PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPI---A 81
           PPPP       PP P   G  G  I   P  P    G      L     +PPP P+   A
Sbjct: 2   PPPPLPGGVGIPPPPPLPG--GVCIPPPPPLPGGT-GIPPPPPLPGGAAIPPPPPLPGVA 58

Query: 82  TSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSS 132
             PPPP           P +    G      PPP    + IP P    P  
Sbjct: 59  GIPPPPPLPGATAIPPPPPL---PGAAGIPPPPPLPGGAGIPPPPPPLPGG 106


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 35.3 bits (81), Expect = 0.16
 Identities = 31/129 (24%), Positives = 45/129 (34%), Gaps = 12/129 (9%)

Query: 21  EVDTPPPPSSSRGSEPPSPNTGGV---SGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPP 77
           E  T  PPS +     PS N       SG   S+S S+ S      + +S +   +  PP
Sbjct: 371 EQATEKPPSRNTPPSAPSSNPEPAASSSGSSSSSSGSESSSGSDSESESSSSDSEENEPP 430

Query: 78  SPIATSPPPPSAS----DKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSS 133
              +  P PPS +    D   ++ +PH           S PP     +         SS 
Sbjct: 431 RTASPEPEPPSTNKWQLDNWLNKVNPHKVSPAESV--SSNPPIKQPMEK---EGKVKSSG 485

Query: 134 SQLMQQGGG 142
           SQ   +   
Sbjct: 486 SQYHPESKE 494



 Score = 31.8 bits (72), Expect = 1.8
 Identities = 21/93 (22%), Positives = 32/93 (34%), Gaps = 11/93 (11%)

Query: 62  HHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISK 121
            +    +  P    P SP     P      +  S  S H + HT       P P+ +IS+
Sbjct: 69  SNQPHLVGIPKNSVPQSPNNKGKPSFFGDQRGRSIPSSHQSTHTSA---PGPAPSGNISQ 125

Query: 122 IPSPS--------SSSPSSSSQLMQQGGGGGSN 146
               S        +  P S+    Q+ G  GS+
Sbjct: 126 SQKRSQPRSQWTKAGHPPSTVSSSQRHGQQGSS 158



 Score = 30.3 bits (68), Expect = 7.0
 Identities = 23/114 (20%), Positives = 33/114 (28%), Gaps = 5/114 (4%)

Query: 26  PPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPP-----PSPI 80
               S R  E    +        +    SQ S Q+   +L S +S  Q  P     P   
Sbjct: 171 CRSKSPRDLEVNVQSPERPPDSLVLLPSSQHSVQNFPPSLMSKSSAMQQKPTAYVRPMDG 230

Query: 81  ATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
               P  S   K   + S               P  + ++K+  PS     S S
Sbjct: 231 QDQAPDESPELKSSIEESYGQQSFGKTMDELKSPAKAKLTKLKIPSQPVEQSYS 284


>gnl|CDD|165133 PHA02768, PHA02768, hypothetical protein; Provisional.
          Length = 55

 Score = 31.6 bits (71), Expect = 0.16
 Identities = 14/45 (31%), Positives = 25/45 (55%), Gaps = 2/45 (4%)

Query: 410 YKCQYCGKNFSRKEHMVNHVRKHTGETPHRCEICKKAFTRKEHYV 454
           Y+C  CG+ + +++ M+ H+RKH   T  +   CK+   R   Y+
Sbjct: 6   YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48



 Score = 27.3 bits (60), Expect = 5.7
 Identities = 7/24 (29%), Positives = 15/24 (62%)

Query: 468 CQVCGKKYTRKEHLANHMRSHTND 491
           C +CG+ Y +++ +  H+R H  +
Sbjct: 8   CPICGEIYIKRKSMITHLRKHNTN 31



 Score = 26.9 bits (59), Expect = 6.7
 Identities = 9/23 (39%), Positives = 16/23 (69%)

Query: 810 CPHCSKSFVEHTCMIKHIRKNHT 832
           CP C + +++   MI H+RK++T
Sbjct: 8   CPICGEIYIKRKSMITHLRKHNT 30


>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509).  This
           family consists of several uncharacterized viral
           proteins from the Marek's disease-like viruses. Members
           of this family are typically around 400 residues in
           length. The function of this family is unknown.
          Length = 377

 Score = 35.0 bits (80), Expect = 0.17
 Identities = 37/170 (21%), Positives = 50/170 (29%), Gaps = 26/170 (15%)

Query: 2   PIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQ------ 55
           PI C  ++ P   +   S   D  P  S   G     P    +      T PS       
Sbjct: 123 PIPC-FAEVPVFPRPYQSSGDDDGPSTSRGSGVARVRPTV--IQHRVDKTRPSDYENHRP 179

Query: 56  -----PSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPS----------ASDKHDSQSSPH 100
                 +P        +   P Q P PS    SP  P+           +       SP 
Sbjct: 180 RPFAMANPSWVDEPDAAAQRPPQ-PGPSGQNRSPRTPTLSNVRVLDAPVATNRGEAPSPP 238

Query: 101 ITQHTGPGIFQSPPPTSHI-SKIPSPSSSSPSSSSQLMQQGGGGGSNGQT 149
            T    P    + P  +   +  P PSS  P    +   Q    G+N QT
Sbjct: 239 RTDTLDPDPAIAGPSRAVNRTPSPRPSSPPPEIDEEYNAQPDPWGTNAQT 288


>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 35.3 bits (81), Expect = 0.18
 Identities = 20/66 (30%), Positives = 22/66 (33%), Gaps = 13/66 (19%)

Query: 25  PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSP 84
           P  P   R   P  P   G            PSP        S  +P Q PP +  A  P
Sbjct: 397 PDLPQPDRHPGPAKPEAPGAR------PAELPSPA-------SAPTPEQQPPVARSAPLP 443

Query: 85  PPPSAS 90
           P P AS
Sbjct: 444 PSPQAS 449



 Score = 33.0 bits (75), Expect = 0.85
 Identities = 22/112 (19%), Positives = 31/112 (27%), Gaps = 18/112 (16%)

Query: 55  QPSPQHGHHTLTSLTSPHQMPPPSPIATSPP--PPSASDKHDSQSSPHITQHTGPGIFQS 112
            PSP            P     P P     P  P +   +     SP     + P   Q 
Sbjct: 381 APSPAGSPDVKKKAPEP---DLPQPDRHPGPAKPEAPGARPAELPSPA----SAPTPEQQ 433

Query: 113 PPPTSHISKIPSPSSSSPSSSSQL---------MQQGGGGGSNGQTHLHIQN 155
           PP        PSP +S+P + +             +      NG     +Q 
Sbjct: 434 PPVARSAPLPPSPQASAPRNVASGKPGVDLGSWQGKFMNFTRNGSRKQPVQA 485


>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal.  Members
           of this family are found in the N terminal region of
           cytochrome B561, as well as in various other putative
           uncharacterized proteins.
          Length = 559

 Score = 35.1 bits (81), Expect = 0.20
 Identities = 29/148 (19%), Positives = 42/148 (28%), Gaps = 37/148 (25%)

Query: 2   PIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHG 61
             Q  L    A   +  ++       P +S+ S P +               S+P    G
Sbjct: 95  ERQLGLLGVKA-KDSQFTVVSQAKKSPPASKTSTPMNT--------------SEPL-VPG 138

Query: 62  HHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISK 121
           H + +   SP +   PS                   SP  T    P   Q  P     S 
Sbjct: 139 HSSFSD--SPSRSASPSRK----------------FSPSSTIQQSP---QLTPSNKPASP 177

Query: 122 IPSPSSSSPSSSSQLMQQGGGGGSNGQT 149
             S  S S SSS   +   G   +   +
Sbjct: 178 SSSYQSPSYSSSLGPVNSSGNRSNLRSS 205


>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
          Length = 421

 Score = 34.7 bits (79), Expect = 0.22
 Identities = 22/125 (17%), Positives = 43/125 (34%), Gaps = 15/125 (12%)

Query: 16  TSTSMEVDTPPPPSSSRGSEPPSPNTGGVS-----GGFISTSPSQPSPQHGHHTLTSLTS 70
           T  ++ V    P S+   S   +P++  +         ++T  +  +        ++   
Sbjct: 295 THGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVL 354

Query: 71  P--------HQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKI 122
           P          +P   P+   P P S ++   S +    +   GP       P S+I   
Sbjct: 355 PSDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITSTANGPTTSLPAAPASNIP-- 412

Query: 123 PSPSS 127
            SP+S
Sbjct: 413 VSPTS 417



 Score = 29.7 bits (66), Expect = 8.6
 Identities = 16/95 (16%), Positives = 26/95 (27%)

Query: 51  TSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIF 110
           T P          T +S  +P     PSP        + S      S+  ++        
Sbjct: 298 TVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSD 357

Query: 111 QSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGS 145
            + P T  +      +      S+   QQ   G  
Sbjct: 358 VTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNI 392


>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175). 
          Length = 820

 Score = 34.9 bits (81), Expect = 0.22
 Identities = 15/72 (20%), Positives = 18/72 (25%), Gaps = 23/72 (31%)

Query: 137 MQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQ 196
            Q+G   G  GQ                G    Q         Q   GQQQ  Q      
Sbjct: 617 AQRGEQQGQQGQ----------------GG---QGQGQPGQQGQQGQGQQQGQQGQGGQG 657

Query: 197 SS----QQQSDL 204
                 ++Q  L
Sbjct: 658 GQGSLAERQQAL 669


>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in bacteria and eukaryotes, and
           is typically between 384 and 430 amino acids in length.
          Length = 414

 Score = 34.4 bits (79), Expect = 0.23
 Identities = 29/127 (22%), Positives = 43/127 (33%), Gaps = 13/127 (10%)

Query: 11  PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGG--FISTSPSQPSPQHGHHTLTSL 68
             S K  T + +   PPP S   S+ PS      SG     S+  S+ +        TS 
Sbjct: 195 TNSFKEVTPVGLMRTPPPGS--HSKSPS-----KSGIPDLPSSRDSEKTKPEKPQQETSS 247

Query: 69  TSPHQMPPPSPIATSPP-PPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSS 127
               +   P P  T  P  P  +   D+       +       +S   ++     PS  S
Sbjct: 248 MDTEKSSAPKPRETLDPKSPEKAPPIDTTEEE--LKSPEASPKESEEASARKR-SPSLLS 304

Query: 128 SSPSSSS 134
            SP + S
Sbjct: 305 PSPKAES 311


>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger. 
          Length = 23

 Score = 30.1 bits (68), Expect = 0.24
 Identities = 9/23 (39%), Positives = 12/23 (52%)

Query: 662 HKCHFCSKSFTRKEHLNNHIRLH 684
           ++C  C K F  K  L  H+R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 29.7 bits (67), Expect = 0.39
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 410 YKCQYCGKNFSRKEHMVNHVRKH 432
           Y+C  CGK F  K  +  H+R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 29.4 bits (66), Expect = 0.45
 Identities = 10/23 (43%), Positives = 12/23 (52%)

Query: 522 HRCDFCSKTFTRKEHLLNHVRQH 544
           +RC  C K F  K  L  H+R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 29.4 bits (66), Expect = 0.47
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 466 HVCQVCGKKYTRKEHLANHMRSH 488
           + C  CGK +  K  L  HMR+H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 29.4 bits (66), Expect = 0.48
 Identities = 10/23 (43%), Positives = 12/23 (52%)

Query: 606 YQCQYCPKAFSRKDHLVNHVRQH 628
           Y+C  C K F  K  L  H+R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 29.0 bits (65), Expect = 0.66
 Identities = 8/23 (34%), Positives = 12/23 (52%)

Query: 634 HKCSYCKQSFTRKDHLNNHVRLH 656
           ++C  C + F  K  L  H+R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 28.6 bits (64), Expect = 0.81
 Identities = 9/23 (39%), Positives = 12/23 (52%)

Query: 550 HKCNYCAKSFTRKEHLVNHVRQH 572
           ++C  C K F  K  L  H+R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 28.6 bits (64), Expect = 0.81
 Identities = 9/23 (39%), Positives = 12/23 (52%)

Query: 578 HKCNYCAKSFTRKEHLVNHVRQH 600
           ++C  C K F  K  L  H+R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 27.8 bits (62), Expect = 1.4
 Identities = 8/23 (34%), Positives = 12/23 (52%)

Query: 690 HKCDFCLKTFTRKEHLTNHLKQH 712
           ++C  C K F  K  L  H++ H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 26.3 bits (58), Expect = 5.3
 Identities = 10/23 (43%), Positives = 10/23 (43%)

Query: 382 YCCSVCMKTFARKEHLENHTRCH 404
           Y C  C K F  K  L  H R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 26.3 bits (58), Expect = 5.8
 Identities = 8/23 (34%), Positives = 11/23 (47%)

Query: 298 HTCDLCGKFFPYRYQMIVHRRYH 320
           + C  CGK F  +  +  H R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 25.9 bits (57), Expect = 6.8
 Identities = 9/23 (39%), Positives = 13/23 (56%)

Query: 780 FKCDVCPKDFTCKGHLVSHLRSH 802
           ++C  C K F  K  L  H+R+H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 25.9 bits (57), Expect = 7.4
 Identities = 8/23 (34%), Positives = 13/23 (56%)

Query: 354 YTCAVCFHVFANEESLQRHTKRH 376
           Y C  C  VF ++ +L+ H + H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23


>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
           Rho/Rac/Cdc42-like GTPases [Signal transduction
           mechanisms].
          Length = 1175

 Score = 34.9 bits (80), Expect = 0.24
 Identities = 33/140 (23%), Positives = 44/140 (31%), Gaps = 14/140 (10%)

Query: 12  ASHKTSTSMEVDTPP-PPSSSRGSEPPSPNTGGVSGGFISTSPSQP--------SPQHGH 62
           A+  TS+    D     P+S   S  PS        G    SP Q         S  HG 
Sbjct: 98  ATSSTSSLNSNDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQKRKNPLLPSSSTHGT 157

Query: 63  HTLTSLTSPHQMPPPSPIATSPPP-PSASDKHDSQSSPHITQHTG----PGIFQSPPPTS 117
           H     T  +     +P A S    PS   +     SPH  Q          F  P    
Sbjct: 158 HPPIVFTDNNGSHAGAPNARSRKEIPSLGSQSMQLPSPHFRQKFSSSDTSNGFSYPSIRK 217

Query: 118 HISKIPSPSSSSPSSSSQLM 137
           +     +   S P SS+ ++
Sbjct: 218 NSRHSSNSMPSFPHSSTAVL 237



 Score = 31.0 bits (70), Expect = 3.6
 Identities = 29/117 (24%), Positives = 40/117 (34%), Gaps = 11/117 (9%)

Query: 28  PSSSRGSEPPSPNT---GGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSP 84
           PSSS     P        G   G  +    +  P  G  ++  L SPH     S   TS 
Sbjct: 150 PSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEIPSLGSQSM-QLPSPHFRQKFSSSDTSN 208

Query: 85  PPPSASDKHDSQ-------SSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
                S + +S+       S PH +              S IS   +PSSS+  + S
Sbjct: 209 GFSYPSIRKNSRHSSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSSSNSEAMS 265


>gnl|CDD|177663 PLN00026, PLN00026, aquaporin  NIP; Provisional.
          Length = 298

 Score = 34.5 bits (79), Expect = 0.24
 Identities = 15/70 (21%), Positives = 21/70 (30%), Gaps = 7/70 (10%)

Query: 27 PPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLT------SLTSPHQMPPPSPI 80
            +    S P +P T G  GG +       S  +G   L        + +      P   
Sbjct: 1  MDNEEVPSAPSTPATPGTPGGPLFGGFRVDSLSYGDRKLLPSCKCFPVEALEWGRLP-TC 59

Query: 81 ATSPPPPSAS 90
              PPP  S
Sbjct: 60 FCDLPPPDVS 69


>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 944

 Score = 34.7 bits (80), Expect = 0.27
 Identities = 24/143 (16%), Positives = 48/143 (33%), Gaps = 18/143 (12%)

Query: 15  KTSTSMEVDTPP--PPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLT--SLTS 70
           K+S   +  TPP   P +S      SP+    S  F      + +           S ++
Sbjct: 638 KSSADRKPKTPPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQSVPEAALASGSA 697

Query: 71  PHQMP------------PPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQ--SPPPT 116
           P   P             P   + +  P +A++ + S+S    +      + Q  +  P 
Sbjct: 698 PAPPPVPDPYDRPPWEEAPEVASANDGPNNAAEGNLSESVEDASNSELQAVEQQATHQPQ 757

Query: 117 SHISKIPSPSSSSPSSSSQLMQQ 139
                    S+++ + +S  +Q 
Sbjct: 758 VQAEAQSPASTTALTQTSSEVQD 780


>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
           carboxylase.
          Length = 274

 Score = 34.0 bits (78), Expect = 0.30
 Identities = 18/62 (29%), Positives = 24/62 (38%), Gaps = 11/62 (17%)

Query: 73  QMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSS 132
           Q PPP+P+    PPP           PH      P   Q  P     S  P+P+S  P+ 
Sbjct: 143 QPPPPAPVVMMQPPP-----------PHAMPPASPPAAQPAPSAPASSPPPTPASPPPAK 191

Query: 133 SS 134
           + 
Sbjct: 192 AP 193



 Score = 33.7 bits (77), Expect = 0.39
 Identities = 19/65 (29%), Positives = 22/65 (33%), Gaps = 25/65 (38%)

Query: 70  SPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSS 129
            PH MPP SP A  P P + +                     SPPPT       SP  + 
Sbjct: 157 PPHAMPPASPPAAQPAPSAPA--------------------SSPPPTP-----ASPPPAK 191

Query: 130 PSSSS 134
              SS
Sbjct: 192 APKSS 196


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 34.1 bits (78), Expect = 0.35
 Identities = 23/118 (19%), Positives = 38/118 (32%), Gaps = 12/118 (10%)

Query: 25  PPPPSSSRGSEPPSPNTGGVS------GGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPS 78
              P+ +  +  P+P              F S +P+QP          S+  P    P  
Sbjct: 477 AAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDD 536

Query: 79  PIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
              T  P P+A+       +P     T P +   PP  S            P+ +++L
Sbjct: 537 AFETLAPAPAAA------PAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPALAARL 588



 Score = 30.6 bits (69), Expect = 4.6
 Identities = 17/115 (14%), Positives = 39/115 (33%), Gaps = 15/115 (13%)

Query: 27  PPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPP 86
           P  S  G+ P +     V+    + +    +         +  +P   P  +P A +   
Sbjct: 365 PGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAP-------APAAPPAAPAAAPAAAAAAR 417

Query: 87  PSASDKHDSQSSPHIT------QHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQ 135
             A+    ++ SP         Q +  G   +P P    +  P+ ++   ++  +
Sbjct: 418 AVAAA--PARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPR 470


>gnl|CDD|152349 pfam11914, DUF3432, Domain of unknown function (DUF3432).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is about 100
           amino acids in length. This domain is found associated
           with pfam00096. This domain has two conserved sequence
           motifs: YPSPV and PSP.
          Length = 100

 Score = 31.7 bits (71), Expect = 0.36
 Identities = 32/112 (28%), Positives = 46/112 (41%), Gaps = 16/112 (14%)

Query: 22  VDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIA 81
            D   P S+       SPN    S   +S+ PS  +  +     TS +SP     PSP+ 
Sbjct: 1   ADKAAPVST------ASPNISIYSSSPVSSYPSPIATSYPSPVPTSYSSPVSSCYPSPVH 54

Query: 82  TSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSS 133
           TS P PS +           T  +    FQ+   TS  S + + S SSP ++
Sbjct: 55  TSFPSPSIA----------TTYPSVSPTFQTQVATSFPSSVVTNSFSSPVTT 96


>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
          Length = 580

 Score = 34.0 bits (78), Expect = 0.38
 Identities = 22/128 (17%), Positives = 30/128 (23%), Gaps = 11/128 (8%)

Query: 21  EVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSL----------TS 70
                 P   +  S   SP  G     F +      +P       TS           T 
Sbjct: 3   GKPRGQPGRWNYDSSDESP-EGSRDENFDAERDDFLTPLGSTSEATSEDDDDLYPPRETG 61

Query: 71  PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSP 130
                  S I T P PP   ++   +          P      PP            +  
Sbjct: 62  SGGGVATSTIYTVPRPPRGPEQTLDKPDSLPASRELPPGPTPVPPGGFRGASSPRLGADS 121

Query: 131 SSSSQLMQ 138
           +S   L Q
Sbjct: 122 TSPRFLYQ 129



 Score = 30.5 bits (69), Expect = 4.4
 Identities = 14/69 (20%), Positives = 21/69 (30%), Gaps = 4/69 (5%)

Query: 24  TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
           TP PP   RG+  P       S  F+     Q +       +    S  +        + 
Sbjct: 102 TPVPPGGFRGASSPRLGADSTSPRFLY----QVNFPVILAPIGESNSSSEELSEEEEHSR 157

Query: 84  PPPPSASDK 92
           PPP  +   
Sbjct: 158 PPPSESLKV 166


>gnl|CDD|234471 TIGR04104, cxxc_20_cxxc, cxxc_20_cxxc protein.  This small,
           uncommon, poorly conserved protein is found primarily in
           the Firmicutes. It features are pair of CxxC motifs
           separated by about 20 amino acids, followed by a highly
           hydrophobic region of about 45 amino acids. It has no
           conserved gene neighborhood, and its function is
           unknown.
          Length = 94

 Score = 31.2 bits (71), Expect = 0.44
 Identities = 12/50 (24%), Positives = 20/50 (40%), Gaps = 7/50 (14%)

Query: 466 HVCQVCGKKYTRKEHLANHMRSHTNDTPFKCEICTKSFTRKEHFTNHIMW 515
            +C+ C +K++ KE L +         P KC  C      K++ T     
Sbjct: 1   PICKNCNEKFSYKELLKSLFSL---YRPIKCPNCGT----KQYLTAKSRK 43



 Score = 29.6 bits (67), Expect = 1.6
 Identities = 10/42 (23%), Positives = 17/42 (40%), Gaps = 3/42 (7%)

Query: 438 HRCEICKKAFTRKEHYVNHVMWHTGETPHVCQVCGKKYTRKE 479
             C+ C + F+ KE   +  ++ +   P  C  CG K     
Sbjct: 1   PICKNCNEKFSYKELLKS--LF-SLYRPIKCPNCGTKQYLTA 39



 Score = 28.1 bits (63), Expect = 6.4
 Identities = 10/45 (22%), Positives = 17/45 (37%), Gaps = 8/45 (17%)

Query: 411 KCQYCGKNFSRKEHMVNHVRKHTGETPHRCEICKKAFTRKEHYVN 455
            C+ C + FS KE + +         P +C  C       + Y+ 
Sbjct: 2   ICKNCNEKFSYKELLKSLFSL---YRPIKCPNCG-----TKQYLT 38


>gnl|CDD|223965 COG1034, NuoG, NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75
           kD subunit (chain G) [Energy production and conversion].
          Length = 693

 Score = 33.9 bits (78), Expect = 0.45
 Identities = 29/145 (20%), Positives = 43/145 (29%), Gaps = 42/145 (28%)

Query: 705 LTNHLKQHAGETEHVCNIC-------------------SKPFLKKEHLVNHS----RSHT 741
           L NH           C +C                   S+    K    +        + 
Sbjct: 91  LINH--------PLDCPVCDKGGECELQDLAVKYGVSHSRYRETKRTHRDKDLGPLVKYD 142

Query: 742 GERPFACTDC---GKSFPLKGNLLFHQRSHRKQNGEVLERPFK------C-DVCPKD-FT 790
             R   CT C    K       L   +R    + G  L++P +      C DVCP    T
Sbjct: 143 MNRCILCTRCVRFCKEIAGTHELGVIKRGENSEIGTYLDQPLESELCGNCVDVCPVGALT 202

Query: 791 CKGHLVSHLRSHLNERPHVCPHCSK 815
            K    +  +  L + P +C HC+ 
Sbjct: 203 SKPFAFTARKWELKKTPSICVHCAV 227


>gnl|CDD|221463 pfam12211, LMWSLP_N, Low molecular weight S layer protein N
           terminal.  This family of proteins is found in bacteria.
           Proteins in this family are typically between 328 and
           381 amino acids in length. There is a conserved LGDG
           sequence motif. Clostridial species have a layer of
           surface proteins surrounding their membrane. This layer
           is comprised of a high molecular weight protein and a
           low molecular weight protein. This domain is the N
           terminal domain of the low molecular weight protein. It
           is a structural domain.
          Length = 253

 Score = 33.2 bits (76), Expect = 0.45
 Identities = 11/53 (20%), Positives = 20/53 (37%)

Query: 239 TSSTLGLASTSTAQGDMNSAADNGVSGTNSVITNNSTENNTTSITLDIKADLD 291
           T ST G+     +      AAD  +  ++       T+++   + L I A   
Sbjct: 112 TGSTAGIVKAKDSSSTKTVAADAPLKVSDVFTFAVVTDSDKVGLKLTITAKAS 164


>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1.  Mating in
           fungi is controlled by the loci that determine the
           mating type of an individual, and only individuals with
           differing mating types can mate. Basidiomycete fungi
           have evolved a unique mating system, termed tetrapolar
           or bifactorial incompatibility, in which mating type is
           determined by two unlinked loci; compatibility at both
           loci is required for mating to occur. The multi-allelic
           tetrapolar mating system is considered to be a novel
           innovation that could have only evolved once, and is
           thus unique to the mushroom fungi. This domain is
           C-terminal to the homeodomain transcription factor
           region.
          Length = 418

 Score = 33.2 bits (76), Expect = 0.60
 Identities = 23/146 (15%), Positives = 43/146 (29%), Gaps = 18/146 (12%)

Query: 1   MPIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPS----QP 56
            P+   L  FP          +D P P + S  S   +     ++   +  S +      
Sbjct: 255 TPLDISLFNFPLIPLLPPE-ALDLPAPTAVSSSSSTFA--VPALTSSSVDQSATPLDQGF 311

Query: 57  S-PQHGHHT----LTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGP--GI 109
           S      ++     T+ +  + +P  S    S           + +S      +      
Sbjct: 312 SNFGSNMYSEPLNPTNDSLLYGLPSSS----SLYANRTIFPAWASTSVSPLDFSTLFNQP 367

Query: 110 FQSPPPTSHISKIPSPSSSSPSSSSQ 135
             SP  +  I     P+S SP +   
Sbjct: 368 SPSPMASQSILAPAQPTSPSPVALPS 393



 Score = 30.2 bits (68), Expect = 6.3
 Identities = 22/121 (18%), Positives = 36/121 (29%), Gaps = 1/121 (0%)

Query: 8   SQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTS 67
           S   +S  +         P P++S   E    +   +    +S   +             
Sbjct: 129 SDSISSSSSPAKPPEACLPSPAASTQDELSEASAAPLPTPSLSPPHTPTDTAPSGKRKRR 188

Query: 68  LTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSS 127
           L+   Q+P P    TS  P + SD     ++          +  SP        IP P S
Sbjct: 189 LSDGFQLPAPKRPQTSSRPQTVSDPLPLHATTDWDTWFQATVSSSPSLLLTGD-IPPPVS 247

Query: 128 S 128
            
Sbjct: 248 V 248


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 33.2 bits (76), Expect = 0.62
 Identities = 18/102 (17%), Positives = 28/102 (27%), Gaps = 23/102 (22%)

Query: 24  TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
            P P          +P            SP +P+P     T     +   +PP  P+  +
Sbjct: 361 VPVPAPQPAKPTAAAP------------SPVRPTP--APSTRPKAAAAANIPPKEPVRET 406

Query: 84  PPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSP 125
             PP    +  +   PH              P    + IP  
Sbjct: 407 ATPPPVPPRPVAPPVPHT---------PESAPKLTRAAIPVD 439



 Score = 32.5 bits (74), Expect = 1.1
 Identities = 17/100 (17%), Positives = 25/100 (25%), Gaps = 13/100 (13%)

Query: 16  TSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMP 75
            +      T   PS  R +  PS      +   I                          
Sbjct: 364 PAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETA-----------TPPPV 412

Query: 76  PPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPP 115
           PP P+A   P    S    ++++  I     P      PP
Sbjct: 413 PPRPVAPPVPHTPESAPKLTRAA--IPVDEKPKYTPPAPP 450



 Score = 30.5 bits (69), Expect = 4.7
 Identities = 18/82 (21%), Positives = 22/82 (26%), Gaps = 9/82 (10%)

Query: 11  PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
           PA      +      PP    R +  P P            +P  P        LT    
Sbjct: 383 PAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP------VAPPVPHTPESAPKLTRAAI 436

Query: 71  PHQMPPPSPIATSPPPPSASDK 92
           P       P  T P PP   +K
Sbjct: 437 P---VDEKPKYTPPAPPKEEEK 455


>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295).  This
           family is conserved in fungi but the function is not
           known.
          Length = 509

 Score = 33.4 bits (76), Expect = 0.63
 Identities = 32/132 (24%), Positives = 50/132 (37%), Gaps = 15/132 (11%)

Query: 6   ELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTL 65
           +L +   S K    +E    P P+ +  + P S             +P+ PS +    T 
Sbjct: 62  DLEKMVLSIKEKKDLEPLALPMPNLAPITPPSS-----------EPTPAPPSSES---TA 107

Query: 66  TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKI-PS 124
           T    P+Q    S  +TS      +D   S +    +  T      + P TS +    PS
Sbjct: 108 TRTPDPNQQALESTESTSTTSADCNDSEQSSTPNLNSSDTSTSSSGALPSTSVVRGFSPS 167

Query: 125 PSSSSPSSSSQL 136
             SSS  S++QL
Sbjct: 168 HISSSYRSTAQL 179


>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor.  This family is secreted by
           gram-negative Gammaproteobacteria such as Pseudomonas
           syringae of tomato and the fire blight plant pathogen
           Erwinia amylovora, amongst others. It is an essential
           pathogenicity factor of approximately 198 kDa. Its
           injection into the host-plant is dependent upon the
           bacterial type III or Hrp secretion system. The family
           is long and carries a number of predicted functional
           regions, including an ERMS or endoplasmic reticulum
           membrane retention signal at both the C- and the
           N-termini, a leucine-zipper motif from residues 539-560,
           and a nuclear localisation signal at 1358-1361. this
           conserved AvrE-family of effectors is among the few that
           are required for full virulence of many phytopathogenic
           pseudomonads, erwinias and pantoeas.
          Length = 1771

 Score = 33.2 bits (76), Expect = 0.72
 Identities = 41/251 (16%), Positives = 73/251 (29%), Gaps = 36/251 (14%)

Query: 54  SQPSPQHGHHTLTSL--TSPHQMPP-------PSPIATSPPPPSASDKHDSQSSPHITQH 104
           S+   Q   H+L S    +  +MP        P  I  +PP          QSS   T  
Sbjct: 32  SESPTQRASHSLASEGKKNRKKMPKVFQKSSAPRQIQAAPPQALNPTAAAPQSSRGPTLR 91

Query: 105 TGPGIFQSPPPTSHI--SKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSP 162
               + +    T     S      + S   +   M+   G                V+ P
Sbjct: 92  ELLALPEDDGETQAPESSPSARRLTRSEGVARHEMEDLAG--------------RPVVKP 137

Query: 163 SYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQSDLIKTNGTGD-VNPDEFNQL 221
              +   Q    + S+ + PP  +++        ++   + L K +     V+    +Q 
Sbjct: 138 DADRQLRQDILNKSSSSRRPPVSKEEGTSSKMPATALASAALFKDDEIRQEVDAARSDQA 197

Query: 222 LAYHEVFGKLQSSVVTHTSSTLGLASTSTAQGDMNSAADNGVSGTNSVITNNSTENNTTS 281
                          +  +        +  Q  +  +A     G +  +  N    +   
Sbjct: 198 SQ--------SRLSRSRGNPPAIPPDAAPRQPMLTRSAGGRFEGEDENLERNLQPQSP-- 247

Query: 282 ITLDIKADLDL 292
           ITLD K  LD 
Sbjct: 248 ITLDKKGKLDF 258


>gnl|CDD|144451 pfam00859, CTF_NFI, CTF/NF-I family transcription modulation
           region. 
          Length = 295

 Score = 32.7 bits (74), Expect = 0.73
 Identities = 30/129 (23%), Positives = 39/129 (30%), Gaps = 16/129 (12%)

Query: 24  TPPPPSSSR-------GSEPPSPNTGGVSGGFISTSPSQPSP------QHGHHTLTSLTS 70
           +P  PSSS        G              F S SP   SP      QH    +T  + 
Sbjct: 86  SPSSPSSSSANWHEVEGGMSSPTMKKPDKSLFSSPSPQDSSPRLSAFTQHHRPVITGHSG 145

Query: 71  PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSS--- 127
               P P+P     P      +  S   PH      P +    P    +  +  PSS   
Sbjct: 146 ISASPHPTPSPLHFPTSPILPQQPSSYFPHTAIRYPPHLHPQDPLKEFVQLVCDPSSQQA 205

Query: 128 SSPSSSSQL 136
             P+ S Q 
Sbjct: 206 GQPNGSGQG 214



 Score = 31.2 bits (70), Expect = 2.1
 Identities = 19/73 (26%), Positives = 26/73 (35%), Gaps = 4/73 (5%)

Query: 27  PPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPP 86
            PSS +  +P     G V   F+ T    P P        +   P  MP   P  TS   
Sbjct: 199 DPSSQQAGQPNGSGQGKVPNHFLPTPMLAPPPPPP----MARPVPLPMPDTKPPTTSTEG 254

Query: 87  PSASDKHDSQSSP 99
            + S    + S+P
Sbjct: 255 GATSPTSPTYSTP 267


>gnl|CDD|216412 pfam01285, TEA, TEA/ATTS domain family. 
          Length = 424

 Score = 32.8 bits (75), Expect = 0.89
 Identities = 18/71 (25%), Positives = 24/71 (33%), Gaps = 1/71 (1%)

Query: 29  SSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPS 88
           SS++ S           G    TS    S  H     +S  S H + P  P +   PPP 
Sbjct: 136 SSAQRSSAQHLFYIIPGGPSWRTSIKPFSSSHYGSHNSSAYSDH-LRPLQPYSGELPPPL 194

Query: 89  ASDKHDSQSSP 99
             +   S S  
Sbjct: 195 GPNWQASNSKK 205


>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
          Length = 361

 Score = 32.7 bits (75), Expect = 0.89
 Identities = 22/125 (17%), Positives = 38/125 (30%), Gaps = 34/125 (27%)

Query: 17  STSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPP 76
            T      P P          +  +   +G  +++S    +P     TL           
Sbjct: 205 GTPSVQPAPAPQGDVLPVSNSTLKSEDPTGAPVTSSGFLGAPT----TL----------A 250

Query: 77  PSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
           P  +  S P P+A                      S P T+  +  P  +++S S+S   
Sbjct: 251 PGVLEGSEPTPTAP--------------------SSAPATAPAAAAPQAAATSSSASGNF 290

Query: 137 MQQGG 141
           + Q G
Sbjct: 291 VVQVG 295


>gnl|CDD|132765 cd07303, Porin3, Eukaryotic porin family that forms channels in the
           mitochondrial outer membrane.  The porin family 3
           contains two sub-families that play vital roles in the
           mitochondrial outer membrane, a translocase for unfolded
           pre-proteins (Tom40) and the voltage-dependent anion
           channel (VDAC) that regulates the flux of mostly anionic
           metabolites through the outer mitochondrial membrane.
          Length = 274

 Score = 32.2 bits (73), Expect = 0.93
 Identities = 24/127 (18%), Positives = 41/127 (32%), Gaps = 9/127 (7%)

Query: 235 VVTHTSSTLGLASTSTAQGDMNSAADNGVSGTNSVITNNSTENN----TTSITLDIKADL 290
             T   +T           D  S      S  +S  + N+ + N    T    +++  D+
Sbjct: 66  TFTEKWNTDNTLGLEITVEDQLSRGLK--STFDSSFSPNTGKKNAKIKTGYKRINLGCDV 123

Query: 291 DLPVMG---NHTCDLCGKFFPYRYQMIVHRRYHTERKPFTCHVCGKAFDFQAELMSHGKC 347
           D  + G        L  + +   YQM+        +  F         +FQA    +   
Sbjct: 124 DFDIAGPLIRGALVLGYEGWLAGYQMVFETVSRVTQSNFAVGYKTDYNEFQAHTNVNDGT 183

Query: 348 HLGGSMY 354
             GGS+Y
Sbjct: 184 EFGGSIY 190


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 32.6 bits (74), Expect = 0.94
 Identities = 34/206 (16%), Positives = 49/206 (23%), Gaps = 29/206 (14%)

Query: 6   ELSQFPASHKTSTSMEVDT------PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQ 59
           +LS   A    + S  VD+      P   +    ++ P            +    QP  Q
Sbjct: 28  QLSHEEAQSSEAHSFHVDSTKQPPAPEQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQ 87

Query: 60  HG-----HHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPP 114
                   H   S   P Q+       T    P        Q  P           Q  P
Sbjct: 88  QQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQA----QQPQP 143

Query: 115 PTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDT 174
           P         P      S  Q  Q         Q+   +    + L P    +  Q    
Sbjct: 144 PPQV------PQQQQYQSPPQQPQYQQNPPPQAQSAPQV----SGLYPEESPYQPQ---- 189

Query: 175 QHSTVQSPPGQQQQYQQMDTSQSSQQ 200
            +   +  P           +  SQQ
Sbjct: 190 SYPPNEPLPSSMAMQPPYSGAPPSQQ 215



 Score = 29.9 bits (67), Expect = 6.4
 Identities = 26/153 (16%), Positives = 38/153 (24%), Gaps = 26/153 (16%)

Query: 49  ISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPG 108
           +S   +Q S  H  H  ++   P             P    +      + P         
Sbjct: 29  LSHEEAQSSEAHSFHVDSTKQPPAPEQVAKHELADAPLQQVN-----AALPPAPA----- 78

Query: 109 IFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQHC 168
             QSP P           S +P S     Q              +Q++    +P    + 
Sbjct: 79  -PQSPQPD------QQQQSQAPPSHQYPSQL---------PPQQVQSVPQQPTPQQEPYY 122

Query: 169 IQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQ 201
                 Q    Q P  QQ Q       Q   Q 
Sbjct: 123 PPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQS 155



 Score = 29.9 bits (67), Expect = 6.5
 Identities = 30/170 (17%), Positives = 36/170 (21%), Gaps = 27/170 (15%)

Query: 35  EPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPP--PPSASDK 92
           E    +         +  P  P     H            P     A  PP   P +   
Sbjct: 32  EEAQSSEAHSFHVDSTKQPPAPEQVAKHEL-------ADAPLQQVNAALPPAPAPQSPQP 84

Query: 93  HDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLH 152
              Q S     H  P    S  P   +  +P   +           Q             
Sbjct: 85  DQQQQSQAPPSHQYP----SQLPPQQVQSVPQQPTPQQEPYYPPPSQ--------PQPPP 132

Query: 153 IQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQS 202
            Q   A       Q        Q    QSPP Q Q  Q       S  Q 
Sbjct: 133 AQQPQAQQPQPPPQV------PQQQQYQSPPQQPQYQQNPPPQAQSAPQV 176


>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
           (DUF2360).  This is the conserved 140 amino acid region
           of a family of proteins conserved from nematodes to
           humans. One C. elegans member is annotated as a
           Daf-16-dependent longevity protein 1 but this could not
           be confirmed. The function is unknown.
          Length = 147

 Score = 31.2 bits (71), Expect = 0.95
 Identities = 12/70 (17%), Positives = 19/70 (27%), Gaps = 7/70 (10%)

Query: 15  KTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQM 74
           +  T      PP  + + G  PP P          + + S P P+               
Sbjct: 51  EDVTVQTTPPPPASAITNGGPPPPPPAR-------AEAASPPPPEAPAEPPAEPEPEAPA 103

Query: 75  PPPSPIATSP 84
                +A  P
Sbjct: 104 ENTVTVAKDP 113


>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist.  LIN-8
           is a nuclear protein, present at the sites of
           transcriptional repressor complexes, which interacts
           with LIN-35 Rb.Lin35 Rb is a product of the class B
           synMuv gene lin-35 which silences genes required for
           vulval specification through chromatin modification and
           remodelling. The biological role of the interaction has
           not yet been determined however predictions have been
           made. The interaction shows that class A synMuv genes
           control vulval induction through the transcriptional
           regulation of gene expression. LIN-8 normally functions
           as part of a protein complex however when the complex is
           absent, other family members can partially replace LIN-8
           activity.
          Length = 316

 Score = 32.4 bits (74), Expect = 0.97
 Identities = 25/107 (23%), Positives = 32/107 (29%), Gaps = 9/107 (8%)

Query: 14  HKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSL---TS 70
                ++ VD P     S  S P       V       +P QP P     + TS    TS
Sbjct: 176 DDYGANLAVDEP---EQSTMSRP-QEVKQSVPAQQAPPNPQQPMPSASSESATSKSASTS 231

Query: 71  PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS 117
               P P        P    D    ++   IT      +FQ  P  S
Sbjct: 232 RESSPQPQSPPPRRVPAPTVDSTAKEAE-QITYQAK-RLFQEYPERS 276



 Score = 30.1 bits (68), Expect = 5.9
 Identities = 13/67 (19%), Positives = 24/67 (35%), Gaps = 7/67 (10%)

Query: 73  QMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSS 132
            +P          P  ++    + S    T        QSPPP     ++P+P+  S   
Sbjct: 201 SVPAQQAPPNPQQPMPSASSESATSKSASTSRESSPQPQSPPP----RRVPAPTVDST-- 254

Query: 133 SSQLMQQ 139
            ++  +Q
Sbjct: 255 -AKEAEQ 260


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 32.9 bits (76), Expect = 0.98
 Identities = 13/99 (13%), Positives = 24/99 (24%), Gaps = 4/99 (4%)

Query: 35  EPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHD 94
           EP  P            + +  +             P   P  +P    P   S      
Sbjct: 367 EPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQ----L 422

Query: 95  SQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSS 133
             +   + +  G    +   P +     P  S+    +S
Sbjct: 423 LAARQQLQRAQGATKAKKSEPAAASRARPVNSALERLAS 461



 Score = 29.8 bits (68), Expect = 6.7
 Identities = 13/81 (16%), Positives = 20/81 (24%), Gaps = 6/81 (7%)

Query: 11  PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSP-SQPSPQHGHHTLTSLT 69
            ++  T+       PP   +       +P          +TS       Q     L    
Sbjct: 379 ASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQ-----LQRAQ 433

Query: 70  SPHQMPPPSPIATSPPPPSAS 90
              +     P A S   P  S
Sbjct: 434 GATKAKKSEPAAASRARPVNS 454


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 32.8 bits (74), Expect = 1.0
 Identities = 25/119 (21%), Positives = 38/119 (31%), Gaps = 6/119 (5%)

Query: 16  TSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMP 75
           TS + +   P P      S  P P     S        +Q              +PHQ  
Sbjct: 33  TSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAA 92

Query: 76  PPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSP--PPTSHISKIPSPSSSSPSS 132
             +P     P  +A+ K D          +     ++P    TS  SK P P++ +  S
Sbjct: 93  SRAPDPAVAPQLAAAPKPD----AAEAFTSAAQAHEAPADAGTSAASKKPDPAAHTQHS 147



 Score = 30.1 bits (67), Expect = 7.1
 Identities = 19/121 (15%), Positives = 29/121 (23%), Gaps = 6/121 (4%)

Query: 11  PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
           PA      +     P    +S  S  P               P+    Q          +
Sbjct: 42  PAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPDPAVA 101

Query: 71  PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSP 130
           P     P P A      +A        +            + P P +H    P P + + 
Sbjct: 102 PQLAAAPKPDAAEAFTSAAQAHEAPADAGTSAAS------KKPDPAAHTQHSPPPFAYTR 155

Query: 131 S 131
           S
Sbjct: 156 S 156


>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription
           factor N terminal domain.  The N terminus of the PEA3
           transcription factors is implicated in transactivation
           and in inhibition of DNA binding. Transactivation is
           potentiated by activation of the Ras/MAP kinase and
           protein kinase A signalling cascades. The N terminal
           region contains conserved MAP kinase phosphorylation
           sites.
          Length = 336

 Score = 32.5 bits (74), Expect = 1.1
 Identities = 28/102 (27%), Positives = 39/102 (38%), Gaps = 8/102 (7%)

Query: 35  EPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHD 94
           EP SP+    S    S S  QP   +G   L S ++  + P       +PP    S  + 
Sbjct: 90  EPQSPSKELNS----SCSQKQPPYPYGEKCLYSYSAYDRKPASGFKPPTPPSTPCSPVNP 145

Query: 95  SQSSPHITQHTGPGIFQSPPPTS---HISKIPSPSSSSPSSS 133
            ++   + Q +GP    SPP             P  SSP SS
Sbjct: 146 QETVRQL-QPSGPLSNSSPPSPHTPLPNQSPLPPPMSSPDSS 186


>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein.  Tymoviruses
           are single stranded RNA viruses. This family includes a
           protein of unknown function that has been named based on
           its molecular weight. Tymoviruses such as the ononis
           yellow mosaic tymovirus encode only three proteins. Of
           these two are overlapping this protein overlaps a larger
           ORF that is thought to be the polymerase.
          Length = 458

 Score = 32.4 bits (74), Expect = 1.1
 Identities = 18/64 (28%), Positives = 28/64 (43%), Gaps = 1/64 (1%)

Query: 76  PPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPP-PTSHISKIPSPSSSSPSSSS 134
            P     SPP P+ S    S  SP     T  G+  +P  P +    +P P+SS+P   +
Sbjct: 247 HPHTTRPSPPRPAFSRSPSSPLSPLPRPSTRRGLLPNPRLPRASRGHLPPPTSSAPPRPN 306

Query: 135 QLMQ 138
             ++
Sbjct: 307 GGLR 310


>gnl|CDD|237887 PRK15046, PRK15046, 2-aminoethylphosphonate ABC transporter
           substrate-binding protein; Provisional.
          Length = 349

 Score = 32.4 bits (74), Expect = 1.1
 Identities = 15/53 (28%), Positives = 19/53 (35%), Gaps = 13/53 (24%)

Query: 222 LAYHEVFGKLQSSVVTHTSSTLGLAST------STAQGD--MN--SAADNGVS 264
             Y     KLQ++ V  + ST  L           A GD  MN   A   G +
Sbjct: 192 FDY---LAKLQANNVGPSKSTGKLTPLVSKGEIYVANGDLQMNLAQAEHGGPN 241


>gnl|CDD|217330 pfam03035, RNA_capsid, Calicivirus putative RNA polymerase/capsid
           protein. 
          Length = 226

 Score = 31.9 bits (73), Expect = 1.1
 Identities = 27/111 (24%), Positives = 41/111 (36%), Gaps = 16/111 (14%)

Query: 29  SSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPS 88
           + +R   P S  T   SGGF S+    P       +  S  S       S  + S    S
Sbjct: 101 NGTRYWAPNSMATTSYSGGFTSSPVPVPPSSSSSASSVSSQSTQSTGLSSS-SYSSSSAS 159

Query: 89  A--SD---KHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
           +  S      +S   P +     PG  Q    T+ ++  PS ++SS  + S
Sbjct: 160 SRTSSWVRSQNSNLEPFM-----PGALQ----TAWVTP-PSSTASSSGTVS 200


>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein.  The WWbp domain is
           characterized by several short PY and PT-like motifs of
           the PPPPY form. These appear to bind directly to the WW
           domains of WWP1 and WWP2 and other such diverse proteins
           as dystrophin and YAP (Yes-associated protein). This is
           the WW-domain binding protein WWbp via PY and PY_like
           motifs. The presence of a phosphotyrosine residue in the
           pWBP-1 peptide abolishes WW domain binding which
           suggests a potential regulatory role for tyrosine
           phosphorylation in modulating WW domain-ligand
           interactions. Given the likelihood that WWP1 and WWP2
           function as E3 ubiquitin-protein ligases, it is possible
           that initial substrate-specific recognition occurs via
           WW domain-substrate protein interaction followed by
           ubiquitin transfer and subsequent proteolysis. This
           domain lies just downstream of the GRAM (pfam02893) in
           many members.
          Length = 111

 Score = 30.4 bits (69), Expect = 1.1
 Identities = 16/76 (21%), Positives = 18/76 (23%), Gaps = 4/76 (5%)

Query: 12  ASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSP 71
           A      S E    PPP +    EP        +       P       G       T P
Sbjct: 34  AQRAQPVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPG----VYPTPP 89

Query: 72  HQMPPPSPIATSPPPP 87
                       PPPP
Sbjct: 90  PPNSGYMADPQEPPPP 105



 Score = 30.1 bits (68), Expect = 1.6
 Identities = 13/78 (16%), Positives = 20/78 (25%), Gaps = 13/78 (16%)

Query: 6   ELSQFPASHKTSTSMEVDTPPPPSSSR----GSEPPSPNTGGVSGGFISTSPSQPSPQHG 61
            +S+    +    +     P P            PP P      G + +  P        
Sbjct: 39  PVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMAD 98

Query: 62  HHTLTSLTSPHQMPPPSP 79
                    P + PPP P
Sbjct: 99  ---------PQEPPPPYP 107



 Score = 28.5 bits (64), Expect = 6.2
 Identities = 9/45 (20%), Positives = 13/45 (28%), Gaps = 6/45 (13%)

Query: 73  QMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS 117
               P P       P           P       PG++ +PPP +
Sbjct: 54  VHLEPLPAYGQYAAPPPY------GPPPPYYPAPPGVYPTPPPPN 92


>gnl|CDD|238156 cd00253, PL_Passenger_AT, Pertactin-like passenger domains
           (virulence factors) of autotransporter proteins of the
           type V secretion system. Autotransporters are proteins
           used by Gram-negative bacteria to transport proteins
           across their outer membranes. The C-terminal (beta)
           domain of autotransporters forms a pore in the outer
           membrane through which the N-terminal passenger domain
           is transported. Following transport, the passenger
           domain is generally cleaved by an outer membrane
           protease with the passenger domain either remaining in
           contact with the surface via a noncovalent interaction
           with the beta domain or cleaved to release a soluble
           protein. These proteins are highly diverse and perform a
           variety of functions that promote virulence, including
           catalyzing proteolysis, serving as an adhesin, mediating
           actin-promoted motility, or serving as a cytotoxin.
           Proteins in this family share similarity in the
           C-terminal region of the passenger domain as seen in the
           pertactin structure P.69, a Bordetella pertussis
           agglutinogen responsible for human pertussis. The  P.69
           protein consists of a 16-stranded parallel beta-helix
           with a V-shaped cross-section, and is one of the largest
           beta-helix known to date.
          Length = 186

 Score = 31.6 bits (72), Expect = 1.1
 Identities = 11/58 (18%), Positives = 23/58 (39%), Gaps = 2/58 (3%)

Query: 227 VFGKLQSSVVTHTSSTLGLASTSTAQGDMNSAADNGVSGTNSVITNNSTENNTTSITL 284
             G   +  +T  S  LG          +NS +   ++G ++V  N +  N+  ++  
Sbjct: 16  TLGGGGALTLTGNSVWLGTLQAGGGTVSLNSNSLWTLTGDSTV--NGNLTNSGGTVDF 71


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 32.6 bits (74), Expect = 1.1
 Identities = 26/159 (16%), Positives = 45/159 (28%), Gaps = 23/159 (14%)

Query: 27  PPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPH-QMPPPSPIATSPP 85
           P   +             S   ++++      Q+     +   S + Q P          
Sbjct: 202 PKEENHTLSVTDKEKSEASSKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKD 261

Query: 86  PPSASDK--HDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSS-------------- 129
              + +   H    + H ++H    + Q+     + S    PSSSS              
Sbjct: 262 HHHSHNHQHHSIGINNHHSKHADSKL-QTIEVIENHSNKSRPSSSSTNGSKETTSNSSSA 320

Query: 130 -----PSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPS 163
                 S SS+  +      SN     H    G+V S S
Sbjct: 321 AAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSS 359


>gnl|CDD|218693 pfam05687, DUF822, Plant protein of unknown function (DUF822).
           This family consists of the N terminal regions of
           several plant proteins of unknown function.
          Length = 151

 Score = 30.9 bits (70), Expect = 1.3
 Identities = 16/60 (26%), Positives = 24/60 (40%), Gaps = 4/60 (6%)

Query: 32  RGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASD 91
           +G +P     G  S    S + S  S        ++  SP      SP ++S P PS+ D
Sbjct: 71  KGCKPLERMEGAGS----SATASPCSSYQLSPVSSAFPSPVPSYSASPASSSFPSPSSLD 126


>gnl|CDD|216908 pfam02159, Oest_recep, Oestrogen receptor. 
          Length = 139

 Score = 31.1 bits (70), Expect = 1.3
 Identities = 22/120 (18%), Positives = 40/120 (33%), Gaps = 13/120 (10%)

Query: 31  SRGSEPPSPNTGGVSGGFISTSPS--QPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPS 88
              +E  +P     S G+         P    G  +L S+       PPSP+   P  P 
Sbjct: 22  GAPTETSAPVYSSASLGYYPAPEDVHGPPSLGGLQSLGSV-------PPSPLMFLPSSPQ 74

Query: 89  ASD--KHDSQSSPH--ITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGG 144
            S       Q  P+    + +   + ++     + S + +   +     S   ++G GG 
Sbjct: 75  LSPFLHPHGQQVPYYLENEPSPAAVRETSSTAFYRSSVDNRRQAGRERCSAAEERGSGGM 134


>gnl|CDD|240610 cd12813, LbR-like, Left-handed beta-roll, including virulence
           factors and various other proteins.  This family
           contains a variety of protein domains with a left-handed
           beta-roll structure including cell surface adhesion
           proteins, bacterial virulence factors, and ice-binding
           proteins, and other activities. UspA1 Head And Neck
           Domain and YadA of Yersinia are part of a class of
           pathogenicity factors that act as cell surface adhesion
           molecules, in which N-terminal head and neck domains
           extend from the bacterial outer membrane. The UspA1 head
           domain of Moraxella catarrhalis, is formed from trimeric
           beta-rolls of 14-16 amino acid repeats. The UspA1 head
           domain connects to a neck region of large extended,
           charged loops that maybe be ligand binding, which is in
           turn connected to an extended coiled coil domain that
           tethers the head and neck region to the cell surface via
           a transmembrane region. The collagen-binding domain
           virulence factor YadA an adhesion proteins of several
           Yersinia species, and related cell surface proteins. The
           collagen-binding portion is found in the hydrophobic
           N-terminal region. YadA forms a matrix on the bacterial
           outer membrane, which mediates binding to collagen and
           epithelial cells. YadA inhibits the
           complement-activating pathway with the coating of the
           cell surface with factor H, which impedes C3b molecules.
           The ice-binding protein of the grass Lolium perenne
           (LpIBP) discourages the recrystallization of ice.
           Ice-binding proteins produced by organisms to prevent
           the growing of ice are termed to anti-freeze proteins.
           LpIBP consists of an unusual left-handed beta roll.
           Ice-binding is mediated by a flat beta-sheet on one side
           of the helix. These domains form a left handed beta roll
           made up of a series of short repeated elements. .
          Length = 99

 Score = 30.2 bits (68), Expect = 1.3
 Identities = 16/59 (27%), Positives = 28/59 (47%), Gaps = 1/59 (1%)

Query: 227 VFGKLQSSVVTHTSSTLGLASTSTAQGDMNSAADNGVSGTNSV-ITNNSTENNTTSITL 284
           V G   +S     S+ +G A+T+T    + S  +  V+  N+V   NN+  + + S  L
Sbjct: 26  VIGGDNNSASGSNSTAVGGANTATGSNAVASGTNAIVTDDNAVASGNNNLASGSNSTAL 84


>gnl|CDD|215964 pfam00513, Late_protein_L2, Late Protein L2. 
          Length = 466

 Score = 31.9 bits (73), Expect = 1.5
 Identities = 27/114 (23%), Positives = 41/114 (35%), Gaps = 7/114 (6%)

Query: 17  STSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPP 76
           S+ +E   P PP    GS  P   +   +   +  +P+  +    H + T   +P    P
Sbjct: 108 SSIIESGAPIPPIPGDGSGFPITTSSTTTPAILDVTPTTRT---VHVSRTQYNNPLFTDP 164

Query: 77  PSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSP 130
                  PP P+    H   S   I  H+   I       S  +  P P SS+P
Sbjct: 165 SV---LQPPQPAEVSGHVLVSGQTIGTHSYEEIPMDTFAVSEGTT-PPPISSTP 214


>gnl|CDD|178493 PLN02905, PLN02905, beta-amylase.
          Length = 702

 Score = 32.3 bits (73), Expect = 1.6
 Identities = 34/123 (27%), Positives = 46/123 (37%), Gaps = 35/123 (28%)

Query: 28  PSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPP 87
           PS S+G+ P     GG S    ++S S             L S  Q PPPS    S    
Sbjct: 145 PSRSQGTRP----AGGTSAVAATSSSSH------------LVSQ-QTPPPSLRGVSSGYR 187

Query: 88  SASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNG 147
           S+ + +  +           G+F    P        SP   SPS SS+L+   G  GS  
Sbjct: 188 SSVEYNACRMK---------GVFV---PA------SSPYDVSPSQSSELVVVMGDRGSQN 229

Query: 148 QTH 150
           + H
Sbjct: 230 ENH 232


>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger.  This family contains
           a number of divergent C2H2 type zinc fingers.
          Length = 24

 Score = 28.0 bits (62), Expect = 1.6
 Identities = 12/23 (52%), Positives = 17/23 (73%)

Query: 410 YKCQYCGKNFSRKEHMVNHVRKH 432
           +KC  CGK+FS K+ +  H+RKH
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 27.6 bits (61), Expect = 1.8
 Identities = 11/23 (47%), Positives = 16/23 (69%)

Query: 606 YQCQYCPKAFSRKDHLVNHVRQH 628
           ++C  C K+FS KD L  H+R+H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 27.6 bits (61), Expect = 1.9
 Identities = 9/23 (39%), Positives = 15/23 (65%)

Query: 522 HRCDFCSKTFTRKEHLLNHVRQH 544
            +C  C K+F+ K+ L  H+R+H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 27.6 bits (61), Expect = 2.1
 Identities = 11/23 (47%), Positives = 14/23 (60%)

Query: 662 HKCHFCSKSFTRKEHLNNHIRLH 684
            KC  C KSF+ K+ L  H+R H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 26.4 bits (58), Expect = 5.2
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query: 466 HVCQVCGKKYTRKEHLANHMRSH 488
             C +CGK ++ K+ L  H+R H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 26.1 bits (57), Expect = 6.1
 Identities = 11/23 (47%), Positives = 14/23 (60%)

Query: 634 HKCSYCKQSFTRKDHLNNHVRLH 656
            KC  C +SF+ KD L  H+R H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 26.1 bits (57), Expect = 7.0
 Identities = 12/23 (52%), Positives = 14/23 (60%)

Query: 780 FKCDVCPKDFTCKGHLVSHLRSH 802
           FKC +C K F+ K  L  HLR H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 26.1 bits (57), Expect = 7.1
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 550 HKCNYCAKSFTRKEHLVNHVRQH 572
            KC  C KSF+ K+ L  H+R+H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 26.1 bits (57), Expect = 7.1
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 578 HKCNYCAKSFTRKEHLVNHVRQH 600
            KC  C KSF+ K+ L  H+R+H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 25.7 bits (56), Expect = 9.7
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 494 FKCEICTKSFTRKEHFTNHIMWH 516
           FKC +C KSF+ K+    H+  H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 25.7 bits (56), Expect = 10.0
 Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 690 HKCDFCLKTFTRKEHLTNHLKQH 712
            KC  C K+F+ K+ L  HL++H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 30.6 bits (69), Expect = 1.7
 Identities = 20/76 (26%), Positives = 25/76 (32%), Gaps = 20/76 (26%)

Query: 24  TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
           TPP P      EP +P T       + T P   +P                 PP+P A  
Sbjct: 83  TPPAPP-----EPVTPPTAQSPAPAVPTPPPTSTP---------------AVPPAPAAAV 122

Query: 84  PPPPSASDKHDSQSSP 99
           P P +A    D    P
Sbjct: 123 PAPAAAPPPSDPPQPP 138


>gnl|CDD|188717 cd09331, LIM1_PINCH, The first LIM domain of protein PINCH.  The
           first LIM domain of paxillin: Paxillin is an adaptor
           protein, which recruits key components of the
           signal-transduction machinery to specific sub-cellular
           locations to respond to environmental changes rapidly.
           The C-terminal region of paxillin contains four LIM
           domains which target paxillin to focal adhesions,
           presumably through a direct association with the
           cytoplasmic tail of beta-integrin. The N-terminal of
           paxillin is leucine-rich LD-motifs. Paxillin is found at
           the interface between the plasma membrane and the actin
           cytoskeleton. The binding partners of paxillin are
           diverse and include protein tyrosine kinases, such as
           Src and FAK, structural proteins, such as vinculin and
           actopaxin, and regulators of actin organization.
           Paxillin recruits these proteins to their function sites
           to control the dynamic changes in cell adhesion,
           cytoskeletal reorganization and gene expression. LIM
           domains are 50-60 amino acids in size and share two
           characteristic zinc finger motifs. The two zinc fingers
           contain eight conserved residues, mostly cysteines and
           histidines, which coordinately bond to two zinc atoms.
           LIM domains function as adaptors or scaffolds to support
           the assembly of multimeric protein complexes.
          Length = 59

 Score = 28.8 bits (65), Expect = 1.7
 Identities = 16/51 (31%), Positives = 24/51 (47%), Gaps = 3/51 (5%)

Query: 720 CNICSKPFLKKEHLVNHSRSHTGERPFACTDCGKSFPLKGNLLFHQRSHRK 770
           C  C + F   E +VN +     E+ F C  C + FP   + LF++   RK
Sbjct: 1   CERCREGFEPDEKIVNSNGELYHEQCFVCAQCFQPFP---DGLFYEFEGRK 48


>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 226

 Score = 31.4 bits (71), Expect = 1.7
 Identities = 15/95 (15%), Positives = 26/95 (27%), Gaps = 14/95 (14%)

Query: 24  TPPPPSSSR-GSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIAT 82
           T PP   ++   +        V    ++  P + +P            P     P P+  
Sbjct: 68  TQPPEGVAQEIQDAGDAAAASVDPQPVAQPPVESTPAGV---------PVAAQTPKPVKP 118

Query: 83  SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS 117
              PP+ +        P       P    +  PT 
Sbjct: 119 PKQPPAGAVPAKPTPKPEPK----PVAEPAAAPTG 149


>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
          Length = 280

 Score = 31.4 bits (70), Expect = 1.8
 Identities = 22/119 (18%), Positives = 39/119 (32%), Gaps = 3/119 (2%)

Query: 24  TPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATS 83
              P + +     P+P          +  P    P           S  Q PP  P+ T 
Sbjct: 97  ACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPACPPSTRQCPPAPPLPTP 156

Query: 84  PPPPSA--SDKHDSQSSPHITQHTGPGIFQSPPPTSHI-SKIPSPSSSSPSSSSQLMQQ 139
            P P+A     H+    P     + P I  +P  +  +  +IP     + +    L+++
Sbjct: 157 KPAPAAKPIFLHNQLPPPDYPAASCPTIETAPAASPVLEPRIPDKIIDADNDDKDLIKK 215


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 32.0 bits (72), Expect = 1.8
 Identities = 28/141 (19%), Positives = 41/141 (29%), Gaps = 4/141 (2%)

Query: 1   MPIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEP-PSPNTGGVSGGFISTSPSQPSPQ 59
            P+Q +  Q  A  +     +    P P   +  +P             ++  P    PQ
Sbjct: 750 EPVQ-QPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQ 808

Query: 60  HGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSP--PPTS 117
                      P Q   P P    P  P A    D+   P + ++        P  P  S
Sbjct: 809 QPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPS 868

Query: 118 HISKIPSPSSSSPSSSSQLMQ 138
                P PS   P  +  L Q
Sbjct: 869 LDLLTPPPSEVEPVDTFALEQ 889



 Score = 30.8 bits (69), Expect = 4.7
 Identities = 24/114 (21%), Positives = 29/114 (25%), Gaps = 7/114 (6%)

Query: 94  DSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHI 153
           D    P  T    P      P              +P    Q  QQ            + 
Sbjct: 738 DGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQ-----PVAPQPQYQ 792

Query: 154 QNIGAVLSPSYGQHCIQ--QDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQSDLI 205
           Q    V      Q   Q      Q+   Q P   Q QYQQ     + Q Q  L+
Sbjct: 793 QPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLL 846


>gnl|CDD|221395 pfam12048, DUF3530, Protein of unknown function (DUF3530).  This
           family of proteins is functionally uncharacterized. This
           protein is found in bacteria. Proteins in this family
           are typically between 272 to 336 amino acids in length.
           These proteins are distantly related to alpa/beta
           hydrolases so they may act as enzymes.
          Length = 294

 Score = 31.6 bits (72), Expect = 1.9
 Identities = 12/43 (27%), Positives = 13/43 (30%), Gaps = 1/43 (2%)

Query: 58  PQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPH 100
           P  G  TL SLT P       P   +             SSP 
Sbjct: 107 PDAGWATL-SLTPPDPPSAAPPNRPTEAEEVTKAGAKQLSSPS 148


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 31.8 bits (72), Expect = 2.0
 Identities = 22/73 (30%), Positives = 29/73 (39%), Gaps = 9/73 (12%)

Query: 24  TPPPPSSSRGS-------EPPSPNTGGVSGGFISTSP--SQPSPQHGHHTLTSLTSPHQM 74
            PPPPSS  G         P + +T G     +S +P    PSP        S+ +P   
Sbjct: 187 QPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPAPPIP 246

Query: 75  PPPSPIATSPPPP 87
           P    +A  P PP
Sbjct: 247 PVIQYVAPPPVPP 259


>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
           inhibitor.
          Length = 670

 Score = 31.6 bits (71), Expect = 2.0
 Identities = 28/111 (25%), Positives = 40/111 (36%), Gaps = 17/111 (15%)

Query: 29  SSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGH-----HTLTSLTSPHQMPPPSPIATS 83
           S+  GS   S  T          +PS  SP  GH      T + + SP   PP S + + 
Sbjct: 571 STPTGSAASSNTTFSSDSPSTVVAPST-SPPAGHLGSPPATPSKIVSPSTSPPASHLGSP 629

Query: 84  PPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
              PS+ +     +S            ++  P S I    + SS S  S S
Sbjct: 630 STTPSSPESSIKVAST-----------ETASPESSIKVASTESSVSMVSMS 669


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 31.5 bits (72), Expect = 2.1
 Identities = 20/110 (18%), Positives = 25/110 (22%), Gaps = 13/110 (11%)

Query: 11  PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQP-SPQHGHHTLTSLT 69
              H+ +         PP +S     P               P     PQH         
Sbjct: 78  AQEHEAARPSPQHQYQPPYASAQPRQPVQQ-----------PPEAQVPPQHAPRPAQPAP 126

Query: 70  SPHQMPPPSPIATSPPPPSASDKHDSQSSP-HITQHTGPGIFQSPPPTSH 118
            P Q P   P    P     S +      P H         FQ   P + 
Sbjct: 127 QPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAA 176


>gnl|CDD|236170 PRK08166, PRK08166, NADH dehydrogenase subunit G; Validated.
          Length = 791

 Score = 31.9 bits (73), Expect = 2.2
 Identities = 18/56 (32%), Positives = 23/56 (41%), Gaps = 16/56 (28%)

Query: 769 RKQNGEVLERPFK---CDVCPKD-FTCKGHLVSHLRSHLNER------PHVCPHCS 814
           R ++G  LE  F     +VCP   FT K H         N +      P +C HCS
Sbjct: 183 RPEDG-TLESEFSGNLVEVCPTGVFTDKTH-----SERYNRKWDMQFAPSICQHCS 232


>gnl|CDD|221825 pfam12877, DUF3827, Domain of unknown function (DUF3827).  This
           family contains the human KIAA1549 protein which has
           been found to be fused fused to BRAF gene in many cases
           of pilocytic astrocytomas. The fusion is due mainly to a
           tandem duplication of 2 Mb at 7q34. Although nothing is
           known about the function of KIAA1549 protein, the BRAF
           protein is a well characterized oncoprotein. It is a
           serine/threonine protein kinase which is implicated in
           MAP/ERK signalling, a critical pathway for the
           regulation of cell division, differentiation and
           secretion.
          Length = 684

 Score = 31.4 bits (71), Expect = 2.2
 Identities = 27/116 (23%), Positives = 38/116 (32%), Gaps = 30/116 (25%)

Query: 23  DTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIAT 82
              P    S GS   S  +     G  ST+PS  + Q                       
Sbjct: 383 RKSPSDGDSEGSSVISNRSSREKSGRPSTTPSVTAQQKPTK--------------EEGRK 428

Query: 83  SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQ 138
            P PPS +D+  S +S          IF+      H+ ++  PSS     SS  +Q
Sbjct: 429 KPAPPSGTDEQLSSAS----------IFE------HVDRLSRPSSDPYDRSSGKIQ 468


>gnl|CDD|150310 pfam09602, PhaP_Bmeg, Polyhydroxyalkanoic acid inclusion protein
           (PhaP_Bmeg).  This entry describes a protein found in
           polyhydroxyalkanoic acid (PHA) gene regions and
           incorporated into PHA inclusions in Bacillus cereus and
           Bacillus megaterium. The role of the protein may include
           amino acid storage.
          Length = 168

 Score = 30.5 bits (68), Expect = 2.3
 Identities = 16/86 (18%), Positives = 32/86 (37%), Gaps = 4/86 (4%)

Query: 170 QQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQSDLIK--TNGTGDVNPDEFNQLL-AYHE 226
           QQ D  H   ++    + + +Q   ++    Q D ++      G+   D+  +     HE
Sbjct: 41  QQQDALHKLTKAVDELEAEDKQF-KAEFEDLQQDTVENLRKTAGNALADQIEEWQEKTHE 99

Query: 227 VFGKLQSSVVTHTSSTLGLASTSTAQ 252
               LQ      + S+L +   +  Q
Sbjct: 100 ALAHLQELFFNPSKSSLSILKQAQEQ 125


>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c.  This family of
           proteins is found in eukaryotes. Proteins in this family
           are typically between 263 and 318 amino acids in length.
           There is a conserved SFT sequence motif. MSA-2 is a
           plasma membrane glycoprotein which can be found in
           Babesia bovis species.
          Length = 201

 Score = 30.5 bits (69), Expect = 2.4
 Identities = 18/90 (20%), Positives = 29/90 (32%), Gaps = 19/90 (21%)

Query: 1   MPIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQH 60
           M I   L+ +    K          P  SSS+G+    P+ G  +    +++P  P    
Sbjct: 122 MRIYKALNTYELVVKAPAE---YYSPKHSSSQGTSTTRPSDGSATPN--TSAPPTPGNPA 176

Query: 61  GHHTLTSLTSPHQMPPPSPIATSPPPPSAS 90
                          P  P A +P    +S
Sbjct: 177 AQ-------------PEKP-AETPKGNGSS 192


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
           family of proteins are found in large numbers in the
           Trichomonas vaginalis proteome. The function of this
           protein is unknown.
          Length = 422

 Score = 31.2 bits (70), Expect = 2.4
 Identities = 19/81 (23%), Positives = 24/81 (29%), Gaps = 1/81 (1%)

Query: 36  PPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDS 95
           PP+P     + G   T  S    +H H    +     Q P   P   +P           
Sbjct: 173 PPNPPREAPAPGLPKTFTSSHGHRHRHAPKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQ 232

Query: 96  QSSPHITQHTGPGIFQSPPPT 116
              P   Q T     Q PP T
Sbjct: 233 PVQPA-QQPTPQNPAQQPPQT 252



 Score = 30.0 bits (67), Expect = 5.8
 Identities = 19/58 (32%), Positives = 25/58 (43%), Gaps = 8/58 (13%)

Query: 145 SNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQS 202
           S+G  H H        +P   Q    Q+  Q  TVQ+P  Q QQ  Q    Q +QQ +
Sbjct: 192 SHGHRHRH--------APKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPT 241


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 31.5 bits (71), Expect = 2.5
 Identities = 17/100 (17%), Positives = 34/100 (34%), Gaps = 6/100 (6%)

Query: 7   LSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLT 66
           + +FP   +   S         ++   +     +T     G I T  +Q S Q    +L 
Sbjct: 555 VVRFPYQIRNMESPAFVHSFTSTTLAAAAGQGSDTAEALAGAIETLLTQASAQPAGLSLP 614

Query: 67  ------SLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPH 100
                 + ++P   PPP      P P +++   ++     
Sbjct: 615 APAVPVNASTPASTPPPLAPQEPPQPGTSAPSLETSLPQQ 654



 Score = 31.5 bits (71), Expect = 2.5
 Identities = 13/90 (14%), Positives = 22/90 (24%), Gaps = 6/90 (6%)

Query: 52  SPSQPSPQHGHHTLTSLTSPHQMP-----PPSPIATSPPPPSASDKHDSQSSPHITQHTG 106
             +  +    H        P +        P+    +  PP      D            
Sbjct: 365 KVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSP-YNPQS 423

Query: 107 PGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
           PG    P P   +   P+     P S + +
Sbjct: 424 PGTSYGPEPVGPVPPQPTNPYVMPISMANM 453


>gnl|CDD|165099 PHA02732, PHA02732, hypothetical protein; Provisional.
          Length = 1467

 Score = 31.6 bits (71), Expect = 2.5
 Identities = 35/163 (21%), Positives = 61/163 (37%), Gaps = 33/163 (20%)

Query: 37   PSPNTGGVSGGFISTSPSQP------SP--QHGHHTLT---SLTSPHQM----PPPSPIA 81
                T   +  +++++P  P      +P   +  + L+   S+T+   +      P+P  
Sbjct: 1127 FGAGTVASAFLYMNSTPQSPVLALLLAPYISYKFNALSLGFSITADAAIFSLFGIPAPQL 1186

Query: 82   TSPPPPSASDKHDSQS----SPHITQHTGPGIFQ----------SPPPTSHISKIPSPSS 127
             S   P+ S  +         P I   +G   F           +  P +  +  P PSS
Sbjct: 1187 LSSYIPTGSVLYQDPIFTYIPPGIIGMSGTNTFTFKAAQLQLSAASSPPAATTPTPPPSS 1246

Query: 128  SSPSSSSQLMQQGGGGGS--NGQTHLHIQNI--GAVLSPSYGQ 166
            SS SS+  +    G      NG T +HI  +   A+ +P  GQ
Sbjct: 1247 SSSSSAQSISTSPGQIQIVLNGSTTIHINFLFFPALSTPKIGQ 1289



 Score = 30.5 bits (68), Expect = 4.9
 Identities = 22/74 (29%), Positives = 32/74 (43%), Gaps = 9/74 (12%)

Query: 66   TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQ---SPPPTSHISKI 122
              L       PP+    +PPP S+S    S S+  I+  T PG  Q   +   T HI+ +
Sbjct: 1224 AQLQLSAASSPPAATTPTPPPSSSS----SSSAQSIS--TSPGQIQIVLNGSTTIHINFL 1277

Query: 123  PSPSSSSPSSSSQL 136
              P+ S+P     L
Sbjct: 1278 FFPALSTPKIGQIL 1291


>gnl|CDD|215496 PLN02918, PLN02918, pyridoxine (pyridoxamine) 5'-phosphate oxidase.
          Length = 544

 Score = 31.4 bits (71), Expect = 2.5
 Identities = 20/79 (25%), Positives = 28/79 (35%), Gaps = 2/79 (2%)

Query: 73  QMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSS 132
           Q   P PI  SPPPP +S    S S          G    P   +  +K   P      +
Sbjct: 15  QSLLPLPI--SPPPPHSSSLSSSPSPTQRFLTPSQGSRLPPRRRALCTKSQDPRWRRAMA 72

Query: 133 SSQLMQQGGGGGSNGQTHL 151
           S  ++      GS   ++L
Sbjct: 73  SLAVIPNMQDSGSPPLSYL 91


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 31.5 bits (72), Expect = 2.5
 Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 5/43 (11%)

Query: 163 SYGQHCIQQDDTQHSTVQSPPGQ----QQQYQQMDTSQSSQQQ 201
            +G   + Q +   S +QS P Q    +Q YQQ   +Q   +Q
Sbjct: 915 QHGNA-LAQLEPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQ 956


>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family.  This protein is
           found in bacteria. Proteins in this family are typically
           between 234 to 465 amino acids in length. There is a
           conserved GEV sequence motif.Most members are annotated
           as being SprA-related.
          Length = 261

 Score = 30.9 bits (70), Expect = 2.6
 Identities = 19/109 (17%), Positives = 33/109 (30%), Gaps = 4/109 (3%)

Query: 93  HDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLH 152
           + S S   I+  +   I  S    ++      P   +PSSS       G      +    
Sbjct: 1   NISNSLSSISSGSSAPIGTSALRGTNTPAAAKP---APSSSEASNAGSGSSEQKAKLKGQ 57

Query: 153 IQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQ 201
                   S    +   + +D +    +  P + Q       SQ  Q+Q
Sbjct: 58  ASTAAGSASQELQKQASESND-EEVVGEEEPEKDQAEAGKQLSQKQQEQ 105


>gnl|CDD|111993 pfam03157, Glutenin_hmw, High molecular weight glutenin subunit.
           Members of this family include high molecular weight
           subunits of glutenin. This group of gluten proteins is
           thought to be largely responsible for the elastic
           properties of gluten, and hence, doughs. Indeed,
           glutenin high molecular weight subunits are classified
           as elastomeric proteins, because the glutenin network
           can withstand significant deformations without breaking,
           and return to the original conformation when the stress
           is removed. Elastomeric proteins differ considerably in
           amino acid sequence, but they are all polymers whose
           subunits consist of elastomeric domains, composed of
           repeated motifs, and non-elastic domains that mediate
           cross-linking between the subunits. The elastomeric
           domain motifs are all rich in glycine residues in
           addition to other hydrophobic residues. High molecular
           weight glutenin subunits have an extensive central
           elastomeric domain, flanked by two terminal non-elastic
           domains that form disulphide cross-links. The central
           elastomeric domain is characterized by the following
           three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It
           possesses overlapping beta-turns within and between the
           repeated motifs, and assumes a regular helical secondary
           structure with a diameter of approx. 1.9 nm and a pitch
           of approx. 1.5 nm.
          Length = 779

 Score = 31.6 bits (70), Expect = 2.6
 Identities = 39/177 (22%), Positives = 48/177 (27%), Gaps = 11/177 (6%)

Query: 27  PPSSSRGSEPPSPNTGGV-SGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPI---AT 82
                +G +P      G    G+  TS  QP      H   SL  P Q   P  +   A 
Sbjct: 441 LQQPGQGQQPGQWQQPGQGQPGYYPTSLLQPGQGQPGHDPASLQQPGQGQQPGQLQQPAQ 500

Query: 83  SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGG 142
             P    +     Q    + Q   P   Q         +   P      +  +  QQ G 
Sbjct: 501 GQPGQQLAQGQQGQQPAQVQQGQQPAQGQQGQQLGQGQQGQQPGQGQHPAQGEQGQQPGQ 560

Query: 143 GGSNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQ 199
           G    Q        G    P  GQ        Q S     PGQ QQ  Q       Q
Sbjct: 561 GQQGQQP-------GQGQQPGQGQPWYYPTSPQESGQGQQPGQWQQPGQWQQPGQGQ 610


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 31.0 bits (70), Expect = 2.6
 Identities = 21/111 (18%), Positives = 32/111 (28%), Gaps = 29/111 (26%)

Query: 50  STSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGI 109
             +P   +P       TS  +P   PP SP +     P+++ +  S S            
Sbjct: 83  KVAPPAATP-------TSAPTPTPSPPASPASGMSAAPASAVEEKSPS------------ 123

Query: 110 FQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGG---GGSNGQTHLHIQNIG 157
            +            +P S S S  S           G     T   I  +G
Sbjct: 124 -EESATA------TAPESPSTSVPSSGSDAASTLVVGSERETTIEEIMEMG 167



 Score = 30.2 bits (68), Expect = 5.5
 Identities = 16/79 (20%), Positives = 25/79 (31%), Gaps = 14/79 (17%)

Query: 13  SHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPH 72
           S   + + +V  P    +S     P+P           + P+ P+         S     
Sbjct: 75  SKPKTGTGKVAPPAATPTS----APTPTP---------SPPASPASGMSAAP-ASAVEEK 120

Query: 73  QMPPPSPIATSPPPPSASD 91
                S  AT+P  PS S 
Sbjct: 121 SPSEESATATAPESPSTSV 139


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 31.4 bits (72), Expect = 2.8
 Identities = 8/77 (10%), Positives = 22/77 (28%), Gaps = 10/77 (12%)

Query: 23  DTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIAT 82
            T  P +++  +   +             +P+   P           +     PP P A 
Sbjct: 40  STAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPA----------AAAPAAPPKPAAA 89

Query: 83  SPPPPSASDKHDSQSSP 99
           +    + +    + ++ 
Sbjct: 90  AAAAAAPAAPPAAAAAA 106


>gnl|CDD|219902 pfam08558, TRF, Telomere repeat binding factor (TRF).  Telomere
           repeat binding factor (TRF) family proteins are
           important for the regulation of telomere stability. The
           two related human TRF proteins hTRF1 and hTRF2 form
           homodimers and bind directly to telomeric TTAGGG repeats
           via the myb DNA binding domain pfam00249 at the carboxy
           terminus. TRF1 is implicated in telomere length
           regulation and TRF2 in telomere protection. Other
           telomere complex associated proteins are recruited
           through their interaction with either TRF1 or TRF2. The
           fission yeast protein Taz1p (telomere-associated in
           Schizosaccharomyces pombe) has similarity to both hTRF1
           and hTRF2 and may perform the dual functions of TRF1 and
           TRF2 at fission yeast telomeres. This domain is composed
           of multiple alpha helices arranged in a solenoid
           conformation similar to TPR repeats. The fungal members
           have now also been found to carry two double strand
           telomeric repeat binding factors.
          Length = 239

 Score = 30.8 bits (70), Expect = 2.9
 Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 519 ETPHRCDFCSKTFTRKEHLLNHVRQHTGESPHKCNYCAKSFTRKEHLVNHVRQHTG 574
            TP   DF  +   R+E LLN +R+     P   NY  +SF ++  L++++ ++ G
Sbjct: 172 LTPSETDFVERCKKRREKLLNIIREKDDAHPLLQNYSWESFLKE--LLSYLEKNLG 225


>gnl|CDD|218108 pfam04487, CITED, CITED.  CITED, CBP/p300-interacting
           transactivator with ED-rich tail, are characterized by a
           conserved 32-amino acid sequence at the C-terminus.
           CITED proteins do not bind DNA directly and are thought
           to function as transcriptional co-activators.
          Length = 206

 Score = 30.6 bits (69), Expect = 2.9
 Identities = 21/85 (24%), Positives = 25/85 (29%), Gaps = 10/85 (11%)

Query: 53  PSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQS 112
            S  +P HG H L +L       P   +  S  P SA         PH          QS
Sbjct: 26  NSLAAPNHGPHDLRALMGTLLHYPGGGMDASGRPRSAMSGPMGGGHPH----------QS 75

Query: 113 PPPTSHISKIPSPSSSSPSSSSQLM 137
            P     +    P         QLM
Sbjct: 76  MPAYMMFNPSSKPQPFMLVPGPQLM 100


>gnl|CDD|185230 PRK15330, PRK15330, cell invasion protein SipD; Provisional.
          Length = 343

 Score = 30.9 bits (69), Expect = 3.0
 Identities = 17/61 (27%), Positives = 25/61 (40%)

Query: 75  PPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
           P P  +A  P  PSAS+  +    P  T+H G  I       + I +      S+P  S 
Sbjct: 11  PHPGIVAERPQTPSASEHVEIAVVPSTTEHRGTDIISLSQAATKIQQAQQTLQSTPPISE 70

Query: 135 Q 135
           +
Sbjct: 71  E 71


>gnl|CDD|225995 COG3464, COG3464, Transposase and inactivated derivatives [DNA
           replication, recombination, and repair].
          Length = 402

 Score = 30.9 bits (70), Expect = 3.0
 Identities = 10/67 (14%), Positives = 20/67 (29%), Gaps = 7/67 (10%)

Query: 802 HLNERPHVCPHCSKSFVEHTCMIKHIRKNHTLANVNVSPMNSVGTQHNMLPHTMSSIPLS 861
            L +R + C  C K F E    +    +  T      +          +     +     
Sbjct: 72  FLRKRRYKCCRCGKRFAEKVDFVDRA-QRITRRWAQAAA------IALIKGKAFTDTAEL 124

Query: 862 HSMSTIT 868
           + +S +T
Sbjct: 125 NLISRLT 131


>gnl|CDD|220840 pfam10667, DUF2486, Protein of unknown function (DUF2486).  This
           family is made up of members from various Burkholderia
           spp. The function is unknown.
          Length = 245

 Score = 30.7 bits (69), Expect = 3.0
 Identities = 16/100 (16%), Positives = 38/100 (38%), Gaps = 2/100 (2%)

Query: 37  PSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQ 96
           P+P+   V+    + +P++P      H +    +  Q P PS +A   P    +    + 
Sbjct: 79  PTPHVPAVALPGDTDAPAEPGA--APHVVAERAAAMQAPLPSALAADDPQAPPAGATAAD 136

Query: 97  SSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
           +       T P    + PP +  +   + ++ +   +  +
Sbjct: 137 AGDAAPDATPPAAGDASPPAAAQAAASAAAALTDLDAQHI 176



 Score = 29.5 bits (66), Expect = 7.6
 Identities = 17/111 (15%), Positives = 31/111 (27%), Gaps = 10/111 (9%)

Query: 25  PPPPSSSRGSEPPSP--------NTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPP 76
           P P  SS                    V G   + S +               +   +P 
Sbjct: 21  PVPARSSSADAAGPHDDAAEPVLTDQIVPGAEQAASAAPVHAAREATADPEFVAVEPVPT 80

Query: 77  PSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSS 127
           P   A + P  + +      ++PH+         Q+P P++  +  P    
Sbjct: 81  PHVPAVALPGDTDAPAEPG-AAPHVVAERAAA-MQAPLPSALAADDPQAPP 129


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 31.1 bits (70), Expect = 3.0
 Identities = 19/81 (23%), Positives = 31/81 (38%), Gaps = 6/81 (7%)

Query: 53  PSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQS 112
           P+ P P     T  +  SP   P PS   TSPP  +      + ++P            +
Sbjct: 205 PATPRP-----TPRTTASPETTPTPST-TTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPA 258

Query: 113 PPPTSHISKIPSPSSSSPSSS 133
           PP        P+ ++ +P +S
Sbjct: 259 PPTPGGGEAPPANATPAPEAS 279



 Score = 29.5 bits (66), Expect = 9.0
 Identities = 18/68 (26%), Positives = 28/68 (41%), Gaps = 13/68 (19%)

Query: 23  DTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIAT 82
           +T P PS++    PPS      S        +  +PQ G    T+  +     PP+P   
Sbjct: 219 ETTPTPSTTT--SPPSTTIPAPS-------TTIAAPQAG----TTPEAEGTPAPPTPGGG 265

Query: 83  SPPPPSAS 90
             PP +A+
Sbjct: 266 EAPPANAT 273


>gnl|CDD|217512 pfam03359, GKAP, Guanylate-kinase-associated protein (GKAP)
           protein. 
          Length = 342

 Score = 30.9 bits (70), Expect = 3.1
 Identities = 12/80 (15%), Positives = 21/80 (26%)

Query: 15  KTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQM 74
            T        P     S    PP  ++   +G    ++  Q +         S  +    
Sbjct: 82  STEDKALQFGPSFQRESSEDSPPPSSSTYSAGTRTVSTQGQSAYLSDPKRRPSSEASESE 141

Query: 75  PPPSPIATSPPPPSASDKHD 94
                 +  PPP     + D
Sbjct: 142 TVAFDESDLPPPDPWRCRRD 161


>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region.  The
           defective chorion-1 gene (dec-1) in Drosophila encodes
           follicle cell proteins necessary for proper eggshell
           assembly. Multiple products of the dec-1 gene are formed
           by alternative RNA splicing and proteolytic processing.
           Cleavage products include S80 (80 kDa) which is
           incorporated into the eggshell, and further proteolysis
           of S80 gives S60 (60 kDa).
          Length = 407

 Score = 30.9 bits (69), Expect = 3.1
 Identities = 24/97 (24%), Positives = 43/97 (44%), Gaps = 17/97 (17%)

Query: 12  ASHKTSTSMEVDTPPP---------PSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGH 62
           A+ KT  + +   PPP         P+  +  + PS + G   G F++  P+ PS     
Sbjct: 46  ATAKTRVARQ--NPPPNPLGQLMNWPALPQDFQLPSMDLGPQVGSFLAQLPAMPSMP--- 100

Query: 63  HTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSP 99
             L    +P  +P P+P   + PP + +   D+ ++P
Sbjct: 101 -GLLGAAAP--VPAPAPAPAAAPPAAPAPAADTPAAP 134


>gnl|CDD|215598 PLN03138, PLN03138, Protein TOC75; Provisional.
          Length = 796

 Score = 31.0 bits (70), Expect = 3.1
 Identities = 18/105 (17%), Positives = 28/105 (26%), Gaps = 10/105 (9%)

Query: 46  GGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHT 105
           G   ST  S  +      +     S          AT  P  S+  K  + +S   +  +
Sbjct: 1   GRSSSTMVSAAASTSLS-SSRPQLSSFS-SRSPQSATRSPRASSI-KCSASASASSSATS 57

Query: 106 GPGIFQSPPPTSHISK---IPSPSSSSPSSSSQLMQQGGGGGSNG 147
                 +    +         + S             GGGGG  G
Sbjct: 58  SS----ASLVANGAVALLSASAISGGGGGGGGGFGGFGGGGGGGG 98


>gnl|CDD|234383 TIGR03895, protease_PatA, cyanobactin maturation protease,
           PatA/PatG family.  This model describes a protease
           domain associated with the maturation of various members
           of the cyanobactin family of ribosomally produced,
           heavily modified bioactive metabolites. Members include
           the PatA protein and C-terminal domain of the PatG
           protein of Prochloron didemni, TenA and a region of TenG
           from Nostoc spongiaeforme var. tenue, etc.
          Length = 602

 Score = 31.2 bits (71), Expect = 3.2
 Identities = 13/94 (13%), Positives = 27/94 (28%), Gaps = 3/94 (3%)

Query: 47  GFISTSPSQPSPQHGHHTLTSLTS-PHQMPP-PSPIATSPPPPSASDKHDS-QSSPHITQ 103
           G +++         G     ++ S    +PP  +        P A+    +  ++  I  
Sbjct: 234 GLVTSEQDGVEEASGCGVQGTIESSTSVIPPGRAAEPAPVSIPVAAPGEGATPAAAQIEL 293

Query: 104 HTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLM 137
             G       P T         + S   S+   +
Sbjct: 294 SAGVLPNAISPATPVRPASNGVTPSQAPSAEPQL 327


>gnl|CDD|215056 PLN00104, PLN00104, MYST -like histone acetyltransferase;
           Provisional.
          Length = 450

 Score = 30.9 bits (70), Expect = 3.2
 Identities = 11/19 (57%), Positives = 13/19 (68%)

Query: 692 CDFCLKTFTRKEHLTNHLK 710
           C+FCLK   RKE L  H+K
Sbjct: 201 CEFCLKFMKRKEQLQRHMK 219


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 31.0 bits (70), Expect = 3.2
 Identities = 15/81 (18%), Positives = 24/81 (29%), Gaps = 4/81 (4%)

Query: 54  SQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSP 113
           S  +P        S      M        +P    A    D Q+      +T   + +  
Sbjct: 276 SWQTPAQTPARRIST----PMTEEIKSWQTPLQTPAMYSSDYQAPKPEPIYTWEELLRER 331

Query: 114 PPTSHISKIPSPSSSSPSSSS 134
            P+   +    P S S +S S
Sbjct: 332 FPSDLFAISSLPDSDSEASDS 352


>gnl|CDD|237182 PRK12727, PRK12727, flagellar biosynthesis regulator FlhF;
           Provisional.
          Length = 559

 Score = 31.1 bits (70), Expect = 3.2
 Identities = 17/117 (14%), Positives = 29/117 (24%), Gaps = 16/117 (13%)

Query: 2   PIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHG 61
           P++      PA+   + +  V T P    +       P        F++T+P   +P   
Sbjct: 129 PVRAASIPSPAAQALAHAAAVRTAPRQEHA---LSAVPEQLFAD--FLTTAPVPRAPVQA 183

Query: 62  HHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQ---SSPHITQHTGPGIFQSPPP 115
                        P P P   +     A+   D                     P  
Sbjct: 184 P--------VVAAPAPVPAIAAALAAHAAYAQDDDEQLDDDGFDLDDALPQILPPAA 232


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 30.9 bits (70), Expect = 3.5
 Identities = 23/117 (19%), Positives = 35/117 (29%), Gaps = 26/117 (22%)

Query: 8   SQFPASH-KTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLT 66
            + P  H K   +     P P +++  S  PS ++        +  PS P          
Sbjct: 372 GRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSS-------AAAQPSAPQSATQPAG-- 422

Query: 67  SLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQS--PPPTSHISK 121
              +P  +    P A    PPS              Q   P  F+     P S +S 
Sbjct: 423 ---TPPTVSVDPPAAVPVNPPST-----------APQAVRPAQFKEEKKIPVSKVSS 465


>gnl|CDD|147099 pfam04771, CAV_VP3, Chicken anaemia virus VP-3 protein.  This
           protein is found in the nucleus of infected cells and
           may act as a transcriptional regulator. It induces
           apoptosis, and is also known as apoptin.
          Length = 121

 Score = 29.4 bits (65), Expect = 3.5
 Identities = 13/42 (30%), Positives = 19/42 (45%)

Query: 15  KTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQP 56
           K    +  D P PPS  R  +P       +    I+T+PS+P
Sbjct: 71  KNVPDLRTDQPKPPSKKRSCDPSEYRVSELKESLITTTPSRP 112


>gnl|CDD|218397 pfam05044, Prox1, Homeobox prospero-like protein (PROX1).  The
           homeobox gene Prox1 is expressed in a subpopulation of
           endothelial cells that, after budding from veins, gives
           rise to the mammalian lymphatic system. Prox1 has been
           found to be an early specific marker for the developing
           liver and pancreas in the mammalian foregut endoderm.
           This family contains an atypical homeobox domain.
          Length = 908

 Score = 31.2 bits (70), Expect = 3.5
 Identities = 18/72 (25%), Positives = 23/72 (31%), Gaps = 11/72 (15%)

Query: 53  PSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTG------ 106
           PS  S   G H       P + P P P+  + P PS          P  + H G      
Sbjct: 621 PSSLSASMGFHP-----PPFRHPFPLPLTVAIPNPSLHQSEVFMGYPFQSPHLGAPSGSP 675

Query: 107 PGIFQSPPPTSH 118
           PG  +  P    
Sbjct: 676 PGKDRDSPDLPR 687


>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain.  Syndecans are transmembrane
           heparin sulfate proteoglycans which are implicated in
           the binding of extracellular matrix components and
           growth factors.
          Length = 207

 Score = 30.1 bits (68), Expect = 3.5
 Identities = 17/80 (21%), Positives = 29/80 (36%), Gaps = 2/80 (2%)

Query: 30  SSRGSEP-PSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPS 88
           S  G+ P    ++  V+        +  S    + T T+ TS    P  S   T+   PS
Sbjct: 56  SGSGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPS 115

Query: 89  ASDKHD-SQSSPHITQHTGP 107
            +D  + + +    T   G 
Sbjct: 116 ETDTEEATTTVSTETPTEGG 135


>gnl|CDD|219133 pfam06682, DUF1183, Protein of unknown function (DUF1183).  This
           family consists of several eukaryotic proteins of around
           360 residues in length. The function of this family is
           unknown.
          Length = 317

 Score = 30.5 bits (69), Expect = 3.6
 Identities = 31/143 (21%), Positives = 37/143 (25%), Gaps = 42/143 (29%)

Query: 19  SMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPS 78
              +         R         GG  GG      S P P                PPP 
Sbjct: 175 KFFLSCGGVRGGPRPERAGYGGGGGGGGGGGGGGGSGPGP----------------PPPG 218

Query: 79  PIATSPPPPSASDKHDSQSSPHIT----QHTGPGIF-------------------QSPPP 115
             ++ PPP        S      T       GPG +                    S   
Sbjct: 219 FKSSFPPPYGPGAGPSSGYGSGGTRSGQGGWGPGFWTGLGAGGALGYLFGSRRNNNSSYG 278

Query: 116 TSHISKIPS---PSSSSPSSSSQ 135
            S+ S  PS    SSS+ SSSS 
Sbjct: 279 RSYGSGSPSYSPSSSSNSSSSSS 301



 Score = 30.1 bits (68), Expect = 5.5
 Identities = 16/69 (23%), Positives = 19/69 (27%), Gaps = 7/69 (10%)

Query: 83  SPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIP-------SPSSSSPSSSSQ 135
              P      +            G G    PPP    S  P        PSS   S  ++
Sbjct: 184 RGGPRPERAGYGGGGGGGGGGGGGGGSGPGPPPPGFKSSFPPPYGPGAGPSSGYGSGGTR 243

Query: 136 LMQQGGGGG 144
             Q G G G
Sbjct: 244 SGQGGWGPG 252


>gnl|CDD|226535 COG4049, COG4049, Uncharacterized protein containing archaeal-type
           C2H2 Zn-finger [General function prediction only].
          Length = 65

 Score = 28.0 bits (62), Expect = 3.8
 Identities = 14/45 (31%), Positives = 20/45 (44%)

Query: 406 GETPYKCQYCGKNFSRKEHMVNHVRKHTGETPHRCEICKKAFTRK 450
           GE   +C  CG  F R++  + HV K  G    R +   K   +K
Sbjct: 14  GEEFLRCPRCGMVFRRRKDYIRHVNKAHGWLFGRGKPKGKRLKKK 58



 Score = 27.7 bits (61), Expect = 5.6
 Identities = 12/29 (41%), Positives = 15/29 (51%)

Query: 429 VRKHTGETPHRCEICKKAFTRKEHYVNHV 457
           VR   GE   RC  C   F R++ Y+ HV
Sbjct: 9   VRDRDGEEFLRCPRCGMVFRRRKDYIRHV 37


>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator.  Shisa is a
           transcription factor-type molecule that physically
           interacts with immature forms of the Wnt receptor
           Frizzled and the FGF receptor within the endoplasmic
           reticulum to inhibit their post-translational maturation
           and trafficking to the cell surface.
          Length = 177

 Score = 29.8 bits (67), Expect = 3.8
 Identities = 15/61 (24%), Positives = 19/61 (31%), Gaps = 1/61 (1%)

Query: 55  QPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPP 114
               +      TS T     P P P +T+P  P    +      P       P   Q PP
Sbjct: 107 CRPQRPVMTRATSTTVQTT-PLPQPPSTAPSYPGPQYQGYHPMPPQPGMPAPPYSLQYPP 165

Query: 115 P 115
           P
Sbjct: 166 P 166


>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
          Length = 226

 Score = 30.2 bits (68), Expect = 3.9
 Identities = 16/62 (25%), Positives = 22/62 (35%)

Query: 11  PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
           P +  TS + E  T PP +++    P    T   +      SP Q  P  G     S   
Sbjct: 115 PEASSTSATDEAATDPPATAAARDGPTPDPTAQPATPDERRSPRQRPPVSGEPPTPSTPD 174

Query: 71  PH 72
            H
Sbjct: 175 AH 176



 Score = 30.2 bits (68), Expect = 4.0
 Identities = 18/93 (19%), Positives = 30/93 (32%)

Query: 50  STSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGI 109
           S + SQ SP           +  Q  PP   +TS    +A+D   + ++           
Sbjct: 88  SDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSATDEAATDPPATAAARDGPTPDPTAQ 147

Query: 110 FQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGG 142
             +P       + P  S   P+ S+      G 
Sbjct: 148 PATPDERRSPRQRPPVSGEPPTPSTPDAHVAGT 180


>gnl|CDD|219744 pfam08204, V-set_CD47, CD47 immunoglobulin-like domain.  This
           family represents the CD47 leukocyte antigen V-set like
           Ig domain.
          Length = 93

 Score = 28.6 bits (64), Expect = 4.0
 Identities = 11/40 (27%), Positives = 18/40 (45%), Gaps = 4/40 (10%)

Query: 267 NSVITNNSTENNTTSITLDIKADLDLPVMGNHTCDLCGKF 306
           N  I   +  NN +++  D    L     GN+TC++   F
Sbjct: 43  NKDIFIFNGTNNISTLKFDFSDALS----GNYTCEVTELF 78


>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
          Length = 331

 Score = 30.4 bits (69), Expect = 4.0
 Identities = 12/84 (14%), Positives = 28/84 (33%), Gaps = 3/84 (3%)

Query: 8   SQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTS 67
           ++   +   S  ++  T   P+++     P   T   S      +   P+         +
Sbjct: 152 AELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAV---DPQQNA 208

Query: 68  LTSPHQMPPPSPIATSPPPPSASD 91
           + +P Q    +    +P  P+  D
Sbjct: 209 VVAPSQANVDTAATPAPAAPATPD 232


>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal.  Mediator is a
           large complex of up to 33 proteins that is conserved
           from plants to fungi to humans - the number and
           representation of individual subunits varying with
           species. It is arranged into four different sections, a
           core, a head, a tail and a kinase-activity part, and the
           number of subunits within each of these is what varies
           with species. Overall, Mediator regulates the
           transcriptional activity of RNA polymerase II but it
           would appear that each of the four different sections
           has a slightly different function. Mediator subunit
           Hrs1/Med3 is a physical target for Cyc8-Tup1, a yeast
           transcriptional co-repressor.
          Length = 381

 Score = 30.4 bits (68), Expect = 4.1
 Identities = 14/136 (10%), Positives = 38/136 (27%), Gaps = 13/136 (9%)

Query: 22  VDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIA 81
                  +S+  ++  + +         +TS +  +P         +   +    P+   
Sbjct: 120 NQLGNAGASASITKTSNGSDAA------TTSSTANTPAAA-----KVLKANAASAPNTTT 168

Query: 82  TSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHIS--KIPSPSSSSPSSSSQLMQQ 139
                 + +    + ++   T    P   +    T   +  K  + + +   +S+     
Sbjct: 169 GVGSAATTAAISATTATTPTTTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQASAYNQMG 228

Query: 140 GGGGGSNGQTHLHIQN 155
             G   N      I N
Sbjct: 229 SLGVPQNTSMLAQIPN 244



 Score = 30.4 bits (68), Expect = 5.2
 Identities = 33/258 (12%), Positives = 71/258 (27%), Gaps = 55/258 (21%)

Query: 11  PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
             +   S S+   +    +++  S   +P    V     + + S P+   G  +  +  +
Sbjct: 122 LGNAGASASITKTSNGSDAATTSSTANTPAAAKVLK---ANAASAPNTTTGVGSAATTAA 178

Query: 71  PHQMPPPSPIATSPPP------------------PSASDKHDSQS----SPHITQHTGPG 108
                  +P  T   P                   SA  +  + +               
Sbjct: 179 ISATTATTPTTTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQASAYNQMGSLGVPQNTSM 238

Query: 109 IFQSPPPTSHISKIP--SPSSS--------SPSSSSQLMQQ-------------GGGGGS 145
           + Q P PT  +  +   SP+++        SP  +   M               G     
Sbjct: 239 LAQIPNPTPLMQLLNGVSPNNAMASPLNNMSPMRNLNQMGNQNNGGQMTPSANNGNMNNQ 298

Query: 146 NGQTHLHIQNIGAVLSPSYGQHCIQQDDTQH--STVQSPPGQQQQYQQMDTSQSSQQQSD 203
           + +  ++        +PS     +      +  +   +      Q Q     Q   QQ+ 
Sbjct: 299 SRENSMNQGM-----TPSASMINLNNITPANILNMSLNLAFDLNQNQTPQQLQPQNQQNM 353

Query: 204 LIKTNGTGDVNPDEFNQL 221
            +  N   + +  + N L
Sbjct: 354 NMGMNDVNNFDLLDLNNL 371


>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
          Length = 935

 Score = 30.8 bits (69), Expect = 4.2
 Identities = 39/195 (20%), Positives = 61/195 (31%), Gaps = 20/195 (10%)

Query: 25  PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSP 84
           P      R +  P P   G        S SQ              +P  MP P P+   P
Sbjct: 485 PGVVQDGRPACAPVPAPAGPIVRPWEASLSQVPGV--------AFAPV-MPQPMPVEPVP 535

Query: 85  PPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGG 144
            P  A ++    + P I    GPG            +    + + P S SQ+  +     
Sbjct: 536 VPTVALERPVCPAPPLIAM-QGPGETSGIVRVRERWRPAPWTPNPPRSPSQMSVRDRLAR 594

Query: 145 SNGQTHLHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQQSDL 204
              +   +  ++  V  P           TQ S  Q      +  QQM       Q +D+
Sbjct: 595 LRAEAQPYQASV-EVQPPQL---------TQVSPQQPMEYPLEPEQQMFPGSPFSQVADV 644

Query: 205 IKTNGTGDVNPDEFN 219
           ++  G   + P  F+
Sbjct: 645 MRAGGVPAMQPQYFD 659


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 30.5 bits (69), Expect = 4.5
 Identities = 14/81 (17%), Positives = 25/81 (30%), Gaps = 13/81 (16%)

Query: 66  TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPP-PTSHISKIPS 124
                      P+P A + PPP+A        +P +          + P P       P 
Sbjct: 381 APAPPSAAWGAPTPAAPAAPPPAA--------APPVPPAAPARPAAARPAPAP----APP 428

Query: 125 PSSSSPSSSSQLMQQGGGGGS 145
            +++ P+ S+        G  
Sbjct: 429 AAAAPPARSADPAAAASAGDR 449


>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
           factor (transactivator).  This family includes EBV BRLF1
           and similar ORF 50 proteins from other herpesviruses.
          Length = 500

 Score = 30.5 bits (69), Expect = 4.6
 Identities = 35/213 (16%), Positives = 55/213 (25%), Gaps = 18/213 (8%)

Query: 6   ELSQFPAS--HKTSTSMEVDTPPPPSSSRGSEPPSPNTGG---VSGGFISTSPSQPSPQH 60
            LSQF     H +    +    P PSS+R  +  S +             TS    +P  
Sbjct: 209 PLSQFHGFTPHPSLPQPQSPLKPSPSSARPQQSESFSDVWPASTQSPREETSAEPLAPAS 268

Query: 61  GHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHIS 120
                 S     Q+   SP A       +     S S P              PP ++  
Sbjct: 269 PSSRRPSTAQEEQIACSSPQAEPEQGVQSYVPQSSDSRPSCFPAPSTTQPTFLPPNTNKK 328

Query: 121 KIPS-------------PSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSPSYGQH 167
                             + S       +    G    +GQ+      + + L+ +    
Sbjct: 329 AKRDRRPQMVTPKQEGGAAVSQNHDGGTVRAPRGRPSGSGQSPPSNSPLLSSLADTPSGA 388

Query: 168 CIQQDDTQHSTVQSPPGQQQQYQQMDTSQSSQQ 200
             Q        V     +    +Q  T  SS  
Sbjct: 389 AHQPASLLPPAVVQQQLEDASDKQPPTPGSSLV 421


>gnl|CDD|240323 PTZ00233, PTZ00233, variable surface protein Vir18; Provisional.
          Length = 509

 Score = 30.3 bits (68), Expect = 4.7
 Identities = 24/107 (22%), Positives = 49/107 (45%), Gaps = 4/107 (3%)

Query: 172 DDTQHSTVQSPPGQQQQYQQMDTSQSSQQQSDLIKTNGTGDVNPDEFNQLLAYHEV--FG 229
           D    S ++  P + +  +  D  + SQ Q+D  ++N    V   + + + +   +    
Sbjct: 135 DTKSQSKLRPVPSKAKSLEIKDPQEQSQNQADAQESNKESVVLQPQSDSMPSPSSIGTED 194

Query: 230 KLQSSVVTHTSSTLGLASTSTAQGDMNSAADNGVSGTNSVITNNSTE 276
           K   S+V H S+T G+  T T Q  +N++ D+ +   +S   +  +E
Sbjct: 195 KEPQSIVNHHSTTSGMGETQTQQ--LNASGDSPIRELDSSAGDPPSE 239


>gnl|CDD|177547 PHA03164, PHA03164, hypothetical protein; Provisional.
          Length = 88

 Score = 28.4 bits (63), Expect = 4.8
 Identities = 15/54 (27%), Positives = 20/54 (37%), Gaps = 16/54 (29%)

Query: 85  PPPSASDK-----HDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSS 133
           PPPS S+       D Q SPH               TS +  +P P  S  + +
Sbjct: 7   PPPSYSEVLMMDVMDGQVSPHQEN-----------DTSFVECLPPPQISRTAWN 49


>gnl|CDD|218146 pfam04554, Extensin_2, Extensin-like region. 
          Length = 57

 Score = 27.4 bits (61), Expect = 4.9
 Identities = 18/51 (35%), Positives = 24/51 (47%), Gaps = 9/51 (17%)

Query: 75  PPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSP 125
           PPP     SPPPP  S  + S           P +++SPPP  ++ K P P
Sbjct: 15  PPPPYYYKSPPPPVKSPVYKSPP---------PPVYKSPPPPKYVYKSPPP 56


>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI.  This minor capsid
           protein may act as a link between the external capsid
           and the internal DNA-protein core. The C-terminal 11
           residues may function as a protease cofactor leading to
           enzyme activation.
          Length = 238

 Score = 29.8 bits (67), Expect = 5.0
 Identities = 18/78 (23%), Positives = 26/78 (33%), Gaps = 5/78 (6%)

Query: 71  PHQMPPPSPIATSPPPPSASDK-HDSQSSPHITQHTGPGIFQSPP---PTSHISKIPSPS 126
           P    PPS   T  P P+  ++  DS +       T P   + PP   P   +   PS  
Sbjct: 143 PAAPEPPSYEETIKPGPAPVEEPVDSMAIAVPAIDT-PVTLELPPAPQPPPPVVPQPSTM 201

Query: 127 SSSPSSSSQLMQQGGGGG 144
                S  +  +  G   
Sbjct: 202 VVHRRSRIKRTRSSGWQA 219


>gnl|CDD|188836 cd09452, LIM1_Enigma, The first LIM domain of Enigma.  The first
           LIM domain of Enigma: Enigma was initially characterized
           in humans as a protein containing three LIM domains at
           the C-terminus and a PDZ domain at N-terminus.  The
           third LIM domain specifically interacts with the insulin
           receptor and the second LIM domain interacts with the
           receptor tyrosine kinase Ret and the adaptor protein
           APS. Thus Enigma is implicated in signal transduction
           processes such as mitogenic activity, insulin related
           actin organization, and glucose metabolism. Enigma is
           expressed in multiple tissues, such as skeletal muscle,
           heart, bone and brain.  LIM domains are 50-60 amino
           acids in size and share two characteristic zinc finger
           motifs. The two zinc fingers contain eight conserved
           residues, mostly cysteines and histidines, which
           coordinately bond to two zinc atoms. LIM domains
           function as adaptors or scaffolds to support the
           assembly of multimeric protein complexes.
          Length = 52

 Score = 27.1 bits (60), Expect = 5.2
 Identities = 15/37 (40%), Positives = 17/37 (45%), Gaps = 2/37 (5%)

Query: 300 CDLCGKFFPYRYQMIVHRRYHTERKPFTCHVCGKAFD 336
           C  C K    RY + + R YH E   FTC  C K  D
Sbjct: 1   CAQCNKIIRGRYLVALGRSYHPEE--FTCSQCKKVLD 35


>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
          Length = 486

 Score = 30.4 bits (68), Expect = 5.3
 Identities = 13/60 (21%), Positives = 21/60 (35%), Gaps = 2/60 (3%)

Query: 11 PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGF-ISTSPSQPSPQHGHHTLTSLT 69
            +   ST+    TP  P+ +  S   SP   G       + + S       H   T++T
Sbjct: 34 IENSDNSTAEMQSTPATPTHTT-SNLTSPFGTGTDNSTNANGTESTTQASQPHSHETTIT 92


>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 433 to 538 amino acids in length. This domain is
           found associated with pfam00616, pfam00168. This domain
           has two conserved sequence motifs: DLQ and PLSFQNP.
          Length = 489

 Score = 30.1 bits (67), Expect = 5.5
 Identities = 24/126 (19%), Positives = 35/126 (27%), Gaps = 13/126 (10%)

Query: 11  PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
            +SH  S S +       S+S G E         S  F     S P  QH        ++
Sbjct: 204 ASSH--SNSEDCSLSAGFSNSGGEE-----FTRRSTDFTRRQLSLPDRQHQPALPRQNSA 256

Query: 71  PHQMPPPSPIATSPP------PPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPS 124
             Q     P            PPS      S+ S   ++    G       +S     P 
Sbjct: 257 GPQRRVDQPSPPGGGSHRGRIPPSLLSSLPSEGSMLSSEWPQSGARPRQQSSSSKGDSPE 316

Query: 125 PSSSSP 130
              ++ 
Sbjct: 317 LRPAAG 322



 Score = 30.1 bits (67), Expect = 6.2
 Identities = 21/129 (16%), Positives = 35/129 (27%), Gaps = 11/129 (8%)

Query: 11  PASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS 70
           P S +  +  E  +     S+      S       G   +   +  + +       SL  
Sbjct: 189 PVSPRGDSGSENHSSASSHSNSEDCSLSAGFSNSGGEEFTRRSTDFTRRQ-----LSLPD 243

Query: 71  PHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSP 130
               P        P   SA  +            +  G       +S  S+    SS  P
Sbjct: 244 RQHQP------ALPRQNSAGPQRRVDQPSPPGGGSHRGRIPPSLLSSLPSEGSMLSSEWP 297

Query: 131 SSSSQLMQQ 139
            S ++  QQ
Sbjct: 298 QSGARPRQQ 306



 Score = 30.1 bits (67), Expect = 6.3
 Identities = 28/161 (17%), Positives = 39/161 (24%), Gaps = 35/161 (21%)

Query: 66  TSLTSPH--QMPPPSPIATSPPPPSAS----------DKHDSQSSPHITQ---------- 103
            +L +P   Q     P   +P PPS S             D  SS   T+          
Sbjct: 6   AALANPTPVQQQSRRPHERNPSPPSVSCGLSSGLQKIMMDDLDSSVDFTRLPSPTPENKD 65

Query: 104 ----HTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL---------MQQGGGGGSNGQTH 150
                       SP  +S  S+I  P    P+    L           Q           
Sbjct: 66  LFFVTRPSLAQSSPAYSSSYSEINEPELGMPNGGRSLSLVDLQDAASGQVHSSPMGLPGA 125

Query: 151 LHIQNIGAVLSPSYGQHCIQQDDTQHSTVQSPPGQQQQYQQ 191
           L ++      S   GQ  I           +P  +      
Sbjct: 126 LGLRRASVPWSARSGQSSIAGGLRLSQPQSAPTVRLLHAAP 166


>gnl|CDD|148271 pfam06566, Chon_Sulph_att, Chondroitin sulphate attachment domain. 
           This family represents the chondroitin sulphate
           attachment domain of vertebrate neural transmembrane
           proteoglycans that contain EGF modules. Evidence has
           been accumulated to support the idea that neural
           proteoglycans are involved in various cellular events
           including mitogenesis, differentiation, axonal outgrowth
           and synaptogenesis. This domain contains several
           potential sites of chondroitin sulphate attachment, as
           well as potential sites of N-linked glycosylation.
          Length = 253

 Score = 30.0 bits (67), Expect = 5.5
 Identities = 17/111 (15%), Positives = 31/111 (27%), Gaps = 2/111 (1%)

Query: 19  SMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPS 78
           S E    P      G E  +  T        S      + + G      L +     P  
Sbjct: 50  SEEAPVGPGVGPEEGLEASAAVTPTAWLEASSPGLGGVTAEAGSGDSQGLPATLPT-PDE 108

Query: 79  PIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHIS-KIPSPSSS 128
            +  S P  +  +  ++ + P       P +    P  S +   +    S+
Sbjct: 109 ALGNSNPSLALPEATEASNPPSPGPGDKPSLLPELPKESPVEVWLNLGGST 159


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 30.5 bits (69), Expect = 5.5
 Identities = 25/127 (19%), Positives = 32/127 (25%), Gaps = 29/127 (22%)

Query: 16  TSTSMEVDTPPP--------PSSSRGSEPPSPNTGGV--SGGFISTSPSQPSPQHGHHTL 65
           T   M +   PP        P  +RG E   P   G    G    T+      +      
Sbjct: 626 TVGEMIISGFPPVFKTALPRPDYNRGGEAGGPGVPGPVPVGMPAHTARPSRVARGDPVRP 685

Query: 66  TSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSP 125
           T     H     +P A  P  P                  G G+   P   +     P  
Sbjct: 686 T----AHHAALRAPQAPRPGGPPGG---------------GGGLPPPPDLPAAAGPAPCG 726

Query: 126 SSSSPSS 132
           SS   S 
Sbjct: 727 SSLIASP 733



 Score = 30.2 bits (68), Expect = 6.9
 Identities = 19/99 (19%), Positives = 27/99 (27%), Gaps = 14/99 (14%)

Query: 19  SMEVDTPPPPSSSRGSE-PPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPP 77
            M   T  P   +RG    P+ +             +  +PQ               PPP
Sbjct: 666 GMPAHTARPSRVARGDPVRPTAHHA-----------ALRAPQAPRPGGPPGGGGGLPPPP 714

Query: 78  SPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPT 116
              A + P P  S      S     +   PG  Q+    
Sbjct: 715 DLPAAAGPAPCGSSL--IASPTAPPEPEPPGAEQADGAE 751


>gnl|CDD|236652 PRK10118, PRK10118, flagellar hook-length control protein;
           Provisional.
          Length = 408

 Score = 30.2 bits (68), Expect = 5.9
 Identities = 22/96 (22%), Positives = 32/96 (33%), Gaps = 8/96 (8%)

Query: 15  KTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQM 74
            TS  +     P  +    ++P +P            S   P         +   +PHQ 
Sbjct: 199 LTSAQLT-TAQPDDAPGTPAQPLTPLAAEAQAKAEVISTPSPVT----AAASPTITPHQT 253

Query: 75  PPPSPIATSPPPPSASDKHDSQS--SPHITQHTGPG 108
             P P A +P   +    H+ Q   S HIT  T  G
Sbjct: 254 -QPLPTAAAPVLSAPLGSHEWQQSLSQHITLFTRQG 288


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 30.2 bits (68), Expect = 6.1
 Identities = 15/111 (13%), Positives = 33/111 (29%), Gaps = 8/111 (7%)

Query: 52  SPSQPSPQHGHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQ 111
           +P  P      + L  + +  + P  +    +  PP           P+           
Sbjct: 424 TPMGPGGPLRPNGLAPM-NAVRAPSRNAQNAAQKPPMQ----PVMYPPNYQSLPLSQDLP 478

Query: 112 SPPPTSHISKIPSPSSSSPSSSSQLMQQGGGGGSNGQTHLHIQNIGAVLSP 162
            P  T+         +   +S++  MQ+   G    +    ++ I   L+ 
Sbjct: 479 QPQSTASQGGQNKKLAQVLASATPQMQKQVLG---ERLFPLVEAIEPALAA 526



 Score = 29.8 bits (67), Expect = 6.9
 Identities = 26/148 (17%), Positives = 42/148 (28%), Gaps = 14/148 (9%)

Query: 64  TLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIP 123
            L       Q         SP   +          P+  Q   P    +  P        
Sbjct: 370 HLQDQFMQLQPRMRQLPMGSPMGGAMGQP------PYYGQG--PQQQFNGQPLGWPRMSM 421

Query: 124 SPSSSSPSSSSQ-----LMQQGGGGGSNGQTHLHIQNIGAVL-SPSYGQHCIQQDDTQHS 177
            P+   P    +      M        N Q       +  V+  P+Y    + QD  Q  
Sbjct: 422 MPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQ 481

Query: 178 TVQSPPGQQQQYQQMDTSQSSQQQSDLI 205
           +  S  GQ ++  Q+  S + Q Q  ++
Sbjct: 482 STASQGGQNKKLAQVLASATPQMQKQVL 509


>gnl|CDD|227516 COG5189, SFP1, Putative transcriptional repressor regulating G2/M
           transition [Transcription / Cell division and chromosome
           partitioning].
          Length = 423

 Score = 30.1 bits (67), Expect = 6.2
 Identities = 23/94 (24%), Positives = 32/94 (34%), Gaps = 21/94 (22%)

Query: 462 GETPHVCQV--CGKKYTRKEHLANHMR-SHTND--TPFKCEICTKSFTRKEHFTNHIMWH 516
              P+ C V  C KKY  +  L  HM   H N              F+ K+         
Sbjct: 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKD--------- 396

Query: 517 TGETPHRCDFCSKTFTRKEHLLNHVRQHTGESPH 550
               P+RC+ C K +      LN ++ H   S  
Sbjct: 397 ---KPYRCEVCDKRYKN----LNGLKYHRKHSHD 423


>gnl|CDD|146273 pfam03546, Treacle, Treacher Collins syndrome protein Treacle. 
          Length = 519

 Score = 30.2 bits (67), Expect = 6.2
 Identities = 26/130 (20%), Positives = 47/130 (36%)

Query: 17  STSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPP 76
           +T  + + P   S S   +  S      +       PS  SPQ    +  +  SP +  P
Sbjct: 12  ATQAKAEKPEEDSESSSEDSDSEEEMPAAKNPPQAKPSGKSPQVKAASAPAKESPQKGAP 71

Query: 77  PSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQL 136
           P     + P  + + + +++SS   +   G     +   TS     P   +S    +S +
Sbjct: 72  PVTPGKAGPAAAQAGEEEAKSSEEESDSEGETPTAATLTTSPAQAKPLGKNSQVRPASTV 131

Query: 137 MQQGGGGGSN 146
                G G+N
Sbjct: 132 TPGPSGKGAN 141


>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
          Length = 318

 Score = 29.9 bits (67), Expect = 6.4
 Identities = 13/41 (31%), Positives = 16/41 (39%), Gaps = 4/41 (9%)

Query: 75  PPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPP 115
            PP      P PP  +      +SP  T  + PG    PPP
Sbjct: 10  SPPRR----PSPPRPTPPRSPDASPEETPPSPPGPGAEPPP 46


>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
          Length = 291

 Score = 29.6 bits (66), Expect = 6.8
 Identities = 16/73 (21%), Positives = 31/73 (42%), Gaps = 9/73 (12%)

Query: 61  GHHTLTSLTSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHIS 120
           G  T T+  +    P  + +      P ++     QS P+ T         +PP T   +
Sbjct: 19  GVITTTTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNAT---------TPPSTKVEA 69

Query: 121 KIPSPSSSSPSSS 133
              +P++++PSS+
Sbjct: 70  PQQTPNATTPSST 82


>gnl|CDD|215064 PLN00122, PLN00122, serine/threonine protein phosphatase 2A;
           Provisional.
          Length = 170

 Score = 28.9 bits (65), Expect = 7.1
 Identities = 8/62 (12%), Positives = 16/62 (25%)

Query: 75  PPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSS 134
              +   +S P  ++S    S      +  +           +    +PS      S   
Sbjct: 1   SSSASSGSSAPAVNSSPSSSSSGKSSNSSKSASSAVNPASVVAGYEPLPSFRDVPNSEKQ 60

Query: 135 QL 136
            L
Sbjct: 61  NL 62


>gnl|CDD|150725 pfam10083, DUF2321, Uncharacterized protein conserved in bacteria
           (DUF2321).  Members of this family of hypothetical
           bacterial proteins have no known function.
          Length = 158

 Score = 28.7 bits (64), Expect = 7.3
 Identities = 20/68 (29%), Positives = 26/68 (38%), Gaps = 6/68 (8%)

Query: 423 EHMVNHVRKHTGETPHRCEICKKAFTRKEHYVNHVMW--HTGETPHVCQVCGKKY---TR 477
           E M N   K    T   C  C     R  +YV+ V+      + P  C  CGK +    R
Sbjct: 25  ELMENFCSKCGAATITSCPHCSTPI-RGFYYVDGVLGLGSDYDAPSYCHNCGKPFPWTKR 83

Query: 478 KEHLANHM 485
           K   AN +
Sbjct: 84  KLEAANEL 91


>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
          Length = 200

 Score = 29.1 bits (65), Expect = 7.3
 Identities = 17/82 (20%), Positives = 25/82 (30%), Gaps = 3/82 (3%)

Query: 69  TSPHQMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSS 128
           T     PP +P  +S PPP           P         +   PPP     +     + 
Sbjct: 47  TGYPFCPPTTPHPSSQPPPCPP---SPGHPPQTNDTHEKDLALQPPPGGKKKEKKKKETE 103

Query: 129 SPSSSSQLMQQGGGGGSNGQTH 150
            P+   +   QG      G+ H
Sbjct: 104 KPAQGGEKPDQGPEAKGEGEGH 125


>gnl|CDD|173376 PTZ00082, PTZ00082, L-lactate dehydrogenase; Provisional.
          Length = 321

 Score = 29.7 bits (67), Expect = 7.7
 Identities = 10/32 (31%), Positives = 18/32 (56%), Gaps = 5/32 (15%)

Query: 609 QYCPKAF-----SRKDHLVNHVRQHTGESPHK 635
           +YCP AF     +  D +V  +++H+G   +K
Sbjct: 117 KYCPNAFVIVITNPLDVMVKLLQEHSGLPKNK 148


>gnl|CDD|219833 pfam08418, Pol_alpha_B_N, DNA polymerase alpha subunit B
           N-terminal.  This is the eukaryotic DNA polymerase alpha
           subunit B N-terminal domain which is involved in complex
           formation. Also see pfam04058.
          Length = 239

 Score = 29.3 bits (66), Expect = 7.9
 Identities = 16/111 (14%), Positives = 33/111 (29%), Gaps = 8/111 (7%)

Query: 15  KTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLT----SLTS 70
           +  T   + T         +E    ++               +P+      +    SL S
Sbjct: 65  RVRTPASIKTSKRLIEVPEAEESLLDSYTTPSDKGGMLRILSTPELPKRKRSFSASSLES 124

Query: 71  PHQMPPPSP----IATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS 117
           P     P+      A S P P+++      +   + +   P + Q+P    
Sbjct: 125 PSLFFSPASFSPSAAPSTPSPNSAKFSSRSNPGEVVETLNPHLGQTPEGGG 175


>gnl|CDD|218994 pfam06333, Med13_C, Mediator complex subunit 13 C-terminal.
           Mediator is a large complex of up to 33 proteins that is
           conserved from plants through fungi to humans - the
           number and representation of individual subunits varying
           with species. It is arranged into four different
           sections, a core, a head, a tail and a kinase-activity
           part, and the number of subunits within each of these is
           what varies with species. Overall, Mediator regulates
           the transcriptional activity of RNA polymerase II but it
           would appear that each of the four different sections
           has a slightly different function. Med13 is part of the
           ancillary kinase module, together with Med12, CDK8 and
           CycC, which in yeast is implicated in transcriptional
           repression, though most of this activity is likely
           attributable to the CDK8 kinase. The large Med12 and
           Med13 proteins are required for specific developmental
           processes in Drosophila, zebrafish, and Caenorhabditis
           elegans but their biochemical functions are not
           understood.
          Length = 406

 Score = 29.6 bits (67), Expect = 7.9
 Identities = 20/103 (19%), Positives = 29/103 (28%), Gaps = 7/103 (6%)

Query: 16  TSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMP 75
           T   +  D PP           S ++   +      + S PSP    +  T     +   
Sbjct: 227 TDPKLVPD-PPTQPIPFSRFNTSNSS---TPQSTPCTLSFPSPDQFGNAPTPPLGENGND 282

Query: 76  PPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSH 118
              P     PP S S+         I   T   I     P S+
Sbjct: 283 NAPP---VTPPGSPSESDPDSILVDIDDETWGVILSHRLPLSN 322


>gnl|CDD|205996 pfam13825, Paramyxo_PNT, Paramyxovirus structural protein V/P
           N-terminus.  This family consists of several
           Paramyxoviridae structural protein P and V sequences.
           From a structural point of view, P is the
           best-characterized protein of the replicative complex. P
           is organised into two moieties that are functionally and
           structurally distinct: a C-terminal moiety (PCT) and an
           N-terminal moiety (PNT). PCT is the most conserved in
           sequence and contains all regions required for virus
           transcription, whereas PNT, which is poorly conserved,
           provides several additional functions required for
           replication. P protein plays a crucial role in the
           enzyme by positioning L onto the N/RNA template through
           an interaction with the C-terminal domain of N. Without
           P, L is not functional. The N, P, and L proteins of SeV
           and measles and mumps viruses are functionally
           equivalent. However, sequence identity between proteins
           from these viruses is limited, and the viruses have been
           placed in different genera (Respirovirus, Morbilivirus,
           and Rubulavirus, respectively). SeV P protein (568 aa)
           is a modular protein with distinct functional domains.
           The N-terminal part of P (PNT) is a chaperone for N and
           prevents it from binding to non-viral RNA in the
           infected cell.
          Length = 309

 Score = 29.4 bits (66), Expect = 8.0
 Identities = 22/94 (23%), Positives = 28/94 (29%), Gaps = 10/94 (10%)

Query: 25  PPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTS-PHQMPPPSPIATS 83
           PP P   RG     P   G      S          G  T +  T    Q    S   +S
Sbjct: 210 PPIPDVKRGDPSCKPIKKGTEERSAS---------SGTETESLSTGGATQSALKSTWGSS 260

Query: 84  PPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTS 117
            P  SA +   S S+  + Q        +  P S
Sbjct: 261 EPNASAGNVRQSASNAKMIQKCKQESGTTASPRS 294


>gnl|CDD|225987 COG3456, COG3456, Predicted component of the type VI protein
           secretion system, contains a FHA domain [Intracellular
           trafficking, secretion, and vesicular    transport;
           Signal transduction mechanisms].
          Length = 430

 Score = 29.4 bits (66), Expect = 8.9
 Identities = 26/126 (20%), Positives = 32/126 (25%), Gaps = 24/126 (19%)

Query: 13  SHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPH 72
           S       + DT  P       +    +T           P    P          T P 
Sbjct: 105 SRAEPDESDPDTGSP-------QSAVSSTDMDDDFLADIEPEPRLPPGL-------TDPL 150

Query: 73  QMPPPSPIATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSS 132
           Q+  P P   S  P SA      QS   I Q    G   S    S      +P S   S+
Sbjct: 151 QLADPVPNGFSADPLSA-----LQSESLIAQPDPAGGAPSISRNS-----EAPLSDPASA 200

Query: 133 SSQLMQ 138
                 
Sbjct: 201 GGIDTP 206


>gnl|CDD|171610 PRK12615, PRK12615, galactose-6-phosphate isomerase subunit LacB;
           Reviewed.
          Length = 171

 Score = 28.8 bits (64), Expect = 9.1
 Identities = 24/95 (25%), Positives = 41/95 (43%), Gaps = 8/95 (8%)

Query: 202 SDLIKTNGTGDVNPDEFNQLLAYHEVFGKLQSSVVTHTSSTLGLASTSTAQGDMNSAADN 261
           SD +K+ G   ++   ++    ++ +FGK     V +  + LG+    T  G  N  A N
Sbjct: 19  SDFLKSKGYDVIDCGTYDHTRTHYPIFGKKVGEAVVNGQADLGVCICGTGVGINN--AVN 76

Query: 262 GVSGTNSVITNNSTENNTTSITLDIKADLDLPVMG 296
            V G  S +  + T        L  K +L+  V+G
Sbjct: 77  KVPGIRSALVRDMT------TALYAKEELNANVIG 105


>gnl|CDD|205121 pfam12874, zf-met, Zinc-finger of C2H2 type.  This is a zinc-finger
           domain with the CxxCx(12)Hx(6)H motif, found in multiple
           copies in a wide range of proteins from plants to
           metazoans. Some member proteins, particularly those from
           plants, are annotated as being RNA-binding.
          Length = 25

 Score = 25.5 bits (57), Expect = 9.2
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 780 FKCDVCPKDFTCKGHLVSHLRSH 802
           F C++C   FT +  L SHLR  
Sbjct: 1   FYCELCNVTFTSESQLKSHLRGK 23


>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160.  This
           family consists of several special lobe-specific silk
           protein SSP160 sequences which appear to be specific to
           Chironomus (Midge) species.
          Length = 758

 Score = 29.7 bits (66), Expect = 9.6
 Identities = 19/73 (26%), Positives = 34/73 (46%), Gaps = 6/73 (8%)

Query: 221 LLAYHEVFGKLQSSVVTHTSSTLGLASTSTAQGDMNSAADNGVSGTNSVITNNST-ENNT 279
           +  +  +   L++      S+     S ST+    NS   N  + TNS  + NST  +N+
Sbjct: 263 VAEWEAILAALEAFANGSASANSTSNSNSTSNSTTNS---NSTTTTNSTTSTNSTSSSNS 319

Query: 280 TSI--TLDIKADL 290
           ++I   +DI A+ 
Sbjct: 320 STIAGCIDIAANF 332


>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein.  This is a
           family of proteins which appears to have a highly
           conserved zinc finger domain at the C terminal end,
           described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
           predicted to contain a coiled coil. Members are
           annotated as being tumour-associated antigen HCA127 in
           humans but this could not confirmed.
          Length = 215

 Score = 29.0 bits (65), Expect = 9.9
 Identities = 14/58 (24%), Positives = 19/58 (32%), Gaps = 10/58 (17%)

Query: 68  LTSPHQMPPPSPIATSPPPPSASDKH----------DSQSSPHITQHTGPGIFQSPPP 115
           ++      PP P+   P  PS    +                 I +H  P  FQ PPP
Sbjct: 121 ISPATSPVPPVPLPDPPAFPSTLPANAAAAAAAQQQRDVPQNLINRHRMPPSFQQPPP 178


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.130    0.417 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 42,341,700
Number of extensions: 3839931
Number of successful extensions: 6990
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5911
Number of HSP's successfully gapped: 546
Length of query: 896
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 790
Effective length of database: 6,236,078
Effective search space: 4926501620
Effective search space used: 4926501620
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.6 bits)