RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy834
         (926 letters)



>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 58.6 bits (142), Expect = 1e-08
 Identities = 58/372 (15%), Positives = 101/372 (27%), Gaps = 41/372 (11%)

Query: 564 PLTPPPQSDPESGTQGELSPPTPGPNPHSRVN-----------PNPNSRANPNPNPNSRA 612
           P T  P ++  S     LS   P                    P     A+P P+P    
Sbjct: 75  PGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDL 134

Query: 613 NPNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSR 672
           +  L            P     ++  + + V S   +    Q++L + S E+ +    S 
Sbjct: 135 SEMLRPVGS---PGPPPAASPPAAGASPAAVASDAASSR--QAALPLSSPEETARAPSSP 189

Query: 673 VQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRV 732
                        SP    R                   ++         + +P   +  
Sbjct: 190 PAEPPPSTPPAAASPRPPRR-------------------SSPISASASSPAPAPGRSAAD 230

Query: 733 QSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAE 792
            +  ++     S  +       +     R           + S       +      ++ 
Sbjct: 231 DAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPG---PASS 287

Query: 793 QGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAE 852
             SP  RS      +  S    S    S+   S    S+   +    S   ++ +   + 
Sbjct: 288 SSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSP 347

Query: 853 QRSPNTRSRVQSSATTRSTEQRSRS-RSPLTRSTEQGSPNIRSTE-QRASRTRSADQASQ 910
            RSP + SR    A   S  +R R  R+P + +   G P  R      A R R  D   +
Sbjct: 348 SRSP-SPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGR 406

Query: 911 SVVTRSRSKLPS 922
               R R     
Sbjct: 407 FPAGRPRPSPLD 418



 Score = 57.1 bits (138), Expect = 4e-08
 Identities = 54/356 (15%), Positives = 98/356 (27%), Gaps = 28/356 (7%)

Query: 568 PPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNSAQ 627
           PP             P +P P+P   ++       +P P P +      +  A   + A 
Sbjct: 108 PPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAA 167

Query: 628 NPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSP 687
           +         +     ++  +  +    S         SP    R    +   S+   +P
Sbjct: 168 SSRQAALPLSSPEETARAPSSPPAEPPPS---TPPAAASPRPPRRSSPISASASSPAPAP 224

Query: 688 ISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPT 747
                          S+ S  ++++     P  E          + +     S      +
Sbjct: 225 GRSAADDA----GASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS 280

Query: 748 TRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPN 807
                 SS + R R  S          +P++     SS++ R +   S S  S   R   
Sbjct: 281 RPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAA 340

Query: 808 TRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQSSAT 867
                           G   +RS     P         P   S+ ++ P       S A 
Sbjct: 341 VSP-------------GPSPSRSPSPSRP-------PPPADPSSPRKRPRPSRAPSSPAA 380

Query: 868 TRSTEQRSRSR-SPLTRSTEQGSPNIRSTEQRASRTRSADQASQSVVTRSRSKLPS 922
           +     R R+R +   R+  + +       +       A  AS +   R     PS
Sbjct: 381 SAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPS 436



 Score = 49.8 bits (119), Expect = 8e-06
 Identities = 55/310 (17%), Positives = 94/310 (30%), Gaps = 11/310 (3%)

Query: 564 PLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQAN 623
           P +P P         G   PP     P +  +P   +    +    +    +    A+A 
Sbjct: 127 PPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAP 186

Query: 624 NSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAE 683
           +S     P  +       R      +     S+ +   A   S    +   SS +  S  
Sbjct: 187 SSPPAEPPPSTPPAAASPR--PPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSES 244

Query: 684 QGS---PISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRS 740
            G    P +     +    +  + +   +  N  +  P P  SSS   R R  S   +  
Sbjct: 245 SGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSS-PRERSPSPSPSSP 303

Query: 741 RVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIRS 800
               +P++     SS ++R    SS +  S               +  RS     P   +
Sbjct: 304 GSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSP---GPSPSRSPSPSRPPPPA 360

Query: 801 AVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTV-QGAPRIHSREQSSPNTHSAEQRSPNTR 859
               SP  R R   + ++ +A  G PT R      A R   R+ +        + SP   
Sbjct: 361 DP-SSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDA 419

Query: 860 SRVQSSATTR 869
                +   R
Sbjct: 420 GAASGAFYAR 429



 Score = 46.3 bits (110), Expect = 9e-05
 Identities = 46/267 (17%), Positives = 81/267 (30%), Gaps = 21/267 (7%)

Query: 563 RPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQA 622
           R  + PP   P S      SP  P  +        P S +  +P P    +      A +
Sbjct: 184 RAPSSPPAEPPPSTPPAAASPRPPRRSS-------PISASASSPAPAPGRSAADDAGASS 236

Query: 623 NNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSA 682
           ++S+ +        P     +          +           S    +   SS   RS 
Sbjct: 237 SDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSP 296

Query: 683 EQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRV 742
                   +     +     S+ S   ++++S+ +   E S              +RS  
Sbjct: 297 SPSPSSPGSGPAP-SSPRASSSSSSSRESSSSSTSSSSESSRGAAVSP---GPSPSRSPS 352

Query: 743 QSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIRSAV 802
            S P   +   SSP  R R   + +  +A  G P TR R +++   R+  + +       
Sbjct: 353 PSRPPPPADP-SSPRKRPRPSRAPSSPAASAGRP-TRRRARAAVAGRARRRDATGRF--- 407

Query: 803 QRSPNTRSRVQSSLTTHSAEQGSPTTR 829
                   R + S     A  G+   R
Sbjct: 408 -----PAGRPRPSPLDAGAASGAFYAR 429



 Score = 39.0 bits (91), Expect = 0.015
 Identities = 52/361 (14%), Positives = 88/361 (24%), Gaps = 26/361 (7%)

Query: 568 PPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNSAQ 627
               D  SG+QG+L                  + A           P      +A  +  
Sbjct: 30  DAADDLLSGSQGQLVSD-----SAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANES 84

Query: 628 NPLPEVSSSPNT--RSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQG 685
              P  S S         + SPT            +    SP        S  +R     
Sbjct: 85  RSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSP 144

Query: 686 SPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSS 745
            P                     + A +S Q  LP  S     R+              +
Sbjct: 145 GPPPAASPPAAGASPAAV----ASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPA 200

Query: 746 PTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPS-IRSAVQR 804
             +         +     S+ +   A   S    +   SS +  S   G      +    
Sbjct: 201 AASPR---PPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPL 257

Query: 805 SPNTRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQS 864
                  + + +   S   G  +            +   SSP   S      +  S   +
Sbjct: 258 PRPAPITLPTRIWEASGWNGPSSRPG--------PASSSSSPRERSPSPSPSSPGSG-PA 308

Query: 865 SATTRSTEQRSRSRSPLTRSTEQGSPNIRSTEQRASRTRSADQASQSVVTRSRSKLPSKK 924
            ++ R++   S SR   + ST   S + R        + S   +         +   S +
Sbjct: 309 PSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSR--PPPPADPSSPR 366

Query: 925 K 925
           K
Sbjct: 367 K 367



 Score = 31.7 bits (72), Expect = 2.2
 Identities = 30/192 (15%), Positives = 56/192 (29%), Gaps = 12/192 (6%)

Query: 560 QQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRR 619
           +   PL  P      +          P   P    + +     +P+P+P+S  +      
Sbjct: 252 ENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSS 311

Query: 620 AQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANI 679
            +A++S+ +     SSS ++ S                           + SR    A+ 
Sbjct: 312 PRASSSSSSSRESSSSSTSSSSESSRGAAVSP---------GPSPSRSPSPSRPPPPADP 362

Query: 680 RSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTR 739
            S  +    SR         S      RRA+A  + +      ++      R + SP   
Sbjct: 363 SSPRKRPRPSRA--PSSPAASAGRPTRRRARAAVAGRA-RRRDATGRFPAGRPRPSPLDA 419

Query: 740 SRVQSSPTTRSR 751
                +   R  
Sbjct: 420 GAASGAFYARYP 431



 Score = 29.8 bits (67), Expect = 9.2
 Identities = 17/106 (16%), Positives = 28/106 (26%), Gaps = 1/106 (0%)

Query: 566 TPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNS 625
           +     D  S + G   PP     P    +   +          +R   N  RR +    
Sbjct: 821 SRSHTPDGGSESSGPARPPGAAARPPPARSSESSKSKPAAAGGRARGK-NGRRRPRPPEP 879

Query: 626 AQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRS 671
              P           +    +P  R R    + +     G P+ R 
Sbjct: 880 RARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDPRG 925


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
            chromosome partitioning].
          Length = 1163

 Score = 57.0 bits (138), Expect = 4e-08
 Identities = 29/161 (18%), Positives = 57/161 (35%), Gaps = 45/161 (27%)

Query: 395  QLNKEQQRLAPLLDRLRRYEADLKALYKRRWILQKKFDKVKHTRRQKFENMYDFVNDNID 454
            +  + ++R   L  +    E   + L        +  +++   +R++F+  +D +N+N  
Sbjct: 971  EYEEVEERYEELKSQREDLEEAKEKL-------LEVIEELDKEKRERFKETFDKINENFS 1023

Query: 455  ATYKVHLKSSACQVYLIVSGWPKKKENLVIVVQSNLFSPQALTQSFGHAFLDITGAEQEP 514
              +K        ++ L                                          +P
Sbjct: 1024 EIFKELFGGGTAELELTEPD--------------------------------------DP 1045

Query: 515  YLAGMNFSCTPQGRSYVGIQHLSGGERTLAALAMIFTIWKL 555
              AG+  S  P G+    +  LSGGE++L ALA++F I K 
Sbjct: 1046 LTAGIEISARPPGKKLQSLSLLSGGEKSLTALALLFAIQKY 1086


>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
           Rho/Rac/Cdc42-like GTPases [Signal transduction
           mechanisms].
          Length = 1175

 Score = 52.2 bits (125), Expect = 1e-06
 Identities = 51/261 (19%), Positives = 87/261 (33%), Gaps = 34/261 (13%)

Query: 568 PPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPN---------PNPNSRANPNLSR 618
             Q  P    Q +L+P +         N   NS +  +            +  ++P L +
Sbjct: 28  SKQLLPPRRLQRKLNPISIRNGA---DNDIINSESKESFGKYALGHQIFSSFSSSPKLFQ 84

Query: 619 RAQA--NNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSS 676
           R  +    +        +SS N+    Q SP + S   +  + QS +   P   S VQ  
Sbjct: 85  RRNSAGPITHSPSATSSTSSLNSNDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQKR 144

Query: 677 ANI-----RSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSR 731
            N       +     PI  T        +  +   +   +  S    LP    SP+ R +
Sbjct: 145 KNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEIPSLGSQSMQLP----SPHFRQK 200

Query: 732 VQSSPTTRSRVQSSPTTRSR-VQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRS 790
             SS T  S   S P+ R     SS +  S   SS  +        +  S + S+     
Sbjct: 201 FSSSDT--SNGFSYPSIRKNSRHSSNSMPSFPHSSTAVLLKRHSGSSGASLISSNI---- 254

Query: 791 AEQGSPSIRSAVQRSPNTRSR 811
               +PS  ++   S +++  
Sbjct: 255 ----TPSSSNSEAMSTSSKRP 271



 Score = 44.9 bits (106), Expect = 2e-04
 Identities = 57/295 (19%), Positives = 105/295 (35%), Gaps = 27/295 (9%)

Query: 639 TRSRVQSSPTTRSRVQSSL---TIQSAEQ-GSPNTRSRVQSSANIRSAEQGSPISRTRFT 694
           + + V        R+Q  L   +I++       N+ S+          +  S        
Sbjct: 23  SDAFVSKQLLPPRRLQRKLNPISIRNGADNDIINSESKESFGKYALGHQIFS-------- 74

Query: 695 KYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQS 754
                S K    R +    +        +SS N+    Q SP + S   +  +T+SR  S
Sbjct: 75  -SFSSSPKLFQRRNSAGPITHSPSATSSTSSLNSNDGDQFSPASDSLSFNPSSTQSRKDS 133

Query: 755 SPTTRSRVQSS---LTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNTRSR 811
            P   S VQ     L   S+  G+        ++     +  G+P+ RS  +  P+  S+
Sbjct: 134 GPGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNG----SHAGAPNARS-RKEIPSLGSQ 188

Query: 812 VQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQSSATTRST 871
                + H  ++ S +  S     P I    + S N+  +    P++ + V     + S+
Sbjct: 189 SMQLPSPHFRQKFSSSDTSNGFSYPSIRKNSRHSSNSMPS---FPHSSTAVLLKRHSGSS 245

Query: 872 EQRSRSRSPLTRSTEQGSPNIRSTEQRASRTRSADQASQSVVTRSRSKLPSKKKD 926
              S   S +T S+        S+++       A  +  +V  + R +L   KKD
Sbjct: 246 -GASLISSNITPSSSNSEAMSTSSKRP--YIYPALLSRVAVEFKMRLQLGDHKKD 297


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 51.9 bits (124), Expect = 2e-06
 Identities = 61/364 (16%), Positives = 102/364 (28%), Gaps = 16/364 (4%)

Query: 557  VRVQQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNL 616
            V  +  RP  PP  + P      +      GP P S + P+ ++   P P+P+  AN   
Sbjct: 2582 VTSRARRPDAPPQSARPR--APVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPD 2639

Query: 617  SRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSS 676
                      + P  + +    +R R       R+   SS   +   + +  T   + S 
Sbjct: 2640 PHPPPTVPPPERPRDDPAPGRVSRPRRARRL-GRAAQASSPPQRPRRRAARPTVGSLTSL 2698

Query: 677  ANIRSAE---QGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQ 733
            A+        + +P +    T        +  +  A     A   +P   ++P   +R  
Sbjct: 2699 ADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPA 2758

Query: 734  SSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQ 793
              PTT      +P                 +SL+       SP   +   ++    +A  
Sbjct: 2759 RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818

Query: 794  GSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQ 853
               +  +     P +                 P   S   G          SP    A  
Sbjct: 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP 2878

Query: 854  RSPNTRSRVQSSATTRSTE---------QRSRSRSPLTRSTEQGSPNIRSTEQRASRTRS 904
              P  R R+   A +RSTE         +R            Q  P      Q       
Sbjct: 2879 ARPPVR-RLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937

Query: 905  ADQA 908
              Q 
Sbjct: 2938 RPQP 2941



 Score = 41.8 bits (98), Expect = 0.002
 Identities = 55/344 (15%), Positives = 83/344 (24%), Gaps = 20/344 (5%)

Query: 563  RPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQA 622
            R       S P    +   + PT G        P P     P P+    A P     A A
Sbjct: 2669 RLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAA 2728

Query: 623  NNS--------AQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQ 674
              +        A   +P   ++P   +R    PTT      +     A            
Sbjct: 2729 RQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAV 2788

Query: 675  SSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQS 734
            +S +       SP          L    +     + A         + ++ P        
Sbjct: 2789 ASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP 2848

Query: 735  SPTTRSRV------QSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANI 788
            S      V      +  P +RS   + P   +R        +    S +T S        
Sbjct: 2849 SLPLGGSVAPGGDVRRRPPSRSPA-AKPAAPARPPVRRL--ARPAVSRSTESFALPPDQP 2905

Query: 789  RSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNT 848
                Q         Q  P    + Q         Q      +   GA             
Sbjct: 2906 ERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG 2965

Query: 849  HSAEQRSPNTRSRVQSSATTRSTEQRS---RSRSPLTRSTEQGS 889
                 R    R RV   A +R     S    +   L+R +   S
Sbjct: 2966 ALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWAS 3009



 Score = 35.3 bits (81), Expect = 0.19
 Identities = 22/102 (21%), Positives = 32/102 (31%), Gaps = 6/102 (5%)

Query: 560  QQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRR 619
            Q  RP  P     P+   Q    PP P P P     P P     P  +P     P+ +  
Sbjct: 2904 QPERPPQPQAPPPPQPQPQ---PPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVP 2960

Query: 620  AQANNS---AQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLT 658
                 +    +  +P         SR   + +T      SL+
Sbjct: 2961 QPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLS 3002



 Score = 32.2 bits (73), Expect = 1.9
 Identities = 15/77 (19%), Positives = 23/77 (29%), Gaps = 8/77 (10%)

Query: 560 QQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRR 619
            Q RP  P P S P          P P   P     P P++      + +  A P   + 
Sbjct: 403 DQTRPAAPVPASVPTPA-----PTPVPASAPPPPATPLPSAEP---GSDDGPAPPPERQP 454

Query: 620 AQANNSAQNPLPEVSSS 636
                      P+ ++ 
Sbjct: 455 PAPATEPAPDDPDDATR 471



 Score = 31.4 bits (71), Expect = 2.7
 Identities = 28/150 (18%), Positives = 42/150 (28%), Gaps = 8/150 (5%)

Query: 563  RPLTPPPQSDPESGTQ---GELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRR 619
            RP +  P + P +  +     L+ P    +  S   P       P P       P     
Sbjct: 2865 RPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPP 2924

Query: 620  AQANNSAQ---NPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSS 676
                        P P+   +P T       P+          +       P  R RV   
Sbjct: 2925 PPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVP--RFRVPQP 2982

Query: 677  ANIRSAEQGSPISRTRFTKYTLQSTKSNLS 706
            A  R A   S    T  +   + S  S+L+
Sbjct: 2983 APSREAPASSTPPLTGHSLSRVSSWASSLA 3012


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
            found at the N terminus of SMC proteins. The SMC
            (structural maintenance of chromosomes) superfamily
            proteins have ATP-binding domains at the N- and
            C-termini, and two extended coiled-coil domains separated
            by a hinge in the middle. The eukaryotic SMC proteins
            form two kind of heterodimers: the SMC1/SMC3 and the
            SMC2/SMC4 types. These heterodimers constitute an
            essential part of higher order complexes, which are
            involved in chromatin and DNA dynamics. This family also
            includes the RecF and RecN proteins that are involved in
            DNA metabolism and recombination.
          Length = 1162

 Score = 46.5 bits (110), Expect = 7e-05
 Identities = 18/40 (45%), Positives = 26/40 (65%)

Query: 513  EPYLAGMNFSCTPQGRSYVGIQHLSGGERTLAALAMIFTI 552
            +P+  G+  S  P G+    + +LSGGE+TL ALA+IF I
Sbjct: 1056 DPFSGGIEISARPPGKGVKNLDNLSGGEKTLVALALIFAI 1095


>gnl|CDD|218658 pfam05616, Neisseria_TspB, Neisseria meningitidis TspB protein.
           This family consists of several Neisseria meningitidis
           TspB virulence factor proteins.
          Length = 502

 Score = 44.6 bits (105), Expect = 2e-04
 Identities = 24/84 (28%), Positives = 31/84 (36%), Gaps = 3/84 (3%)

Query: 567 PPPQSDPESGTQGELSP---PTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQAN 623
           P P   P S    E  P    +P  NP +  NP  N    PNP P+   NP+ +      
Sbjct: 315 PRPDLTPGSAEAPEAQPLPEVSPAENPANNPNPRENPGTRPNPEPDPDLNPDANPDTDGQ 374

Query: 624 NSAQNPLPEVSSSPNTRSRVQSSP 647
              +   P V   PN R R +   
Sbjct: 375 PGTRPDSPAVPDRPNGRHRKERKE 398



 Score = 36.5 bits (84), Expect = 0.059
 Identities = 22/67 (32%), Positives = 29/67 (43%), Gaps = 8/67 (11%)

Query: 249 RP-LTPPPQSDPESGTQGELSP-------PTPGPNPNSRANPNPNPNSRANPNLSRRAQA 300
           RP LTP     PE+    E+SP       P P  NP +R NP P+P+   + N     Q 
Sbjct: 316 RPDLTPGSAEAPEAQPLPEVSPAENPANNPNPRENPGTRPNPEPDPDLNPDANPDTDGQP 375

Query: 301 NNSAQNP 307
                +P
Sbjct: 376 GTRPDSP 382


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily
            archaeal type.  SMC (structural maintenance of
            chromosomes) proteins bind DNA and act in organizing and
            segregating chromosomes for partition. SMC proteins are
            found in bacteria, archaea, and eukaryotes. It is found
            in a single copy and is homodimeric in prokaryotes, but
            six paralogs (excluded from this family) are found in
            eukarotes, where SMC proteins are heterodimeric. This
            family represents the SMC protein of archaea and a few
            bacteria (Aquifex, Synechocystis, etc); the SMC of other
            bacteria is described by TIGR02168. The N- and C-terminal
            domains of this protein are well conserved, but the
            central hinge region is skewed in composition and highly
            divergent [Cellular processes, Cell division, DNA
            metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 44.7 bits (106), Expect = 2e-04
 Identities = 17/51 (33%), Positives = 29/51 (56%), Gaps = 7/51 (13%)

Query: 514  PYLAGMNFSCTPQGRSYVGIQHLSGGERTLAALAMIFTIWKLVVRVQQLRP 564
            P+  G+  S  P+G+    ++ +SGGE++L AL+ IF I       Q+ +P
Sbjct: 1053 PFAGGLELSAKPKGKPVQRLEAMSGGEKSLTALSFIFAI-------QRYKP 1096


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 39.4 bits (92), Expect = 0.008
 Identities = 28/141 (19%), Positives = 42/141 (29%), Gaps = 25/141 (17%)

Query: 555 LVVRVQQLRPLTP-PPQSDPESGTQGELSPPTPGPNPHSRVNP---------------NP 598
           L  +  QL+P     P   P  G  G+  PP  G  P  + N                 P
Sbjct: 371 LQDQFMQLQPRMRQLPMGSPMGGAMGQ--PPYYGQGPQQQFNGQPLGWPRMSMMPTPMGP 428

Query: 599 NSRANPN---PNPNSRANPNLSRRAQANNSAQNPLPEVSSS---PNTRSRVQSSPTTRSR 652
                PN   P    RA    ++ A        P+    +    P ++   Q   T    
Sbjct: 429 GGPLRPNGLAPMNAVRAPSRNAQNAAQKPP-MQPVMYPPNYQSLPLSQDLPQPQSTASQG 487

Query: 653 VQSSLTIQSAEQGSPNTRSRV 673
            Q+    Q     +P  + +V
Sbjct: 488 GQNKKLAQVLASATPQMQKQV 508


>gnl|CDD|213245 cd03278, ABC_SMC_barmotin, ATP-binding cassette domain of barmotin,
           a member of the SMC protein family.  Barmotin is a tight
           junction-associated protein expressed in rat epithelial
           cells which is thought to have an important regulatory
           role in tight junction barrier function. Barmotin
           belongs to the SMC protein family. SMC proteins are
           large (approximately 110 to 170 kDa), and each is
           arranged into five recognizable domains. Amino-acid
           sequence homology of SMC proteins between species is
           largely confined to the amino- and carboxy-terminal
           globular domains. The amino-terminal domain contains a
           'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the
           single-letter amino-acid code), which by mutational
           studies has been shown to be essential in several
           proteins. The carboxy-terminal domain contains a
           sequence (the DA-box) that resembles a 'Walker B' motif,
           and a motif with homology to the signature sequence of
           the ATP-binding cassette (ABC) family of ATPases. The
           sequence homology within the carboxy-terminal domain is
           relatively high within the SMC1-SMC4 group, whereas SMC5
           and SMC6 show some divergence in both of these
           sequences. In eukaryotic cells, the proteins are found
           as heterodimers of SMC1 paired with SMC3, SMC2 with
           SMC4, and SMC5 with SMC6 (formerly known as Rad18).
          Length = 197

 Score = 37.8 bits (89), Expect = 0.010
 Identities = 13/29 (44%), Positives = 18/29 (62%), Gaps = 7/29 (24%)

Query: 536 LSGGERTLAALAMIFTIWKLVVRVQQLRP 564
           LSGGE+ L ALA++F I        ++RP
Sbjct: 114 LSGGEKALTALALLFAI-------FRVRP 135


>gnl|CDD|213242 cd03275, ABC_SMC1_euk, ATP-binding cassette domain of eukaryotic
           SMC1 proteins.  The structural maintenance of
           chromosomes (SMC) proteins are large (approximately 110
           to 170 kDa), and each is arranged into five recognizable
           domains. Amino-acid sequence homology of SMC proteins
           between species is largely confined to the amino- and
           carboxy-terminal globular domains. The amino-terminal
           domain contains a 'Walker A' nucleotide-binding domain
           (GxxGxGKS/T, in the single-letter amino-acid code),
           which by mutational studies has been shown to be
           essential in several proteins. The carboxy-terminal
           domain contains a sequence (the DA-box) that resembles a
           'Walker B' motif, and a motif with homology to the
           signature sequence of the ATP-binding cassette (ABC)
           family of ATPases. The sequence homology within the
           carboxy-terminal domain is relatively high within the
           SMC1-SMC4 group, whereas SMC5 and SMC6 show some
           divergence in both of these sequences. In eukaryotic
           cells, the proteins are found as heterodimers of SMC1
           paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6
           (formerly known as Rad18).
          Length = 247

 Score = 38.3 bits (90), Expect = 0.012
 Identities = 14/29 (48%), Positives = 22/29 (75%)

Query: 524 TPQGRSYVGIQHLSGGERTLAALAMIFTI 552
            P G+ +  + +LSGGE+T+AALA++F I
Sbjct: 144 NPPGKRFRDMDNLSGGEKTMAALALLFAI 172


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
           subunit 62; Provisional.
          Length = 576

 Score = 38.0 bits (88), Expect = 0.022
 Identities = 39/185 (21%), Positives = 64/185 (34%), Gaps = 23/185 (12%)

Query: 563 RPLTPPPQSDPESGTQGELSPPT-PGPNPHSRVNPNPNS--------RANPNPNPNSRAN 613
           RPL+P    +       +L PPT P P P S    +  S          +  P+P S +N
Sbjct: 369 RPLSPYTAYE-------DLKPPTSPIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASN 421

Query: 614 PNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRV 673
                 AQ       PL   +   + +     SPT  + V  S++  S+    P+T    
Sbjct: 422 VPEVEPAQVEAKKTRPLSPYARYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVPDTAP-- 479

Query: 674 QSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQ 733
            ++A   +A       R      +  +   +L      + +A       SS+        
Sbjct: 480 -ATAATDAAAPPPANMR----PLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGN 534

Query: 734 SSPTT 738
           S+P T
Sbjct: 535 SAPPT 539


>gnl|CDD|213206 cd03239, ABC_SMC_head, The SMC head domain belongs to the
           ATP-binding cassette superfamily.  The structural
           maintenance of chromosomes (SMC) proteins are essential
           for successful chromosome transmission during
           replication and segregation of the genome in all
           organisms. SMCs are generally present as single proteins
           in bacteria, and as at least six distinct proteins in
           eukaryotes. The proteins range in size from
           approximately 110 to 170 kDa, and each has five distinct
           domains: amino- and carboxy-terminal globular domains,
           which contain sequences characteristic of ATPases, two
           coiled-coil regions separating the terminal domains ,
           and a central flexible hinge. SMC proteins function
           together with other proteins in a range of chromosomal
           transactions, including chromosome condensation,
           sister-chromatid cohesion, recombination, DNA repair,
           and epigenetic silencing of gene expression.
          Length = 178

 Score = 36.9 bits (86), Expect = 0.022
 Identities = 14/31 (45%), Positives = 22/31 (70%), Gaps = 3/31 (9%)

Query: 525 PQGRSYVGIQHLSGGERTLAALAMIFTIWKL 555
            QG+     Q LSGGE++L+ALA+IF + ++
Sbjct: 87  LQGKVE---QILSGGEKSLSALALIFALQEI 114


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 37.2 bits (87), Expect = 0.045
 Identities = 43/244 (17%), Positives = 67/244 (27%), Gaps = 31/244 (12%)

Query: 560 QQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRR 619
                 TP P   P        +P T          P   S   P P   +  +P     
Sbjct: 372 SAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPAN 431

Query: 620 AQANNSAQNPLPEVSSSPNTRSRVQSS---PTTRSRVQS-----SLTIQSAE-QGSPNTR 670
           A     + N L E+        ++ +    P+TR  +       SL    A    SPN  
Sbjct: 432 AANAPPSLN-LEEL------WQQILAKLELPSTRMLLSQQAELVSLDSNRAVIAVSPNWL 484

Query: 671 SRVQSSAN-IRSAEQ---GSPISRTRFTKYTLQSTKSNLSRRAQ----ANNSAQNPLPEV 722
             VQS    +  A     G  I      K  L+S   + S  A+       S   P P  
Sbjct: 485 GMVQSRKPLLEQAFAKVLGRSI------KLNLESQSGSASNTAKTPPPPQKSPPPPAPTP 538

Query: 723 SSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRV 782
                T +    +P       +  ++       P   S         +      ++   +
Sbjct: 539 PLPQPTATAPPPTPPPPPPTATQASSN-APAQIPADSSPPPPIPEEPTPSPTKDSSPEEI 597

Query: 783 QSSA 786
             +A
Sbjct: 598 DKAA 601



 Score = 29.9 bits (68), Expect = 8.4
 Identities = 12/78 (15%), Positives = 22/78 (28%), Gaps = 3/78 (3%)

Query: 554 KLVVRVQQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPN---PNS 610
                     P  P PQ    +       PP       S       + ++P P      +
Sbjct: 526 PPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPT 585

Query: 611 RANPNLSRRAQANNSAQN 628
            +    S   + + +A+N
Sbjct: 586 PSPTKDSSPEEIDKAAKN 603


>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
          Length = 1560

 Score = 37.0 bits (85), Expect = 0.061
 Identities = 68/377 (18%), Positives = 130/377 (34%), Gaps = 47/377 (12%)

Query: 567 PPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRA--QANN 624
            P      SG     +PPT G    S      +   + N N  + A  N +  +     N
Sbjct: 152 APHAQHNHSGKTNGDNPPTGGQYHQSGGTSRNHQMMDSNKNCPADALFNETNPSGEHKRN 211

Query: 625 SAQNPLPE---VSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGS------PNTRSRVQS 675
           S    +P    + S PN    VQ +   + +  +S T   A + +       N + R  S
Sbjct: 212 SIDGDIPSDIYIDSQPN-EGDVQKTNPWQGKQGNSATSPPANENNAVTLSCSNDQQRGAS 270

Query: 676 SANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQ-- 733
           SA    AE G    R         S  ++ +      ++A NPL   +++   R+ +Q  
Sbjct: 271 SA----AESGYAHHRG--------SNIASHTPNDNIMHAANNPLN--NTNDAQRNAIQGD 316

Query: 734 ---SSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRV-QSSANIR 789
               +P  ++          ++       S   +S        G+      + Q   + R
Sbjct: 317 LVRGAPNDKNSFDRGNEKTYQIYGGFHDGSPNAASAGAPFNGLGNQADGGHINQVHPDAR 376

Query: 790 SAEQGSP--------SIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSR 841
            A  G P        +  S   +S   +S    S   +S       +      AP  ++ 
Sbjct: 377 GAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFSNAGYS---NPGNSNPGYNNAPNSNTP 433

Query: 842 EQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTEQGSPNIRSTEQRASR 901
             + PN+++     PN+     +   + +      S +PL+ +    + +  S    A +
Sbjct: 434 YNNPPNSNTPYSNPPNSNPPYSNLPYSNT----PYSNAPLSNAPPSSAKDHHSAYHAAYQ 489

Query: 902 TRSADQASQSVVTRSRS 918
            R+A+Q + ++ T ++ 
Sbjct: 490 HRAANQPAANLPTANQP 506



 Score = 33.1 bits (75), Expect = 0.94
 Identities = 51/277 (18%), Positives = 104/277 (37%), Gaps = 14/277 (5%)

Query: 601 RANPNPNPN-SRANPNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTI 659
           R  PN   +  R N    +  Q      +  P  +S+    + + +        Q     
Sbjct: 319 RGAPNDKNSFDRGN---EKTYQIYGGFHDGSPNAASAGAPFNGLGNQADGGHINQVHPDA 375

Query: 660 QSAEQGSPNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPL 719
           + A  G P++ +    +A   +A+  +  S   F+     +  SN        N+A N  
Sbjct: 376 RGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFS----NAGYSNPGNSNPGYNNAPNSN 431

Query: 720 PEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTR 779
              ++ PN+ +   + P +     + P + +   ++P + +   S+    SA   +   R
Sbjct: 432 TPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHR 491

Query: 780 SRVQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAE-QGSPTTRSTVQGAPRI 838
           +  Q +AN+ +A Q  P+  +    + N+     +S    SA   G+  T +   G  + 
Sbjct: 492 AANQPAANLPTANQ--PAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNGIAKR 549

Query: 839 HSREQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRS 875
               +   N    EQ    +   V+SS++  S+E  +
Sbjct: 550 EDHPEGGTNRQKYEQSDEES---VESSSSENSSENEN 583



 Score = 30.4 bits (68), Expect = 6.1
 Identities = 45/276 (16%), Positives = 91/276 (32%), Gaps = 33/276 (11%)

Query: 648 TTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSR 707
              + + +  +  ++ + +   R    SS N+ +A   +  +R    +       +    
Sbjct: 63  ANENHINTRSSDNNSCENANMNRDANSSSENVGNANGNNYDNRAHVGR--AYGAGACEQS 120

Query: 708 RAQANNSAQNPLPEVSSSPN---TRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQS 764
             Q+N    +P+  +   PN    R R + +       Q + + ++   + PT     QS
Sbjct: 121 NQQSNIPIGDPVNHLRGHPNFGEPRERAEDAAP---HAQHNHSGKTNGDNPPTGGQYHQS 177

Query: 765 SLTIQSAEQGSPNTRSRVQSSAN--IRSAEQGSPSIRS---------------AVQRSPN 807
             T ++ +    N      +  N    S E    SI                  VQ++  
Sbjct: 178 GGTSRNHQMMDSNKNCPADALFNETNPSGEHKRNSIDGDIPSDIYIDSQPNEGDVQKTNP 237

Query: 808 TRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQSSAT 867
            + +  +S T+  A + +  T S      R  S    S   H    R  N  S   +   
Sbjct: 238 WQGKQGNSATSPPANENNAVTLSCSNDQQRGASSAAESGYAH---HRGSNIASHTPNDNI 294

Query: 868 TRSTEQRSRSRSPLTRSTEQ-----GSPNIRSTEQR 898
             +      + +   R+  Q     G+PN +++  R
Sbjct: 295 MHAANNPLNNTNDAQRNAIQGDLVRGAPNDKNSFDR 330


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 36.7 bits (84), Expect = 0.062
 Identities = 45/273 (16%), Positives = 83/273 (30%), Gaps = 11/273 (4%)

Query: 620 AQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANI 679
            + + +   P        +T   V ++  T      + +    E  SP +R+   +SA  
Sbjct: 441 VEPHKTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRT---TSATP 497

Query: 680 RSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTR 739
            +      ++    T  T Q T S+                  +S P   + V  + T+ 
Sbjct: 498 NATSPTPAVTTPNATSPTTQKT-SDTPNATSPTPIVIGVTTTATSPPTGTTSV-PNATSP 555

Query: 740 SRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIR 799
              + SP   +      +  S + S++T      GS  T  +      I S+   +P   
Sbjct: 556 QVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQ----PGIPSSSHSTPRSN 611

Query: 800 SAVQRSPNTRSRVQSSLTTHSAEQGSPTTR--STVQGAPRIHSREQSSPNTHSAEQRSPN 857
           S       T +               P+T   ST+   P   +  Q S   +S+  R P 
Sbjct: 612 STSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGPGNSSTSRYPG 671

Query: 858 TRSRVQSSATTRSTEQRSRSRSPLTRSTEQGSP 890
                +      +T   + S       T   + 
Sbjct: 672 EVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTG 704



 Score = 36.7 bits (84), Expect = 0.065
 Identities = 54/316 (17%), Positives = 97/316 (30%), Gaps = 22/316 (6%)

Query: 566 TPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNS 625
           T  P S   +GT     P    P   +  +  PN+ +         A    +++     +
Sbjct: 466 TADPTSGTPTGTTSSTLPEDTSPTSRT-TSATPNATSPTPAVTTPNATSPTTQKTSDTPN 524

Query: 626 AQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAE---QGSPNTRSRVQSSANIRSA 682
           A +P P V     T     S PT  + V ++ + Q  E     + NT     + + + SA
Sbjct: 525 ATSPTPIVIGVTTT---ATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSA 581

Query: 683 EQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVS---SSPNTRSRVQSSPTTR 739
                               S+     ++N+++  PL   +      N      S P+T 
Sbjct: 582 VTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTT 641

Query: 740 SRVQSSPTTRSRV--QSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPS 797
                SP        Q S    S            +G PN  +         ++      
Sbjct: 642 HVSTLSPGPGPGTTSQVSGPGNSSTSRYPGEVHVTEGMPNPNA---------TSPSAPSG 692

Query: 798 IRSAVQRSPNTRSRVQSSLTTHSAEQ-GSPTTRSTVQGAPRIHSREQSSPNTHSAEQRSP 856
            ++AV    +T  +  S+    S     + T+  T +GA R      ++    S   +  
Sbjct: 693 QKTAVPTVTSTGGKANSTTKETSGSTLMASTSPHTNEGAFRTTPYNATTYLPPSTSSKLR 752

Query: 857 NTRSRVQSSATTRSTE 872
              +      TT+   
Sbjct: 753 PRWTFTSPPVTTKQAT 768



 Score = 30.9 bits (69), Expect = 4.4
 Identities = 38/254 (14%), Positives = 72/254 (28%), Gaps = 22/254 (8%)

Query: 677 ANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSP 736
           AN  +  +   I+RT  T  T  + K    +      S       V   P+  + V ++P
Sbjct: 395 ANPVADAKTLIITRTA-TNATTTTHKVVFHKAPDTTKSVIFVYTLVHVEPHKTTAVPTTP 453

Query: 737 ------TTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRS 790
                 T  +   + PT+ +   ++ +T     S  +  ++   +  + +   ++ N  S
Sbjct: 454 SLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATS 513

Query: 791 AEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTVQGA---PRIHSREQSSPN 847
                 S              V ++ T+      S    ++ Q     P  ++      +
Sbjct: 514 PTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTS 573

Query: 848 THSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRST------------EQGSPNIRST 895
             S    +  T      S+ T        S     RS               G  NI   
Sbjct: 574 APSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEE 633

Query: 896 EQRASRTRSADQAS 909
                 T      S
Sbjct: 634 TPSVPSTTHVSTLS 647


>gnl|CDD|213239 cd03272, ABC_SMC3_euk, ATP-binding cassette domain of eukaryotic
           SMC3 proteins.  The structural maintenance of
           chromosomes (SMC) proteins are large (approximately 110
           to 170 kDa), and each is arranged into five recognizable
           domains. Amino-acid sequence homology of SMC proteins
           between species is largely confined to the amino- and
           carboxy-terminal globular domains. The amino-terminal
           domain contains a 'Walker A' nucleotide-binding domain
           (GxxGxGKS/T, in the single-letter amino-acid code),
           which by mutational studies has been shown to be
           essential in several proteins. The carboxy-terminal
           domain contains a sequence (the DA-box) that resembles a
           'Walker B' motif, and a motif with homology to the
           signature sequence of the ATP-binding cassette (ABC)
           family of ATPases. The sequence homology within the
           carboxy-terminal domain is relatively high within the
           SMC1-SMC4 group, whereas SMC5 and SMC6 show some
           divergence in both of these sequences. In eukaryotic
           cells, the proteins are found as heterodimers of SMC1
           paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6
           (formerly known as Rad18).
          Length = 243

 Score = 35.7 bits (83), Expect = 0.072
 Identities = 16/40 (40%), Positives = 22/40 (55%), Gaps = 10/40 (25%)

Query: 525 PQGR----------SYVGIQHLSGGERTLAALAMIFTIWK 554
           PQG+              +Q LSGG+++L ALA+IF I K
Sbjct: 138 PQGKINSLTNMKQDEQQEMQQLSGGQKSLVALALIFAIQK 177


>gnl|CDD|213241 cd03274, ABC_SMC4_euk, ATP-binding cassette domain of eukaryotic
           SMC4 proteins.  The structural maintenance of
           chromosomes (SMC) proteins are large (approximately 110
           to 170 kDa), and each is arranged into five recognizable
           domains. Amino-acid sequence homology of SMC proteins
           between species is largely confined to the amino- and
           carboxy-terminal globular domains. The amino-terminal
           domain contains a 'Walker A' nucleotide-binding domain
           (GxxGxGKS/T, in the single-letter amino-acid code),
           which by mutational studies has been shown to be
           essential in several proteins. The carboxy-terminal
           domain contains a sequence (the DA-box) that resembles a
           'Walker B' motif, and a motif with homology to the
           signature sequence of the ATP-binding cassette (ABC)
           family of ATPases. The sequence homology within the
           carboxy-terminal domain is relatively high within the
           SMC1-SMC4 group, whereas SMC5 and SMC6 show some
           divergence in both of these sequences. In eukaryotic
           cells, the proteins are found as heterodimers of SMC1
           paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6
           (formerly known as Rad18).
          Length = 212

 Score = 35.3 bits (82), Expect = 0.087
 Identities = 12/23 (52%), Positives = 20/23 (86%)

Query: 528 RSYVGIQHLSGGERTLAALAMIF 550
           +S+  I +LSGGE+TL++LA++F
Sbjct: 120 KSWKNISNLSGGEKTLSSLALVF 142


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 34.7 bits (80), Expect = 0.20
 Identities = 22/131 (16%), Positives = 38/131 (29%), Gaps = 9/131 (6%)

Query: 566 TPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNS 625
            P P    +   +      T   N     + +P S +  +P  +S   P+     +  +S
Sbjct: 147 DPNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSS 206

Query: 626 AQNP-LPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQ 684
             +  LP   SS  +     + P        S T  S   G               S  +
Sbjct: 207 PSDSSLPPAPSSFQS----DTPPP----SPESPTNPSPPPGPAAPPPPPVQQVPPLSTAK 258

Query: 685 GSPISRTRFTK 695
            +P S +    
Sbjct: 259 PTPPSASATPA 269



 Score = 33.1 bits (76), Expect = 0.60
 Identities = 17/76 (22%), Positives = 23/76 (30%)

Query: 564 PLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQAN 623
           P  P   S PE  +    S   P P+      P P+  +  NP+P            Q  
Sbjct: 192 PGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQV 251

Query: 624 NSAQNPLPEVSSSPNT 639
                  P   S+  T
Sbjct: 252 PPLSTAKPTPPSASAT 267



 Score = 30.8 bits (70), Expect = 2.9
 Identities = 20/99 (20%), Positives = 28/99 (28%), Gaps = 6/99 (6%)

Query: 564 PLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQAN 623
              P   S  +  +     P  P P      +   +S   P P+      P  S  +  N
Sbjct: 176 DADPASASPSDPPSSSPGVPSFPSPPEDP--SSPSDSSLPPAPSSFQSDTPPPSPESPTN 233

Query: 624 NSAQNPLPEVSSSPNTRSRVQSS---PTTRSRVQSSLTI 659
            S   P P     P  +     S   PT  S   +   I
Sbjct: 234 PSPP-PGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPI 271



 Score = 30.4 bits (69), Expect = 4.2
 Identities = 9/84 (10%), Positives = 16/84 (19%)

Query: 564 PLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQAN 623
                P            S  +  P P S  +  P        NP+    P         
Sbjct: 190 SSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQ 249

Query: 624 NSAQNPLPEVSSSPNTRSRVQSSP 647
                   + +    + +      
Sbjct: 250 QVPPLSTAKPTPPSASATPAPIGG 273


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
            bacterial type.  SMC (structural maintenance of
            chromosomes) proteins bind DNA and act in organizing and
            segregating chromosomes for partition. SMC proteins are
            found in bacteria, archaea, and eukaryotes. This family
            represents the SMC protein of most bacteria. The smc gene
            is often associated with scpB (TIGR00281) and scpA genes,
            where scp stands for segregation and condensation
            protein. SMC was shown (in Caulobacter crescentus) to be
            induced early in S phase but present and bound to DNA
            throughout the cell cycle [Cellular processes, Cell
            division, DNA metabolism, Chromosome-associated
            proteins].
          Length = 1179

 Score = 35.0 bits (81), Expect = 0.21
 Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 1/55 (1%)

Query: 501  GHAFLDITGAEQEPYLAGMNFSCTPQGRSYVGIQHLSGGERTLAALAMIFTIWKL 555
            G A L +T  E +   AG+     P G+    +  LSGGE+ L ALA++F I+K+
Sbjct: 1056 GEAELRLTDPE-DLLEAGIEIFAQPPGKKNQNLSLLSGGEKALTALALLFAIFKV 1109


>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor.  This family is secreted by
           gram-negative Gammaproteobacteria such as Pseudomonas
           syringae of tomato and the fire blight plant pathogen
           Erwinia amylovora, amongst others. It is an essential
           pathogenicity factor of approximately 198 kDa. Its
           injection into the host-plant is dependent upon the
           bacterial type III or Hrp secretion system. The family
           is long and carries a number of predicted functional
           regions, including an ERMS or endoplasmic reticulum
           membrane retention signal at both the C- and the
           N-termini, a leucine-zipper motif from residues 539-560,
           and a nuclear localisation signal at 1358-1361. this
           conserved AvrE-family of effectors is among the few that
           are required for full virulence of many phytopathogenic
           pseudomonads, erwinias and pantoeas.
          Length = 1771

 Score = 35.1 bits (81), Expect = 0.21
 Identities = 50/220 (22%), Positives = 72/220 (32%), Gaps = 22/220 (10%)

Query: 713 NSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSA- 771
            + Q      + +P    +   SPT R+    +   +   +  P    +  +   IQ+A 
Sbjct: 12  TAVQPEATPSAGAPTGLQQSSESPTQRASHSLASEGKKNRKKMPKVFQKSSAPRQIQAAP 71

Query: 772 EQGSPNTRSRVQSSANIR----SAEQGSPSIRSAVQRSPNTRSRVQSSLTT-HSAEQGSP 826
            Q    T +  QSS         A         A + SP+ R   +S     H  E  + 
Sbjct: 72  PQALNPTAAAPQSSRGPTLRELLALPEDDGETQAPESSPSARRLTRSEGVARHEMEDLA- 130

Query: 827 TTRSTVQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTE 886
             R  V+       R Q   N  S+ +R P     V     T S         P T    
Sbjct: 131 -GRPVVKPDADRQLR-QDILNKSSSSRRPP-----VSKEEGTSSKM-------PATALAS 176

Query: 887 QGSPNIRSTEQRASRTRSADQASQSVVTRSRSKLPSKKKD 926
                     Q     RS DQASQS ++RSR   P+   D
Sbjct: 177 AALFKDDEIRQEVDAARS-DQASQSRLSRSRGNPPAIPPD 215


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 34.3 bits (78), Expect = 0.34
 Identities = 29/145 (20%), Positives = 43/145 (29%), Gaps = 1/145 (0%)

Query: 524 TPQGRSYVGIQHLSGGERTLAALAMIFTIWKLVVRVQQLRPLTP-PPQSDPESGTQGELS 582
           TP   ++  I H+        A  M+   W         R  TP  P + P    Q   +
Sbjct: 660 TPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAA 719

Query: 583 PPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPLPEVSSSPNTRSR 642
                  P +         A P       A P  +R   A      P      +P  +  
Sbjct: 720 ATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPP 779

Query: 643 VQSSPTTRSRVQSSLTIQSAEQGSP 667
            Q+ P  + R + + T Q   Q  P
Sbjct: 780 PQAPPAPQQRPRGAPTPQPPPQAGP 804


>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
           Provisional.
          Length = 576

 Score = 34.2 bits (78), Expect = 0.35
 Identities = 18/82 (21%), Positives = 31/82 (37%), Gaps = 9/82 (10%)

Query: 573 PESGTQGELSPPTPGPNPHSRVNPNPNSRANPN---------PNPNSRANPNLSRRAQAN 623
           PE+   G+  P  P P  + R   N N + NP            P+ R N N +   + +
Sbjct: 427 PENKGDGQNEPVIPKPLDNERDQSNKNKQVNPGNRHNSEDRYTRPHGRNNENRNYNNKNS 486

Query: 624 NSAQNPLPEVSSSPNTRSRVQS 645
           +  ++P       P  + +  S
Sbjct: 487 DIPKHPERSEHEQPEDKKKKSS 508


>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
           Provisional.
          Length = 481

 Score = 34.0 bits (77), Expect = 0.35
 Identities = 29/139 (20%), Positives = 58/139 (41%), Gaps = 13/139 (9%)

Query: 660 QSAEQGSPNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQ----ANNSA 715
           Q+A   +P    + ++ A   + +Q +P+ +      T  + K   + + Q    A   A
Sbjct: 245 QTANTATPKAEVKTEAPA---AEKQAAPVVKENTNTNTATTEKKETTTQQQTAPKAPTEA 301

Query: 716 QNPLPEVSSSPNTRS-RVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQG 774
             P P  S++ N       ++  T +   S+P+  +   ++  T +   ++     A QG
Sbjct: 302 AKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNTN-----ANQG 356

Query: 775 SPNTRSRVQSSANIRSAEQ 793
           S N  S   +SA I  A++
Sbjct: 357 SSNNNSNSSASAIIAEAQK 375



 Score = 34.0 bits (77), Expect = 0.40
 Identities = 30/143 (20%), Positives = 54/143 (37%), Gaps = 23/143 (16%)

Query: 596 PNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQS 655
           P   ++  P P+ N+ AN   +      N+     P  +++ NT S   ++  T      
Sbjct: 298 PTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNT------ 351

Query: 656 SLTIQSAEQGSPNTRSRVQSSANIRSAEQG------------SPISRTRFTKYTLQSTKS 703
                +A QGS N  S   +SA I  A++             +    + +TKY       
Sbjct: 352 -----NANQGSSNNNSNSSASAIIAEAQKHLGKAYSWGGNGPTTFDCSGYTKYVFAKAGI 406

Query: 704 NLSRRAQANNSAQNPLPEVSSSP 726
           +L R + A  ++   + E  + P
Sbjct: 407 SLPRTSGAQYASTTRISESQAKP 429


>gnl|CDD|177556 PHA03195, PHA03195, tegument protein VP11/12; Provisional.
          Length = 746

 Score = 34.2 bits (78), Expect = 0.37
 Identities = 20/77 (25%), Positives = 34/77 (44%)

Query: 249 RPLTPPPQSDPESGTQGELSPPTPGPNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPL 308
            P   P  + P S T+G    P PG    SR  P   P++  +P  +  A+ + +A+N  
Sbjct: 439 CPPQGPLPAPPNSKTRGTFRRPRPGSVRGSRQLPASPPSNIVSPRTNPAAEGSTAAKNGQ 498

Query: 309 FSKGEHLCGTLNFQDCT 325
            ++   +  +  F DC 
Sbjct: 499 GAETIQVRSSGEFNDCI 515


>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3.  The SLD3 DNA
           replication regulator is required for loading and
           maintenance of Cdc45 on chromatin during DNA
           replication.
          Length = 437

 Score = 33.6 bits (77), Expect = 0.48
 Identities = 20/90 (22%), Positives = 29/90 (32%), Gaps = 3/90 (3%)

Query: 701 TKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTT-- 758
             S L ++      +  P P       T     S  + R  +Q   T   +  S  T   
Sbjct: 236 KISPLKKKKTGTLKSSKPEPGTPLKRQTSPASSSQKSRRRSLQRVLTDERKSSSRRTPSL 295

Query: 759 -RSRVQSSLTIQSAEQGSPNTRSRVQSSAN 787
            RSR  SSL      + S N    + S  +
Sbjct: 296 LRSRTNSSLIEFLKRESSENLLPSLSSRTS 325


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 33.1 bits (75), Expect = 0.55
 Identities = 23/94 (24%), Positives = 31/94 (32%), Gaps = 5/94 (5%)

Query: 558 RVQQLRPLTPP-PQSDPESGTQGELSPPTPGPNPHSR---VNPNPNSRANPNPNPNSRAN 613
           R+   R   P  P      G   ++ P  PG  P+      NP P     P  N N    
Sbjct: 168 RINPQRFGFPMQPNMGMRPGFN-QMPPHMPGMPPNQMRPGFNPMPGMPPRPGFNQNPNMM 226

Query: 614 PNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSP 647
           PN++R            P     PN + R   +P
Sbjct: 227 PNMNRPGFRPQPGGFNHPGTPMGPNMQQRPGFNP 260


>gnl|CDD|140307 PTZ00284, PTZ00284, protein kinase; Provisional.
          Length = 467

 Score = 33.4 bits (76), Expect = 0.60
 Identities = 21/79 (26%), Positives = 29/79 (36%), Gaps = 3/79 (3%)

Query: 847 NTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTEQGSPNIRSTEQRASRTRSAD 906
           N  S      N  +      T+RST    RS S   R T   + + R+     + T    
Sbjct: 25  NALSGNSPKANNSAS-TGQTTSRSTNSARRSGSKRDRETATSTDSGRTKSHEGAATT--K 81

Query: 907 QASQSVVTRSRSKLPSKKK 925
           QA+ +  T      P KKK
Sbjct: 82  QATTTPTTNVEVAPPPKKK 100


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This family
           contains bacterial fibronectin-attachment proteins
           (FAP). Family members are rich in alanine and proline,
           are approximately 300 long, and seem to be restricted to
           mycobacteria. These proteins contain a
           fibronectin-binding motif that allows mycobacteria to
           bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 32.9 bits (75), Expect = 0.71
 Identities = 16/75 (21%), Positives = 23/75 (30%), Gaps = 5/75 (6%)

Query: 564 PLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQAN 623
           P  PP  +           PP P   P  + +    +   P  +PN+   P         
Sbjct: 44  PPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPP-----PVDP 98

Query: 624 NSAQNPLPEVSSSPN 638
           N+   P PE     N
Sbjct: 99  NAPPPPAPEPGRIDN 113


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
           procyclic acidic repetitive protein (PARP) like
           sequences. The procyclic acidic repetitive protein
           (parp) genes of Trypanosoma brucei encode a small family
           of abundant surface proteins whose expression is
           restricted to the procyclic form of the parasite. They
           are found at two unlinked loci, parpA and parpB;
           transcription of both loci is developmentally regulated.
          Length = 145

 Score = 31.9 bits (71), Expect = 0.76
 Identities = 14/65 (21%), Positives = 16/65 (24%), Gaps = 5/65 (7%)

Query: 566 TPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNS 625
               + +PE     E     P P       P P     P P P     P     A    S
Sbjct: 75  EGEEEPEPE-----EEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPEPEPEPGAATLKS 129

Query: 626 AQNPL 630
              P 
Sbjct: 130 VALPF 134



 Score = 29.2 bits (64), Expect = 6.1
 Identities = 10/48 (20%), Positives = 13/48 (27%), Gaps = 1/48 (2%)

Query: 567 PPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANP 614
             P+ + E   + E     P P       P P       P P     P
Sbjct: 62  DEPEEEEEPEPE-EEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEP 108


>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
           inhibitor.
          Length = 670

 Score = 32.8 bits (74), Expect = 0.95
 Identities = 27/107 (25%), Positives = 46/107 (42%), Gaps = 5/107 (4%)

Query: 684 QGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTR---SRVQSSPTTRS 740
           +G P     F      +  +     A +N +  +  P    +P+T      + S P T S
Sbjct: 553 KGVPYIPGLFAGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPS 612

Query: 741 RVQSSPTT--RSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSS 785
           ++ S  T+   S + S  TT S  +SS+ + S E  SP +  +V S+
Sbjct: 613 KIVSPSTSPPASHLGSPSTTPSSPESSIKVASTETASPESSIKVAST 659



 Score = 32.8 bits (74), Expect = 1.0
 Identities = 31/106 (29%), Positives = 47/106 (44%), Gaps = 3/106 (2%)

Query: 712 NNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSA 771
           N  + N  P  S++ +  +    SP+T     +SP     + S P T S++ S  T   A
Sbjct: 565 NPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPA-GHLGSPPATPSKIVSPSTSPPA 623

Query: 772 EQ-GSPNTR-SRVQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSS 815
              GSP+T  S  +SS  + S E  SP     V  + ++ S V  S
Sbjct: 624 SHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESSVSMVSMS 669


>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 433 to 538 amino acids in length. This domain is
           found associated with pfam00616, pfam00168. This domain
           has two conserved sequence motifs: DLQ and PLSFQNP.
          Length = 489

 Score = 32.8 bits (74), Expect = 1.0
 Identities = 54/310 (17%), Positives = 88/310 (28%), Gaps = 21/310 (6%)

Query: 597 NPNSRANPNPNPNSRANPNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSS 656
           NP      +  P+ R     S     ++  Q  + +   S    +R+ S PT  ++    
Sbjct: 10  NPTPVQQQSRRPHERNPSPPSVSCGLSSGLQKIMMDDLDSSVDFTRLPS-PTPENKDLFF 68

Query: 657 LTIQSAEQGSPNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQ 716
           +T  S  Q SP   +   S + I   E G P      +   LQ   S     +       
Sbjct: 69  VTRPSLAQSSP---AYSSSYSEINEPELGMPNGGRSLSLVDLQDAASGQVHSSPMGLPGA 125

Query: 717 NPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQ------- 769
             L   S   + RS   S        Q       R+  +   + ++   L+ Q       
Sbjct: 126 LGLRRASVPWSARSGQSSIAGGLRLSQPQSAPTVRLLHAAPAQPQLLQPLSFQNPVYHMA 185

Query: 770 -------SAEQGSPNTRSRVQSSANIRSAEQGSPSIRSA--VQRSPNTRSRVQSSLTTHS 820
                    + GS N  S    S +   +     S        R     +R Q SL    
Sbjct: 186 APLPVSPRGDSGSENHSSASSHSNSEDCSLSAGFSNSGGEEFTRRSTDFTRRQLSLPDRQ 245

Query: 821 AEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSP 880
            +   P   S      R+         +H          S     +   S   +S +R  
Sbjct: 246 HQPALPRQNSAGP-QRRVDQPSPPGGGSHRGRIPPSLLSSLPSEGSMLSSEWPQSGARPR 304

Query: 881 LTRSTEQGSP 890
              S+ +G  
Sbjct: 305 QQSSSSKGDS 314


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 32.6 bits (75), Expect = 1.1
 Identities = 11/36 (30%), Positives = 16/36 (44%), Gaps = 3/36 (8%)

Query: 515 YLAGMNFSCTPQGRSYVGIQHLSGGERTLAALAMIF 550
           YL    F+   Q +    +  LSGGE+    LA + 
Sbjct: 422 YLGRFGFTGEDQEKP---VGVLSGGEKARLLLAKLL 454


>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2).  All proteins
           in this family for which functions are known are flap
           endonucleases that generate the 3' incision next to DNA
           damage as part of nucleotide excision repair. This
           family is related to many other flap endonuclease
           families including the fen1 family. This family is based
           on the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1034

 Score = 32.6 bits (74), Expect = 1.3
 Identities = 30/201 (14%), Positives = 61/201 (30%), Gaps = 9/201 (4%)

Query: 29  EIHIPTVDSTSSVSSEEIRRHESNLNLLRRKTQQRRTIQHTESSTSSSNSEQNLRELTKA 88
             H    +  + +S  E + +ESNL+              + SS +  +   +  E    
Sbjct: 551 SDHPEQFEFQNELSPLETKNNESNLSSDAETEGSPNPEMPSWSSVTVPSEALDNYE---- 606

Query: 89  TTYIQNIESLLSQASDLIERRQLEAHRILYVLLKFRTELEGTHQSYDSGSDQAGFISPME 148
           TT   N + + + A   I+   +     L ++           +  +S SD     S +E
Sbjct: 607 TTNPSNAKEVRNFAETGIQTTNVGESADLLLISNPMEVEPMESEKEESESD----GSFIE 662

Query: 149 TQDENPPSEHSSEFPGVERCLRRIESSLLNVRDLLENRPSDTLSPAQSEVQAEFLSGESR 208
               +   E               E++   V  +      +       E + + + G   
Sbjct: 663 VDSVSSTLELQVPSKSQPTDESE-ENAENKVASIEGEHRKEIEDLLFDESEEDNIVGMIE 721

Query: 209 EELTFANESSETNSQGVPRAE 229
           EE    +  +E     +   E
Sbjct: 722 EEKDADDFKNEWQDISLEELE 742


>gnl|CDD|215814 pfam00242, DNA_pol_viral_N, DNA polymerase (viral) N-terminal
           domain. 
          Length = 379

 Score = 32.1 bits (73), Expect = 1.4
 Identities = 20/124 (16%), Positives = 34/124 (27%), Gaps = 7/124 (5%)

Query: 806 PNTRSRVQSSLTTHSAEQGSPTTRSTVQGA---PRIHSREQSSPNTHSAEQRSPNTRSRV 862
              R     +  T      S +     Q      R+  +       H  + RS + R R 
Sbjct: 215 SENRRTRNLANNTSRKSDTSRSVGPVRQSQIQRSRLGLQANQGKLAHGQQGRSGSIRGRK 274

Query: 863 QSSATTRSTEQR---SRSRSPLTRSTEQGSPNIRSTEQRASRTRSADQASQSVVTRSRSK 919
            S+ T R        S   +    S+          E   S   ++++ S S        
Sbjct: 275 HST-TRRPFGVEPSSSGVTTNRASSSSSCFHQSAVRETAYSSLSTSERHSSSGHAVELRS 333

Query: 920 LPSK 923
           +P  
Sbjct: 334 IPGG 337



 Score = 30.9 bits (70), Expect = 3.1
 Identities = 25/155 (16%), Positives = 47/155 (30%), Gaps = 16/155 (10%)

Query: 666 SPNTRSRVQSSANI--RSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNS--------A 715
                     S+ I  RS  + +       ++ +  S      R++Q   S         
Sbjct: 198 RHGDEPFGAQSSGILSRSENRRTRNLANNTSRKSDTSRSVGPVRQSQIQRSRLGLQANQG 257

Query: 716 QNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGS 775
           +    +   S + R R  S+      V+ S +  +  ++S ++    QS+         S
Sbjct: 258 KLAHGQQGRSGSIRGRKHSTTRRPFGVEPSSSGVTTNRASSSSSCFHQSA---VRETAYS 314

Query: 776 PNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNTRS 810
             + S   SS+          SI      S N   
Sbjct: 315 SLSTSERHSSS---GHAVELRSIPGGSVSSQNAGP 346



 Score = 29.4 bits (66), Expect = 8.7
 Identities = 34/152 (22%), Positives = 54/152 (35%), Gaps = 14/152 (9%)

Query: 609 NSRANPNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPN 668
            SR+  N   R  ANN+++      S  P  +S++Q S       Q  L     +  S +
Sbjct: 212 LSRSE-NRRTRNLANNTSRKSDTSRSVGPVRQSQIQRSRLGLQANQGKLAH-GQQGRSGS 269

Query: 669 TRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNT 728
            R R  S+   R    G   S +  T     S+ S            Q+ + E + S  +
Sbjct: 270 IRGRKHSTTR-RP--FGVEPSSSGVTTNRASSSSSCFH---------QSAVRETAYSSLS 317

Query: 729 RSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRS 760
            S   SS      ++S P      Q++    S
Sbjct: 318 TSERHSSSGHAVELRSIPGGSVSSQNAGPLLS 349


>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal.  Mediator is a
           large complex of up to 33 proteins that is conserved
           from plants to fungi to humans - the number and
           representation of individual subunits varying with
           species. It is arranged into four different sections, a
           core, a head, a tail and a kinase-activity part, and the
           number of subunits within each of these is what varies
           with species. Overall, Mediator regulates the
           transcriptional activity of RNA polymerase II but it
           would appear that each of the four different sections
           has a slightly different function. Mediator subunit
           Hrs1/Med3 is a physical target for Cyc8-Tup1, a yeast
           transcriptional co-repressor.
          Length = 381

 Score = 31.9 bits (72), Expect = 1.5
 Identities = 21/106 (19%), Positives = 41/106 (38%)

Query: 800 SAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQRSPNTR 859
           +  Q      S   +  +  S    + +T +T   A  + +   S+PNT +    +  T 
Sbjct: 118 TYNQLGNAGASASITKTSNGSDAATTSSTANTPAAAKVLKANAASAPNTTTGVGSAATTA 177

Query: 860 SRVQSSATTRSTEQRSRSRSPLTRSTEQGSPNIRSTEQRASRTRSA 905
           +   ++ATT +T Q+   +   T+ T   +        +A    SA
Sbjct: 178 AISATTATTPTTTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQASA 223



 Score = 30.0 bits (67), Expect = 6.0
 Identities = 18/184 (9%), Positives = 53/184 (28%), Gaps = 5/184 (2%)

Query: 559 VQQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSR 618
            Q          +   +G+    +  +    P +      N+ + PN      +    + 
Sbjct: 120 NQLGNAGASASITKTSNGSDAA-TTSSTANTPAAAKVLKANAASAPNTTTGVGSAATTAA 178

Query: 619 RAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSAN 678
            +    +      +    P    +   +   +++  +    Q++      +    Q+++ 
Sbjct: 179 ISATTATTPTTTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQASAYNQMGSLGVPQNTSM 238

Query: 679 IRSAEQGSPISRTRFTKYTLQSTKSNLSR----RAQANNSAQNPLPEVSSSPNTRSRVQS 734
           +      +P+ +         +  S L+     R       QN   +++ S N  +    
Sbjct: 239 LAQIPNPTPLMQLLNGVSPNNAMASPLNNMSPMRNLNQMGNQNNGGQMTPSANNGNMNNQ 298

Query: 735 SPTT 738
           S   
Sbjct: 299 SREN 302


>gnl|CDD|216205 pfam00937, Corona_nucleoca, Coronavirus nucleocapsid protein. 
          Length = 346

 Score = 31.6 bits (72), Expect = 1.7
 Identities = 11/60 (18%), Positives = 18/60 (30%), Gaps = 2/60 (3%)

Query: 572 DPESGTQGELSPPTPGPNPHSRVNPNPNSRAN--PNPNPNSRANPNLSRRAQANNSAQNP 629
           +P +     L      P           SR++   +   NSR     S R  + N   + 
Sbjct: 128 NPNNDEAIPLRFSPGLPKGFYIEGFRGRSRSSSRSSSRSNSRGPSRGSSRNNSRNRNSSS 187


>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA.  Members
           of this protein share two defining regions. One is a
           histidine/serine-rich cluster, typically
           H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
           the context of a pair of radical SAM proteins, HxsB and
           HxsC, and a fourth protein HxsD. The system is predicted
           to perform peptide modifications, likely in the
           His-Xaa-Ser region, to produce some uncharacterized
           natural product.
          Length = 186

 Score = 31.0 bits (70), Expect = 1.9
 Identities = 19/89 (21%), Positives = 34/89 (38%)

Query: 835 APRIHSREQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTEQGSPNIRS 894
             R HS  +S  +  S    +  + S      +T S    S S SP   S+ Q  P+   
Sbjct: 52  GHRSHSSHRSHSSHSSHYSGAGGSYSVPSGDTSTYSYPVPSPSYSPSPGSSIQSLPSTTG 111

Query: 895 TEQRASRTRSADQASQSVVTRSRSKLPSK 923
              ++S   +  +  + +V R ++ L   
Sbjct: 112 VRPQSSAENANSEKRKLLVIRVQTALVVF 140


>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
           splicing factor.  These splicing factors consist of an
           N-terminal arginine-rich low complexity domain followed
           by three tandem RNA recognition motifs (pfam00076). The
           well-characterized members of this family are auxilliary
           components of the U2 small nuclear ribonuclearprotein
           splicing factor (U2AF). These proteins are closely
           related to the CC1-like subfamily of splicing factors
           (TIGR01622). Members of this subfamily are found in
           plants, metazoa and fungi.
          Length = 509

 Score = 31.4 bits (71), Expect = 2.2
 Identities = 21/86 (24%), Positives = 33/86 (38%), Gaps = 8/86 (9%)

Query: 841 REQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTEQGSPNIRSTEQRAS 900
           RE+        ++ S   R R +  +  R   +RSR RS    S        R  ++R  
Sbjct: 7   REREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDS--------RPRDRRRY 58

Query: 901 RTRSADQASQSVVTRSRSKLPSKKKD 926
            +RS      S V RSR +   + + 
Sbjct: 59  DSRSPRSLRYSSVRRSRDRPRRRSRS 84


>gnl|CDD|219916 pfam08580, KAR9, Yeast cortical protein KAR9.  The KAR9 protein in
           Saccharomyces cerevisiae is a cytoskeletal protein
           required for karyogamy, correct positioning of the
           mitotic spindle and for orientation of cytoplasmic
           microtubules. KAR9 localises at the shmoo tip in mating
           cells and at the tip of the growing bud in anaphase.
          Length = 626

 Score = 31.8 bits (72), Expect = 2.3
 Identities = 31/260 (11%), Positives = 70/260 (26%), Gaps = 11/260 (4%)

Query: 605 NPNPNSRANPNLSRRAQANNSAQNPLPE---VSSSPNTRSRVQSSPTTRSRVQSSLTIQS 661
             + +S+        + + +   NP       SSSP++   +    +      SS  + +
Sbjct: 366 RDSQSSKIQQIRDSISVSGSDYSNPGSSIDTPSSSPSSSVIMTPPDSGPGSNVSSRRVGT 425

Query: 662 AEQGSPNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPE 721
               S    + +    NI+     S            + ++++ +      +       E
Sbjct: 426 PGSKSDRVGAVLLRRMNIKPTL-ASIPDEKPSNISVFEDSETSPNSSTLLRDPPPKKCGE 484

Query: 722 VSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSR 781
            S              T S +      +S +     +R   + S    S      +    
Sbjct: 485 ESGHLPNNPFFNKLKLTLSSIPPLSPRQSIITLPTPSRPASRIS----SLSLRLGSYSGS 540

Query: 782 VQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSR 841
           + S     +      +   +  RS    S ++           +       +      SR
Sbjct: 541 IVSPPPYPTLVSRKGAAGLSFNRS---VSDIEGERIGRYNLLPTRIPALPFKAESTTSSR 597

Query: 842 EQSSPNTHSAEQRSPNTRSR 861
             SS  + +     P +  R
Sbjct: 598 RSSSLPSPTGVIGFPGSVPR 617


>gnl|CDD|182059 PRK09752, PRK09752, adhesin; Provisional.
          Length = 1250

 Score = 31.6 bits (71), Expect = 2.3
 Identities = 11/27 (40%), Positives = 14/27 (51%)

Query: 582 SPPTPGPNPHSRVNPNPNSRANPNPNP 608
           SPP P P P     P+P+   +P P P
Sbjct: 922 SPPDPDPTPDPDPTPDPDPTPDPEPTP 948



 Score = 31.2 bits (70), Expect = 3.2
 Identities = 13/41 (31%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 580 ELSPPTP-GPNPHSRVNPNPNSRANPNPNPNSRANPNLSRR 619
           E++PP+P  P+P    +P P+    P+P P     P L+ +
Sbjct: 917 EVTPPSPPDPDPTPDPDPTPDPDPTPDPEPTPAYQPVLNAK 957



 Score = 30.4 bits (68), Expect = 6.2
 Identities = 15/47 (31%), Positives = 22/47 (46%), Gaps = 11/47 (23%)

Query: 246 LELRPLTPPPQSDPESGTQGELSPPTPGPNPNSRANPNPNPNSRANP 292
           L  + +TPP   DP+         PTP P+P    +P+P P+    P
Sbjct: 913 LRSQEVTPPSPPDPD---------PTPDPDPT--PDPDPTPDPEPTP 948


>gnl|CDD|213194 cd03227, ABC_Class2, ATP-binding cassette domain of non-transporter
           proteins.  ABC-type Class 2 contains systems involved in
           cellular processes other than transport. These families
           are characterized by the fact that the ABC subunit is
           made up of duplicated, fused ABC modules (ABC2). No
           known transmembrane proteins or domains are associated
           with these proteins.
          Length = 162

 Score = 30.4 bits (69), Expect = 2.4
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 536 LSGGERTLAALAMIFTIWKL 555
           LSGGE+ L+ALA+I  +  L
Sbjct: 78  LSGGEKELSALALILALASL 97


>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
           UL32.  The large phosphorylated protein (UL32-like) of
           herpes viruses is the polypeptide most frequently
           reactive in immuno-blotting analyses with antisera when
           compared with other viral proteins.
          Length = 777

 Score = 31.4 bits (71), Expect = 2.6
 Identities = 44/357 (12%), Positives = 92/357 (25%), Gaps = 24/357 (6%)

Query: 571 SDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPL 630
            +P            PG      ++    S  +   +P  +    +        +     
Sbjct: 369 FEPGLSPSPNSGKEMPGILTTENLDLPLASTDSTEMDPEDKRGGAVKINNSGILAWGLKT 428

Query: 631 PEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTR--SRVQSSANIRSAEQGSPI 688
           P ++ +      V S                    S +    S    S    SA      
Sbjct: 429 PGLAVNDERSIAVSSDG----ITDVLDPPSPLRLHSSDKVIDSVSPPSKRRVSAPASRLD 484

Query: 689 SRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTT 748
              R      + T +  S  + +   A     E SS   +   V S    R R       
Sbjct: 485 DAKRP-----EVTATPESSGSDSEGGASGREDETSSDAES---VVSIKELRPR---IGFI 533

Query: 749 RSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNT 808
                     +SR ++ +   S    S     R + S           +  + V+ + N 
Sbjct: 534 NKSPPPKSPPKSR-RTLIVALSLASPSTAGSPRPKPSLGKFVIGTDPFAFANTVRLTDNM 592

Query: 809 RSRVQSSLTTHSAEQGSPTT----RSTVQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQS 864
           R       +       S        S ++ A   + +  SS    +A     +++ ++ S
Sbjct: 593 RGGNGVGSSVKPKGSASSKPLTGPGSDLKPATL-NGKTPSSSLVGAARNAGASSKVKIPS 651

Query: 865 SATTRSTEQRSRSRSPLTRSTEQGSPNIRSTEQRASRTRSADQASQSVVTRSRSKLP 921
                ++       +     T   S  ++          + ++A     + S + + 
Sbjct: 652 GLGGFTSPISLLESALEDVLTSATSTPVKK-NDPYLWDTNGEKAGGGTESASTTDVF 707


>gnl|CDD|171912 PRK13223, PRK13223, phosphoglycolate phosphatase; Provisional.
          Length = 272

 Score = 31.0 bits (70), Expect = 2.7
 Identities = 22/65 (33%), Positives = 32/65 (49%), Gaps = 23/65 (35%)

Query: 376 LRWQDRGGVCRQHDVKMQLQLNKEQQRLAPLLDRLR--RYEADLKALYKRRWIL------ 427
           L+W  + GV      +M L  NK ++ +APLLD+++  RY          RWI+      
Sbjct: 110 LKWLKKQGV------EMALITNKPERFVAPLLDQMKIGRY---------FRWIIGGDTLP 154

Query: 428 QKKFD 432
           QKK D
Sbjct: 155 QKKPD 159


>gnl|CDD|227952 COG5665, NOT5, CCR4-NOT transcriptional regulation complex, NOT5
           subunit [Transcription].
          Length = 548

 Score = 31.2 bits (70), Expect = 3.0
 Identities = 32/178 (17%), Positives = 71/178 (39%), Gaps = 8/178 (4%)

Query: 713 NSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTT-RSRVQSSLTIQSA 771
           +S+ N  P+  ++  + S ++SS     + + SP  ++  +    + R+    +  ++SA
Sbjct: 203 SSSNNEAPKEGNNQTSLSSIRSS----KKQERSPKKKAPQRDVSISDRATTPIAPGVESA 258

Query: 772 EQGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRST 831
            Q   +T + V +   + + +  S    ++   +P T   ++   + + +EQ     + +
Sbjct: 259 SQSISSTPTPVSTDTPLHTVKDDSIKFDNSTLGTPTTHVSMKKKESENDSEQQLNFPKDS 318

Query: 832 VQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTEQGS 889
                 I    Q    T++A Q            A+ R   Q  +  SP   ST + S
Sbjct: 319 TD---EIRKTIQHDVETNAAFQNPLFNDELKWWLASKRYLTQPLQEMSPSMVSTLENS 373



 Score = 30.8 bits (69), Expect = 4.1
 Identities = 25/127 (19%), Positives = 50/127 (39%), Gaps = 7/127 (5%)

Query: 699 QSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTT 758
           Q++ S++    +   S +   P+   S + R+    +P   S  QS  +T + V +    
Sbjct: 216 QTSLSSIRSSKKQERSPKKKAPQRDVSISDRATTPIAPGVESASQSISSTPTPVSTDTPL 275

Query: 759 RSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQG------SPS-IRSAVQRSPNTRSR 811
            +    S+   ++  G+P T   ++   +   +EQ       S   IR  +Q    T + 
Sbjct: 276 HTVKDDSIKFDNSTLGTPTTHVSMKKKESENDSEQQLNFPKDSTDEIRKTIQHDVETNAA 335

Query: 812 VQSSLTT 818
            Q+ L  
Sbjct: 336 FQNPLFN 342


>gnl|CDD|220096 pfam09052, SipA, Salmonella invasion protein A.  Salmonella
           invasion protein A is an actin-binding protein that
           contributes to host cytoskeletal rearrangements by
           stimulating actin polymerisation and counteracting
           F-actin destabilising proteins. Members of this family
           possess an all-helical fold consisting of eight
           alpha-helices arranged so that six long, amphipathic
           helices form a compact fold that surrounds a final,
           predominantly hydrophobic helix in the middle of the
           molecule.
          Length = 674

 Score = 31.2 bits (70), Expect = 3.1
 Identities = 42/211 (19%), Positives = 68/211 (32%), Gaps = 29/211 (13%)

Query: 564 PLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQAN 623
           P  P P   P  G  G+ S P    + H  ++ +  S  N   +         +RR   N
Sbjct: 277 PPEPMPDGGPTPGGNGKTSQPV-EIHYHINIHNDNRSYDNRVFDNRGDTYLGGARRHYDN 335

Query: 624 -------NSAQNPLPEVSSSPNTRSRVQSSPTT-----RSRVQS---------SLTIQSA 662
                  N AQ+P  + +      + + S P +      + VQ          S T+ + 
Sbjct: 336 SYHENSENDAQSPTSQTNDLSRNGNSLLSPPASPAAGQHALVQKVTSVLPHSISGTVDTF 395

Query: 663 EQGS---PNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPL 719
              S       +   S   +R A  GS    T   + +  ++ S   R     NS+    
Sbjct: 396 ANNSAEKVFNHTPDNSDGAVRLAGIGSDGLTTSSQERSANNSLSRGGRPLNIQNSSVTD- 454

Query: 720 PEVSSSPNTRSRVQSSPTTRSRVQSSPTTRS 750
                 P   +   +     S   SS TT+S
Sbjct: 455 ---PLHPVLTAADGAEGVKSSTDNSSDTTKS 482


>gnl|CDD|213181 cd03214, ABC_Iron-Siderophores_B12_Hemin, ATP-binding component of
           iron-siderophores, vitamin B12 and hemin transporters
           and related proteins.  ABC transporters, involved in the
           uptake of siderophores, heme, and vitamin B12, are
           widely conserved in bacteria and archaea. Only very few
           species lack representatives of the siderophore family
           transporters. The E. coli BtuCD protein is an ABC
           transporter mediating vitamin B12 uptake. The two
           ATP-binding cassettes (BtuD) are in close contact with
           each other, as are the two membrane-spanning subunits
           (BtuC); this arrangement is distinct from that observed
           for the E. coli lipid flippase MsbA. The BtuC subunits
           provide 20 transmembrane helices grouped around a
           translocation pathway that is closed to the cytoplasm by
           a gate region, whereas the dimer arrangement of the BtuD
           subunits resembles the ATP-bound form of the Rad50 DNA
           repair enzyme. A prominent cytoplasmic loop of BtuC
           forms the contact region with the ATP-binding cassette
           and represent a conserved motif among the ABC
           transporters.
          Length = 180

 Score = 30.1 bits (69), Expect = 3.1
 Identities = 12/32 (37%), Positives = 14/32 (43%), Gaps = 8/32 (25%)

Query: 525 PQGRSYVGIQH--------LSGGERTLAALAM 548
           PQ    +G+ H        LSGGER    LA 
Sbjct: 79  PQALELLGLAHLADRPFNELSGGERQRVLLAR 110


>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
           Cobaltochelatase is responsible for the insertion of
           cobalt into the corrin ring of coenzyme B12 during its
           biosynthesis. Two versions have been well described.
           CbiK/CbiX is a monomeric, anaerobic version which acts
           early in the biosynthesis (pfam06180). CobNST is a
           trimeric, ATP-dependent, aerobic version which acts late
           in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
           number of genomes (actinobacteria, cyanobacteria,
           betaproteobacteria and pseudomonads) which apparently
           biosynthesize B12, encode a cobN gene but are
           demonstrably lacking cobS and cobT. These genomes do,
           however contain a homolog (modelled here) of the
           magnesium chelatase subunits BchI/BchD family. Aside
           from the cyanobacteria (which have a separate magnesium
           chelatase trimer), these species do not make chlorins,
           so do not have any use for a magnesium chelatase.
           Furthermore, in nearly all cases the members of this
           family are proximal to either CobN itself or other genes
           involved in cobalt transport or B12 biosynthesis.
          Length = 633

 Score = 31.2 bits (71), Expect = 3.2
 Identities = 32/166 (19%), Positives = 55/166 (33%), Gaps = 26/166 (15%)

Query: 527 GRSYVGIQHLSGGERTLAALAMIFTIWKLVVRVQQLRPLTPPPQSDPESGTQGELSPPTP 586
           GR  V  + +    R  A L        L  R ++     P  + + +   + E   P P
Sbjct: 288 GRRRVTAEDV----REAAELV-------LPHRRRRKPFEQPQGKDEKDLEEKPEEPGPDP 336

Query: 587 -----GPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPL-----PEVSSS 636
                G +   +       R +P P  +    P+    A    S+ +P        V + 
Sbjct: 337 EKPDEGEDDAEQ----SGPRGHPTPGNDDEKEPDPQEEADGQGSSTDPAGDIFRIRVLAP 392

Query: 637 PNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSA 682
           P  R+R  S   +R+R  S      A +        +   A +R+A
Sbjct: 393 PQARARGASGRRSRTRSDSRGRYVRARRNRGPP-DDLAVDATLRAA 437


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 30.5 bits (69), Expect = 3.3
 Identities = 19/89 (21%), Positives = 28/89 (31%), Gaps = 6/89 (6%)

Query: 559 VQQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSR 618
                P  P P+  P+   + +   P P P P     P P  +  P P P    +   ++
Sbjct: 74  ETPPEPTPPKPKEKPK--PEKKPKKPKPKPKPK----PKPKPKVKPQPKPKKPPSKTAAK 127

Query: 619 RAQANNSAQNPLPEVSSSPNTRSRVQSSP 647
              A N    P    S+S        S  
Sbjct: 128 APAAPNQPARPPSAASASGAATGPSASYL 156


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 31.0 bits (70), Expect = 3.3
 Identities = 12/44 (27%), Positives = 14/44 (31%), Gaps = 4/44 (9%)

Query: 252 TPP---PQSDPESGTQGELSPPTPGPNPNSRANPNPNPNSRANP 292
           T P     S P  G     +PP   P P +   P   P     P
Sbjct: 904 TAPGGDAASAPPPGAGPP-APPQAVPPPRTTQPPAAPPRGPDVP 946


>gnl|CDD|215187 PLN02328, PLN02328, lysine-specific histone demethylase 1 homolog.
          Length = 808

 Score = 31.1 bits (70), Expect = 3.3
 Identities = 32/126 (25%), Positives = 47/126 (37%), Gaps = 23/126 (18%)

Query: 184 ENRPSDTLSPAQSEVQAEFLSGESREELTFANESSETNSQGVPRAERIMQRIESSLLQVR 243
           E +  +  +   ++V +E  S E+  +L+ +   SE N +   +     Q   + L    
Sbjct: 4   ETKEPEDPADNVNDVVSEASSPET--DLSLSPSQSEQNIENDGQNSPETQSPLTELQPS- 60

Query: 244 NLLELRPLTPP-PQSDPESGTQGELSPPTPGP-NPNSRANPNPNPNSR-----------A 290
                 PL P      P S +QG+ S     P NPNS   P P P  R           A
Sbjct: 61  ------PLPPNTTLDAPVSDSQGDESSSEQQPQNPNS-TEPAPPPKKRRRRKRFFTEINA 113

Query: 291 NPNLSR 296
           NP   R
Sbjct: 114 NPAFRR 119



 Score = 30.3 bits (68), Expect = 5.6
 Identities = 17/56 (30%), Positives = 20/56 (35%), Gaps = 2/56 (3%)

Query: 566 TPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSR--ANPNLSRR 619
           +P  QS         L P T    P S    + +S      NPNS   A P   RR
Sbjct: 47  SPETQSPLTELQPSPLPPNTTLDAPVSDSQGDESSSEQQPQNPNSTEPAPPPKKRR 102


>gnl|CDD|213844 TIGR03657, IsdB, heme uptake protein IsdB.  Isd proteins are
           iron-regulated surface proteins found in Bacillus,
           Staphylococcus and Listeria species and are responsible
           for heme scavenging from hemoproteins. The IsdB protein
           is only observed in Staphylococcus and consists of an
           N-terminal hydrophobic signal sequence, a pair of tandem
           NEAT (NEAr Transporter, pfam05031) domains which confers
           the ability to bind heme and a C-terminal sortase
           processing signal which targets the protein to the cell
           wall. IsdB is believed to make a direct contact with
           methemoglobin facilitating transfer of heme to IsdB. The
           heme is then transferred to other cell wall-bound NEAT
           domain proteins such as IsdA and IsdC.
          Length = 644

 Score = 31.1 bits (69), Expect = 3.3
 Identities = 27/152 (17%), Positives = 62/152 (40%), Gaps = 7/152 (4%)

Query: 706 SRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSS 765
           +++ Q +NSA+       ++P T S+  + P  +   +         QS    +    SS
Sbjct: 462 NKKEQQDNSAKK-----ETTPATPSKPTTPPVEKESQKQDSQKDDNKQSPSVEKENDASS 516

Query: 766 LTIQSAEQGSPNTRSRVQSSANI--RSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQ 823
            + +     +   +  V+SS+    +         +     S  T+  VQ+S  +  A+ 
Sbjct: 517 ESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKPTTASSETTKDVVQTSAGSSEAKD 576

Query: 824 GSPTTRSTVQGAPRIHSREQSSPNTHSAEQRS 855
            +P  ++ ++     H++ Q++ NT   + +S
Sbjct: 577 SAPLQKANIKNTNDGHTQSQNNKNTQENKAKS 608



 Score = 31.1 bits (69), Expect = 3.7
 Identities = 29/148 (19%), Positives = 61/148 (41%), Gaps = 8/148 (5%)

Query: 574 ESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANN-SAQNPLPE 632
           ++  + E +P TP     S+    P  + +   +     N       + N+ S+++   +
Sbjct: 468 DNSAKKETTPATP-----SKPTTPPVEKESQKQDSQKDDNKQSPSVEKENDASSESGKDK 522

Query: 633 VSSSPNTRSRVQSSPTTRSRVQSSL--TIQSAEQGSPNTRSRVQSSANIRSAEQGSPISR 690
             ++   +  V+SS TT ++V S+     +     S  T+  VQ+SA    A+  +P+ +
Sbjct: 523 TPATKPAKGEVESSSTTPTKVVSTTQNVAKPTTASSETTKDVVQTSAGSSEAKDSAPLQK 582

Query: 691 TRFTKYTLQSTKSNLSRRAQANNSAQNP 718
                     T+S  ++  Q N +   P
Sbjct: 583 ANIKNTNDGHTQSQNNKNTQENKAKSLP 610


>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
           translational repressor [Translation, ribosomal
           structure and biogenesis].
          Length = 777

 Score = 31.3 bits (71), Expect = 3.4
 Identities = 46/269 (17%), Positives = 80/269 (29%), Gaps = 34/269 (12%)

Query: 596 PNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQS 655
               S A  +    S++       +  N S  +     S+  +T+    +S  T S   +
Sbjct: 81  SGSWSVAISSSTSGSQSLLMELPSSSFNPSTSSRNKSNSALSSTQQGNANSSVTLSSSTA 140

Query: 656 SLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSA 715
           S    S +   PN       + +  +    S  S       T  S+ S        ++  
Sbjct: 141 SSMFNSNKLPLPN------PNHSNSATTNQSGSS----FINTPASSSSQPLTNLVVSSIK 190

Query: 716 QNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRV-QSSPTTRSRVQSSLTI---QSA 771
           + P    S SP     +  S  + +    +  + +     SP           +   QS 
Sbjct: 191 RFPY-LTSLSPFFNYLIDPSSDSATASADTSPSFNPPPNLSPNNLFSTSDLSPLPDTQSV 249

Query: 772 EQGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRST 831
           E       S   S   + S     PSIR+                  +SA        S+
Sbjct: 250 ENNIILNSS--SSINELTSIYGSVPSIRNLRG--------------LNSALVSFLNVSSS 293

Query: 832 VQGAPRIHSREQS---SPNTHSAEQRSPN 857
                 ++ +E S   SP+T S  +  P 
Sbjct: 294 SLAFSALNGKEVSPTGSPSTRSFARVLPK 322


>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
           Provisional.
          Length = 1164

 Score = 31.1 bits (70), Expect = 3.6
 Identities = 39/245 (15%), Positives = 85/245 (34%), Gaps = 50/245 (20%)

Query: 702 KSNLSRRAQANNSAQN---PLPEVSSSP--NTRSRVQSSPTTRSRVQSSPTTRSRVQSSP 756
             NL    Q  N+  +    L E S +P  N  +        +    +  ++ S   SS 
Sbjct: 74  NLNLPDYNQIQNNTHDFYIDLNERSKTPIKNNDNVTTPIKANKKEKHNLDSSSSSSISSS 133

Query: 757 TTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSL 816
            T     SS           +  S + +S     + + SP +   ++ + +T   + S  
Sbjct: 134 LTNISFFSSP---------TSIYSCLSNS----LSSKHSPKV---IKENQSTHVNISSDN 177

Query: 817 TTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQ----RSPNTRSRVQSSATTRSTE 872
           +  + E  +                ++ +  TH+       RSP   S ++++   ++ E
Sbjct: 178 SPRNKEISN-------------KQLKKQTNVTHTTCYDKMRRSPRNTSTIKNNTNDKNKE 224

Query: 873 QRSRSRSPLTRSTEQGSPNIRSTEQRA------------SRTRSADQASQSVVTRSRSKL 920
           +       + +  +      R++E+              S T+   +  ++VV+  RS +
Sbjct: 225 KNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSGIRSSV 284

Query: 921 PSKKK 925
             K+K
Sbjct: 285 LLKRK 289


>gnl|CDD|236504 PRK09418, PRK09418, bifunctional 2',3'-cyclic nucleotide
           2'-phosphodiesterase/3'-nucleotidase precursor protein;
           Reviewed.
          Length = 780

 Score = 30.8 bits (69), Expect = 3.6
 Identities = 22/112 (19%), Positives = 39/112 (34%), Gaps = 6/112 (5%)

Query: 565 LTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANN 624
           +T     D E+G +   +PPT   +        P    N   NP +          + NN
Sbjct: 652 ITKKNDDDKETGGENPTTPPTGEGDNGENPTTPPTGEGNNGENPTTPPT------GEGNN 705

Query: 625 SAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSS 676
                 P      N  S   ++    S+  ++++    E+  P T + V S+
Sbjct: 706 GGNPTTPSTDEGNNAGSGQTTTDNQNSKETTTVSENKEERDLPKTGTSVAST 757


>gnl|CDD|177646 PHA03418, PHA03418, hypothetical E4 protein; Provisional.
          Length = 230

 Score = 30.5 bits (68), Expect = 3.8
 Identities = 16/68 (23%), Positives = 24/68 (35%), Gaps = 3/68 (4%)

Query: 564 PLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQAN 623
           P  P PQ DP+        PP P   P     PN +++      P         +   A 
Sbjct: 39  PHHPNPQEDPDKNPS---PPPDPPLTPRPPAQPNGHNKPPVTKQPGGEGTEEDHQAPLAA 95

Query: 624 NSAQNPLP 631
           ++  +P P
Sbjct: 96  DADDDPRP 103


>gnl|CDD|182276 PRK10160, PRK10160, taurine transporter subunit; Provisional.
          Length = 275

 Score = 30.5 bits (69), Expect = 3.8
 Identities = 11/30 (36%), Positives = 18/30 (60%), Gaps = 1/30 (3%)

Query: 542 TLAALAMIFTIWKLVVRVQQLRPL-TPPPQ 570
           ++  LA++  +W  V  +Q + PL  PPPQ
Sbjct: 27  SIGTLAVLLAVWWAVAALQLISPLFLPPPQ 56


>gnl|CDD|219133 pfam06682, DUF1183, Protein of unknown function (DUF1183).  This
           family consists of several eukaryotic proteins of around
           360 residues in length. The function of this family is
           unknown.
          Length = 317

 Score = 30.5 bits (69), Expect = 3.9
 Identities = 16/74 (21%), Positives = 23/74 (31%), Gaps = 4/74 (5%)

Query: 543 LAALAMIFTIWKLVVRVQQLRPLTPPPQSDPESGTQGELSPPTPGPNPH-SRVNPNPNSR 601
           LA LA I  ++K  +    +R   P P+     G  G       G         P   S 
Sbjct: 166 LAVLAFI--LYKFFLSCGGVRG-GPRPERAGYGGGGGGGGGGGGGGGSGPGPPPPGFKSS 222

Query: 602 ANPNPNPNSRANPN 615
             P   P +  +  
Sbjct: 223 FPPPYGPGAGPSSG 236


>gnl|CDD|128420 smart00110, C1Q, Complement component C1q domain.  Globular domain
           found in many collagens and eponymously in complement
           C1q. When part of full length proteins these domains
           form a 'bouquet' due to the multimerization of
           heterotrimers. The C1q fold is similar to that of tumour
           necrosis factor.
          Length = 135

 Score = 29.6 bits (67), Expect = 4.0
 Identities = 12/63 (19%), Positives = 23/63 (36%), Gaps = 18/63 (28%)

Query: 430 KFDKVKHTRRQKFEN-----------MYDFVNDNIDATYKVHLKSSACQVYLIVSGWPKK 478
           +FDKV + ++  ++            +Y F       +Y V  K    +V L+ +G    
Sbjct: 26  RFDKVLYNQQGHYDPRTGKFTCPVPGVYYF-------SYHVESKGRNVKVSLMKNGIQVM 78

Query: 479 KEN 481
              
Sbjct: 79  STY 81


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 30.5 bits (68), Expect = 4.4
 Identities = 26/110 (23%), Positives = 41/110 (37%), Gaps = 7/110 (6%)

Query: 701 TKSNLSRRAQANNSAQNPLPEVSSSPNTRS-RVQSSPTTRSRVQSSPTTRSRVQSSPTTR 759
           T    S   QA N+ Q   P++  +PNT   +  +S   +         +S   S+    
Sbjct: 55  TAGANSATTQAANTRQERTPKLEKAPNTNEEKTSASKIEKISQPKQEEQKSLNISATPAP 114

Query: 760 SRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNTR 809
            + QS  T +S     P T+     S N     Q   S +S   +SP  +
Sbjct: 115 KQEQSQTTTESTT---PKTKVTTPPSTNT---PQPMQSTKSDTPQSPTIK 158


>gnl|CDD|177475 PHA02693, PHA02693, hypothetical protein; Provisional.
          Length = 710

 Score = 30.8 bits (69), Expect = 4.7
 Identities = 27/114 (23%), Positives = 47/114 (41%), Gaps = 14/114 (12%)

Query: 721 EVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTR- 779
           + S + ++ SR  S+ +TRS   +  T       + TT   +      +S ++  P TR 
Sbjct: 271 DESDTADSCSRSFSTQSTRSTRSTRSTRSGA--ETDTTDPDLDPDDD-ESFDEVGPLTRR 327

Query: 780 ----------SRVQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQ 823
                     S   SSA++R   +GS  I   +  S     +V  + T  +AE+
Sbjct: 328 FTATSFAPRASVRSSSASMRLHARGSTRISEPLMSSAARVPKVSMAPTLDTAEE 381


>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal.  Members
           of this family are found in the N terminal region of
           cytochrome B561, as well as in various other putative
           uncharacterized proteins.
          Length = 559

 Score = 30.5 bits (69), Expect = 5.4
 Identities = 34/209 (16%), Positives = 70/209 (33%), Gaps = 17/209 (8%)

Query: 569 PQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNSAQN 628
            +S P S T   ++   P    HS  + +P+  A+P+   +  +    S +   +N    
Sbjct: 117 KKSPPASKTSTPMNTSEPLVPGHSSFSDSPSRSASPSRKFSPSSTIQQSPQLTPSNK--- 173

Query: 629 PLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPI 688
                 +SP++  +  S  ++   V SS            +  R    A   S ++    
Sbjct: 174 -----PASPSSSYQSPSYSSSLGPVNSSGN---------RSNLRSSPWALRSSGDKKDIT 219

Query: 689 SRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTT 748
           +  ++ +  L            +      P   ++S  ++     +     S    S   
Sbjct: 220 TDEKYLETFLAEVDEEQHMITSSAGKNATPPETINSFGSSSPSFWNYSRNASDAARSLKK 279

Query: 749 RSRVQSSPTTRSRVQSSLTIQSAEQGSPN 777
           RS   S     S+ ++S + +  E   PN
Sbjct: 280 RSYQLSPSPVPSKQKASTSPKKGEGEPPN 308


>gnl|CDD|236049 PRK07562, PRK07562, ribonucleotide-diphosphate reductase subunit
            alpha; Validated.
          Length = 1220

 Score = 30.3 bits (69), Expect = 5.6
 Identities = 10/71 (14%), Positives = 28/71 (39%)

Query: 641  SRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQS 700
            + V S+  TR R  + + +      +    +   +++    +++ +  +         ++
Sbjct: 1113 TNVVSTGLTRGRKPTLVVVTGGGAAAGAAVAAAPAASAATFSDKAAAAAAAAAAAAAAEA 1172

Query: 701  TKSNLSRRAQA 711
             K    RRA+A
Sbjct: 1173 KKVEAERRAEA 1183


>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
           family). 
          Length = 619

 Score = 30.2 bits (68), Expect = 5.7
 Identities = 42/205 (20%), Positives = 69/205 (33%), Gaps = 7/205 (3%)

Query: 539 GERTLAALAMIFTIWKLVVRVQQLRPLTPPPQSDPESGTQGELSPPTPGPNPHSRVNPNP 598
           G   L  +A I   W+   R ++ R       ++  S         T    P S  +   
Sbjct: 419 GVPLLERMAQIEAQWERH-RQEKQRAKARKKLANKTSTVMEPPYGSTESSVP-STPSTRR 476

Query: 599 NSRANPNPNPNSRANPNL---SRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQS 655
           N R   +  P+ +  PNL   S   +A+  +++       S   R          SR   
Sbjct: 477 NDRNITSNTPSLKRTPNLTKSSLSQEASLISKSTGNTHKHSTPRRLTTLPKLPAASRSSK 536

Query: 656 SLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSA 715
              I+S   G+ ++      S N +S E   P+ R         +TK   S  +  N   
Sbjct: 537 GNLIRSGANGNASSDLSSPGSINSKSPEHSVPLVRVFDIHLRASTTKGRHSTPST-NEKK 595

Query: 716 QNPLPEVSSSPNTRSRVQSSPTTRS 740
           +  L     SP  +  V ++P   S
Sbjct: 596 KRLLKRSPLSP-PKESVATTPRLNS 619


>gnl|CDD|237854 PRK14898, PRK14898, DNA-directed RNA polymerase subunit A'';
           Provisional.
          Length = 858

 Score = 30.2 bits (68), Expect = 5.9
 Identities = 14/44 (31%), Positives = 24/44 (54%), Gaps = 3/44 (6%)

Query: 29  EIHIPTVDSTSSVS---SEEIRRHESNLNLLRRKTQQRRTIQHT 69
           EI+  ++D    V       + RH++N  L++ KT+  RTI+ T
Sbjct: 123 EIYALSLDQDEKVHWKRIISVIRHKANGKLIKIKTESGRTIRAT 166


>gnl|CDD|220749 pfam10428, SOG2, RAM signalling pathway protein.  SOG2 proteins in
           Saccharomyces cerevisiae are involved in cell separation
           and cytokinesis.
          Length = 419

 Score = 30.1 bits (68), Expect = 6.0
 Identities = 30/187 (16%), Positives = 55/187 (29%), Gaps = 13/187 (6%)

Query: 713 NSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAE 772
           N+  +  P +            S         S   R R   +PT   R  SS       
Sbjct: 143 NAWSSLGPPLQHRKRDAVTASPSSMIARNTPISDRLRPR-SVTPTRGRRPSSS------- 194

Query: 773 QGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTV 832
             S +  + ++S +N++      P   +   RS    S    S+ +  A   S  +  + 
Sbjct: 195 PRSLSNPTTLESPSNLQVTTDVPPPYSNGTSRSSTMSSSANLSIISSLATPRSGESFRST 254

Query: 833 QGAPRIHSREQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTEQGSPNI 892
              P   S    +P +   E        ++     T +T+   R    LT    +     
Sbjct: 255 ---PTSGSS-SINPVSGLDEAEEDRIDEQLFLKLRT-ATDMALRVLPQLTEQFSKSLIAS 309

Query: 893 RSTEQRA 899
            ++    
Sbjct: 310 TTSRNIT 316


>gnl|CDD|223020 PHA03246, PHA03246, large tegument protein UL36; Provisional.
          Length = 3095

 Score = 30.3 bits (68), Expect = 6.2
 Identities = 58/374 (15%), Positives = 115/374 (30%), Gaps = 45/374 (12%)

Query: 573  PESGTQGELSPPTPGPNPHSRVNP-NPNSRANPNPNPNSRANPNLSR------------- 618
            P   TQG+            +  P N  + A     P+++ +    +             
Sbjct: 2645 PALATQGDQDGFVSKVGSAMKYKPPNVTAMAQSIVQPDTQKSSIKKKTDIRLAGYRTMPA 2704

Query: 619  --RAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQSS 676
              R + N     P    S +      V  SP   +R  ++        G        Q S
Sbjct: 2705 DNRQKGNGLRHKPKTGRSHTTIKNVEVYPSPPRWTRTAAAQKRHGEILG--------QVS 2756

Query: 677  ANIRSAEQGSPISRTRFTKYTLQSTKSNLSRRAQANNSAQNPLPEVSSSPNTR------- 729
              I + +  +P++  R  K+++   + +  R      S    +   S S   +       
Sbjct: 2757 PPIVTLDNATPVNGIRHEKHSIPYDRLDEHRTDPDALSTPPVIAAPSPSLIYKELSGGIG 2816

Query: 730  ------SRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSAEQGSPNTRSRVQ 783
                   R Q +  +RS+ + S    +  + +  +RS     + + S          R  
Sbjct: 2817 KNILKMGRKQIN--SRSKERMSSKASNSPRKNNISRSTADILVGMASTTSSHSPLLQRAD 2874

Query: 784  ----SSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRSTVQGAP-RI 838
                +  NI   E          ++  N    V++ L  + +    P   ++ +    R 
Sbjct: 2875 DMYINIKNIPVPEVKEEGNWKGCKKRNNHVPEVETRLYDNESFPPPPNIIASWKQRVVRF 2934

Query: 839  HSREQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTEQGSPNIRSTEQR 898
                  +P  ++  ++   T  +   +ATT   +  S  R P+    E G     +    
Sbjct: 2935 WEILPHTPRGNAHRKKVSQTTHK-PPNATTAKHDHLSMYRGPVLTEAESGENKFNNVNVD 2993

Query: 899  ASRTRSADQASQSV 912
              R RS  ++  S 
Sbjct: 2994 PIRKRSESESLSST 3007


>gnl|CDD|234058 TIGR02916, PEP_his_kin, putative PEP-CTERM system histidine kinase.
            Members of this protein family have a novel N-terminal
           domain, a single predicted membrane-spanning helix, and
           a predicted cystosolic histidine kinase domain. We
           designate this protein PrsK, and its companion
           DNA-binding response regulator protein (TIGR02915) PrsR.
           These predicted signal-transducing proteins appear to
           enable enhancer-dependent transcriptional activation.
           The prsK gene is often associated with exopolysaccharide
           biosynthesis genes [Protein fate, Protein and peptide
           secretion and trafficking, Signal transduction,
           Two-component systems].
          Length = 679

 Score = 30.1 bits (68), Expect = 6.4
 Identities = 29/100 (29%), Positives = 41/100 (41%), Gaps = 18/100 (18%)

Query: 616 LSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLT--IQSAEQGSPNT---- 669
           L RRA A+  AQ P PEVS   +   R       R R++  L   +Q+A + +P      
Sbjct: 548 LLRRAIASKRAQGPRPEVSIDTDLSVRAD-----RERLERVLGHLVQNALEATPGEGRVA 602

Query: 670 --RSRVQSSANIRSAEQGSPIS----RTRFTKYTLQSTKS 703
               R   +A I   + G  +S    R R  K    +TK 
Sbjct: 603 IRVERECGAARIEIEDSGCGMSPAFIRERLFK-PFDTTKG 641


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 30.1 bits (67), Expect = 7.3
 Identities = 18/97 (18%), Positives = 34/97 (35%), Gaps = 5/97 (5%)

Query: 567 PPPQSDPESGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANPNLSRRAQANNSA 626
                 PE  T      P P P PH   +  P+    P    + + +      AQA   A
Sbjct: 23  NTNIPIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDL-----AQAPTPA 77

Query: 627 QNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTIQSAE 663
            +   + + +P+  +     P    ++ ++    +AE
Sbjct: 78  ASEKFDPAPAPHQAASRAPDPAVAPQLAAAPKPDAAE 114


>gnl|CDD|233376 TIGR01363, strep_his_triad, streptococcal histidine triad protein. 
           This model represents the N-terminal half of a family of
           Streptococcal proteins that contain a signal peptide and
           then up to five repeats of a region that includes a
           His-X-X-His-X-His (histidine triad) motif. Three repeats
           are found in the seed alignment. Members of this family
           from Streptococcus pneumoniae are suggested to cleave
           human C3, and the member PhpA has been shown in vaccine
           studies to be a protective antigen in mice [Cellular
           processes, Pathogenesis].
          Length = 348

 Score = 29.5 bits (66), Expect = 7.9
 Identities = 8/48 (16%), Positives = 15/48 (31%), Gaps = 2/48 (4%)

Query: 264 QGELSPPTPGPNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPLFSK 311
           +             SR      P+   NP      + N+++QN    +
Sbjct: 235 KQGRGARPSDYRQGSR--KASIPDVTPNPGQPAPPRPNDASQNKTQRQ 280



 Score = 29.5 bits (66), Expect = 8.6
 Identities = 9/44 (20%), Positives = 14/44 (31%), Gaps = 3/44 (6%)

Query: 270 PTPGPNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPLFSKGE 313
              G    S  +  PNP     P   R   A+ +       +G+
Sbjct: 245 YRQGSRKASIPDVTPNP---GQPAPPRPNDASQNKTQRQEFQGK 285


>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
          Length = 327

 Score = 29.7 bits (67), Expect = 8.3
 Identities = 33/204 (16%), Positives = 70/204 (34%), Gaps = 17/204 (8%)

Query: 596 PNPNSRANPNPNPNSRANPNLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQ- 654
           P  +     + N  +  N +LS  +  ++  Q+      ++  +     S     +  Q 
Sbjct: 61  PTEHESQQSSNNAGAEKNIDLSGSSSLSSGNQSSPSA--ANNTSDGHDASGVKNTAPPQD 118

Query: 655 ------SSLTIQSAEQGSPNTRSRVQSSANIRSA--EQGSPISRTRFTKYTLQSTKSNLS 706
                 S    Q+A   +PN + R++   NI  A  +Q   ++    ++    +T +  +
Sbjct: 119 ISAPPISPTPTQAAPPQTPNGQQRIELPGNISDALSQQQGQVNA--ASQNAQGNTSTLPT 176

Query: 707 RRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSL 766
             A    S    +P  + +  T            +   +    + V   P T  + +S  
Sbjct: 177 APATVAPSKGAKVPATAETHPT----PPQKPATKKPAVNHHKTATVAVPPATSGKPKSGA 232

Query: 767 TIQSAEQGSPNTRSRVQSSANIRS 790
               A   +P +   +Q S+  RS
Sbjct: 233 ASARALSSAPASHYTLQLSSASRS 256


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 29.6 bits (66), Expect = 9.7
 Identities = 66/341 (19%), Positives = 126/341 (36%), Gaps = 49/341 (14%)

Query: 558 RVQQLRPLTPPPQSDPE---SGTQGELSPPTPGPNPHSRVNPNPNSRANPNPNPNSRANP 614
           R  ++ P TP     P    +    E +P TP    +  ++ +   R   +  P  R   
Sbjct: 117 RTAKITPTTPKNNYSPTAAGTERVKEDTPATPSRALNHYISTSGRQRVK-SYTPKPRGEV 175

Query: 615 NLSRRAQANNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSLTI--QSAEQGSP--NTR 670
             S   Q     +   P    SP  R     +P+T   +  S  I  ++  + S    T 
Sbjct: 176 KSSSPTQTREKVRKYTP----SPLGRMVNSYAPSTFMTMPRSHGITPRTTVKDSEITATY 231

Query: 671 SRVQSSANIRSAEQGSPISRTRFTKYTLQ-----------------STKSNLS--RRAQA 711
             ++++ + R+A + +P      T  T                     K+ L+  RR ++
Sbjct: 232 KMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVES 291

Query: 712 NNSAQNPLPEVSSSPNT--RSRVQSSP-TTRSRVQSSPTTRSRVQ----SSPTTRSRVQS 764
           N+S  +      ++  T   + ++ +P T+  +V  S  T S       S+   + R   
Sbjct: 292 NSSTNHWGLVGKNNLTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAWKIRNPL 351

Query: 765 SLTIQSAEQGSPNTRSRVQSSANIRSAEQGSPSIRSA----------VQRSPNTRSRVQS 814
           S T   A + +  T   ++ + +   +   +P +R+           V+ +P   +    
Sbjct: 352 SRTSAPAVRIASATFRGLEKNPSTAPSTPATPRVRAVLTTQVHHCVVVKPAPAVPTTPSP 411

Query: 815 SLTTHS-AEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQR 854
           SLTT    E  SP+  +   G P +H + +  P+  S E+R
Sbjct: 412 SLTTALFPEAPSPSPSALPPGQPDLHPKAEYPPDLFSVEER 452


>gnl|CDD|227367 COG5034, TNG2, Chromatin remodeling protein, contains PhD zinc
           finger [Chromatin structure and dynamics].
          Length = 271

 Score = 29.1 bits (65), Expect = 10.0
 Identities = 18/106 (16%), Positives = 33/106 (31%), Gaps = 11/106 (10%)

Query: 809 RSRVQSSLTTHSAEQGSPTTRSTVQGAPRIHSREQSSPNTHSAEQRSPNTRS------RV 862
             R +  L    A++      + ++      SR + +  + +A + S   RS        
Sbjct: 96  LRRHRKLLDDRIAKRPHEKVAARIENCHDAVSRLERNSYSSAARRSSGEHRSAASSQGSR 155

Query: 863 QSSATTRSTEQRSRSRSPLT-----RSTEQGSPNIRSTEQRASRTR 903
            +    R      + RSP        S    SP++  T  R     
Sbjct: 156 HTKLKKRKNIHNLKRRSPELSSKREVSFTLESPSVPDTATRVKEGN 201


>gnl|CDD|227436 COG5105, MIH1, Mitotic inducer, protein phosphatase [Cell division
           and chromosome partitioning].
          Length = 427

 Score = 29.2 bits (65), Expect = 10.0
 Identities = 22/211 (10%), Positives = 58/211 (27%), Gaps = 14/211 (6%)

Query: 712 NNSAQNPLPEVSSSPNTRSRVQSSPTTRSRVQSSPTTRSRVQSSPTTRSRVQSSLTIQSA 771
             +  + L +  ++ +     ++S   +S           + +    +++   +L     
Sbjct: 1   MKNIFHGLEDECANEDVFFFQKAS--KKSIFGDKKNIFRNIATFFKPKAKH--ALADDDL 56

Query: 772 EQGSPNTRSRVQSSANIRSAEQGSPSIRSAVQRSPNTRSRVQSSLTTHSAEQGSPTTRST 831
                    +    +N RS E   P          N +             +G   T   
Sbjct: 57  INKENFAFDKRPLLSNHRSKEIAGPF--------LNIKQLGHRDELDEKENEGDDATLHL 108

Query: 832 VQGAPRIHSREQSSPNTHSAEQRSPNTRSRVQSSATTRSTEQRSRSRSPLTRSTEQGSPN 891
                R+ S   ++ + +       +     +      S+ +      P        S N
Sbjct: 109 HFALQRMTSSSANASSDNEQCPADVDQMYIKKFYEIPWSSSENIEFEDPGHDPFVDNSDN 168

Query: 892 IRSTEQRAS--RTRSADQASQSVVTRSRSKL 920
            +    R S  + +  ++ + +V T  +   
Sbjct: 169 SKMNHLRGSGKQPKCREKIAFAVWTSLQGMR 199


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.309    0.122    0.338 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 43,769,469
Number of extensions: 4116647
Number of successful extensions: 5523
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4820
Number of HSP's successfully gapped: 418
Length of query: 926
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 820
Effective length of database: 6,236,078
Effective search space: 5113583960
Effective search space used: 5113583960
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 64 (28.6 bits)