RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6065
         (316 letters)



>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score =  280 bits (718), Expect = 8e-94
 Identities = 118/316 (37%), Positives = 152/316 (48%), Gaps = 18/316 (5%)

Query: 13  KAIQHFLNVASEHDNRDLTVSYWCRLHALQKALKIDKSSDGAKMLLLKLMDWLEQEKKTH 72
           K+IQ +L  A E +  D  V+Y+CRL+A+Q  LKI          L KL+D LE  KK  
Sbjct: 1   KSIQPYLKRAQELEKADPVVAYYCRLYAVQLILKIKLKDGEVTKFLTKLLDQLEAFKKEL 60

Query: 73  RDNDTIMNETAAQAYIENYALKLFQWADSMDRASTFNKNVVKAFYTASILMDVLQTFGET 132
            DND I +E AAQAY+EN+ALKLF  AD  DRA    KN  K FY AS   +VL  FGE 
Sbjct: 61  GDNDAITDEVAAQAYVENFALKLFNRADKQDRAGRATKNTAKTFYAASTFFEVLAIFGEL 120

Query: 133 SEDIEQNKKYAKWKAAYIHNCLKSGETPIPGPP-----ESGEGSEGPSLSSQNSNDQLPS 187
            E++ Q  KYAKWKAA IH  LK GE P PGPP     E  + +   S +S    D  P+
Sbjct: 121 DEEVAQKIKYAKWKAARIHKALKEGEDPNPGPPLDEEDEDADVATTNSDNSFPGEDADPA 180

Query: 188 PPRENSNEDVTPSFPAPTSSGGT-----LPSPPSMPQNLPGPSISGPLIPGPSIPSFPSP 242
               +     +P  P+  S           S P  P +    +         +    P P
Sbjct: 181 SASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGP 240

Query: 243 AVTPSSP-------STPAPQPAMPALVTQPEQEISYIEAELINKCQKYIKFASSALNYDD 295
           A  P  P       ST  P P   +    P   I+ ++ + I K QK+ K+A SALN++D
Sbjct: 241 AAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGIT-LDDDAIAKAQKHAKWAISALNFED 299

Query: 296 YKEAKSNLIKVLNILN 311
              A   L K L +L 
Sbjct: 300 VPTAVKELRKALKLLG 315


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 44.0 bits (104), Expect = 9e-05
 Identities = 27/113 (23%), Positives = 41/113 (36%), Gaps = 9/113 (7%)

Query: 153 CLKSGETPIPGPPESGEGSEG------PSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTS 206
           C      PI  P    E S        P  +S +S+ +  SP    S+    P+  +P +
Sbjct: 255 CPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRA 314

Query: 207 SGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
           S  +  S  S   +    S S     G ++   PSP+ +PS    P P     
Sbjct: 315 SSSSSSSRES---SSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSS 364



 Score = 40.5 bits (95), Expect = 0.001
 Identities = 26/112 (23%), Positives = 34/112 (30%), Gaps = 7/112 (6%)

Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTP-------SFPAPTSSG 208
               P P    S      PS S  +            S+   +        +  +  SS 
Sbjct: 278 PSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSR 337

Query: 209 GTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
           G   SP   P   P PS   P     S    P P+  PSSP+  A +P    
Sbjct: 338 GAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRR 389



 Score = 39.4 bits (92), Expect = 0.002
 Identities = 28/111 (25%), Positives = 38/111 (34%), Gaps = 7/111 (6%)

Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLP-SPPRENSNEDVTPS----FPAPTSSGGT 210
               P PGP      +E  S  + + +   P SP RE S     PS     P        
Sbjct: 67  PPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASP 126

Query: 211 LPSPPS-MPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
            PSP   + + L      GP        +  SPA   S  ++   Q A+P 
Sbjct: 127 PPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASS-RQAALPL 176



 Score = 36.3 bits (84), Expect = 0.024
 Identities = 26/108 (24%), Positives = 32/108 (29%), Gaps = 8/108 (7%)

Query: 158 ETPIPGPPESGEGSEGPSLS-------SQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGT 210
             P   PP S   S  P LS       S         P    S   V     +   +   
Sbjct: 116 PPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALP 175

Query: 211 LPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAV-TPSSPSTPAPQPA 257
           L SP    +    P    P    P+  S   P   +P S S  +P PA
Sbjct: 176 LSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPA 223



 Score = 35.9 bits (83), Expect = 0.030
 Identities = 24/103 (23%), Positives = 29/103 (28%), Gaps = 14/103 (13%)

Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
              P  P  G G+E P                  S    + S  AP S        P  P
Sbjct: 65  FEPPTGPPPGPGTEAP-------------ANESRSTPTWSLSTLAPASPAREGSPTPPGP 111

Query: 219 QNLPG-PSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
            +    P    P  P PS     S  + P     P P  + PA
Sbjct: 112 SSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPA 154



 Score = 35.1 bits (81), Expect = 0.047
 Identities = 23/106 (21%), Positives = 35/106 (33%), Gaps = 7/106 (6%)

Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGT--LPS 213
           S   P    P S   S       +++ D         S+ D + S  +    G     P 
Sbjct: 203 SPRPPRRSSPISASASSPAPAPGRSAAD-----DAGASSSDSSSSESSGCGWGPENECPL 257

Query: 214 PPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
           P   P  LP          GPS    P+ + +     +P+P P+ P
Sbjct: 258 PRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSP 303



 Score = 33.6 bits (77), Expect = 0.14
 Identities = 23/99 (23%), Positives = 29/99 (29%), Gaps = 3/99 (3%)

Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNL 221
                 G G E      + +   LP+   E S  +   S P P SS          P   
Sbjct: 242 SESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSS--SPRERSPSPS 299

Query: 222 PGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
           P    SGP    P   S  S     SS S+ +       
Sbjct: 300 PSSPGSGPAPSSPRASSSSSS-SRESSSSSTSSSSESSR 337



 Score = 32.8 bits (75), Expect = 0.27
 Identities = 23/110 (20%), Positives = 31/110 (28%), Gaps = 7/110 (6%)

Query: 162 PGPPESGEGSEGP------SLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPP 215
           P PP        P       LS        P PP   S      + PA  +S        
Sbjct: 114 PDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPA-AGASPAAVASDAASSRQA 172

Query: 216 SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
           ++P + P  +   P  P    P    PA     P   +   +  A    P
Sbjct: 173 ALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAP 222



 Score = 32.5 bits (74), Expect = 0.34
 Identities = 27/117 (23%), Positives = 37/117 (31%), Gaps = 7/117 (5%)

Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSP--PRENSNEDVTPSFPAPTSSGGTLPS 213
           S  +P P P  S     G S S  +S++       P              PT        
Sbjct: 216 SASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGW 275

Query: 214 PPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEIS 270
                +  P  S S P    PS    PSP+ +P S   P+   A  +  +  E   S
Sbjct: 276 NGPSSRPGPASSSSSPRERSPS----PSPS-SPGSGPAPSSPRASSSSSSSRESSSS 327



 Score = 32.5 bits (74), Expect = 0.38
 Identities = 27/128 (21%), Positives = 38/128 (29%), Gaps = 17/128 (13%)

Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDV-TPSFPAPTSSGGTLPSPPSMPQN 220
             PP +         S  + +    +  R         P  P P +  G    P +    
Sbjct: 836 ARPPGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAA 895

Query: 221 LPGPSISGPLIPGPSIPSFPSPA-----------VTPSSPSTPAPQPAMPALVTQPEQEI 269
            P  + +    P P +   P P            V P    TPAP  A  A    PE   
Sbjct: 896 PPAGAPAPRPRPAPRVKLGPMPPGGPDPRGGFRRVPPGDLHTPAPSAAALAAYCPPE--- 952

Query: 270 SYIEAELI 277
             + AEL+
Sbjct: 953 --VVAELV 958



 Score = 31.7 bits (72), Expect = 0.65
 Identities = 19/101 (18%), Positives = 27/101 (26%), Gaps = 5/101 (4%)

Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
            +      G        ++  S   + S    +       + P   S   T  +P S P 
Sbjct: 137 MLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQ----AALPLS-SPEETARAPSSPPA 191

Query: 220 NLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
             P  +      P P   S P  A   S    P    A  A
Sbjct: 192 EPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDA 232



 Score = 31.3 bits (71), Expect = 0.77
 Identities = 18/108 (16%), Positives = 28/108 (25%), Gaps = 6/108 (5%)

Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPP 215
           S                  S            PP   S+E  + S PA         +  
Sbjct: 813 SRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSES-SKSKPAAAGGRARGKNGR 871

Query: 216 SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVT 263
             P+     +  G   P  +     + A  P+    P P+PA    + 
Sbjct: 872 RRPRPPEPRARPGAAAPPKA-----AAAAPPAGAPAPRPRPAPRVKLG 914



 Score = 29.8 bits (67), Expect = 2.3
 Identities = 20/117 (17%), Positives = 28/117 (23%), Gaps = 13/117 (11%)

Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRE----NSNEDVTPSFPAPTSSGGTLPSPP-- 215
           P   E+     G    S  + D       +    +   D       P    G    PP  
Sbjct: 789 PVRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPA 848

Query: 216 -------SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
                  S P    G +        P  P   +     + P   A  P   A   +P
Sbjct: 849 RSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRP 905



 Score = 28.2 bits (63), Expect = 8.3
 Identities = 14/91 (15%), Positives = 22/91 (24%), Gaps = 3/91 (3%)

Query: 173 GPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSP---PSMPQNLPGPSISGP 229
            P+   + +    P                 P +   +  +            G   SGP
Sbjct: 776 EPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGP 835

Query: 230 LIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
             P  +    P    + SS S PA       
Sbjct: 836 ARPPGAAARPPPARSSESSKSKPAAAGGRAR 866


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 41.3 bits (96), Expect = 4e-04
 Identities = 23/109 (21%), Positives = 43/109 (39%), Gaps = 4/109 (3%)

Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSP---PRENSNEDVTPSFPAPTSSGGTLPSPPS 216
           P P P ++   +  P+++  ++  + P     P   ++E   P+ PAP  +    P P  
Sbjct: 42  PAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPA-PAPHQAASRAPDPAV 100

Query: 217 MPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
            PQ    P          +  +  +PA   +S ++  P PA     + P
Sbjct: 101 APQLAAAPKPDAAEAFTSAAQAHEAPADAGTSAASKKPDPAAHTQHSPP 149


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 41.1 bits (96), Expect = 6e-04
 Identities = 29/114 (25%), Positives = 38/114 (33%), Gaps = 4/114 (3%)

Query: 160  PIPGPPESGEGSEGPS--LSSQNSNDQLPSPPRENSNEDVTP-SFPAPTSSGGTLPSPPS 216
              P P     GS  P   +  +  +    + P   +   V   + PA + S  +   PP 
Sbjct: 2844 GPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPD 2903

Query: 217  MPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEIS 270
             P+  P P    P  P P  P  P P   P  P  P P  A P        E S
Sbjct: 2904 QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLA-PTTDPAGAGEPS 2956



 Score = 38.4 bits (89), Expect = 0.005
 Identities = 26/101 (25%), Positives = 34/101 (33%), Gaps = 4/101 (3%)

Query: 165  PESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGP 224
            P S   +  P+  ++    +L  P    S E    SF  P       P P + P   P P
Sbjct: 2866 PPSRSPAAKPAAPARPPVRRLARPAVSRSTE----SFALPPDQPERPPQPQAPPPPQPQP 2921

Query: 225  SISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
                P  P P  P  P P    +  + PA        V QP
Sbjct: 2922 QPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQP 2962



 Score = 37.2 bits (86), Expect = 0.012
 Identities = 21/91 (23%), Positives = 32/91 (35%), Gaps = 2/91 (2%)

Query: 185 LPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAV 244
           LP+  R ++    TP F               +P ++P P+ +      P  P+ P P+ 
Sbjct: 380 LPTRKRRSARHAATP-FARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSA 438

Query: 245 TPSSPSTPAPQPA-MPALVTQPEQEISYIEA 274
            P S   PAP P   P             +A
Sbjct: 439 EPGSDDGPAPPPERQPPAPATEPAPDDPDDA 469



 Score = 36.5 bits (84), Expect = 0.019
 Identities = 26/114 (22%), Positives = 36/114 (31%), Gaps = 7/114 (6%)

Query: 159  TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
            T       S      PS             P        +P+ P P  +    P+ P  P
Sbjct: 2784 TRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS-AQPTAPPPP 2842

Query: 219  QNLPGPSIS--GPLIPGPSIPSFPSPAVTPSSPSTPAPQP----AMPALVTQPE 266
               P PS+   G + PG  +   P      + P+ PA  P    A PA+    E
Sbjct: 2843 PGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTE 2896



 Score = 36.5 bits (84), Expect = 0.020
 Identities = 21/78 (26%), Positives = 25/78 (32%), Gaps = 6/78 (7%)

Query: 186  PSPPRENSNEDVTPSFPAPTSSG------GTLPSPPSMPQNLPGPSISGPLIPGPSIPSF 239
            P+ P    +  V P  PAP  S          P  P        P        GP+ PS 
Sbjct: 2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSP 2616

Query: 240  PSPAVTPSSPSTPAPQPA 257
              P      P  P+P PA
Sbjct: 2617 LPPDTHAPDPPPPSPSPA 2634



 Score = 36.5 bits (84), Expect = 0.021
 Identities = 22/115 (19%), Positives = 34/115 (29%), Gaps = 10/115 (8%)

Query: 162  PGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTS----------SGGTL 211
            P PP    G   P   ++ +     + P   +      + P              S  +L
Sbjct: 2739 PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL 2798

Query: 212  PSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
            PSP                +P  + P+ P P  T + P+ P P P  P       
Sbjct: 2799 PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG 2853



 Score = 36.5 bits (84), Expect = 0.022
 Identities = 25/126 (19%), Positives = 38/126 (30%), Gaps = 17/126 (13%)

Query: 157  GETPIPGPPESGEGSEGPSLS-----SQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTL 211
                +P P    +      +S      +       S P +              +S    
Sbjct: 2642 PPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP 2701

Query: 212  PSPPSMPQNLPGPSISG-PLIPGPSI-----------PSFPSPAVTPSSPSTPAPQPAMP 259
            P PP  P+  P   +S  PL PGP+            P+ P+    P++P  PA     P
Sbjct: 2702 PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761

Query: 260  ALVTQP 265
                 P
Sbjct: 2762 TTAGPP 2767



 Score = 36.1 bits (83), Expect = 0.024
 Identities = 21/104 (20%), Positives = 34/104 (32%), Gaps = 6/104 (5%)

Query: 157  GETPIPGPP-ESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPP 215
            G      PP  +G  +  P  +      +  + P   S  +   S P+P       P+ P
Sbjct: 2753 GPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWD-----PADP 2807

Query: 216  SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
                  P  ++     P   +P   S   T   P    P P++P
Sbjct: 2808 PAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851



 Score = 36.1 bits (83), Expect = 0.028
 Identities = 25/115 (21%), Positives = 34/115 (29%), Gaps = 9/115 (7%)

Query: 160  PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTP---SFPAPTSSGGTLPSPPS 216
            P    P+    S  P+ +  + +     PP E   +D  P   S P      G      S
Sbjct: 2619 PDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASS 2678

Query: 217  MPQNLPGPSISGPLIP------GPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
             PQ     +    +         P  P  P PA      +TP P     A    P
Sbjct: 2679 PPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASP 2733



 Score = 36.1 bits (83), Expect = 0.030
 Identities = 28/91 (30%), Positives = 38/91 (41%), Gaps = 13/91 (14%)

Query: 173  GPSLSSQNSNDQLPSP-PRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLI 231
            G   S  +     P+P P  ++    TP  P P ++    P+ P+ P   P    +GP  
Sbjct: 2693 GSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAP--APPAVPAGPAT 2750

Query: 232  PGPSIPSFPSPAVTPSSPSTPA--PQPAMPA 260
            PG        PA  P+ P T A  P PA PA
Sbjct: 2751 PG-------GPA-RPARPPTTAGPPAPAPPA 2773



 Score = 34.5 bits (79), Expect = 0.080
 Identities = 30/135 (22%), Positives = 42/135 (31%), Gaps = 30/135 (22%)

Query: 159  TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPA--PTSSGGTLPSPPS 216
            TP+P  P +   +              P            P+ PA  PT++G   P+PP+
Sbjct: 2719 TPLPPGPAAARQASPAL-----PAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPA 2773

Query: 217  MPQNLPGP----------SISGPLIPGPSIPSFPSPAVTPSSPSTPA----------P-- 254
             P   P            S S   +P P  P+ P  AV   + + P           P  
Sbjct: 2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS 2833

Query: 255  -QPAMPALVTQPEQE 268
             QP  P     P   
Sbjct: 2834 AQPTAPPPPPGPPPP 2848



 Score = 34.1 bits (78), Expect = 0.12
 Identities = 27/127 (21%), Positives = 37/127 (29%), Gaps = 18/127 (14%)

Query: 162  PGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNL 221
              PP         S S+++       P R    +   P  P P       P PP  P   
Sbjct: 2879 ARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPR 2938

Query: 222  PGPSI---SGPLIPGPSIPSFPSPA---------------VTPSSPSTPAPQPAMPALVT 263
            P P +   + P   G    + P P                V   +PS  AP  + P L  
Sbjct: 2939 PQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTG 2998

Query: 264  QPEQEIS 270
                 +S
Sbjct: 2999 HSLSRVS 3005



 Score = 33.4 bits (76), Expect = 0.19
 Identities = 26/103 (25%), Positives = 31/103 (30%), Gaps = 7/103 (6%)

Query: 160  PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
              P  P           +   S +    PP      D     P P +     P P   P 
Sbjct: 2873 AKPAAPARPPVRRLARPAVSRSTESFALPP------DQPERPPQPQAPPPPQPQPQPPPP 2926

Query: 220  NLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALV 262
              P P    P  P P +     PA     PS   PQP + ALV
Sbjct: 2927 PQPQPPPPPPPRPQPPLAPTTDPAGAG-EPSGAVPQPWLGALV 2968



 Score = 33.4 bits (76), Expect = 0.21
 Identities = 16/60 (26%), Positives = 21/60 (35%), Gaps = 2/60 (3%)

Query: 200 SFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
           S P P  +     +PP  P   P PS       GP+ P    P    + P+   P  A  
Sbjct: 414 SVPTPAPTPVPASAPP--PPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATR 471



 Score = 33.0 bits (75), Expect = 0.25
 Identities = 23/103 (22%), Positives = 29/103 (28%), Gaps = 3/103 (2%)

Query: 160  PIPGPPESGEGSEGPSLSSQN-SNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
              P    S E    P    +     Q P PP+        P  P P       P PP  P
Sbjct: 2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ-PQPPPPPPPRPQPPLAP 2945

Query: 219  QNLPGP-SISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
               P         +P P + +     V       P P P+  A
Sbjct: 2946 TTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREA 2988



 Score = 31.4 bits (71), Expect = 0.67
 Identities = 28/120 (23%), Positives = 34/120 (28%), Gaps = 27/120 (22%)

Query: 160  PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
            P  GPP         SLS   S + LPSP             PA   +    P+    P 
Sbjct: 2775 PAAGPPRRLTRPAVASLSE--SRESLPSPWD-----------PADPPAAVLAPAAALPPA 2821

Query: 220  NLPGPSISGPLIPGPSIPSFPSPAVTPSSP--------------STPAPQPAMPALVTQP 265
              P   +  P    P+ P  P     PS P                     A PA   +P
Sbjct: 2822 ASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARP 2881



 Score = 31.4 bits (71), Expect = 0.71
 Identities = 22/121 (18%), Positives = 32/121 (26%), Gaps = 18/121 (14%)

Query: 160  PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVT-PSFPAPTSSGGTLPSPPSMP 218
            P P   E            +   D  P+P R +           A  SS    P   +  
Sbjct: 2631 PSPAANEPDPHPPPTVPPPERPRDD-PAPGRVSRPRRARRLGRAAQASSPPQRPRRRAAR 2689

Query: 219  QNL---------PGPSISGPLIPGPSIPSFPSPAVT-------PSSPSTPAPQPAMPALV 262
              +         P P  +    P   + + P P          P+ P+ PAP        
Sbjct: 2690 PTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPA 2749

Query: 263  T 263
            T
Sbjct: 2750 T 2750



 Score = 30.7 bits (69), Expect = 1.2
 Identities = 26/117 (22%), Positives = 38/117 (32%), Gaps = 17/117 (14%)

Query: 160  PIPGPPESGEGSEG--PSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSM 217
            P P P E    S    P    Q++    P  P ++  +   P+ P+P       P PP  
Sbjct: 2573 PAPRPSEPAVTSRARRPDAPPQSAR---PRAPVDDRGDPRGPAPPSPLPPDTHAPDPPP- 2628

Query: 218  PQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTP-------APQPAMPALVTQPEQ 267
                P PS +         P+ P P      P+         A +    A  + P Q
Sbjct: 2629 ----PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQ 2681



 Score = 30.7 bits (69), Expect = 1.2
 Identities = 21/108 (19%), Positives = 32/108 (29%), Gaps = 6/108 (5%)

Query: 160  PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
                PP+  E    P           P PP +       P  P P  +  T P+    P 
Sbjct: 2897 SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956

Query: 220  NLPGPSISGPLIPG------PSIPSFPSPAVTPSSPSTPAPQPAMPAL 261
                    G L+PG        +P        P+S + P    ++  +
Sbjct: 2957 GAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRV 3004



 Score = 29.1 bits (65), Expect = 4.4
 Identities = 24/120 (20%), Positives = 41/120 (34%), Gaps = 17/120 (14%)

Query: 118 TASILMDVLQTFGETSEDIEQNKKYAKWKAAYIHNCLKSGETPIPGPPES-GEGSEGPSL 176
            A +    L  +G  SE   Q++ + + +    H     G+   P PP   GEG++    
Sbjct: 214 PADLTAAALHLYG-ASETYLQDEPFVERRVVISHPL--RGDIAAPAPPPVVGEGADRAPE 270

Query: 177 SSQNSNDQLPSPPRENSNEDVTPSFPAPTSSG-------GTLPSPPSMPQNLPGPSISGP 229
           +++ +    P P      E   P+  A    G       G   + P+ P   P       
Sbjct: 271 TARGATGPPPPP------EAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPAPAGDA 324


>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
          Length = 331

 Score = 40.0 bits (94), Expect = 0.001
 Identities = 19/90 (21%), Positives = 33/90 (36%), Gaps = 8/90 (8%)

Query: 177 SSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSI 236
           S++ S +   S P + S      + PAP           + P N   P+++    P P++
Sbjct: 151 SAELSQNSGQSVPLDTSTTTDPATTPAPA------APVDTTPTNSQTPAVATA--PAPAV 202

Query: 237 PSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
               +  V PS  +        PA    P+
Sbjct: 203 DPQQNAVVAPSQANVDTAATPAPAAPATPD 232


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 40.4 bits (95), Expect = 0.001
 Identities = 19/125 (15%), Positives = 39/125 (31%), Gaps = 2/125 (1%)

Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPA--PTSSGGTLPS 213
               P P  P +  G+     +   +         + + +    +  A  P+ +      
Sbjct: 681 PPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740

Query: 214 PPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISYIE 273
            P  P + P P+ +    P P  P+  +       PS P+ +  M         +    +
Sbjct: 741 LPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRD 800

Query: 274 AELIN 278
           AE + 
Sbjct: 801 AEEVA 805



 Score = 39.2 bits (92), Expect = 0.002
 Identities = 20/105 (19%), Positives = 35/105 (33%), Gaps = 2/105 (1%)

Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
            P P        +  P+ ++       P+P     +           S      +P + P
Sbjct: 409 APAPAAAA-PAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPA-AAPSAQP 466

Query: 219 QNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVT 263
              P  +      P P+ P+ P+PA  P++P+ PA         T
Sbjct: 467 APAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAAT 511



 Score = 37.7 bits (88), Expect = 0.007
 Identities = 25/113 (22%), Positives = 39/113 (34%), Gaps = 3/113 (2%)

Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPP 215
                      +   +  P+ ++  +    P+P             PAP S  G  P+  
Sbjct: 394 PAAAAPSAAAAAPAAAPAPA-AAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGG 452

Query: 216 SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQE 268
           + P   P  + S    P P+    P+ A  P+ P+ PAP  A PA    P   
Sbjct: 453 A-PSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAA-PAAPAAPAAP 503



 Score = 36.9 bits (86), Expect = 0.012
 Identities = 20/122 (16%), Positives = 33/122 (27%), Gaps = 7/122 (5%)

Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
           P      +          +  +     +P +       TP            P   +   
Sbjct: 670 PAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPP-AGQADDPAAQPPQAAQGA 728

Query: 220 NLPGPSISG--PLIPGPSIPSFPSPAVTPS----SPSTPAPQPAMPALVTQPEQEISYIE 273
           + P P+     PL P P  P  P+ A        +P+  A   A P      E+E    +
Sbjct: 729 SAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAED 788

Query: 274 AE 275
             
Sbjct: 789 DA 790



 Score = 36.5 bits (85), Expect = 0.019
 Identities = 16/126 (12%), Positives = 30/126 (23%)

Query: 143 AKWKAAYIHNCLKSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFP 202
           A   +       +      P  P +   +   +  ++ S    P       +        
Sbjct: 602 APASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPD 661

Query: 203 APTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALV 262
           A     G            P P+ +      P+  +   PA  P++           A  
Sbjct: 662 ASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQP 721

Query: 263 TQPEQE 268
            Q  Q 
Sbjct: 722 PQAAQG 727



 Score = 35.0 bits (81), Expect = 0.060
 Identities = 22/120 (18%), Positives = 38/120 (31%), Gaps = 14/120 (11%)

Query: 163 GPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPS--------- 213
             P +  G   P+ +S    ++   P    +        PA  ++     S         
Sbjct: 590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAA 649

Query: 214 PPSMPQNLPGPSIS-----GPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQE 268
           P   P+++  P  S      P   G + P+ P PA  P++P+ PA               
Sbjct: 650 PEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATP 709



 Score = 34.6 bits (80), Expect = 0.064
 Identities = 22/106 (20%), Positives = 29/106 (27%), Gaps = 7/106 (6%)

Query: 163 GPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLP 222
                G G+   +  S  +     +P          P+  A  +     P P   P   P
Sbjct: 385 LGVAGGAGAPAAAAPSAAAAAPAAAPAPA----AAAPAAAAAPAPAAA-PQPAPAPAPAP 439

Query: 223 GPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQE 268
            P  S            P PA  PS+   PAP  A       P   
Sbjct: 440 APP-SPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEP-TAAPAPA 483



 Score = 32.7 bits (75), Expect = 0.28
 Identities = 18/117 (15%), Positives = 33/117 (28%), Gaps = 12/117 (10%)

Query: 155 KSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSF------------P 202
            +   P        E S  P+                ++++                  P
Sbjct: 624 PAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPP 683

Query: 203 APTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
           AP  +    P+  +  Q  P P+ + P        + P  A   +S  +PA    +P
Sbjct: 684 APAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740



 Score = 27.6 bits (62), Expect = 9.9
 Identities = 9/68 (13%), Positives = 17/68 (25%), Gaps = 6/68 (8%)

Query: 203 APTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALV 262
              + G   P+  +       P+        P+  +    A    +P+        PA  
Sbjct: 385 LGVAGGAGAPAAAAPSAAAAAPA------AAPAPAAAAPAAAAAPAPAAAPQPAPAPAPA 438

Query: 263 TQPEQEIS 270
             P     
Sbjct: 439 PAPPSPAG 446


>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
           inhibitor.
          Length = 670

 Score = 39.3 bits (91), Expect = 0.002
 Identities = 34/123 (27%), Positives = 47/123 (38%), Gaps = 12/123 (9%)

Query: 157 GETPIPG---PPESGEGSEGP-SLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLP 212
           G+  IPG   P   G  +  P S +S  +     S    +S+   T   P+ +   G L 
Sbjct: 546 GDAWIPGKGVPYIPGLFAGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLG 605

Query: 213 SPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISYI 272
           SPP+ P  +  PS S P           SP+ TPSSP +     +       PE  I   
Sbjct: 606 SPPATPSKIVSPSTSPP------ASHLGSPSTTPSSPESSIKVASTET--ASPESSIKVA 657

Query: 273 EAE 275
             E
Sbjct: 658 STE 660


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
           subunit 62; Provisional.
          Length = 576

 Score = 39.1 bits (91), Expect = 0.002
 Identities = 29/114 (25%), Positives = 38/114 (33%), Gaps = 2/114 (1%)

Query: 154 LKSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSP-PRENSNEDVTPSFPAPTSSGGTLP 212
           LK   +PIP PP S   S     +     +    P P   SN                L 
Sbjct: 380 LKPPTSPIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPL- 438

Query: 213 SPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
           SP +  ++L  P+   P  P    PS  S +  P+ P T     A  A    P 
Sbjct: 439 SPYARYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPA 492



 Score = 32.2 bits (73), Expect = 0.36
 Identities = 23/108 (21%), Positives = 36/108 (33%), Gaps = 4/108 (3%)

Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
           P   P  +      PS+SS +S   +P      +  D      AP  +     SP ++  
Sbjct: 449 PPTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAA----APPPANMRPLSPYAVYD 504

Query: 220 NLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQ 267
           +L  P+   P  P   +    +  V     S P    A      QP+ 
Sbjct: 505 DLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKP 552


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 38.1 bits (88), Expect = 0.005
 Identities = 27/107 (25%), Positives = 40/107 (37%), Gaps = 11/107 (10%)

Query: 163 GPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLP 222
               SG  +EGP+     +   +P  P     +   P+     SS    P+ P+ P   P
Sbjct: 378 ASAPSGSAAEGPASGGAAT---IP-TPGTQGPQGTAPAAGMTPSSA--APATPA-PSAAP 430

Query: 223 GPSI----SGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
            P +    + P  P   IP  P+P +  +SP   AP     A    P
Sbjct: 431 SPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVPGAPDSVASASDAPP 477


>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
          Length = 361

 Score = 37.3 bits (87), Expect = 0.007
 Identities = 22/97 (22%), Positives = 35/97 (36%), Gaps = 5/97 (5%)

Query: 161 IPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQN 220
           +P  P+   G   PS+    +  Q    P  NS         AP +S G L +P ++   
Sbjct: 194 LPARPDLSGGMGTPSVQPAPA-PQGDVLPVSNSTLKSEDPTGAPVTSSGFLGAPTTLA-- 250

Query: 221 LPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPA 257
            PG        P     + P+ A   ++P   A   +
Sbjct: 251 -PGVLEGSEPTPTAPSSA-PATAPAAAAPQAAATSSS 285


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 37.2 bits (86), Expect = 0.009
 Identities = 23/91 (25%), Positives = 32/91 (35%), Gaps = 12/91 (13%)

Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
            P P     G      SL+   S+      PR + N  +    PA          P + P
Sbjct: 187 QPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPA---------QPSAPP 237

Query: 219 QNLPGPSISGPLIPGPSIPSFPSPAVTPSSP 249
            ++P P I  P+I   + P  P P   P  P
Sbjct: 238 ASIPAPPI-PPVIQYVAPP--PVPPPQPIIP 265



 Score = 35.7 bits (82), Expect = 0.027
 Identities = 28/79 (35%), Positives = 35/79 (44%), Gaps = 4/79 (5%)

Query: 185 LPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQN--LPGPSISGPLIPGPSIPSFPSP 242
            P PP          S  AP++S       P +  N  LPGPS + P  P  SIP+ P P
Sbjct: 187 QPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPAPPIP 246

Query: 243 AVTP--SSPSTPAPQPAMP 259
            V    + P  P PQP +P
Sbjct: 247 PVIQYVAPPPVPPPQPIIP 265


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 37.1 bits (86), Expect = 0.012
 Identities = 27/142 (19%), Positives = 40/142 (28%), Gaps = 20/142 (14%)

Query: 164 PPESGEGSEGPSLSSQNSNDQLPS---PPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQN 220
                +    P  +   +  Q  +        S         AP S+     +PP++  +
Sbjct: 371 GGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVD 430

Query: 221 LPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA-----LVTQPEQE----ISY 271
            P      P+ P    PS    AV P+        P            +P QE     + 
Sbjct: 431 PPAAV---PVNP----PSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATG 483

Query: 272 IEAELINKCQKYIKFASSALNY 293
              E     QK I F    L Y
Sbjct: 484 NIKEAPTGTQKEI-FTEEDLQY 504



 Score = 32.8 bits (75), Expect = 0.25
 Identities = 18/111 (16%), Positives = 32/111 (28%), Gaps = 1/111 (0%)

Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
           P   P  S   +  PS S  ++  Q  +P          P+      +   +  P + PQ
Sbjct: 386 PAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQ 445

Query: 220 NLPGPSISGPLIPGPS-IPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEI 269
            +             S + S     + P           +    T  ++EI
Sbjct: 446 AVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATGNIKEAPTGTQKEI 496



 Score = 31.7 bits (72), Expect = 0.54
 Identities = 16/93 (17%), Positives = 23/93 (24%), Gaps = 8/93 (8%)

Query: 192 NSNEDVTPSF--PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSP 249
              +D +             T P+    P      S S       + PS P  A  P+  
Sbjct: 364 QKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGT 423

Query: 250 ST------PAPQPAMPALVTQPEQEISYIEAEL 276
                   PA  P  P          +  + E 
Sbjct: 424 PPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEK 456


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 37.0 bits (85), Expect = 0.012
 Identities = 35/116 (30%), Positives = 50/116 (43%), Gaps = 9/116 (7%)

Query: 155 KSGETPIPGPPESGEGS---EGPSLSSQNSNDQLPSPPR--ENSNEDVTPSFPAPTSSGG 209
           K+ E   P  P  GEG    EG S  S++ N++  S P+  +  N   +PS P+P  +  
Sbjct: 103 KTQELSRPNSPSEGEGEGEGEGESSDSRSVNEEGSSDPKDIDQDNRSSSPSIPSPQDNES 162

Query: 210 TLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
              S  S  Q L  P    P I  P   +  +P+  P +PS  A  P    +  QP
Sbjct: 163 D--SDSSAQQQLLQPQ-GPPSIQVPPGAA-LAPSAPPPTPSAQAVPPQGSPIAAQP 214



 Score = 33.9 bits (77), Expect = 0.13
 Identities = 28/122 (22%), Positives = 35/122 (28%), Gaps = 19/122 (15%)

Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
            P P  P        PSL  Q    +LPSP               P ++    P PP+  
Sbjct: 215 APQPQQPSPLSLISAPSLHPQ----RLPSPHPPLQ----------PQTASQQSPQPPAPS 260

Query: 219 QNLPGPSISGPLIPGPSI-----PSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISYIE 273
              P  S  GP  P P            P+  P  P   A     P  +    Q  S+  
Sbjct: 261 SRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHTP 320

Query: 274 AE 275
             
Sbjct: 321 PS 322



 Score = 33.5 bits (76), Expect = 0.16
 Identities = 31/109 (28%), Positives = 40/109 (36%), Gaps = 5/109 (4%)

Query: 164 PPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAP--TSSGGTLPSPPSMPQNL 221
            P S          SQ     LPS  + +S+   + S   P        LP  PSMP   
Sbjct: 288 HPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQPLPPAPSMPHIK 347

Query: 222 PGPSISGPLIPGPS---IPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQ 267
           P P+   P +P  S    P    P+  P  PS   P PA+  L + P  
Sbjct: 348 PPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPSNLPPPPALKPLSSLPTH 396



 Score = 32.7 bits (74), Expect = 0.24
 Identities = 28/110 (25%), Positives = 36/110 (32%), Gaps = 7/110 (6%)

Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSND-QLPSPPRENSNEDVTPSFPA----PTSSGGT 210
           S   P P   ES   S       Q      +  PP         P  P+    P      
Sbjct: 151 SPSIPSPQDNESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPI 210

Query: 211 LPSPPSMPQNLPGPSI-SGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
              P   PQ     S+ S P +    +PS P P + P + S  +PQP  P
Sbjct: 211 AAQPAPQPQQPSPLSLISAPSLHPQRLPS-PHPPLQPQTASQQSPQPPAP 259


>gnl|CDD|223029 PHA03264, PHA03264, envelope glycoprotein D; Provisional.
          Length = 416

 Score = 36.1 bits (83), Expect = 0.021
 Identities = 27/114 (23%), Positives = 33/114 (28%), Gaps = 21/114 (18%)

Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAP----TSSGGTLPSP 214
                PP    GS  P           P   R  +  +  P         T   G  P P
Sbjct: 262 KGYEPPPAPSGGSPAP-----------PGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEP 310

Query: 215 PSMPQNLPGPSISGPLIPGPSI------PSFPSPAVTPSSPSTPAPQPAMPALV 262
                   G    GP  P P        PS  +    P +P+TPA   A P +V
Sbjct: 311 AGRDGAAGGEPKPGPPRPAPDADRPEGWPSLEAITFPPPTPATPAVPRARPVIV 364


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 36.2 bits (84), Expect = 0.022
 Identities = 16/104 (15%), Positives = 33/104 (31%), Gaps = 2/104 (1%)

Query: 165 PESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGP 224
           P +   +  P+     +  +  +P      +      PA   +              P P
Sbjct: 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAP 425

Query: 225 SISGPLI-PGPSIPSFPSP-AVTPSSPSTPAPQPAMPALVTQPE 266
             +     P  +  + P+  A+ P+ P+  AP+     +   PE
Sbjct: 426 VAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPE 469



 Score = 29.3 bits (66), Expect = 3.2
 Identities = 19/110 (17%), Positives = 30/110 (27%), Gaps = 14/110 (12%)

Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
           P P    +   S   +          P  P          + PA   +   L   P    
Sbjct: 401 PAPAAAPAAAASAPAA-----PPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQA 455

Query: 220 NLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEI 269
                +I   + P P++          S+   PA  PA   L    E ++
Sbjct: 456 APETVAIPVRVAPEPAVA---------SAAPAPAAAPAAARLTPTEEGDV 496



 Score = 28.5 bits (64), Expect = 5.9
 Identities = 11/67 (16%), Positives = 20/67 (29%), Gaps = 2/67 (2%)

Query: 204 PTSSGGTLPSPPSMPQNLPGPSISGPL-IPGPSIPSFPSPAVTPSSPSTPAPQP-AMPAL 261
           P ++              P  +      +   +    P+ A   ++ +  AP   A PA 
Sbjct: 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAP 425

Query: 262 VTQPEQE 268
           V  P   
Sbjct: 426 VAAPAAA 432



 Score = 28.1 bits (63), Expect = 6.3
 Identities = 10/75 (13%), Positives = 20/75 (26%), Gaps = 2/75 (2%)

Query: 202 PAPTSSGGTLPSPPSMPQNLPGPSISGPLI--PGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
           PA   +     +P +     P P+ +         + P+   PA   +  +         
Sbjct: 381 PARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAA 440

Query: 260 ALVTQPEQEISYIEA 274
           A            +A
Sbjct: 441 APAAVALAPAPPAQA 455


>gnl|CDD|215964 pfam00513, Late_protein_L2, Late Protein L2. 
          Length = 466

 Score = 36.1 bits (84), Expect = 0.022
 Identities = 21/99 (21%), Positives = 29/99 (29%), Gaps = 7/99 (7%)

Query: 174 PSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISG----- 228
            S  +    D L     +    DV     +            S  Q +   S +      
Sbjct: 343 HSGDTSPVEDGLYDIYADPDPLDVELDTYSDDLLLDETTEDFSTSQLVSSSSRTSTTNTT 402

Query: 229 -PLIPGPSIPSFPSPAVT-PSSPSTPAPQPAMPALVTQP 265
            PL   P +P +  P +  P SP T    P  P L T  
Sbjct: 403 IPLSSTPDVPVYYGPDIVLPESPGTTPIVPVPPDLPTVI 441



 Score = 35.0 bits (81), Expect = 0.052
 Identities = 17/82 (20%), Positives = 22/82 (26%), Gaps = 14/82 (17%)

Query: 171 SEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPL 230
                L  + + D   S    +S+   T +   P SS                P   GP 
Sbjct: 371 YSDDLLLDETTEDFSTSQLVSSSSRTSTTNTTIPLSSTPD------------VPVYYGPD 418

Query: 231 IPGPSIPSFPSPAVTPSSPSTP 252
           I  P  P        P  P  P
Sbjct: 419 IVLPESPGTTPI--VPVPPDLP 438


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 35.9 bits (83), Expect = 0.027
 Identities = 17/110 (15%), Positives = 29/110 (26%), Gaps = 8/110 (7%)

Query: 179 QNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPS 238
                    P        + P+  AP ++      P    QN        P++  P+  S
Sbjct: 414 LGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRA---PSRNAQNAAQKPPMQPVMYPPNYQS 470

Query: 239 FPSPAVTPSSPSTPAPQPAMPALVTQ-----PEQEISYIEAELINKCQKY 283
            P     P   ST +       L        P+ +   +   L    +  
Sbjct: 471 LPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQKQVLGERLFPLVEAI 520


>gnl|CDD|222843 PHA02030, PHA02030, hypothetical protein.
          Length = 336

 Score = 35.7 bits (82), Expect = 0.027
 Identities = 20/93 (21%), Positives = 36/93 (38%), Gaps = 9/93 (9%)

Query: 168 GEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSIS 227
           G         S+ +   LP+ P   ++     +   P ++     + PS+PQ      + 
Sbjct: 249 GGEDLIIKPKSKAAGSNLPAVPNVAADAGSAAAPAVPAAAAAVAQAAPSVPQVPNVAVL- 307

Query: 228 GPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
                 P +P   +P   P++P  PA  P +PA
Sbjct: 308 ------PDVP-QVAPVAAPAAPEVPA-VPVVPA 332


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 35.5 bits (82), Expect = 0.039
 Identities = 24/125 (19%), Positives = 37/125 (29%), Gaps = 9/125 (7%)

Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
             +P  P   +  + P LS Q      P PP+    +      P P +       P   P
Sbjct: 234 PQLPQQPPPLQQPQFPGLSQQ----MPPPPPQPPQQQ---QQPPQPQAQPPPQNQPTPHP 286

Query: 219 QNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISYIEAELIN 278
               G +   P    P     P           P  +  +  L  Q  + +S  EA+   
Sbjct: 287 GLPQGQNAPLPPPQQP--QLLPLVQQPQGQQRGPQFREQLVQLSQQQREALSQEEAKRAK 344

Query: 279 KCQKY 283
           +  K 
Sbjct: 345 RRHKI 349



 Score = 33.2 bits (76), Expect = 0.17
 Identities = 22/123 (17%), Positives = 32/123 (26%), Gaps = 15/123 (12%)

Query: 160 PIPGPPESGEG--SEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPS---- 213
           P P PP++ E             S +++ +  ++       P  P      G  P     
Sbjct: 132 PEPQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAF 191

Query: 214 ----PPSMPQNLPGPSISGPLIPGPS-----IPSFPSPAVTPSSPSTPAPQPAMPALVTQ 264
               PP  P   P P    P    P          P+    P       P    P     
Sbjct: 192 PQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGL 251

Query: 265 PEQ 267
            +Q
Sbjct: 252 SQQ 254



 Score = 28.6 bits (64), Expect = 6.2
 Identities = 13/68 (19%), Positives = 15/68 (22%), Gaps = 1/68 (1%)

Query: 199 PSFPAPTSSGGTLPSPPSMPQNLPGPS-ISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPA 257
              P        LP+P   P   P P  +     P             P  P  P  Q  
Sbjct: 208 QGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQ 267

Query: 258 MPALVTQP 265
            P      
Sbjct: 268 QPPQPQAQ 275


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 34.9 bits (80), Expect = 0.048
 Identities = 20/74 (27%), Positives = 29/74 (39%), Gaps = 2/74 (2%)

Query: 197 VTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIP-SFPSPAVTPSSPSTPAPQ 255
           V P    PTS+    PSPP+ P +    + +  +        S  + A    S S P+  
Sbjct: 84  VAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPSTSVPSSG 143

Query: 256 P-AMPALVTQPEQE 268
             A   LV   E+E
Sbjct: 144 SDAASTLVVGSERE 157



 Score = 33.7 bits (77), Expect = 0.097
 Identities = 16/57 (28%), Positives = 25/57 (43%)

Query: 204 PTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
           P +  G +  P + P + P P+ S P  P   + + P+ AV   SPS  +     P 
Sbjct: 77  PKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPE 133


>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 598

 Score = 34.5 bits (80), Expect = 0.062
 Identities = 13/68 (19%), Positives = 19/68 (27%), Gaps = 1/68 (1%)

Query: 202 PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPAL 261
           P     GG  P         P  + +            P+ A+    P   A   A PA 
Sbjct: 391 PPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAA-VPDAAAAAAAPPAP 449

Query: 262 VTQPEQEI 269
              P+  +
Sbjct: 450 AAAPQPAV 457



 Score = 34.1 bits (79), Expect = 0.089
 Identities = 10/84 (11%), Positives = 21/84 (25%), Gaps = 2/84 (2%)

Query: 183 DQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSP 242
            + P  P                 +    P   +      GP+ +   +P  +  +   P
Sbjct: 388 QEGPPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPP 447

Query: 243 AVTPSSPSTPAPQ--PAMPALVTQ 264
           A   +            + AL  +
Sbjct: 448 APAAAPQPAVRLNSFEDIVALAEE 471


>gnl|CDD|233154 TIGR00859, ENaC, sodium channel transporter.  The Epithelial Na+
           Channel (ENaC) Family (TC 1.A.06)The ENaC family
           consists of sodium channels from animals and has no
           recognizable homologues in other eukaryotes or bacteria.
           The vertebrate ENaC proteins from epithelial cells
           cluster tightly together on the phylogenetic tree:
           voltage-insensitive ENaC homologues are also found in
           the brain. Eleven sequenced C. elegans proteins,
           including the degenerins, are distantly related to the
           vertebrate proteins as well as to each other. At least
           some ofthese proteins form part of a mechano-transducing
           complex for touch sensitivity. Other members of the ENaC
           family, the acid-sensing ion channels, ASIC1-3,are homo-
           or hetero-oligomeric neuronal H+-gated channels that
           mediate pain sensation in response to tissue acidosis.
           The homologous Helix aspersa(FMRF-amide)-activated Na+
           channel is the first peptide neurotransmitter-gated
           ionotropic receptor to be sequenced.Mammalian ENaC is
           important for the maintenance of Na+ balance and the
           regulation of blood pressure. Three homologous ENaC
           subunits, a, b and g, havebeen shown to assemble to form
           the highly Na+-selective channel.This model is designed
           from the vertebrate members of the ENaC family
           [Transport and binding proteins, Cations and iron
           carrying compounds].
          Length = 595

 Score = 34.3 bits (79), Expect = 0.072
 Identities = 15/53 (28%), Positives = 20/53 (37%)

Query: 187 SPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSF 239
              R     +      A T     L  PP+ P  LP P  SG  +PG   P++
Sbjct: 540 QRRRGPPYAEPPEPVSADTPPSLQLDDPPTFPSALPLPHASGLSLPGTPPPNY 592


>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
           carrier protein subunit; Validated.
          Length = 153

 Score = 33.3 bits (76), Expect = 0.072
 Identities = 17/66 (25%), Positives = 24/66 (36%), Gaps = 8/66 (12%)

Query: 200 SFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
           ++       G   S        P P       P P++PS P+P     +   PAP  A  
Sbjct: 33  TYEVEAKGLGIDLSAVQEQVPTPAP------APAPAVPSAPTP--VAPAAPAPAPASAGE 84

Query: 260 ALVTQP 265
            +VT P
Sbjct: 85  NVVTAP 90


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 34.4 bits (79), Expect = 0.081
 Identities = 17/107 (15%), Positives = 30/107 (28%), Gaps = 2/107 (1%)

Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP--Q 219
           P PP + +  +  +        +  +    +S  D   + P   S   + P+  + P   
Sbjct: 430 PAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAA 489

Query: 220 NLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
             P P  + P    P+           S    PA           P 
Sbjct: 490 FEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPT 536



 Score = 33.3 bits (76), Expect = 0.19
 Identities = 18/66 (27%), Positives = 25/66 (37%), Gaps = 2/66 (3%)

Query: 202 PAPTSSGGTL-PSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
           PA T  G      P  +   +P P        G S       AVT ++ +  AP+ A  A
Sbjct: 360 PAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGAS-AVPAVTAVTGAAGAALAPKAAAAA 418

Query: 261 LVTQPE 266
             T+ E
Sbjct: 419 AATRAE 424



 Score = 29.8 bits (67), Expect = 2.2
 Identities = 15/80 (18%), Positives = 26/80 (32%), Gaps = 8/80 (10%)

Query: 189 PRENSNEDVTPSFPAPTSSGG------TLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSP 242
               +  +  P+ PAP ++           +P     N    + S         P+    
Sbjct: 417 AAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGS 476

Query: 243 AVTPSSPSTPAP--QPAMPA 260
           A  P+S + P    +PA  A
Sbjct: 477 ASAPASDAPPDAAFEPAPRA 496



 Score = 27.9 bits (62), Expect = 9.6
 Identities = 11/83 (13%), Positives = 21/83 (25%), Gaps = 1/83 (1%)

Query: 187 SPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGP-SIPSFPSPAVT 245
           +P    + +    +            +  S            P   G  S P+  +P   
Sbjct: 429 APAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDA 488

Query: 246 PSSPSTPAPQPAMPALVTQPEQE 268
              P+  A  P+       P+  
Sbjct: 489 AFEPAPRAAAPSAATPAAVPDAR 511


>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
          Length = 226

 Score = 33.4 bits (77), Expect = 0.092
 Identities = 26/115 (22%), Positives = 37/115 (32%), Gaps = 13/115 (11%)

Query: 161 IPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQN 220
           IP  P+ G+  E   + +      LP+ P E + E V     A  S      +PP+ P  
Sbjct: 41  IPLVPKPGDRDEPDMMPAATQ--ALPTQPPEGAAEAVRAGDAAAPSLDPATVAPPNTPVE 98

Query: 221 LPGPSISGPLI---PGPSIPSFPSPAVTPSSPSTPAPQPAMP--------ALVTQ 264
                +  P       P     P   V       P P+P +         A V Q
Sbjct: 99  PEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKAAPTGKAYVVQ 153


>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
          Length = 590

 Score = 33.9 bits (78), Expect = 0.099
 Identities = 16/66 (24%), Positives = 25/66 (37%), Gaps = 2/66 (3%)

Query: 198 TPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFP--SPAVTPSSPSTPAPQ 255
           + +  + +SS  + P   + P+       +  L+  PSI   P  S    PS    PA  
Sbjct: 12  SHASSSSSSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSSPSPPSVTLPPAAT 71

Query: 256 PAMPAL 261
              P L
Sbjct: 72  TQTPQL 77


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 33.6 bits (77), Expect = 0.12
 Identities = 16/95 (16%), Positives = 23/95 (24%), Gaps = 4/95 (4%)

Query: 183 DQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSP 242
              PSP R        P   A  +     P   +       P    P +P     +    
Sbjct: 373 AAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLT 432

Query: 243 AVTPSSPSTPAPQPAMPALVTQPEQEISYIEAELI 277
                    P   P  P      E+E + I    +
Sbjct: 433 RAAIPVDEKPKYTPPAP----PKEEEKALIADGDV 463



 Score = 33.6 bits (77), Expect = 0.14
 Identities = 21/89 (23%), Positives = 30/89 (33%), Gaps = 22/89 (24%)

Query: 201 FPAPTSSGGTLP----------SPPSMPQ-NLPGPSISGPLIPGPSIPSFPSPAVT---- 245
           F   T+S G LP           P   PQ   P  +   P+ P P+  + P  A      
Sbjct: 339 FQLRTTSYGQLPLELAVIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIP 398

Query: 246 -------PSSPSTPAPQPAMPALVTQPEQ 267
                   ++P    P+P  P +   PE 
Sbjct: 399 PKEPVRETATPPPVPPRPVAPPVPHTPES 427



 Score = 30.5 bits (69), Expect = 1.2
 Identities = 22/116 (18%), Positives = 34/116 (29%), Gaps = 7/116 (6%)

Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
            P+P P  +   +  PS                 +N           +     P P + P
Sbjct: 361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPP 420

Query: 219 QNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISYIEA 274
             +P    S P +   +IP    P  TP     PAP       +      +  +EA
Sbjct: 421 --VPHTPESAPKLTRAAIPVDEKPKYTP-----PAPPKEEEKALIADGDVLEQLEA 469


>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729).  This
           family of proteins is found in viruses. Proteins in this
           family are typically between 145 and 1707 amino acids in
           length. The family is found in association with
           pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
           There is a single completely conserved residue L that
           may be functionally important.
          Length = 115

 Score = 32.0 bits (73), Expect = 0.12
 Identities = 17/60 (28%), Positives = 20/60 (33%)

Query: 200 SFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
           S  +P  S      PP      P      P+     +P    PA  P  P  P P PA P
Sbjct: 27  SCFSPPESAHPDDPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGP 86



 Score = 31.6 bits (72), Expect = 0.17
 Identities = 17/71 (23%), Positives = 22/71 (30%), Gaps = 6/71 (8%)

Query: 187 SPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTP 246
            PP            P P S+   LP P       P P       P    P+ P   + P
Sbjct: 39  DPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPE------PPVPGPAGPPSPLAP 92

Query: 247 SSPSTPAPQPA 257
            +P+   P P 
Sbjct: 93  PAPARKPPLPP 103



 Score = 27.7 bits (62), Expect = 4.1
 Identities = 17/75 (22%), Positives = 25/75 (33%), Gaps = 8/75 (10%)

Query: 199 PSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSP-----AVTPSSPSTPA 253
            + P      G    P     + P P  +  ++P PS P+ P P        P+ P +P 
Sbjct: 34  SAHPDDPPPVGD---PRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPL 90

Query: 254 PQPAMPALVTQPEQE 268
             PA       P   
Sbjct: 91  APPAPARKPPLPPPR 105


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 33.7 bits (77), Expect = 0.15
 Identities = 15/99 (15%), Positives = 34/99 (34%)

Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
               GP      +      +  +    P+       E++ P F +P  +         + 
Sbjct: 464 PAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVA 523

Query: 219 QNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPA 257
           +++P P+ + P     ++   P+ A  P + +   P  A
Sbjct: 524 ESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVA 562



 Score = 32.5 bits (74), Expect = 0.28
 Identities = 20/104 (19%), Positives = 36/104 (34%), Gaps = 4/104 (3%)

Query: 159 TPIPGPPESGEGSEGPSLS---SQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPP 215
            P P  P +   +   + +   +  +     SP  E        S   P  +    P+P 
Sbjct: 396 APAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPA 455

Query: 216 SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
           + P     P+ +GP     +  + P+ A  P++   PA     P
Sbjct: 456 AAPAAAARPAAAGPRPVAAAAAAAPARA-APAAAPAPADDDPPP 498



 Score = 31.4 bits (71), Expect = 0.78
 Identities = 17/113 (15%), Positives = 35/113 (30%), Gaps = 1/113 (0%)

Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPP 215
           +   P    P +   +     ++  +     +  R  +      S PAP +      +  
Sbjct: 383 AQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRS-PAPEALAAARQASA 441

Query: 216 SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQE 268
             P   P P+ +    P  +     +     ++ +  AP  A PA    P  +
Sbjct: 442 RGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADD 494



 Score = 30.2 bits (68), Expect = 1.6
 Identities = 19/93 (20%), Positives = 30/93 (32%), Gaps = 12/93 (12%)

Query: 178 SQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIP 237
            Q+     P+           P+  AP ++     +PP+ P     P+ +       + P
Sbjct: 366 GQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPA--AAPAAAAAARAVAAAP 423

Query: 238 SFPSPAVTP----------SSPSTPAPQPAMPA 260
           +  SPA                  PAP PA  A
Sbjct: 424 ARRSPAPEALAAARQASARGPGGAPAPAPAPAA 456



 Score = 29.8 bits (67), Expect = 1.9
 Identities = 17/113 (15%), Positives = 32/113 (28%), Gaps = 2/113 (1%)

Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPP 215
            G         +      P+ ++  +    P+ P          +  A   +       P
Sbjct: 369 GGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSP 428

Query: 216 SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPS-SPSTPAPQPAMPALVTQPEQ 267
           + P+ L     +    PG +    P+PA  P+ +    A  P   A       
Sbjct: 429 A-PEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAP 480



 Score = 29.5 bits (66), Expect = 2.6
 Identities = 17/87 (19%), Positives = 24/87 (27%), Gaps = 7/87 (8%)

Query: 186 PSPPRENSNEDVTP---SFPAPTSSGGTLPSPP--SMPQNL--PGPSISGPLIPGPSIPS 238
            + PR  +         + PA   +      PP   +P     P P+       G    S
Sbjct: 466 AAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAES 525

Query: 239 FPSPAVTPSSPSTPAPQPAMPALVTQP 265
            P PA      +     PA  A     
Sbjct: 526 IPDPATADPDDAFETLAPAPAAAPAPR 552



 Score = 28.7 bits (64), Expect = 4.4
 Identities = 17/111 (15%), Positives = 32/111 (28%), Gaps = 9/111 (8%)

Query: 159 TPIPGPPESGEGSEG-PSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSM 217
            P      +  G     + ++       P+     +++D  P    P        SP   
Sbjct: 457 APAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELP----PEFASPAPA 512

Query: 218 PQNLPGPSISGPLIPGPSI----PSFPSPAVTPSSPSTPAPQPAMPALVTQ 264
             +          IP P+      +F + A  P++   P    A   +V  
Sbjct: 513 QPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563



 Score = 28.3 bits (63), Expect = 6.7
 Identities = 15/70 (21%), Positives = 25/70 (35%), Gaps = 1/70 (1%)

Query: 200 SFPAPTSSGGTLPSPP-SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAM 258
           +F    S GG  P+   + P   P P+ + P    P+  + P+      + +  A   A 
Sbjct: 362 AFRPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAA 421

Query: 259 PALVTQPEQE 268
                 P  E
Sbjct: 422 APARRSPAPE 431



 Score = 27.9 bits (62), Expect = 8.8
 Identities = 11/95 (11%), Positives = 23/95 (24%)

Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
                  +   +  P+       ++LP      +      +     +     P+      
Sbjct: 477 AAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDD 536

Query: 220 NLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAP 254
                + +    P P   +   P V P  P   A 
Sbjct: 537 AFETLAPAPAAAPAPRAAAATEPVVAPRPPRASAS 571


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 33.4 bits (76), Expect = 0.15
 Identities = 20/78 (25%), Positives = 29/78 (37%), Gaps = 1/78 (1%)

Query: 186 PSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVT 245
              PR  ++ + TP+    TS   T    PS     P    + P   G   P  P     
Sbjct: 209 RPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTT-PEAEGTPAPPTPGGGEA 267

Query: 246 PSSPSTPAPQPAMPALVT 263
           P + +TPAP+ +   L  
Sbjct: 268 PPANATPAPEASRYELTV 285



 Score = 31.8 bits (72), Expect = 0.47
 Identities = 31/119 (26%), Positives = 41/119 (34%), Gaps = 7/119 (5%)

Query: 158 ETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLP---SP 214
             P  G   +    EG +  S +      S PR    +   P+ P PT      P     
Sbjct: 165 AFPAEGTLAAPPLGEGSADGSCDP-ALPLSAPRLGPADVFVPATPRPTPRTTASPETTPT 223

Query: 215 PSMPQNLPGPSISGP--LIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISY 271
           PS   + P  +I  P   I  P   + P    TP+ P TP    A PA  T   +   Y
Sbjct: 224 PSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPP-TPGGGEAPPANATPAPEASRY 281


>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509).  This
           family consists of several uncharacterized viral
           proteins from the Marek's disease-like viruses. Members
           of this family are typically around 400 residues in
           length. The function of this family is unknown.
          Length = 377

 Score = 33.1 bits (75), Expect = 0.17
 Identities = 26/99 (26%), Positives = 33/99 (33%), Gaps = 22/99 (22%)

Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
           P PGP         P+LS+    D                  P  T+ G   PSPP    
Sbjct: 201 PQPGPSGQNRSPRTPTLSNVRVLDA-----------------PVATNRGEA-PSPPRTDT 242

Query: 220 NLPGPSISGPLIPG----PSIPSFPSPAVTPSSPSTPAP 254
             P P+I+GP           PS P P +     + P P
Sbjct: 243 LDPDPAIAGPSRAVNRTPSPRPSSPPPEIDEEYNAQPDP 281


>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 709

 Score = 33.1 bits (75), Expect = 0.18
 Identities = 23/119 (19%), Positives = 45/119 (37%), Gaps = 3/119 (2%)

Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENS-NEDVTPSFPAPTSSGG--TLP 212
           + +TP+     +   SEG +    ++ +    PP E++ +E  T +  A TS+    T  
Sbjct: 402 TAQTPVQTASAAAMPSEGKTAGPVSNQENNDVPPWEDAPDEAQTAAGTAQTSAKSIQTAS 461

Query: 213 SPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISY 271
              + P+N    + +        +   PS     ++P+  A +    A     E    Y
Sbjct: 462 EAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPNDEAVETETFAHEAPAEPFYGY 520


>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295).  This
           family is conserved in fungi but the function is not
           known.
          Length = 509

 Score = 33.0 bits (75), Expect = 0.19
 Identities = 22/116 (18%), Positives = 33/116 (28%), Gaps = 9/116 (7%)

Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENS------NEDVTPSFPAPTSSGGTLP 212
            PI  P      +   S S+            E++      + D   S  + T +  +  
Sbjct: 87  APITPPSSEPTPAPPSSESTATRTPDPNQQALESTESTSTTSADCNDSEQSSTPNLNSSD 146

Query: 213 SPPSMPQNLPGPSISGPLIPGP---SIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
           +  S    LP  S+     P     S  S       PS   +  P  A  A    P
Sbjct: 147 TSTSSSGALPSTSVVRGFSPSHISSSYRSTAQLNKAPSPTKSAEPTAAPQAKPELP 202


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 32.7 bits (75), Expect = 0.20
 Identities = 23/112 (20%), Positives = 34/112 (30%), Gaps = 6/112 (5%)

Query: 162 PGPPESGEGSEGPSLSSQNSNDQLP----SPPRENSNEDVTPSFPAPTSSGGTLPSPPSM 217
                     E  +      +   P    + PR+   +      P   +     P+P  +
Sbjct: 70  RVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPV 129

Query: 218 PQNLPGPSISGPLIPG--PSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQ 267
            Q    P    PL     P +   P P  +   P+  A QPA P    QPE 
Sbjct: 130 QQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEP 181



 Score = 30.8 bits (70), Expect = 0.93
 Identities = 22/96 (22%), Positives = 28/96 (29%), Gaps = 13/96 (13%)

Query: 186 PSPPRENSNEDVTPSF---PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPG-------PS 235
           P+  +E+     +P     P   S+    P        +P      P  P          
Sbjct: 75  PANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAY 134

Query: 236 IPSFPSPAVTPSSPS-TPAPQPAM--PALVTQPEQE 268
            P    P   P SP   PAPQP    P    Q  Q 
Sbjct: 135 QPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQP 170


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This family
           contains bacterial fibronectin-attachment proteins
           (FAP). Family members are rich in alanine and proline,
           are approximately 300 long, and seem to be restricted to
           mycobacteria. These proteins contain a
           fibronectin-binding motif that allows mycobacteria to
           bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 32.6 bits (74), Expect = 0.20
 Identities = 16/62 (25%), Positives = 22/62 (35%)

Query: 199 PSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAM 258
           P+ P P  S       P+ P   P P+      P     + P P   P++P  P   P  
Sbjct: 41  PAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNA 100

Query: 259 PA 260
           P 
Sbjct: 101 PP 102



 Score = 32.6 bits (74), Expect = 0.23
 Identities = 15/67 (22%), Positives = 20/67 (29%)

Query: 202 PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPAL 261
           PAP     +  +    P   P P                +P   P+ P+ P P P  P  
Sbjct: 41  PAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNA 100

Query: 262 VTQPEQE 268
              P  E
Sbjct: 101 PPPPAPE 107



 Score = 29.9 bits (67), Expect = 1.7
 Identities = 21/71 (29%), Positives = 29/71 (40%), Gaps = 3/71 (4%)

Query: 186 PSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVT 245
           P+PP    +       PA        P+ P  PQ  P    + P  P     + P P V 
Sbjct: 41  PAPPPPPPSTAAAAPAPAA-PPPPPPPAAPPAPQ--PDDPNAAPPPPPADPNAPPPPPVD 97

Query: 246 PSSPSTPAPQP 256
           P++P  PAP+P
Sbjct: 98  PNAPPPPAPEP 108



 Score = 29.1 bits (65), Expect = 2.8
 Identities = 20/92 (21%), Positives = 33/92 (35%), Gaps = 5/92 (5%)

Query: 185 LPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFP---- 240
           +  P   N++    P  P+  ++     +PP  P     P+   P  P  + P  P    
Sbjct: 31  IALPATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPA-PQPDDPNAAPPPPPADPN 89

Query: 241 SPAVTPSSPSTPAPQPAMPALVTQPEQEISYI 272
           +P   P  P+ P P    P  +       SY+
Sbjct: 90  APPPPPVDPNAPPPPAPEPGRIDNAVGGFSYV 121


>gnl|CDD|152115 pfam11679, DUF3275, Protein of unknown function (DUF3275).  This
           family of proteins with unknown function appear to be
           restricted to Proteobacteria.
          Length = 211

 Score = 32.5 bits (74), Expect = 0.21
 Identities = 18/83 (21%), Positives = 29/83 (34%), Gaps = 9/83 (10%)

Query: 189 PRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTP-- 246
           PR    ++  P   +P S+     +P       P PS   P  P     S     + P  
Sbjct: 90  PRRTEPQEPDPLDESPASAAPVASAPA------PAPSPQSP-KPASRRASRDMRRIAPFG 142

Query: 247 SSPSTPAPQPAMPALVTQPEQEI 269
            + S PA + A  +     + E+
Sbjct: 143 MNASAPAQEAAQASSDEDEDAEL 165


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 32.9 bits (76), Expect = 0.23
 Identities = 15/61 (24%), Positives = 22/61 (36%), Gaps = 1/61 (1%)

Query: 210 TLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEI 269
            LP P   PQ+   P+ S      P+    P  A     P   APQ A    + +   ++
Sbjct: 364 PLPEPEVPPQSAA-PAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQL 422

Query: 270 S 270
            
Sbjct: 423 L 423



 Score = 30.2 bits (69), Expect = 1.6
 Identities = 14/109 (12%), Positives = 30/109 (27%), Gaps = 3/109 (2%)

Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
            P+P P    + +  P+ S+Q +     +     +     P   AP  +        +  
Sbjct: 363 APLPEPEVPPQSAA-PAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQ 421

Query: 219 QNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQ 267
             L           G +      PA    +    +    + ++   P  
Sbjct: 422 --LLAARQQLQRAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSA 468



 Score = 29.8 bits (68), Expect = 2.3
 Identities = 10/63 (15%), Positives = 18/63 (28%)

Query: 199 PSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAM 258
           P+ P P        + P+          +    P       P  +    +P+ P P+   
Sbjct: 361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTS 420

Query: 259 PAL 261
             L
Sbjct: 421 QLL 423


>gnl|CDD|178806 PRK00030, minC, septum formation inhibitor; Provisional.
          Length = 292

 Score = 32.4 bits (73), Expect = 0.23
 Identities = 16/81 (19%), Positives = 30/81 (37%), Gaps = 4/81 (4%)

Query: 186 PSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVT 245
            +PP + +     PS P  T+       P     +        P     S+P+  +   T
Sbjct: 113 TAPPNDVATP--VPSVPEATAEAAAKAGPQDDEADGEQAD-EAPAHNPESVPTRAARETT 169

Query: 246 PSSPSTPA-PQPAMPALVTQP 265
            ++  T   PQ +   ++T+P
Sbjct: 170 EANRPTATPPQSSSALVITKP 190



 Score = 28.9 bits (64), Expect = 3.6
 Identities = 17/77 (22%), Positives = 27/77 (35%), Gaps = 5/77 (6%)

Query: 159 TPIPGPPESG--EGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPS 216
           TP+P  PE+     ++      +   +Q    P  N     T +    T +     +PP 
Sbjct: 121 TPVPSVPEATAEAAAKAGPQDDEADGEQADEAPAHNPESVPTRAARETTEANRPTATPPQ 180

Query: 217 MPQNLPGPSISGPLIPG 233
               L    I+ PL  G
Sbjct: 181 SSSAL---VITKPLRSG 194


>gnl|CDD|216078 pfam00716, Peptidase_S21, Assemblin (Peptidase family S21). 
          Length = 326

 Score = 32.4 bits (74), Expect = 0.25
 Identities = 21/98 (21%), Positives = 36/98 (36%), Gaps = 6/98 (6%)

Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNL 221
           PG   S + +   S S Q  +   P      +   ++ S   P +  G +     +P + 
Sbjct: 235 PGTAPSFDATPSVSPSGQPLSPAAPPGTSSVAGTALSAS---PAALFGDMVY---VPLDA 288

Query: 222 PGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
               ++G     P  P  P+P    + P+   P PA P
Sbjct: 289 YNQLLAGQAFNQPPDPQGPAPPAELAPPAPAPPPPANP 326


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 32.4 bits (73), Expect = 0.27
 Identities = 18/95 (18%), Positives = 30/95 (31%), Gaps = 11/95 (11%)

Query: 165 PESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNL-PG 223
           P + E     S   + S      P +E        + PAP           +  ++  P 
Sbjct: 80  PNTNEEKTSASKIEKISQ-----PKQEEQKSLNISATPAPKQE-----QSQTTTESTTPK 129

Query: 224 PSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAM 258
             ++ P       P   + + TP SP+    Q  M
Sbjct: 130 TKVTTPPSTNTPQPMQSTKSDTPQSPTIKQAQTDM 164


>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator.  Shisa is a
           transcription factor-type molecule that physically
           interacts with immature forms of the Wnt receptor
           Frizzled and the FGF receptor within the endoplasmic
           reticulum to inhibit their post-translational maturation
           and trafficking to the cell surface.
          Length = 177

 Score = 31.7 bits (72), Expect = 0.28
 Identities = 16/50 (32%), Positives = 20/50 (40%), Gaps = 3/50 (6%)

Query: 212 PSPPSMPQNLPGPSISGPLI-PGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
           P  P M +       + PL  P  + PS+P P           PQP MPA
Sbjct: 109 PQRPVMTRATSTTVQTTPLPQPPSTAPSYPGP--QYQGYHPMPPQPGMPA 156



 Score = 30.2 bits (68), Expect = 1.1
 Identities = 19/69 (27%), Positives = 25/69 (36%), Gaps = 2/69 (2%)

Query: 192 NSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPST 251
                +T +      +   LP PPS   + PGP   G   P P  P  P+P  +   P  
Sbjct: 109 PQRPVMTRATSTTVQTT-PLPQPPSTAPSYPGPQYQG-YHPMPPQPGMPAPPYSLQYPPP 166

Query: 252 PAPQPAMPA 260
              QP  P 
Sbjct: 167 GLLQPQGPP 175


>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
          Length = 580

 Score = 32.4 bits (74), Expect = 0.28
 Identities = 24/141 (17%), Positives = 37/141 (26%), Gaps = 21/141 (14%)

Query: 131 ETSEDIEQNKKYAKWKAAYIHNCLKSGET-----PIPGPPESGEGSEGPSLSSQNSNDQL 185
           E+ E        A+           S  T      +  P E+G  S G   +S       
Sbjct: 19  ESPEGSRDENFDAERDDFLTPLGSTSEATSEDDDDLYPPRETG--SGGGVATSTI----- 71

Query: 186 PSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVT 245
            + PR     + T   P    +   LP         PGP+   P     +         T
Sbjct: 72  YTVPRPPRGPEQTLDKPDSLPASRELP---------PGPTPVPPGGFRGASSPRLGADST 122

Query: 246 PSSPSTPAPQPAMPALVTQPE 266
                     P + A + +  
Sbjct: 123 SPRFLYQVNFPVILAPIGESN 143


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 32.6 bits (75), Expect = 0.29
 Identities = 10/83 (12%), Positives = 21/83 (25%), Gaps = 1/83 (1%)

Query: 195 EDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAP 254
                   A  ++     +P + P      + +       +  + P PA   ++      
Sbjct: 39  GSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAA-AAPA 97

Query: 255 QPAMPALVTQPEQEISYIEAELI 277
            P   A    P       E   +
Sbjct: 98  APPAAAAAAAPAAAAVEDEVTPL 120



 Score = 30.6 bits (70), Expect = 1.2
 Identities = 12/80 (15%), Positives = 25/80 (31%)

Query: 186 PSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVT 245
            +     +      +  AP ++          P      + + P  P  +  +  +PA  
Sbjct: 40  STAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAP 99

Query: 246 PSSPSTPAPQPAMPALVTQP 265
           P++ +  AP  A       P
Sbjct: 100 PAAAAAAAPAAAAVEDEVTP 119



 Score = 30.2 bits (69), Expect = 1.8
 Identities = 8/60 (13%), Positives = 15/60 (25%)

Query: 201 FPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
           F          P+  +        + +            P+P    +  + P P  A  A
Sbjct: 33  FADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA 92



 Score = 27.9 bits (63), Expect = 8.1
 Identities = 16/102 (15%), Positives = 33/102 (32%), Gaps = 22/102 (21%)

Query: 163 GPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLP 222
           GP  +   +   + ++  ++    +P  +       P+ PAP       P+ P       
Sbjct: 37  GPGSTAAPTAAAAAAAAAASAPAAAPAAK------APAAPAPAPPAAAAPAAP------- 83

Query: 223 GPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQ 264
                    P P+  +  + A      +  A  PA  A+  +
Sbjct: 84  ---------PKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDE 116


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 32.7 bits (74), Expect = 0.31
 Identities = 22/81 (27%), Positives = 32/81 (39%), Gaps = 4/81 (4%)

Query: 178 SQNSNDQLPSPPRENSNEDVTP-SFPAPTSSGGTLPSPPS---MPQNLPGPSISGPLIPG 233
              +N  +P PP+ +      P S P  + SGG +PSPP+    P      S +      
Sbjct: 419 QVQTNQSMPQPPQPSVPSPGGPGSQPPQSVSGGMIPSPPALMPSPSPQMSQSPASQRTIQ 478

Query: 234 PSIPSFPSPAVTPSSPSTPAP 254
             + S   P  TP   S  +P
Sbjct: 479 QDMVSPGGPLNTPGQSSVNSP 499



 Score = 31.1 bits (70), Expect = 0.98
 Identities = 18/62 (29%), Positives = 25/62 (40%), Gaps = 4/62 (6%)

Query: 213 SPPSMPQNLPGPSISG----PLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQE 268
            P  M    P P +      P  P PS+PS   P   P    +    P+ PAL+  P  +
Sbjct: 407 QPGMMSSPSPVPQVQTNQSMPQPPQPSVPSPGGPGSQPPQSVSGGMIPSPPALMPSPSPQ 466

Query: 269 IS 270
           +S
Sbjct: 467 MS 468


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 32.1 bits (73), Expect = 0.36
 Identities = 15/68 (22%), Positives = 19/68 (27%), Gaps = 8/68 (11%)

Query: 205 TSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPST------PAPQPAM 258
             S G    PP   +    P    P         +  P      P        P  +PA 
Sbjct: 89  DPSAGEPAPPPPHARRTSEPE--LPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAY 146

Query: 259 PALVTQPE 266
           PA   +PE
Sbjct: 147 PAYQQRPE 154



 Score = 31.7 bits (72), Expect = 0.50
 Identities = 23/113 (20%), Positives = 29/113 (25%), Gaps = 9/113 (7%)

Query: 154 LKSGETPIPGPPESGEGSEGPSLSSQNSNDQLPS-PPRENSNEDVTPSFPAPTSSGGTLP 212
            + G  P          +       +  +   P    R    +   P +  P       P
Sbjct: 170 QRLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRPRRDRTDRP 229

Query: 213 SPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
            PP        P        GP  P      V P  PS P P  A PA    P
Sbjct: 230 EPP--------PGAGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGP 274



 Score = 30.9 bits (70), Expect = 1.0
 Identities = 26/112 (23%), Positives = 36/112 (32%), Gaps = 10/112 (8%)

Query: 161 IPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSS--GGTLPSPPSMP 218
           I   P +GE +  P  + + S  +LP P R          +  P +      LP    +P
Sbjct: 86  ITVDPSAGEPAPPPPHARRTSEPELPRPGRRPY-----EGYGGPRADDRPPGLPRQDQLP 140

Query: 219 QNLPG-PSISGPLIPGP-SIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQE 268
              P  P+      PG     +             P    A PA    PEQE
Sbjct: 141 TARPAYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPA-SYAPEQE 191



 Score = 30.2 bits (68), Expect = 1.5
 Identities = 21/122 (17%), Positives = 37/122 (30%), Gaps = 17/122 (13%)

Query: 156 SGETPIPGP-PESGEGSEGPS--------LSSQNSNDQLPSPPRENSNEDVTPSFPAPTS 206
           +GE   P P        E P              ++D+ P  PR++      P++PA   
Sbjct: 92  AGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQ- 150

Query: 207 SGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
                      P   P  +               +P  +P+S +    +   P    +PE
Sbjct: 151 -------QRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYAPEQERDREPYDAGRPE 203

Query: 267 QE 268
            +
Sbjct: 204 YD 205


>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 504

 Score = 32.1 bits (73), Expect = 0.36
 Identities = 16/78 (20%), Positives = 22/78 (28%), Gaps = 5/78 (6%)

Query: 196 DVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSF-----PSPAVTPSSPS 250
            V P  PAP      L       ++L     +     G  +P F       PA  P+  +
Sbjct: 352 AVAPPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAGAPLPDFDPRPRGPPAPEPARSA 411

Query: 251 TPAPQPAMPALVTQPEQE 268
              P  A  A        
Sbjct: 412 EAPPLVAPAAAPAGLALR 429



 Score = 30.6 bits (69), Expect = 1.3
 Identities = 19/108 (17%), Positives = 30/108 (27%), Gaps = 13/108 (12%)

Query: 148 AYIHNCLKSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSS 207
           A +H  L  G  P  G       +  P+  +Q  N                    +  S+
Sbjct: 334 ALLHALLALGGAPSEGVAAVAPPAPAPADLTQRLNRL-------------EKEVRSLRSA 380

Query: 208 GGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQ 255
                +    P     P   GP  P P+  +   P V P++       
Sbjct: 381 PTAAATAAGAPLPDFDPRPRGPPAPEPARSAEAPPLVAPAAAPAGLAL 428



 Score = 28.3 bits (63), Expect = 6.6
 Identities = 17/83 (20%), Positives = 25/83 (30%), Gaps = 2/83 (2%)

Query: 168 GEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFP--APTSSGGTLPSPPSMPQNLPGPS 225
            EG    +  +    D      R         S P  A T++G  LP     P+  P P 
Sbjct: 347 SEGVAAVAPPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAGAPLPDFDPRPRGPPAPE 406

Query: 226 ISGPLIPGPSIPSFPSPAVTPSS 248
            +      P +    +PA     
Sbjct: 407 PARSAEAPPLVAPAAAPAGLALR 429


>gnl|CDD|220749 pfam10428, SOG2, RAM signalling pathway protein.  SOG2 proteins in
           Saccharomyces cerevisiae are involved in cell separation
           and cytokinesis.
          Length = 419

 Score = 32.0 bits (73), Expect = 0.41
 Identities = 21/113 (18%), Positives = 34/113 (30%), Gaps = 5/113 (4%)

Query: 156 SGETPIPGPPESGEGSEGPSLSSQN-----SNDQLPSPPRENSNEDVTPSFPAPTSSGGT 210
           +  +  P        +   S SS        +D+L       +      S P   S+  T
Sbjct: 144 AWSSLGPPLQHRKRDAVTASPSSMIARNTPISDRLRPRSVTPTRGRRPSSSPRSLSNPTT 203

Query: 211 LPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVT 263
           L SP ++      P          S  S  +     SS +TP    +  +  T
Sbjct: 204 LESPSNLQVTTDVPPPYSNGTSRSSTMSSSANLSIISSLATPRSGESFRSTPT 256



 Score = 30.1 bits (68), Expect = 1.6
 Identities = 23/89 (25%), Positives = 29/89 (32%), Gaps = 2/89 (2%)

Query: 171 SEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPL 230
                 SS  S    P+     SN  VT   P P S+G +  S  S   NL   S     
Sbjct: 187 RGRRPSSSPRSLSN-PTTLESPSNLQVTTDVPPPYSNGTSRSSTMSSSANLSIISSLATP 245

Query: 231 IPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
             G S  S P+ + + S         A  
Sbjct: 246 RSGESFRSTPT-SGSSSINPVSGLDEAEE 273


>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein.  Tymoviruses
           are single stranded RNA viruses. This family includes a
           protein of unknown function that has been named based on
           its molecular weight. Tymoviruses such as the ononis
           yellow mosaic tymovirus encode only three proteins. Of
           these two are overlapping this protein overlaps a larger
           ORF that is thought to be the polymerase.
          Length = 458

 Score = 32.0 bits (73), Expect = 0.44
 Identities = 20/86 (23%), Positives = 26/86 (30%), Gaps = 15/86 (17%)

Query: 184 QLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPG---------- 233
           QLP P +           P P      LP     P     PSI  P  P           
Sbjct: 358 QLPPPTK-----RRLRLLPVPPPKVQALPLTALAPLVRHSPSIPLPHPPSALPSHVGASS 412

Query: 234 PSIPSFPSPAVTPSSPSTPAPQPAMP 259
                 P   +     S+P+P P++P
Sbjct: 413 SKHHRLPPSVLPGPRLSSPSPSPSLP 438



 Score = 28.9 bits (65), Expect = 4.1
 Identities = 19/82 (23%), Positives = 30/82 (36%), Gaps = 7/82 (8%)

Query: 179 QNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQN-------LPGPSISGPLI 231
           +N    LPSP     +  +    P    +    P  P+  ++       LP PS    L+
Sbjct: 222 ENQLHPLPSPQPVRHSPGILGPSPLHPHTTRPSPPRPAFSRSPSSPLSPLPRPSTRRGLL 281

Query: 232 PGPSIPSFPSPAVTPSSPSTPA 253
           P P +P      + P + S P 
Sbjct: 282 PNPRLPRASRGHLPPPTSSAPP 303



 Score = 27.8 bits (62), Expect = 8.4
 Identities = 17/59 (28%), Positives = 25/59 (42%), Gaps = 1/59 (1%)

Query: 198 TPSFPAPTSSG-GTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQ 255
           +   P PT      LP PP   Q LP  +++  +   PSIP    P+  PS     + +
Sbjct: 356 SSQLPPPTKRRLRLLPVPPPKVQALPLTALAPLVRHSPSIPLPHPPSALPSHVGASSSK 414


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 32.1 bits (73), Expect = 0.45
 Identities = 22/152 (14%), Positives = 37/152 (24%), Gaps = 28/152 (18%)

Query: 114 KAFYTASILMDVLQTFGETSEDIEQNKKYAKWKAAYIHNCLKSGETPIPGP-PESGEGSE 172
           + F  A++   ++  F    +       Y            ++G   +PGP P       
Sbjct: 620 RTFGRATVGEMIISGFPPVFKTALPRPDYN--------RGGEAGGPGVPGPVPVGMPAHT 671

Query: 173 GPSLSSQ-------NSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPS 225
                          ++      P+        P    P   GG LP PP         +
Sbjct: 672 ARPSRVARGDPVRPTAHHAALRAPQ-------APRPGGPPGGGGGLPPPP-----DLPAA 719

Query: 226 ISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPA 257
                     I S  +P       +  A    
Sbjct: 720 AGPAPCGSSLIASPTAPPEPEPPGAEQADGAE 751



 Score = 30.5 bits (69), Expect = 1.4
 Identities = 25/122 (20%), Positives = 34/122 (27%), Gaps = 12/122 (9%)

Query: 156 SGETPIPG-PPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTL--- 211
            GE  I G PP        P  +          P           + P+  + G  +   
Sbjct: 627 VGEMIISGFPPVFKTALPRPDYNRGGEAGGPGVPGPVPVGMPAHTARPSRVARGDPVRPT 686

Query: 212 -----PSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
                   P  P+   GP   G  +P P  P  P+ A      S+    P  P     P 
Sbjct: 687 AHHAALRAPQAPRP-GGPPGGGGGLPPP--PDLPAAAGPAPCGSSLIASPTAPPEPEPPG 743

Query: 267 QE 268
            E
Sbjct: 744 AE 745


>gnl|CDD|225987 COG3456, COG3456, Predicted component of the type VI protein
           secretion system, contains a FHA domain [Intracellular
           trafficking, secretion, and vesicular    transport;
           Signal transduction mechanisms].
          Length = 430

 Score = 31.7 bits (72), Expect = 0.46
 Identities = 21/111 (18%), Positives = 29/111 (26%), Gaps = 9/111 (8%)

Query: 161 IPGPPESGEGSEGPS-LSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
           I   P    G   P  L+    N     P     +E +           G  PS     +
Sbjct: 136 IEPEPRLPPGLTDPLQLADPVPNGFSADPLSALQSESLIA----QPDPAGGAPSIS---R 188

Query: 220 NLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEIS 270
           N   P  S P   G     FP     P   ++P      P +      +  
Sbjct: 189 NSEAPL-SDPASAGGIDTPFPDDLDVPPIAASPPGPQEGPTISASQPAQRD 238


>gnl|CDD|220633 pfam10214, Rrn6, RNA polymerase I-specific transcription-initiation
           factor.  RNA polymerase I-specific
           transcription-initiation factor Rrn6 and Rrn7 represent
           components of a multisubunit transcription factor
           essential for the initiation of rDNA transcription by
           Pol I. These proteins are found in fungi.
          Length = 753

 Score = 32.0 bits (73), Expect = 0.47
 Identities = 18/99 (18%), Positives = 29/99 (29%), Gaps = 10/99 (10%)

Query: 174 PSLSSQNSNDQLPSPPRENSNEDVTPSFPAPT-----SSGGTLPSPPSMPQNLPGPSISG 228
           P     + +D   + P    +  +      PT         T     S+  + P P    
Sbjct: 639 PGDDPSDVDDSQATQPDVTDSSQLESQSQIPTIRSSQQVSQTRKGGSSVVPSAPAP---- 694

Query: 229 PLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQ 267
            L      P+  S +  P S S       +P   +Q E 
Sbjct: 695 RLAQSSQPPTSQSSSDLPPSSSQAFSLSDLPMQ-SQSES 732


>gnl|CDD|237782 PRK14666, uvrC, excinuclease ABC subunit C; Provisional.
          Length = 694

 Score = 31.8 bits (72), Expect = 0.49
 Identities = 19/97 (19%), Positives = 30/97 (30%), Gaps = 16/97 (16%)

Query: 164 PPESG-EGSEGPSLSSQNSNDQLP-----SPPRENSNEDVTPSFPAPTSSGGTLPSPPSM 217
           P   G EG +    +       LP         E S++ V P   A        P   S+
Sbjct: 317 PDTEGREGDDLAPTAVCTDAGLLPDTPLLPDAPEGSSDPVVPVAAAT-------PVDASL 369

Query: 218 PQNLPGPSISGPLIP--GPSIPSFPSPAVTPSSPSTP 252
           P    G + +            + P+ A T  + + P
Sbjct: 370 PDVRTGTAPTSLANVSHADPAVAQPTQAATL-AGAAP 405


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 32.0 bits (72), Expect = 0.50
 Identities = 19/106 (17%), Positives = 27/106 (25%), Gaps = 5/106 (4%)

Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNL 221
           P P        G +     +  +   P           + P         P     P   
Sbjct: 701 PTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAA 760

Query: 222 PGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQ 267
           PG        P  + P  P+P   P +P  P  +P        P Q
Sbjct: 761 PGR-----ARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQ 801


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 31.9 bits (73), Expect = 0.51
 Identities = 22/111 (19%), Positives = 32/111 (28%), Gaps = 13/111 (11%)

Query: 176 LSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPS 235
           L SQ+ +    +              P P  S       P +PQ         P  P P+
Sbjct: 510 LESQSGSASNTAKTP-----------PPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPT 558

Query: 236 IPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISYIEAELINKCQKYIKF 286
                S A       +  P P +P   T    + S  E E+    +    F
Sbjct: 559 ATQASSNAPAQIPADSS-PPPPIPEEPTPSPTKDSSPE-EIDKAAKNLADF 607



 Score = 30.3 bits (69), Expect = 1.6
 Identities = 20/81 (24%), Positives = 27/81 (33%), Gaps = 3/81 (3%)

Query: 185 LPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAV 244
           LPS      +E    S PA  +       PP+  Q     +      P P       P  
Sbjct: 360 LPSA---FISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIP 416

Query: 245 TPSSPSTPAPQPAMPALVTQP 265
            P+ P+ P+P P   A    P
Sbjct: 417 VPAEPTEPSPTPPANAANAPP 437



 Score = 29.9 bits (68), Expect = 2.1
 Identities = 19/72 (26%), Positives = 22/72 (30%), Gaps = 10/72 (13%)

Query: 209 GTLPSPPSMPQNLPGPSISG---------PLIPGPSIPSFPSPAVTPSSPSTPAPQPA-M 258
           G LPS             +          P    PS P     A TPS P   A  P  +
Sbjct: 358 GLLPSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPV 417

Query: 259 PALVTQPEQEIS 270
           PA  T+P     
Sbjct: 418 PAEPTEPSPTPP 429



 Score = 28.0 bits (63), Expect = 7.2
 Identities = 16/96 (16%), Positives = 25/96 (26%), Gaps = 8/96 (8%)

Query: 174 PSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPG 233
           PS       +         +     P  P   S+  T  +  + P   P           
Sbjct: 361 PSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATT-PSPPPAK-------AS 412

Query: 234 PSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEI 269
           P IP    P     +P   A        + +  Q+I
Sbjct: 413 PPIPVPAEPTEPSPTPPANAANAPPSLNLEELWQQI 448


>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 433 to 538 amino acids in length. This domain is
           found associated with pfam00616, pfam00168. This domain
           has two conserved sequence motifs: DLQ and PLSFQNP.
          Length = 489

 Score = 31.6 bits (71), Expect = 0.52
 Identities = 22/95 (23%), Positives = 33/95 (34%), Gaps = 4/95 (4%)

Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQ---LPSPPRENSNEDVTPSFPAPTSSGGTLPSPP 215
            P+P  P    GSE  S +S +SN +   L +    +  E+ T      T    +LP   
Sbjct: 186 APLPVSPRGDSGSENHSSASSHSNSEDCSLSAGFSNSGGEEFTRRSTDFTRRQLSLPDRQ 245

Query: 216 SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPS 250
             P  LP  + +GP             +     P 
Sbjct: 246 HQPA-LPRQNSAGPQRRVDQPSPPGGGSHRGRIPP 279



 Score = 27.8 bits (61), Expect = 8.9
 Identities = 16/75 (21%), Positives = 23/75 (30%), Gaps = 4/75 (5%)

Query: 158 ETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSF---PAPTSSGGTLPSP 214
           +  +  P   G GS    +     +  LPS     S+E            +SS G  P  
Sbjct: 259 QRRVDQPSPPGGGSHRGRIPPSLLS-SLPSEGSMLSSEWPQSGARPRQQSSSSKGDSPEL 317

Query: 215 PSMPQNLPGPSISGP 229
                +L  PS    
Sbjct: 318 RPAAGHLQQPSPVNM 332


>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
          Length = 327

 Score = 31.6 bits (72), Expect = 0.53
 Identities = 16/74 (21%), Positives = 24/74 (32%), Gaps = 16/74 (21%)

Query: 175 SLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGP 234
           S SS  S+    SP   N                 +     + PQ++  P IS       
Sbjct: 82  SGSSSLSSGNQSSPSAAN---------NTSDGHDASGVKNTAPPQDISAPPISPT----- 127

Query: 235 SIPSFPSPAVTPSS 248
             P+  +P  TP+ 
Sbjct: 128 --PTQAAPPQTPNG 139


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 31.6 bits (71), Expect = 0.58
 Identities = 21/107 (19%), Positives = 34/107 (31%), Gaps = 6/107 (5%)

Query: 168 GEGSEGPSLSSQNSNDQLPSPPRE---NSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGP 224
                 P+  SQ +    P+        +N+ ++ + P  T    T  +P   PQ  P P
Sbjct: 181 PTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPE--PQTEPPP 238

Query: 225 SISGPLIPGPSIPSFPSPAVTPSSPS-TPAPQPAMPALVTQPEQEIS 270
           S  GP       PS  S   + +        +   P   +      S
Sbjct: 239 SQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHS 285


>gnl|CDD|237695 PRK14376, PRK14376, hypothetical protein; Provisional.
          Length = 176

 Score = 30.7 bits (69), Expect = 0.63
 Identities = 13/51 (25%), Positives = 15/51 (29%)

Query: 204 PTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAP 254
           P S+GG  P P   P    GP               P  AV   +P     
Sbjct: 99  PLSAGGYDPVPGEAPGTSVGPGTVRKPRGRTRTAGGPVGAVDAGTPHRSRR 149


>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1.  This is
          the N-terminal 200 residues of a set of proteins
          conserved from yeasts to humans. Most of the proteins
          in this entry have an RhoGEF pfam00621 domain at their
          C-terminal end.
          Length = 158

 Score = 30.1 bits (68), Expect = 0.75
 Identities = 23/69 (33%), Positives = 29/69 (42%), Gaps = 13/69 (18%)

Query: 8  CPDSLKAIQHFLNVASE---HDNRDLTVSYWCRLHALQKALKIDKSSDGAKMLLLKLMDW 64
            DSL +    L V  E    DN  L+  Y  R   L+K          A+  LL+  D 
Sbjct: 37 NLDSLASENQELEVELELLREDNERLSTQY-EREKELRKQ---------AEQKLLEFEDE 86

Query: 65 LEQEKKTHR 73
          LEQEKK  +
Sbjct: 87 LEQEKKELQ 95


>gnl|CDD|237940 PRK15313, PRK15313, autotransport protein MisL; Provisional.
          Length = 955

 Score = 31.3 bits (70), Expect = 0.75
 Identities = 17/55 (30%), Positives = 23/55 (41%), Gaps = 5/55 (9%)

Query: 212 PSPPSMPQN----LPGPSISGPLIPGPSIPSFPSPAV-TPSSPSTPAPQPAMPAL 261
           P  P +P      +P P I  P+ P P  P  P P +  P  P    P+P  P +
Sbjct: 560 PDEPIIPDPVDPVIPDPVIPDPVDPDPVDPVIPDPVIPDPVDPDPVDPEPVDPVI 614


>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region.  The
           defective chorion-1 gene (dec-1) in Drosophila encodes
           follicle cell proteins necessary for proper eggshell
           assembly. Multiple products of the dec-1 gene are formed
           by alternative RNA splicing and proteolytic processing.
           Cleavage products include S80 (80 kDa) which is
           incorporated into the eggshell, and further proteolysis
           of S80 gives S60 (60 kDa).
          Length = 407

 Score = 30.9 bits (69), Expect = 0.77
 Identities = 17/54 (31%), Positives = 25/54 (46%)

Query: 211 LPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQ 264
           LP+ PSMP  L   +      P P+     +PA    +P+ P P    PA++ Q
Sbjct: 93  LPAMPSMPGLLGAAAPVPAPAPAPAAAPPAAPAPAADTPAAPIPDAVQPAILGQ 146


>gnl|CDD|218232 pfam04731, Caudal_act, Caudal like protein activation region.  This
           family consists of the amino termini of proteins
           belonging to the caudal-related homeobox protein family.
           This region is thought to mediate transcription
           activation. The level of activation caused by mouse Cdx2
           is affected by phosphorylation at serine 60 via the
           mitogen-activated protein kinase pathway. Caudal family
           proteins are involved in the transcriptional regulation
           of multiple genes expressed in the intestinal
           epithelium, and are important in differentiation and
           maintenance of the intestinal epithelial lining. Caudal
           proteins always have a homeobox DNA binding domain
           (pfam00046).
          Length = 135

 Score = 30.1 bits (68), Expect = 0.77
 Identities = 21/65 (32%), Positives = 29/65 (44%), Gaps = 4/65 (6%)

Query: 187 SPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGP-SIPSFPSPAVT 245
            PPRE+ +       P P+++  T  SP  M  + P  S   P  PGP  +    S +V 
Sbjct: 53  GPPREDWS--AYGPGPGPSATAATG-SPGQMAYSPPDYSPVHPPGPGPGLLQPADSGSVE 109

Query: 246 PSSPS 250
             SPS
Sbjct: 110 QLSPS 114


>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 31.1 bits (70), Expect = 0.78
 Identities = 20/79 (25%), Positives = 24/79 (30%), Gaps = 7/79 (8%)

Query: 202 PAPTSSGGTLPSPPSMPQNLPGPSIS----GPLIPGPSIPSFPSPAVTPSSPSTPAPQPA 257
           P+P  S       P    +LP P        P  PG      PSPA  P +P    P   
Sbjct: 382 PSPAGSPDVKKKAP--EPDLPQPDRHPGPAKPEAPGARPAELPSPASAP-TPEQQPPVAR 438

Query: 258 MPALVTQPEQEISYIEAEL 276
              L   P+       A  
Sbjct: 439 SAPLPPSPQASAPRNVASG 457



 Score = 30.3 bits (68), Expect = 1.4
 Identities = 25/95 (26%), Positives = 32/95 (33%), Gaps = 20/95 (21%)

Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFP-APTSSGGTLPSPPS- 216
           +P   P    +  E            LP P R        P+ P AP +    LPSP S 
Sbjct: 383 SPAGSPDVKKKAPEPD----------LPQPDRHPG-----PAKPEAPGARPAELPSPASA 427

Query: 217 -MPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPS 250
             P+  P  + S PL P P   +     V    P 
Sbjct: 428 PTPEQQPPVARSAPLPPSPQ--ASAPRNVASGKPG 460



 Score = 29.1 bits (65), Expect = 3.8
 Identities = 22/86 (25%), Positives = 28/86 (32%), Gaps = 4/86 (4%)

Query: 179 QNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPS 238
           +N     PSP      +   P    P       P+ P  P   P    S    P P    
Sbjct: 375 RNDGGVAPSPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPE--- 431

Query: 239 FPSPAVTPSSPSTPAPQPAMPALVTQ 264
              P V  S+P  P+PQ + P  V  
Sbjct: 432 -QQPPVARSAPLPPSPQASAPRNVAS 456


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 31.2 bits (70), Expect = 0.86
 Identities = 24/112 (21%), Positives = 35/112 (31%), Gaps = 12/112 (10%)

Query: 197 VTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPS-TPAPQ 255
            T S+ AP       P   S+      P+++   +PGP       P + P+        Q
Sbjct: 329 ATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQT---GEPVIAPAPEGYPQQSQ 385

Query: 256 PAMPALV--------TQPEQEISYIEAELINKCQKYIKFASSALNYDDYKEA 299
            A PA+          QP+Q      AE   +   Y            Y  A
Sbjct: 386 YAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPA 437



 Score = 30.4 bits (68), Expect = 1.7
 Identities = 22/99 (22%), Positives = 28/99 (28%), Gaps = 2/99 (2%)

Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNL 221
           P      +  +  +   Q    Q P  P+    +   P  P P       P  P      
Sbjct: 747 PIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQ 806

Query: 222 P-GPSISGPLIPGPSIPSFPSPAVT-PSSPSTPAPQPAM 258
           P  P    P    P  P  P P    P  P  P PQ  +
Sbjct: 807 PQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTL 845



 Score = 27.7 bits (61), Expect = 9.8
 Identities = 29/122 (23%), Positives = 43/122 (35%), Gaps = 8/122 (6%)

Query: 158 ETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSM 217
           + P+   P+  +  +  +   Q    Q P  P+    +   P  P P       P  P  
Sbjct: 782 QQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQ---PQQPVA 838

Query: 218 PQNLPGPSISGPLI--PGPSIP-SFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISYIEA 274
           PQ  P  ++  PL+   G S P   P+  +      TP P    P      EQ    +EA
Sbjct: 839 PQ--PQDTLLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVEPVDTFALEQMARLVEA 896

Query: 275 EL 276
            L
Sbjct: 897 RL 898


>gnl|CDD|132858 cd07219, Pat_PNPLA1, Patatin-like phospholipase domain containing
           protein 1.  Members of this family share a patatin
           domain, initially discovered in potato tubers. Some
           members of PNPLA1 subfamily do not have the lipase
           consensus sequence Gly-X-Ser-X-Gly which is essential
           for hydrolase activity.  This family includes PNPLA1
           from Homo sapiens and Gallus gallus. Currently, there is
           no literature available on the physiological role,
           structure, or enzymatic activity of PNPLA1. It is
           expressed in various human tissues in low mRNA levels.
          Length = 382

 Score = 31.0 bits (70), Expect = 0.89
 Identities = 20/88 (22%), Positives = 34/88 (38%), Gaps = 12/88 (13%)

Query: 173 GPSLSSQNSNDQLPSPPREN----SNEDVTPSFPAPTSSGG-TLPSPPSMPQNLP--GPS 225
           G +   ++    L   P+ +     +  ++P   AP S+    + SP S P +     PS
Sbjct: 296 GQASLEESWQPHLARAPKGDGRGLHDPPLSPPLAAPESTAEWVVESPVSSPASPLESSPS 355

Query: 226 ISGPLIPGPSIPSFPS----PAVTPSSP 249
           + G L       S P+    P+ TP   
Sbjct: 356 LPGSLTDLSPA-SLPAVHSLPSSTPGLS 382


>gnl|CDD|225599 COG3057, SeqA, Negative regulator of replication initiationR [DNA
           replication, recombination, and repair].
          Length = 181

 Score = 30.2 bits (68), Expect = 0.97
 Identities = 19/72 (26%), Positives = 33/72 (45%), Gaps = 10/72 (13%)

Query: 241 SPAVTPSSPSTPAPQPAMPALVTQPEQEISYIEAELINKCQKYIKFASSALNYDDYKEAK 300
           S A   ++P+T  P+ A+PA V          EA+     +  ++     L  D+Y E K
Sbjct: 36  SAASQTAAPATKEPRVAVPAKV----------EAKPTKTIKDKVRAMRELLLSDEYAEQK 85

Query: 301 SNLIKVLNILNT 312
             + + + IL+T
Sbjct: 86  RAVNRFMLILST 97


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 30.8 bits (69), Expect = 0.97
 Identities = 28/118 (23%), Positives = 41/118 (34%), Gaps = 21/118 (17%)

Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRE-----NSNEDVTPSFPAPTSSGGTLPSPPS 216
           PGP +  + S+ P+LS +    + P  P++           +   P       T P  P 
Sbjct: 561 PGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRP-------TRPKSPK 613

Query: 217 MPQNL--------PGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
           +P+ L        P    S    P P  PS P     P    +P P P  P     P+
Sbjct: 614 LPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKP-PKSPKPPFDPK 670



 Score = 30.4 bits (68), Expect = 1.5
 Identities = 27/119 (22%), Positives = 37/119 (31%), Gaps = 9/119 (7%)

Query: 155 KSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTL-PS 213
           K  E P  GP  SG   + P        +   S   +   E   P        G    P+
Sbjct: 506 KHDEPP-EGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPA 564

Query: 214 PPSMPQNLPGPSI--SGPLIPG----PSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
               P  +P  S     P  P     P  P  P    +   P+ P   P +P L+  P+
Sbjct: 565 KEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRP-KSPKLPELLDIPK 622


>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1.  This region is found
           in some of the Diaphanous related formins (Drfs). It
           consists of low complexity repeats of around 12
           residues.
          Length = 160

 Score = 29.5 bits (66), Expect = 1.3
 Identities = 31/121 (25%), Positives = 39/121 (32%), Gaps = 14/121 (11%)

Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSG------- 208
            G   IP PP     +  P          +P PP       + P  P P  +G       
Sbjct: 43  PGGAAIPPPPPLPGVAGIPPPPPLPGATAIPPPPPLPGAAGIPPPPPLPGGAGIPPPPPP 102

Query: 209 ----GTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQ 264
                 +P PP +P    GP +  P  P P  P  P P     SP  P     +PA    
Sbjct: 103 LPGGAAVPPPPPLP---GGPGVPPPPPPFPGAPGIPPPPPGMGSPPPPPFGFGVPAAPVL 159

Query: 265 P 265
           P
Sbjct: 160 P 160



 Score = 29.5 bits (66), Expect = 1.4
 Identities = 30/109 (27%), Positives = 35/109 (32%), Gaps = 6/109 (5%)

Query: 157 GETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPS 216
           G   IP PP    G   P          +P PP       + P  P P      +P PP 
Sbjct: 8   GGVGIPPPPPLPGGVCIPPPPPLPGGTGIPPPPPLPGGAAIPP--PPPLPGVAGIPPPPP 65

Query: 217 MPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
               LPG +   P  P P     P P   P     P P P +P     P
Sbjct: 66  ----LPGATAIPPPPPLPGAAGIPPPPPLPGGAGIPPPPPPLPGGAAVP 110


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 29.4 bits (66), Expect = 1.3
 Identities = 12/48 (25%), Positives = 19/48 (39%)

Query: 212 PSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
           P  P   Q+      + P    P++P  P+ AV   + + P   P  P
Sbjct: 90  PVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQP 137



 Score = 29.0 bits (65), Expect = 1.5
 Identities = 17/54 (31%), Positives = 26/54 (48%), Gaps = 1/54 (1%)

Query: 214 PPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQ 267
           PP+ P+ +  P+   P    P+ P   +PAV P +P+   P PA     + P Q
Sbjct: 84  PPAPPEPVTPPTAQSPAPAVPTPPPTSTPAV-PPAPAAAVPAPAAAPPPSDPPQ 136



 Score = 29.0 bits (65), Expect = 2.0
 Identities = 14/53 (26%), Positives = 18/53 (33%)

Query: 202 PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAP 254
           PAP        +    P     P  S P +P     + P+PA  P     P P
Sbjct: 85  PAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQP 137



 Score = 27.5 bits (61), Expect = 5.3
 Identities = 13/54 (24%), Positives = 18/54 (33%)

Query: 212 PSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
           P  P  P   P      P +P P   S P+    P++          P+   QP
Sbjct: 84  PPAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQP 137


>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA.  Members
           of this protein share two defining regions. One is a
           histidine/serine-rich cluster, typically
           H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
           the context of a pair of radical SAM proteins, HxsB and
           HxsC, and a fourth protein HxsD. The system is predicted
           to perform peptide modifications, likely in the
           His-Xaa-Ser region, to produce some uncharacterized
           natural product.
          Length = 186

 Score = 29.8 bits (67), Expect = 1.3
 Identities = 22/79 (27%), Positives = 29/79 (36%), Gaps = 24/79 (30%)

Query: 187 SPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPA-VT 245
           S P  +++   T S+P P+ S     SP                 PG SI S PS   V 
Sbjct: 77  SVPSGDTS---TYSYPVPSPS----YSPS----------------PGSSIQSLPSTTGVR 113

Query: 246 PSSPSTPAPQPAMPALVTQ 264
           P S +  A       LV +
Sbjct: 114 PQSSAENANSEKRKLLVIR 132


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
           procyclic acidic repetitive protein (PARP) like
           sequences. The procyclic acidic repetitive protein
           (parp) genes of Trypanosoma brucei encode a small family
           of abundant surface proteins whose expression is
           restricted to the procyclic form of the parasite. They
           are found at two unlinked loci, parpA and parpB;
           transcription of both loci is developmentally regulated.
          Length = 145

 Score = 29.5 bits (65), Expect = 1.4
 Identities = 25/98 (25%), Positives = 33/98 (33%), Gaps = 6/98 (6%)

Query: 168 GEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSIS 227
           G+G +G  + + ++N   P    E   E      P P   G   P P    +  P P  +
Sbjct: 42  GKGEKGTKVGADDTNGTDPDDEPEEEEE------PEPEEEGEEEPEPEEEGEEEPEPEET 95

Query: 228 GPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
           G   P P     P P   P     P P  A    V  P
Sbjct: 96  GEEEPEPEPEPEPEPEPEPEPEPEPEPGAATLKSVALP 133


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 30.1 bits (68), Expect = 1.4
 Identities = 17/74 (22%), Positives = 24/74 (32%), Gaps = 6/74 (8%)

Query: 194 NEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPA 253
                P   A        P P    Q LP            + P+    A  P++ + P 
Sbjct: 51  PSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQ-----QQHNTPTGSPAANPPATFALP- 104

Query: 254 PQPAMPALVTQPEQ 267
             PA P + T+P Q
Sbjct: 105 AGPAGPTIQTEPGQ 118


>gnl|CDD|150071 pfam09279, efhand_like, Phosphoinositide-specific phospholipase
          C, efhand-like.  Members of this family are
          predominantly found in phosphoinositide-specific
          phospholipase C. They adopt a structure consisting of a
          core of four alpha helices, in an EF like fold, and are
          required for functioning of the enzyme.
          Length = 83

 Score = 28.3 bits (64), Expect = 1.5
 Identities = 13/42 (30%), Positives = 20/42 (47%), Gaps = 5/42 (11%)

Query: 50 SSDGAKMLLLKLMDWLEQEKKTHRDNDTIMNETAAQAYIENY 91
          SS+   +   +L  +L +E+K     D  + E  A A IE Y
Sbjct: 10 SSNKGVLTAEELRRFLNEEQK-----DPRLTEEQALAIIEKY 46


>gnl|CDD|218658 pfam05616, Neisseria_TspB, Neisseria meningitidis TspB protein.
           This family consists of several Neisseria meningitidis
           TspB virulence factor proteins.
          Length = 502

 Score = 30.0 bits (67), Expect = 1.8
 Identities = 21/71 (29%), Positives = 29/71 (40%), Gaps = 1/71 (1%)

Query: 196 DVTP-SFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAP 254
           D+TP S  AP +      SP   P N P P  +    P P      +P   P +   P  
Sbjct: 318 DLTPGSAEAPEAQPLPEVSPAENPANNPNPRENPGTRPNPEPDPDLNPDANPDTDGQPGT 377

Query: 255 QPAMPALVTQP 265
           +P  PA+  +P
Sbjct: 378 RPDSPAVPDRP 388


>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
           (DUF1750).  This is a fungal domain of unknown function.
          Length = 669

 Score = 30.0 bits (67), Expect = 1.8
 Identities = 17/85 (20%), Positives = 23/85 (27%), Gaps = 15/85 (17%)

Query: 176 LSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPS 235
           L  +N+   +P PP+        P  P P +  G  P  P                PG  
Sbjct: 193 LHDRNNWPTIPLPPQMAGQSMYQPPGPYPNAMVGRQPFYPQ---------------PGAV 237

Query: 236 IPSFPSPAVTPSSPSTPAPQPAMPA 260
                      +     A Q A  A
Sbjct: 238 AGPPKRRGGHKAPRGHRASQAAGQA 262


>gnl|CDD|221759 pfam12764, Gly-rich_Ago1, Glycine-rich region of argonaut.  This
           domain is often found at the very N-terminal of
           argonaut-like proteins.
          Length = 102

 Score = 28.4 bits (63), Expect = 1.9
 Identities = 14/73 (19%), Positives = 23/73 (31%), Gaps = 4/73 (5%)

Query: 202 PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPS----TPAPQPA 257
           P      G              P  S P +   +   + +   T  +PS    +  P  +
Sbjct: 14  PPQQGGRGGGGGGRGGGSTGGPPRPSVPELHQATQAPYQAAVSTQPAPSEASSSSQPPES 73

Query: 258 MPALVTQPEQEIS 270
               VTQ  Q++S
Sbjct: 74  SSLQVTQQFQQLS 86


>gnl|CDD|182059 PRK09752, PRK09752, adhesin; Provisional.
          Length = 1250

 Score = 30.0 bits (67), Expect = 1.9
 Identities = 19/50 (38%), Positives = 22/50 (44%), Gaps = 10/50 (20%)

Query: 213 SPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALV 262
           +PPS P   P P       P P+    P P  TP    TPA QP + A V
Sbjct: 919 TPPSPPDPDPTPD------PDPT----PDPDPTPDPEPTPAYQPVLNAKV 958


>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
           (DUF2360).  This is the conserved 140 amino acid region
           of a family of proteins conserved from nematodes to
           humans. One C. elegans member is annotated as a
           Daf-16-dependent longevity protein 1 but this could not
           be confirmed. The function is unknown.
          Length = 147

 Score = 28.9 bits (65), Expect = 2.0
 Identities = 15/61 (24%), Positives = 19/61 (31%)

Query: 205 TSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQ 264
            +   T P P S   N   P          S P   +PA  P+ P   AP      +   
Sbjct: 53  VTVQTTPPPPASAITNGGPPPPPPARAEAASPPPPEAPAEPPAEPEPEAPAENTVTVAKD 112

Query: 265 P 265
           P
Sbjct: 113 P 113


>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
           (LMP2) protein.  This family consists of several
           Gammaherpesvirus latent membrane protein (LMP2)
           proteins. Epstein-Barr virus is a human Gammaherpesvirus
           that infects and establishes latency in B lymphocytes in
           vivo. The latent membrane protein 2 (LMP2) gene is
           expressed in latently infected B cells and encodes two
           protein isoforms, LMP2A and LMP2B, that are identical
           except for an additional N-terminal 119 aa cytoplasmic
           domain which is present in the LMP2A isoform. LMP2A is
           thought to play a key role in either the establishment
           or the maintenance of latency and/or the reactivation of
           productive infection from the latent state. The
           significance of LMP2B and its role in pathogenesis
           remain unclear.
          Length = 489

 Score = 29.8 bits (67), Expect = 2.1
 Identities = 19/94 (20%), Positives = 27/94 (28%), Gaps = 27/94 (28%)

Query: 154 LKSGETPIPGPPESGEGSEGP----SLSSQNSNDQLPSPPRENSNEDVTPSFPAPT---- 205
           L +G     G P+  EG   P    S  S       P P   ++     P +        
Sbjct: 9   LGAGGPRSHGGPDGDEGDSNPYYPSSFGSSWDRPGPPVPEDYDAPSHRPPPYGGSNGDRH 68

Query: 206 -------------------SSGGTLPSPPSMPQN 220
                              + GG LP PP  P++
Sbjct: 69  GGYQPLGQQDPSLYAGLGQNGGGGLPPPPYSPRD 102


>gnl|CDD|227676 COG5386, COG5386, Cell surface protein [Cell envelope biogenesis,
           outer membrane].
          Length = 352

 Score = 29.5 bits (66), Expect = 2.2
 Identities = 9/37 (24%), Positives = 13/37 (35%)

Query: 220 NLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQP 256
                       PG + P  P+P  TP+ P    P+ 
Sbjct: 187 AATPNDDKNVAQPGTTTPKTPTPQTTPNKPKVENPKT 223


>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
           Provisional.
          Length = 576

 Score = 29.5 bits (66), Expect = 2.3
 Identities = 26/90 (28%), Positives = 31/90 (34%), Gaps = 13/90 (14%)

Query: 182 NDQLPSPPRENSNEDVTPS----FPAPTSSGGTLPSPPSMPQNLPGPSISG--------P 229
           N Q P PP  N  +D  P+    FP          + P  P N+PG S S         P
Sbjct: 325 NPQDPVPP-PNEGKDGNPNEENLFPPGDDEVPDESNVPPNPPNVPGGSNSEFSSDVENPP 383

Query: 230 LIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
             P P IP         S+   P   P  P
Sbjct: 384 NPPNPDIPEQEPNIPEDSNKEVPEDVPMEP 413


>gnl|CDD|218158 pfam04580, Pox_D3, Chordopoxvirinae D3 protein.  Chordopoxvirinae
           D3 protein conserved region. Region occupies entire
           length of D3 protein.
          Length = 246

 Score = 29.2 bits (66), Expect = 2.4
 Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 2/50 (4%)

Query: 265 PEQEISYIEAELINKCQKYIK--FASSALNYDDYKEAKSNLIKVLNILNT 312
           PE EI      LI + +   K     S +NY  +K   S L ++LN + +
Sbjct: 133 PEYEIFLTNPFLIKQNRDIFKNLILRSEINYFIFKTKNSPLKELLNHIVS 182


>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN).  This family
           consists of several eukaryotic survival motor neuron
           (SMN) proteins. The Survival of Motor Neurons (SMN)
           protein, the product of the spinal muscular
           atrophy-determining gene, is part of a large
           macromolecular complex (SMN complex) that functions in
           the assembly of spliceosomal small nuclear
           ribonucleoproteins (snRNPs). The SMN complex functions
           as a specificity factor essential for the efficient
           assembly of Sm proteins on U snRNAs and likely protects
           cells from illicit, and potentially deleterious,
           non-specific binding of Sm proteins to RNAs.
          Length = 264

 Score = 29.2 bits (65), Expect = 2.4
 Identities = 25/98 (25%), Positives = 35/98 (35%), Gaps = 13/98 (13%)

Query: 158 ETPIPGPPESGEGSEGPSLSSQNSNDQ-LPSPPRENSNEDVTPSFPAPTSSGGTLPSPPS 216
           E  +     S + S+  S S +  +    P  P    N    P FP         P PP 
Sbjct: 132 EANVNETESSTDESDRSSHSHEVRSKSNFPMGPPSPWN----PRFPPGPP-----PPPPG 182

Query: 217 MPQNLPGPSISGPLIPG--PSIPSFPSPAVTPSSPSTP 252
             ++   PS   P + G  P  P  P P + P  P +P
Sbjct: 183 FGRHGEKPSGWPPFLSGWPPPFPLGP-PMIPPPPPMSP 219


>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
          Length = 280

 Score = 29.4 bits (65), Expect = 2.4
 Identities = 35/128 (27%), Positives = 48/128 (37%), Gaps = 11/128 (8%)

Query: 134 EDIEQNKKYA-KWKAAYIHNCLKSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPREN 192
            D++   KY+ K    Y  + LK+    +  P  SG+    PS +        P+     
Sbjct: 45  ADVDPLDKYSVKEAGRYYQSRLKANSACMQRP--SGQSPLAPSPACAAPAPACPACAPAA 102

Query: 193 SNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTP 252
               VT   PAP     T P+ P      P      P  P P+ P  PS    P +P  P
Sbjct: 103 PAPAVTCPAPAPACPPATAPTCP------PPAVCPAPARPAPACP--PSTRQCPPAPPLP 154

Query: 253 APQPAMPA 260
            P+PA  A
Sbjct: 155 TPKPAPAA 162


>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in bacteria and eukaryotes, and
           is typically between 384 and 430 amino acids in length.
          Length = 414

 Score = 29.4 bits (66), Expect = 2.5
 Identities = 27/120 (22%), Positives = 38/120 (31%), Gaps = 14/120 (11%)

Query: 155 KSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSP-PRENSNEDVTPSFPAPTSSGGTLPS 213
           KSG   +P   +S +          +S D   S  P+     D      AP     T   
Sbjct: 221 KSGIPDLPSSRDSEKTKPEKPQQETSSMDTEKSSAPKPRETLDPKSPEKAPPID--TTEE 278

Query: 214 PPSMPQNLPG-----------PSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALV 262
               P+  P            PS+  P     S     SP  +P  P +P P+P  P + 
Sbjct: 279 ELKSPEASPKESEEASARKRSPSLLSPSPKAESPKPLASPGKSPRDPLSPRPKPQSPPVN 338



 Score = 29.0 bits (65), Expect = 3.2
 Identities = 29/130 (22%), Positives = 42/130 (32%), Gaps = 8/130 (6%)

Query: 146 KAAYIHNCLKSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPT 205
           KA ++ + L   E+P   P    +      LS    +       R NS ++VTP     T
Sbjct: 150 KATWLESALNKPESPKHKPQPPQQPEWKKDLSRLRQSRASVDLGRTNSFKEVTPVGLMRT 209

Query: 206 SSGGTLPSPPSMPQNLPGPSISG--------PLIPGPSIPSFPSPAVTPSSPSTPAPQPA 257
              G+    PS       PS           P     S+ +  S A  P     P     
Sbjct: 210 PPPGSHSKSPSKSGIPDLPSSRDSEKTKPEKPQQETSSMDTEKSSAPKPRETLDPKSPEK 269

Query: 258 MPALVTQPEQ 267
            P + T  E+
Sbjct: 270 APPIDTTEEE 279


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 29.8 bits (66), Expect = 2.5
 Identities = 30/129 (23%), Positives = 40/129 (31%), Gaps = 19/129 (14%)

Query: 156 SGETPIPGPPESGEGSEGPSLSSQ--------NSNDQLPSPPRENSNEDVTPSFPAPTSS 207
            G T     P +G  S   + S Q        N+N  + +         VT       SS
Sbjct: 533 IGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSS 592

Query: 208 GGT-LPSPPSMPQNLP---GPSISGPLIPGPSIPSFPSPAVTPSSPST-------PAPQP 256
             +  P  PS   + P     S +  L              TPS PST       P P P
Sbjct: 593 PTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGP 652

Query: 257 AMPALVTQP 265
              + V+ P
Sbjct: 653 GTTSQVSGP 661



 Score = 28.6 bits (63), Expect = 6.0
 Identities = 18/106 (16%), Positives = 33/106 (31%), Gaps = 5/106 (4%)

Query: 156 SGETPIPGPPESGE--GSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPS 213
           +G T     P SG   G+   +L    S     +    N+            +S  T  +
Sbjct: 460 TGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKT 519

Query: 214 PPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
             +     P P + G      +  S P+   +  + ++P      P
Sbjct: 520 SDTPNATSPTPIVIGVT---TTATSPPTGTTSVPNATSPQVTEESP 562


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 29.6 bits (66), Expect = 2.5
 Identities = 19/97 (19%), Positives = 31/97 (31%), Gaps = 18/97 (18%)

Query: 158 ETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSM 217
           +  +   P++  G          ++ Q P  P     + +  S PA +         P +
Sbjct: 365 KVAVIAAPQTHTGP---------ADRQRPQRP-----DGIPYSVPARSPMTA----YPPV 406

Query: 218 PQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAP 254
           PQ    P +  P  P     S+    V P  P    P
Sbjct: 407 PQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPTNP 443



 Score = 29.6 bits (66), Expect = 2.6
 Identities = 21/114 (18%), Positives = 28/114 (24%), Gaps = 11/114 (9%)

Query: 161 IPGPPESGEGSEGPSLSSQNS-NDQLPSPPRENSNEDVTPSFPAPTSSGGTLPS-----P 214
           I G     E  +  SL++ +             + +  T + PA               P
Sbjct: 338 INGLKAHNEILKTASLTAPSRVLAAAAKVAVIAAPQ--THTGPADRQRPQRPDGIPYSVP 395

Query: 215 PSMPQNLPGPSISGPLIPGPSIPSFPSPAVT--PSSPSTPAPQPAMPALVTQPE 266
              P     P       PG   P  P    T     P  P P P        P 
Sbjct: 396 ARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVP-PQPTNPYVMPI 448


>gnl|CDD|137339 PRK09499, sifB, secreted effector protein SifB; Provisional.
          Length = 316

 Score = 29.2 bits (65), Expect = 2.6
 Identities = 31/112 (27%), Positives = 48/112 (42%), Gaps = 21/112 (18%)

Query: 61  LMDWLEQEKK----------------THRDNDTIMNETAAQAYIENYALKLFQWADSMDR 104
           L DW EQE+K                T  D D I N+ AA+  IEN  LK   + +    
Sbjct: 192 LCDWKEQERKAAISSRINLGIAQAGVTAID-DAIKNKIAAKV-IENTNLKNAAF-EPNYA 248

Query: 105 ASTFNKNVVKAFYTASILMDVLQTFGETSEDIEQNKKYAKWKAAYIHNCLKS 156
            S+  + V    +   ILM++L+    +S  +    +  ++ A  +HN L S
Sbjct: 249 QSSVTQIVYSCLFKNEILMNMLEE--SSSHGLLCLNELTEYVALQVHNSLFS 298


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 29.6 bits (66), Expect = 2.7
 Identities = 28/104 (26%), Positives = 37/104 (35%), Gaps = 4/104 (3%)

Query: 162 PGPPESGEGS--EGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
           P PPES   S  + PS   Q S++  P  P   ++E    S   P S   T   PP  P 
Sbjct: 149 PAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPP 208

Query: 220 NLPGP--SISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPAL 261
           + PG   S +    P P+          P+ P    P       
Sbjct: 209 DEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEPEREGPPFPGHRS 252


>gnl|CDD|225145 COG2236, COG2236, Predicted phosphoribosyltransferases [General
           function prediction only].
          Length = 192

 Score = 28.9 bits (65), Expect = 2.7
 Identities = 15/73 (20%), Positives = 23/73 (31%), Gaps = 4/73 (5%)

Query: 43  KALKIDKSSD-GAKMLLLK--LMDWLEQEKKTHRDNDTIMNETAAQAYIENYALKLFQWA 99
           K L +D   D G  + L    L      E +T        +       +  +A  +F W 
Sbjct: 89  KVLIVDDIVDTGETLELALEELKKLAPAEVRTAVLQY-KKSPDYYGEEVTAWAWIIFPWN 147

Query: 100 DSMDRASTFNKNV 112
              D A    K +
Sbjct: 148 RKEDLADLILKIL 160


>gnl|CDD|224433 COG1516, FliS, Flagellin-specific chaperone FliS [Cell motility and
           secretion / Intracellular trafficking and secretion /
           Posttranslational modification, protein turnover,
           chaperones].
          Length = 132

 Score = 28.4 bits (64), Expect = 2.9
 Identities = 9/36 (25%), Positives = 19/36 (52%)

Query: 276 LINKCQKYIKFASSALNYDDYKEAKSNLIKVLNILN 311
           L     K++K A  A+  +D +E   ++ K ++I+ 
Sbjct: 27  LYEGALKFLKRAKEAIEQEDIEEKNESIDKAIDIIT 62


>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 292

 Score = 29.1 bits (65), Expect = 3.0
 Identities = 21/117 (17%), Positives = 40/117 (34%), Gaps = 15/117 (12%)

Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQL------------PSPPRENSNEDVTPSFPAPTSS 207
            +  PP S   ++G    +Q+   ++            P    + +N  VT + P   ++
Sbjct: 75  DLSLPPISSTPTQGQEPLAQDGQQRVEVQGDLNNAAVQPQNLSQLNNVAVTSTLPTEPAT 134

Query: 208 GGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAP-QPAMPALVT 263
               P   +       P+I+ P+           P    ++ +T A  Q A PA   
Sbjct: 135 --VAPVRNASVPTAERPAITRPVRAQAVSEPAVEPKAAKTATATEAKVQTASPAQTP 189


>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI.  This minor capsid
           protein may act as a link between the external capsid
           and the internal DNA-protein core. The C-terminal 11
           residues may function as a protease cofactor leading to
           enzyme activation.
          Length = 238

 Score = 29.0 bits (65), Expect = 3.2
 Identities = 15/67 (22%), Positives = 19/67 (28%), Gaps = 11/67 (16%)

Query: 199 PSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAM 258
                P     T+  P    Q  P P +   L+P    P           PS        
Sbjct: 110 LGEEEPAPQEETVADPIQALQPRPRPDVEEVLVPAAPEP-----------PSYEETIKPG 158

Query: 259 PALVTQP 265
           PA V +P
Sbjct: 159 PAPVEEP 165



 Score = 27.8 bits (62), Expect = 7.4
 Identities = 16/104 (15%), Positives = 30/104 (28%), Gaps = 6/104 (5%)

Query: 164 PPESGEGSEGPSLSS-QNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLP 222
             E     +  +++    +    P P  E       P  P+   +    P+P   P +  
Sbjct: 110 LGEEEPAPQEETVADPIQALQPRPRPDVEEVLVPAAPEPPSYEETIKPGPAPVEEPVDSM 169

Query: 223 GPSISGPLIPG----PSIPSFPSPAVTPSSPSTPAPQPAMPALV 262
             ++     P     P  P  P P V P   +    + +     
Sbjct: 170 AIAVPAIDTPVTLELPPAPQ-PPPPVVPQPSTMVVHRRSRIKRT 212


>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
           Provisional.
          Length = 475

 Score = 29.0 bits (65), Expect = 3.2
 Identities = 25/98 (25%), Positives = 39/98 (39%), Gaps = 15/98 (15%)

Query: 204 PTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVT 263
            T++G  +P  P+ P   PGP   G   PG +  SFP       S + P P    P    
Sbjct: 128 VTATGEPVPQMPASP---PGPE--GEPQPGNTPVSFPP----QGSVAVPPPTAFYPGNGV 178

Query: 264 QPEQEISYIEAELINKCQKYIKFASSALNYDDYKEAKS 301
            P  +++Y    + N+ Q           Y++ K+  S
Sbjct: 179 TPPPQVTYQSVPVPNRIQ------RKTFTYNEGKKGPS 210


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 28.3 bits (63), Expect = 3.2
 Identities = 15/60 (25%), Positives = 18/60 (30%), Gaps = 4/60 (6%)

Query: 158 ETPIPGPPESGEGSEGPSLSSQ-NSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPS 216
           ET       S   S   S S +  S    P   R +   D T  FP PT+          
Sbjct: 58  ETAASSKAPS---SSSKSSSQETISIPPTPPARRPSRRWDQTGRFPNPTTGAKPTLRAAR 114


>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
          Length = 421

 Score = 29.3 bits (65), Expect = 3.2
 Identities = 13/67 (19%), Positives = 23/67 (34%)

Query: 202 PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPAL 261
             P+S+     +       +P P++    +   S  +  + AV  SS         +P  
Sbjct: 304 VTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGT 363

Query: 262 VTQPEQE 268
           V  P  E
Sbjct: 364 VALPAAE 370



 Score = 27.8 bits (61), Expect = 8.5
 Identities = 21/95 (22%), Positives = 31/95 (32%), Gaps = 5/95 (5%)

Query: 161 IPGPPESGEGSEGPSLSS-QNSNDQLPSPPRENSNEDVTPSFPAP-TSSGGTLPSPPSMP 218
           IP P          S ++ Q S   L S     S+  +  +   P        P P S  
Sbjct: 323 IPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTT 382

Query: 219 QNLP---GPSISGPLIPGPSIPSFPSPAVTPSSPS 250
           +      G   S    P  S+P+ P+  +  S  S
Sbjct: 383 ETQQSSTGNITSTANGPTTSLPAAPASNIPVSPTS 417


>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c.  This family of
           proteins is found in eukaryotes. Proteins in this family
           are typically between 263 and 318 amino acids in length.
           There is a conserved SFT sequence motif. MSA-2 is a
           plasma membrane glycoprotein which can be found in
           Babesia bovis species.
          Length = 201

 Score = 28.6 bits (64), Expect = 3.5
 Identities = 16/50 (32%), Positives = 19/50 (38%), Gaps = 2/50 (4%)

Query: 199 PSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSS 248
              P  +SS GT  + PS       P+ S P  PG        PA TP  
Sbjct: 141 YYSPKHSSSQGTSTTRPSDGSA--TPNTSAPPTPGNPAAQPEKPAETPKG 188


>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
           family.  The proteins represented by this model contain
           three RNA recognition motifs (rrm: pfam00076) and have
           been characterized as poly-pyrimidine tract binding
           proteins associated with RNA splicing factors. In the
           case of PUF60 (GP|6176532), in complex with p54, and in
           the presence of U2AF, facilitates association of U2
           snRNP with pre-mRNA.
          Length = 612

 Score = 29.3 bits (65), Expect = 3.5
 Identities = 21/81 (25%), Positives = 29/81 (35%), Gaps = 2/81 (2%)

Query: 187 SPPRENSNEDVTPSFPAPTSSGGTLPSP-PSMPQNLPGPSISGPL-IPGPSIPSFPSPAV 244
           SP   +S+        A  SS        P +PQ  P     GP+ IP P  P   +   
Sbjct: 330 SPATPSSSLPTDIGNKAVVSSAKKEAEEVPPLPQAAPAVVKPGPMEIPTPVPPPGLAIPS 389

Query: 245 TPSSPSTPAPQPAMPALVTQP 265
             + P   AP    P+ +  P
Sbjct: 390 LVAPPGLVAPTEINPSFLASP 410


>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
          Length = 318

 Score = 28.7 bits (64), Expect = 3.5
 Identities = 25/69 (36%), Positives = 27/69 (39%), Gaps = 1/69 (1%)

Query: 186 PSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVT 245
           PSPPR  S    TP      S   T PSPP      P    +GP  P       P   VT
Sbjct: 9   PSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRGCP-AGVT 67

Query: 246 PSSPSTPAP 254
            SS + P P
Sbjct: 68  FSSSAPPRP 76



 Score = 28.3 bits (63), Expect = 6.0
 Identities = 21/91 (23%), Positives = 29/91 (31%), Gaps = 9/91 (9%)

Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
           +P P    S      P     +  +  PSPP         P    P        +P   P
Sbjct: 8   SPSPPRRPSPPRPTPPRSPDASPEETPPSPP--------GPGAEPPPGRAAGPAAPRRRP 59

Query: 219 QNLP-GPSISGPLIPGPSIPSFPSPAVTPSS 248
           +  P G + S    P P +    +PA TP  
Sbjct: 60  RGCPAGVTFSSSAPPRPPLGLDDAPAATPPP 90


>gnl|CDD|177556 PHA03195, PHA03195, tegument protein VP11/12; Provisional.
          Length = 746

 Score = 29.2 bits (65), Expect = 3.7
 Identities = 16/70 (22%), Positives = 22/70 (31%), Gaps = 2/70 (2%)

Query: 155 KSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSP 214
           ++G  P  G       S  P          LP+PP   +        P        LP+ 
Sbjct: 415 RTGSCPPQGATFGRNPSGFPGQFCCPPQGPLPAPPNSKTRGTFRRPRPGSVRGSRQLPAS 474

Query: 215 PSMPQNLPGP 224
           P  P N+  P
Sbjct: 475 P--PSNIVSP 482


>gnl|CDD|218994 pfam06333, Med13_C, Mediator complex subunit 13 C-terminal.
           Mediator is a large complex of up to 33 proteins that is
           conserved from plants through fungi to humans - the
           number and representation of individual subunits varying
           with species. It is arranged into four different
           sections, a core, a head, a tail and a kinase-activity
           part, and the number of subunits within each of these is
           what varies with species. Overall, Mediator regulates
           the transcriptional activity of RNA polymerase II but it
           would appear that each of the four different sections
           has a slightly different function. Med13 is part of the
           ancillary kinase module, together with Med12, CDK8 and
           CycC, which in yeast is implicated in transcriptional
           repression, though most of this activity is likely
           attributable to the CDK8 kinase. The large Med12 and
           Med13 proteins are required for specific developmental
           processes in Drosophila, zebrafish, and Caenorhabditis
           elegans but their biochemical functions are not
           understood.
          Length = 406

 Score = 28.8 bits (65), Expect = 3.8
 Identities = 19/94 (20%), Positives = 31/94 (32%), Gaps = 16/94 (17%)

Query: 192 NSNEDVTPSFPAPTSSGGTL-PSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPS 250
           +++  + P  P           S  S PQ+ P               SFPSP    ++P+
Sbjct: 226 DTDPKLVPDPPTQPIPFSRFNTSNSSTPQSTPCTL------------SFPSPDQFGNAPT 273

Query: 251 TP--APQPAMPALVTQPEQ-EISYIEAELINKCQ 281
            P           VT P     S  ++ L++   
Sbjct: 274 PPLGENGNDNAPPVTPPGSPSESDPDSILVDIDD 307


>gnl|CDD|188060 TIGR00531, BCCP, acetyl-CoA carboxylase, biotin carboxyl carrier
           protein.  This model is designed to identify biotin
           carboxyl carrier protein as a peptide of acetyl-CoA
           carboxylase. Scoring below the trusted cutoff is a
           related protein encoded in a region associated with
           polyketide synthesis in the prokaryote Saccharopolyspora
           hirsuta, and a reported chloroplast-encoded biotin
           carboxyl carrier protein that may be highly derived from
           the last common ancestral sequence. Scoring below the
           noise cutoff are biotin carboxyl carrier domains of
           other enzymes such as pyruvate carboxylase.The gene name
           is accB or fabE [Fatty acid and phospholipid metabolism,
           Biosynthesis].
          Length = 156

 Score = 28.3 bits (63), Expect = 4.0
 Identities = 8/46 (17%), Positives = 14/46 (30%), Gaps = 1/46 (2%)

Query: 223 GPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQE 268
             +     +   + P  P+      S   PAP    PA     ++ 
Sbjct: 36  AAAAKKSAVQQAAAPV-PAQVPAAPSAQAPAPAVCAPAPAKADKKG 80


>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1.  Mating in
           fungi is controlled by the loci that determine the
           mating type of an individual, and only individuals with
           differing mating types can mate. Basidiomycete fungi
           have evolved a unique mating system, termed tetrapolar
           or bifactorial incompatibility, in which mating type is
           determined by two unlinked loci; compatibility at both
           loci is required for mating to occur. The multi-allelic
           tetrapolar mating system is considered to be a novel
           innovation that could have only evolved once, and is
           thus unique to the mushroom fungi. This domain is
           C-terminal to the homeodomain transcription factor
           region.
          Length = 418

 Score = 28.6 bits (64), Expect = 4.6
 Identities = 23/106 (21%), Positives = 35/106 (33%), Gaps = 8/106 (7%)

Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPT-------SSG 208
           S  +    PPE+   S   S   + S       P  + +   TP+  AP+       S G
Sbjct: 133 SSSSSPAKPPEACLPSPAASTQDELSEASAAPLPTPSLSPPHTPTDTAPSGKRKRRLSDG 192

Query: 209 GTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAP 254
             LP+P   PQ    P      +P  +   + +      S S    
Sbjct: 193 FQLPAPKR-PQTSSRPQTVSDPLPLHATTDWDTWFQATVSSSPSLL 237


>gnl|CDD|227889 COG5602, SIN3, Histone deacetylase complex, SIN3 component
           [Chromatin structure and dynamics].
          Length = 1163

 Score = 28.7 bits (64), Expect = 4.7
 Identities = 23/106 (21%), Positives = 34/106 (32%), Gaps = 1/106 (0%)

Query: 182 NDQLPSPPR-ENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFP 240
           N  LPS  + E S      S    T+  G L SPP        P         P++PS  
Sbjct: 188 NTFLPSGYKIEGSLPQPNGSRLHVTTPQGPLSSPPVQSSYYVAPCNHDQRTSHPTLPSDS 247

Query: 241 SPAVTPSSPSTPAPQPAMPALVTQPEQEISYIEAELINKCQKYIKF 286
            P  +  S      +        Q    ++ ++    N  + Y  F
Sbjct: 248 QPEPSAPSHMPSDARGKHQVDFNQAIIFVNKVKVRFQNNPEMYYDF 293


>gnl|CDD|225499 COG2948, VirB10, Type IV secretory pathway, VirB10 components
           [Intracellular trafficking and secretion].
          Length = 360

 Score = 28.6 bits (64), Expect = 4.7
 Identities = 14/74 (18%), Positives = 19/74 (25%), Gaps = 7/74 (9%)

Query: 185 LPSPPRENSNEDVTPSF-PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPA 243
                ++  N    PS     T     LP  P +P  LP           P +P      
Sbjct: 45  ALQGEKKRINNTQPPSNVERGTPPLPPLPDDPPLPPPLPVDL------GAPVLPDQQVEE 98

Query: 244 VTPSSPSTPAPQPA 257
                    A + A
Sbjct: 99  AKDQPRRLRAAELA 112


>gnl|CDD|218056 pfam04388, Hamartin, Hamartin protein.  This family includes the
           hamartin protein which is thought to function as a
           tumour suppressor. The hamartin protein interacts with
           the tuberin protein pfam03542. Tuberous sclerosis
           complex (TSC) is an autosomal dominant disorder and is
           characterized by the presence of hamartomas in many
           organs, such as brain, skin, heart, lung, and kidney. It
           is caused by mutation either TSC1 or TSC2 tumour
           suppressor gene. TSC1 encodes a protein, hamartin,
           containing two coiled-coil regions, which have been
           shown to mediate binding to tuberin. The TSC2 gene codes
           for tuberin pfam03542. These two proteins function
           within the same pathway(s) regulating cell cycle, cell
           growth, adhesion, and vesicular trafficking.
          Length = 667

 Score = 28.8 bits (64), Expect = 4.7
 Identities = 28/145 (19%), Positives = 45/145 (31%), Gaps = 13/145 (8%)

Query: 119 ASILMDVLQTFGETSEDIEQNKKYAKWKAAYIHNCLKSGETPIPGPPESG-EGSEGPSLS 177
           A + +D  +T  E      ++  Y    +        +    I  P  +   GS   + +
Sbjct: 259 AKLSLDPTETSSEDGYSFSRSSAYP-NSSPRQ-----ALPPSISLPQNTSTSGSLHSAQT 312

Query: 178 SQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLP-GPSISGPLIPGPSI 236
           S+  N          + +D   S  +         S    P  L   PS      PG + 
Sbjct: 313 SRRPNTTFDKAASSGT-KDSLWSPSSLCGMATPPSSIGMSPLILSLSPSHLSGRAPGTTG 371

Query: 237 PSFPSPAVTPSSPSTPAPQPAMPAL 261
                    P+S STP+  P  P L
Sbjct: 372 SGKG----EPASESTPSTSPPPPGL 392


>gnl|CDD|215496 PLN02918, PLN02918, pyridoxine (pyridoxamine) 5'-phosphate oxidase.
          Length = 544

 Score = 28.7 bits (64), Expect = 4.8
 Identities = 16/73 (21%), Positives = 25/73 (34%), Gaps = 2/73 (2%)

Query: 200 SFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
               P     +L S PS  Q    PS  G  +P P   +  + +  P      A    +P
Sbjct: 21  PISPPPPHSSSLSSSPSPTQRFLTPS-QGSRLP-PRRRALCTKSQDPRWRRAMASLAVIP 78

Query: 260 ALVTQPEQEISYI 272
            +       +SY+
Sbjct: 79  NMQDSGSPPLSYL 91


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 28.6 bits (64), Expect = 4.8
 Identities = 16/62 (25%), Positives = 22/62 (35%), Gaps = 3/62 (4%)

Query: 207 SGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
            G   P   +     P    + P  P  + P  P  A    + + PAP PA PA    P 
Sbjct: 379 RGAPAPPSAAWGAPTPAAP-AAP--PPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPA 435

Query: 267 QE 268
           + 
Sbjct: 436 RS 437


>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
          Length = 935

 Score = 28.9 bits (64), Expect = 5.1
 Identities = 23/116 (19%), Positives = 38/116 (32%), Gaps = 23/116 (19%)

Query: 161 IPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSP------ 214
           +PG  + G  +  P          +P+P    +   V P   + +   G   +P      
Sbjct: 484 LPGVVQDGRPACAP----------VPAP----AGPIVRPWEASLSQVPGVAFAPVMPQPM 529

Query: 215 PSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTP---APQPAMPALVTQPEQ 267
           P  P  +P  ++  P+ P P + +   P  T           P P  P     P Q
Sbjct: 530 PVEPVPVPTVALERPVCPAPPLIAMQGPGETSGIVRVRERWRPAPWTPNPPRSPSQ 585


>gnl|CDD|237182 PRK12727, PRK12727, flagellar biosynthesis regulator FlhF;
           Provisional.
          Length = 559

 Score = 28.4 bits (63), Expect = 5.1
 Identities = 15/80 (18%), Positives = 27/80 (33%), Gaps = 5/80 (6%)

Query: 185 LPSPPRENSNEDVTPSFPAPTSSGGTLPSP----PSMPQNLPGPSISGPLIPGPSIPSFP 240
             +P R  S         A  ++  T P       ++P+ L    ++   +P   + +  
Sbjct: 126 AAAPVRAASIPSPAAQALAHAAAVRTAPRQEHALSAVPEQLFADFLTTAPVPRAPVQAPV 185

Query: 241 SPAVTPSSPSTPAPQPAMPA 260
             A  P  P+  A   A  A
Sbjct: 186 VAAPAP-VPAIAAALAAHAA 204


>gnl|CDD|132971 cd06640, STKc_MST4, Catalytic domain of the Protein
           Serine/Threonine Kinase, Mammalian Ste20-like protein
           kinase 4.  Serine/threonine kinases (STKs), mammalian
           Ste20-like protein kinase 4 (MST4) subfamily, catalytic
           (c) domain. STKs catalyze the transfer of the
           gamma-phosphoryl group from ATP to serine/threonine
           residues on protein substrates. The MST4 subfamily is
           part of a larger superfamily that includes the catalytic
           domains of other protein STKs, protein tyrosine kinases,
           RIO kinases, aminoglycoside phosphotransferase, choline
           kinase, and phosphoinositide 3-kinase. MST4 is sometimes
           referred to as MASK (MST3 and SOK1-related kinase). It
           plays a role in mitogen-activated protein kinase (MAPK)
           signaling during cytoskeletal rearrangement,
           morphogenesis, and apoptosis. It influences cell growth
           and transformation by modulating the extracellular
           signal-regulated kinase (ERK) pathway. MST4 may also
           play a role in tumor formation and progression. It
           localizes in the Golgi apparatus by interacting with the
           Golgi matrix protein GM130 and may play a role in cell
           migration.
          Length = 277

 Score = 28.1 bits (62), Expect = 5.6
 Identities = 27/90 (30%), Positives = 38/90 (42%), Gaps = 18/90 (20%)

Query: 3   IQLPPCPDSLKAIQHFLNVASEHDNRDLTVSY---------WCRLHALQKALKIDKSSDG 53
           I L    D ++ IQ  + V S+ D+  +T  Y         W  +  L     +D    G
Sbjct: 37  IDLEEAEDEIEDIQQEITVLSQCDSPYVTKYYGSYLKGTKLWIIMEYLGGGSALDLLRAG 96

Query: 54  -------AKML--LLKLMDWLEQEKKTHRD 74
                  A ML  +LK +D+L  EKK HRD
Sbjct: 97  PFDEFQIATMLKEILKGLDYLHSEKKIHRD 126


>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
          Length = 306

 Score = 28.1 bits (62), Expect = 5.6
 Identities = 25/94 (26%), Positives = 33/94 (35%), Gaps = 11/94 (11%)

Query: 173 GPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIP 232
            P L+   +N    + P    N  + P  P         P  P  PQ L  P+      P
Sbjct: 9   PPGLAGLVANVPPAAAPTPQPNPVIQPQAPVQ-------PGQPGAPQQLAIPTQQ----P 57

Query: 233 GPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
            P   S  +P V   +P+ PAP     A    PE
Sbjct: 58  QPVPTSAMTPHVVQQAPAQPAPAAPPAAGAALPE 91


>gnl|CDD|216194 pfam00922, Phosphoprotein, Vesiculovirus phosphoprotein. 
          Length = 283

 Score = 28.0 bits (62), Expect = 6.2
 Identities = 13/56 (23%), Positives = 24/56 (42%)

Query: 224 PSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISYIEAELINK 279
           P +S      PS  + PS +   S+  +P P     + +  P+   +   + +INK
Sbjct: 170 PDVSSTSEHIPSSNTSPSTSEPVSTLRSPTPSSKSSSSLGLPDLSSTSDWSGMINK 225


>gnl|CDD|212073 cd11504, SLC5sbd_SMVT, Na(+)/multivitamin cotransporter SMVT and
           related proteins; solute-binding domain.  This
           multivitamin transporter SMVT (product of the SLC5A6
           gene) transports biotin, pantothenic acid and lipoate,
           and is essential for mediating biotin uptake into
           mammalian cells. SMVT is expressed in the placenta,
           intestine, heart, brain, lung, liver, kidney and
           pancreas. Biotin may regulate its own cellular uptake
           through participation in holocarboxylase
           synthetase-dependent chromatin remodeling events at SMVT
           promoter loci. The cis regulatory elements, Kruppel-like
           factor 4 and activator protein-2, regulate the activity
           of the human SMVT promoter in the intestine.
           Glycosylation of the hSMVT is important for its
           transport function. This subgroup belongs to the solute
           carrier 5 (SLC5) transporter family.
          Length = 527

 Score = 28.3 bits (63), Expect = 6.4
 Identities = 12/55 (21%), Positives = 23/55 (41%), Gaps = 3/55 (5%)

Query: 209 GTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVT 263
           G+       P     P I+  + P  +  +  +P    ++P+ P P P + AL +
Sbjct: 450 GSFVFRMYSP--TKSPPINDTIFPNVTNATAATPTTLFATPTDPRP-PGLQALYS 501


>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
          Length = 931

 Score = 28.3 bits (63), Expect = 6.5
 Identities = 15/52 (28%), Positives = 22/52 (42%), Gaps = 3/52 (5%)

Query: 212 PSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTP--APQPAMPAL 261
           P P +   ++  P +  P+ P P+       A  PS+   P  AP PA P  
Sbjct: 64  PLPAAAE-SIASPEVPPPVPPAPAQEGEAPAAEQPSAVPAPSAAPAPAEPVE 114


>gnl|CDD|187859 cd09728, Csx1_III-U, CRISPR/Cas system-associated protein Csx1.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Protein of this family often fused to HTH domain; Some
           proteins could have an additional fusion with
           RecB-family nuclease domain; Core domain appears to have
           a Rossmann-like fold; loosely associated with CRISPR/Cas
           systems; also known as DxTHG family.
          Length = 400

 Score = 28.1 bits (63), Expect = 7.0
 Identities = 14/49 (28%), Positives = 20/49 (40%), Gaps = 1/49 (2%)

Query: 265 PEQEISYIEAELINKCQKYIKFASSALNYDDYKEAKSNLIKVLNILNTG 313
           P  EISY++ + INK +   K            E    +IK  + L  G
Sbjct: 193 PVTEISYLKEKFINKYKDNNKDLKRYFKK-ALNEGLEKIIKFASALYNG 240


>gnl|CDD|215732 pfam00121, TIM, Triosephosphate isomerase. 
          Length = 242

 Score = 27.6 bits (62), Expect = 7.1
 Identities = 13/35 (37%), Positives = 17/35 (48%), Gaps = 10/35 (28%)

Query: 127 QTFGETSEDIEQNKKYAKWKAAYIHNCLKSGETPI 161
           Q FGET E +   KK        +   LK+G TP+
Sbjct: 97  QYFGETDEIV--AKK--------VKAALKNGLTPV 121


>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
           synthetase/uroporphyrin-III C-methyltransferase;
           Reviewed.
          Length = 656

 Score = 28.1 bits (63), Expect = 7.5
 Identities = 11/56 (19%), Positives = 20/56 (35%), Gaps = 7/56 (12%)

Query: 202 PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPA 257
           PA  +     P+P  M       S++    P  +  +   P   P++P  P  +  
Sbjct: 272 PATAA-----PAPSRMTDTNDSKSVTSQ--PAAAAAAPAPPPNPPATPPEPPARRG 320


>gnl|CDD|221617 pfam12517, DUF3720, Protein of unknown function (DUF3720).  This
           domain family is found in eukaryotes, and is
           approximately 100 amino acids in length. There are two
           completely conserved A residues that may be functionally
           important.
          Length = 98

 Score = 26.6 bits (59), Expect = 7.9
 Identities = 16/64 (25%), Positives = 25/64 (39%), Gaps = 5/64 (7%)

Query: 166 ESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPS 225
           E+G+   G     +++ +  P    +N     T    AP   GG +   P+     PGP 
Sbjct: 7   EAGQELSGVPDPLEDTKESSP----DNKLGANTEGSGAPGLGGGGVAGQPA-AALPPGPG 61

Query: 226 ISGP 229
            SG 
Sbjct: 62  GSGT 65


>gnl|CDD|235665 PRK05996, motB, flagellar motor protein MotB; Validated.
          Length = 423

 Score = 27.7 bits (62), Expect = 8.1
 Identities = 9/57 (15%), Positives = 19/57 (33%), Gaps = 1/57 (1%)

Query: 213 SPPSMPQNLPGPSISGPLIPGPS-IPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQE 268
            P    + +   +    L PG +   +  + + T +  + P   P   A   +   E
Sbjct: 182 DPDFWSKQVEVTTAGDLLPPGQAREQAQGAKSATAAPATVPQAAPLPQAQPKKAATE 238


>gnl|CDD|240628 cd05303, PGDH_2, Phosphoglycerate dehydrogenase (PGDH) NAD-binding
           and catalytic domains.  Phosphoglycerate dehydrogenase
           (PGDH) catalyzes the initial step in the biosynthesis of
           L-serine from D-3-phosphoglycerate. PGDH comes in 3
           distinct structural forms, with this first group being
           related to 2-hydroxy acid dehydrogenases, sharing
           structural similarity to formate and glycerate
           dehydrogenases. PGDH in E. coli and Mycobacterium
           tuberculosis form tetramers, with subunits containing a
           Rossmann-fold NAD binding domain. Formate/glycerate and
           related dehydrogenases of the D-specific 2-hydroxyacid
           dehydrogenase superfamily include groups such as formate
           dehydrogenase, glycerate dehydrogenase, L-alanine
           dehydrogenase, and S-Adenosylhomocysteine Hydrolase.
           Despite often low sequence identity, these proteins
           typically have a characteristic arrangement of 2 similar
           subdomains of the alpha/beta Rossmann fold NAD+ binding
           form. The NAD+ binding domain is inserted within the
           linear sequence of the mostly N-terminal catalytic
           domain, which has a similar domain structure to the
           internal NAD binding domain. Structurally, these domains
           are connected by extended alpha helices and create a
           cleft in which NAD is bound, primarily to the C-terminal
           portion of the 2nd (internal) domain. Some related
           proteins have similar structural subdomain but with a
           tandem arrangement of the catalytic and NAD-binding
           subdomains in the linear sequence.
          Length = 301

 Score = 27.5 bits (62), Expect = 8.7
 Identities = 13/39 (33%), Positives = 20/39 (51%), Gaps = 2/39 (5%)

Query: 272 IEAELINKCQ--KYIKFASSALNYDDYKEAKSNLIKVLN 308
           +  E+I+  +  K I  A   L+  D + AK   IKV+N
Sbjct: 53  VTKEVIDAAKNLKIIARAGVGLDNIDVEYAKKKGIKVIN 91


>gnl|CDD|219094 pfam06583, Neogenin_C, Neogenin C-terminus.  This family represents
           the C-terminus of eukaryotic neogenin precursor
           proteins, which contains several potential
           phosphorylation sites. Neogenin is a member of the N-CAM
           family of cell adhesion molecules (and therefore
           contains multiple copies of pfam00047 and pfam00041) and
           is closely related to the DCC tumour suppressor gene
           product - these proteins may play an integral role in
           regulating differentiation programmes and/or cell
           migration events within many adult and embryonic
           tissues.
          Length = 295

 Score = 27.6 bits (61), Expect = 9.0
 Identities = 27/121 (22%), Positives = 34/121 (28%), Gaps = 16/121 (13%)

Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSF-----PAPTSSGGTLPSP 214
           PI            PS      + Q    P   S   V   +          S    P P
Sbjct: 100 PIHTLDNPQYPGILPSPRCGYLHHQFSLRPMPFSTLTVQRLYQHGDRAESVESVRQTPEP 159

Query: 215 PSMPQNLPGPSISGPLIPGPSIP-----------SFPSPAVTPSSPSTPAPQPAMPALVT 263
           P +P      S +    P  SIP           SF  PA+  S  +     P+ P L  
Sbjct: 160 PYLPAAQSESSNAAEEAPSRSIPTAHVRPTHPLKSFAVPALPASMSTIEPKLPSTPLLTQ 219

Query: 264 Q 264
           Q
Sbjct: 220 Q 220


>gnl|CDD|202284 pfam02561, FliS, Flagellar protein FliS.  FliS is coded for by the
           FliD operon and is transcribed in conjunction with FliD
           and FliT, however this protein has no known function.
          Length = 122

 Score = 26.7 bits (59), Expect = 9.0
 Identities = 9/35 (25%), Positives = 18/35 (51%)

Query: 276 LINKCQKYIKFASSALNYDDYKEAKSNLIKVLNIL 310
           L +K  K +  A+  +  +D ++   N+ K  NI+
Sbjct: 25  LYDKAIKALVRAAEFIENEDIEKKNENISKAQNII 59


>gnl|CDD|234178 TIGR03354, VI_FHA, type VI secretion system FHA domain protein.
           Members of this protein family are FHA
           (forkhead-associated) domain-containing proteins that
           are part of type VI secretion loci in a considerable
           number of bacteria, most of which are known pathogens.
           Species include Pseudomonas aeruginosa PAO1, Aeromonas
           hydrophila, Yersinia pestis, Burkholderia mallei, etc
           [Protein fate, Protein and peptide secretion and
           trafficking, Cellular processes, Pathogenesis].
          Length = 396

 Score = 27.7 bits (62), Expect = 9.3
 Identities = 13/75 (17%), Positives = 21/75 (28%), Gaps = 10/75 (13%)

Query: 157 GETPIPGP-------PESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGG 209
           G  P P P       P      E  S +  +    L +P     +  +        +   
Sbjct: 125 GGPPTPDPAPLAQLDPLKALDQEPLSAADLDD---LSAPLFPPLDARLPAFAAPIDAEPT 181

Query: 210 TLPSPPSMPQNLPGP 224
            +P    +P   P P
Sbjct: 182 MVPPFVPLPAPEPAP 196


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 27.9 bits (62), Expect = 9.5
 Identities = 12/48 (25%), Positives = 17/48 (35%)

Query: 222 PGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEI 269
           PG   +    PG   P+ P     P +   PA  P  P +      E+
Sbjct: 906 PGGDAASAPPPGAGPPAPPQAVPPPRTTQPPAAPPRGPDVPPAAVAEL 953


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.128    0.375 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,659,244
Number of extensions: 1446068
Number of successful extensions: 3371
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2381
Number of HSP's successfully gapped: 507
Length of query: 316
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 219
Effective length of database: 6,635,264
Effective search space: 1453122816
Effective search space used: 1453122816
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 59 (26.7 bits)