RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9257
         (530 letters)



>gnl|CDD|238492 cd00992, PDZ_signaling, PDZ domain found in a variety of Eumetazoan
           signaling molecules, often in tandem arrangements. May
           be responsible for specific protein-protein
           interactions, as most PDZ domains bind C-terminal
           polypeptides, and binding to internal (non-C-terminal)
           polypeptides and even to lipids has been demonstrated.
           In this subfamily of PDZ domains an N-terminal
           beta-strand forms the peptide-binding groove base, a
           circular permutation with respect to PDZ domains found
           in proteases.
          Length = 82

 Score = 81.5 bits (202), Expect = 4e-19
 Identities = 28/83 (33%), Positives = 46/83 (55%), Gaps = 4/83 (4%)

Query: 347 IRELVLCKDASGLIGLRVCAINE---GVFVCLVERGSPASLVGLRFGDQILSINGETVAG 403
           +R + L KD  G +G  +    +   G+FV  VE G PA   GLR GD+IL +NG +V G
Sbjct: 1   VRTVTLRKDPGGGLGFSLRGGKDSGGGIFVSRVEPGGPAERGGLRVGDRILEVNGVSVEG 60

Query: 404 YNMKQVHKILKAAPVNNIRVVIR 426
              ++  ++LK +  + + + +R
Sbjct: 61  LTHEEAVELLKNSG-DEVTLTVR 82



 Score = 70.7 bits (174), Expect = 3e-15
 Identities = 30/82 (36%), Positives = 41/82 (50%), Gaps = 7/82 (8%)

Query: 431 ERTVTLHRDSSGHFGFHFNKGQ-------IVSLVKDSSAARNGLLVNHNILEVNGANVVG 483
            RTVTL +D  G  GF    G+       +  +     A R GL V   ILEVNG +V G
Sbjct: 1   VRTVTLRKDPGGGLGFSLRGGKDSGGGIFVSRVEPGGPAERGGLRVGDRILEVNGVSVEG 60

Query: 484 LKDKELLEAMEKGGSVLTLTLI 505
           L  +E +E ++  G  +TLT+ 
Sbjct: 61  LTHEEAVELLKNSGDEVTLTVR 82


>gnl|CDD|214570 smart00228, PDZ, Domain present in PSD-95, Dlg, and ZO-1/2.  Also
           called DHR (Dlg homologous region) or GLGF (relatively
           well conserved tetrapeptide in these domains). Some PDZs
           have been shown to bind C-terminal polypeptides; others
           appear to bind internal (non-C-terminal) polypeptides.
           Different PDZs possess different binding specificities.
          Length = 85

 Score = 70.5 bits (173), Expect = 3e-15
 Identities = 28/86 (32%), Positives = 39/86 (45%), Gaps = 5/86 (5%)

Query: 347 IRELVLCKDASGLIGLRVCAINE---GVFVCLVERGSPASLVGLRFGDQILSINGETVAG 403
            R + L K   GL G  +    +   GV V  V  GSPA+  GLR GD IL +NG +V G
Sbjct: 2   PRLVELEKGGGGL-GFSLVGGKDEGGGVVVSSVVPGSPAAKAGLRVGDVILEVNGTSVEG 60

Query: 404 YNMKQVHKILKAAPVNNIRVVIRDRP 429
               +   +LK A    + + +    
Sbjct: 61  LTHLEAVDLLKKAG-GKVTLTVLRGG 85



 Score = 61.6 bits (150), Expect = 4e-12
 Identities = 30/83 (36%), Positives = 43/83 (51%), Gaps = 8/83 (9%)

Query: 431 ERTVTLHRDSSGHFGFHFNKGQ-------IVSLVKDSSAARNGLLVNHNILEVNGANVVG 483
            R V L +   G  GF    G+       + S+V  S AA+ GL V   ILEVNG +V G
Sbjct: 2   PRLVELEKGGGG-LGFSLVGGKDEGGGVVVSSVVPGSPAAKAGLRVGDVILEVNGTSVEG 60

Query: 484 LKDKELLEAMEKGGSVLTLTLIP 506
           L   E ++ ++K G  +TLT++ 
Sbjct: 61  LTHLEAVDLLKKAGGKVTLTVLR 83


>gnl|CDD|238080 cd00136, PDZ, PDZ domain, also called DHR (Dlg homologous region)
           or GLGF (after a conserved sequence motif). Many PDZ
           domains bind C-terminal polypeptides, though binding to
           internal (non-C-terminal) polypeptides and even to
           lipids has been demonstrated. Heterodimerization through
           PDZ-PDZ domain interactions adds to the domain's
           versatility, and PDZ domain-mediated interactions may be
           modulated dynamically through target phosphorylation.
           Some PDZ domains play a role in scaffolding
           supramolecular complexes. PDZ domains are found in
           diverse signaling proteins in bacteria, archebacteria,
           and eurkayotes. This CD contains two distinct structural
           subgroups with either a N- or C-terminal beta-strand
           forming the peptide-binding groove base. The circular
           permutation placing the strand on the N-terminus appears
           to be found in Eumetazoa only, while the C-terminal
           variant is found in all three kingdoms of life, and
           seems to co-occur with protease domains. PDZ domains
           have been named after PSD95(post synaptic density
           protein), DlgA (Drosophila disc large tumor suppressor),
           and ZO1, a mammalian tight junction protein.
          Length = 70

 Score = 69.6 bits (171), Expect = 5e-15
 Identities = 22/60 (36%), Positives = 32/60 (53%)

Query: 367 INEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNNIRVVIR 426
              GV V  VE GSPA   GL+ GD IL++NG  V    ++ V ++LK      + + +R
Sbjct: 11  TEGGVVVLSVEPGSPAERAGLQAGDVILAVNGTDVKNLTLEDVAELLKKEVGEKVTLTVR 70



 Score = 45.0 bits (107), Expect = 2e-06
 Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 1/54 (1%)

Query: 453 IVSLVKDSSAARNGLLVNHNILEVNGANVVGLKDKELLEAM-EKGGSVLTLTLI 505
           ++S+   S A R GL     IL VNG +V  L  +++ E + ++ G  +TLT+ 
Sbjct: 17  VLSVEPGSPAERAGLQAGDVILAVNGTDVKNLTLEDVAELLKKEVGEKVTLTVR 70


>gnl|CDD|238488 cd00988, PDZ_CTP_protease, PDZ domain of C-terminal processing-,
           tail-specific-, and tricorn proteases, which function in
           posttranslational protein processing, maturation, and
           disassembly or degradation, in Bacteria, Archaea, and
           plant chloroplasts. May be responsible for substrate
           recognition and/or binding, as most PDZ domains bind
           C-terminal polypeptides, and binding to internal
           (non-C-terminal) polypeptides and even to lipids has
           been demonstrated. In this subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
           forms the peptide-binding groove base, a circular
           permutation with respect to PDZ domains found in
           Eumetazoan signaling proteins.
          Length = 85

 Score = 58.0 bits (141), Expect = 7e-11
 Identities = 29/84 (34%), Positives = 47/84 (55%), Gaps = 1/84 (1%)

Query: 358 GLIGLRVCAINEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAP 417
           G IGL +   + G+ +  V  GSPA+  G++ GD I++I+GE V G +++ V K+L+   
Sbjct: 2   GGIGLELKYDDGGLVITSVLPGSPAAKAGIKAGDIIVAIDGEPVDGLSLEDVVKLLRGKA 61

Query: 418 VNNIRV-VIRDRPLERTVTLHRDS 440
              +R+ + R     R VTL R  
Sbjct: 62  GTKVRLTLKRGDGEPREVTLTRLK 85



 Score = 38.4 bits (90), Expect = 6e-04
 Identities = 16/64 (25%), Positives = 32/64 (50%), Gaps = 1/64 (1%)

Query: 444 FGFHFNKGQIVSLVKDSSAARNGLLVNHNILEVNGANVVGLKDKELLEAM-EKGGSVLTL 502
             +      I S++  S AA+ G+     I+ ++G  V GL  +++++ +  K G+ + L
Sbjct: 8   LKYDDGGLVITSVLPGSPAAKAGIKAGDIIVAIDGEPVDGLSLEDVVKLLRGKAGTKVRL 67

Query: 503 TLIP 506
           TL  
Sbjct: 68  TLKR 71


>gnl|CDD|238487 cd00987, PDZ_serine_protease, PDZ domain of tryspin-like serine
           proteases, such as DegP/HtrA, which are oligomeric
           proteins involved in heat-shock response, chaperone
           function, and apoptosis. May be responsible for
           substrate recognition and/or binding, as most PDZ
           domains bind C-terminal polypeptides, though binding to
           internal (non-C-terminal) polypeptides and even to
           lipids has been demonstrated. In this subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
           forms the peptide-binding groove base, a circular
           permutation with respect to PDZ domains found in
           Eumetazoan signaling proteins.
          Length = 90

 Score = 52.6 bits (127), Expect = 8e-09
 Identities = 22/70 (31%), Positives = 38/70 (54%), Gaps = 3/70 (4%)

Query: 368 NEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNN-IRVVIR 426
            +GV V  V+ GSPA+  GL+ GD IL++NG+ V   ++  + + L      + + + + 
Sbjct: 23  TKGVLVASVDPGSPAAKAGLKPGDVILAVNGKPVK--SVADLRRALAELKPGDKVTLTVL 80

Query: 427 DRPLERTVTL 436
               E TVT+
Sbjct: 81  RGGKELTVTV 90



 Score = 34.9 bits (81), Expect = 0.011
 Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 453 IVSLVKDSSAARNGLLVNHNILEVNGANVVGLKDKELLEAME--KGGSVLTLTLI 505
           + S+   S AA+ GL     IL VNG  V  + D  L  A+   K G  +TLT++
Sbjct: 28  VASVDPGSPAAKAGLKPGDVILAVNGKPVKSVAD--LRRALAELKPGDKVTLTVL 80


>gnl|CDD|201332 pfam00595, PDZ, PDZ domain (Also known as DHR or GLGF).  PDZ
           domains are found in diverse signaling proteins.
          Length = 80

 Score = 51.5 bits (124), Expect = 1e-08
 Identities = 23/82 (28%), Positives = 37/82 (45%), Gaps = 6/82 (7%)

Query: 349 ELVLCKDASGLIGLRV---CAINEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGY- 404
           E+ L K   G +G  +      + G+FV  V  G  A   GL+ GD+ILSING+ +    
Sbjct: 1   EVTLEKSGRGGLGFSLVGGSDGDPGIFVSEVLPGGAAEAGGLQEGDRILSINGQDLENLS 60

Query: 405 NMKQVHKILKAAPVNNIRVVIR 426
           + + V  +        + + I 
Sbjct: 61  HDEAVLALKG--SGGEVTLTIL 80



 Score = 47.2 bits (113), Expect = 4e-07
 Identities = 23/80 (28%), Positives = 37/80 (46%), Gaps = 7/80 (8%)

Query: 433 TVTLHRDSSGHFGF------HFNKGQIVS-LVKDSSAARNGLLVNHNILEVNGANVVGLK 485
            VTL +   G  GF        + G  VS ++   +A   GL     IL +NG ++  L 
Sbjct: 1   EVTLEKSGRGGLGFSLVGGSDGDPGIFVSEVLPGGAAEAGGLQEGDRILSINGQDLENLS 60

Query: 486 DKELLEAMEKGGSVLTLTLI 505
             E + A++  G  +TLT++
Sbjct: 61  HDEAVLALKGSGGEVTLTIL 80


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 56.9 bits (137), Expect = 2e-08
 Identities = 57/253 (22%), Positives = 78/253 (30%), Gaps = 28/253 (11%)

Query: 36   PPPPVYQTSSHTSPNAYNNPPSPY------------HSANPPPYYQHPSVPSHPSVPTFP 83
            PPPP+   +   +P+    PP P                + PP    P  P        P
Sbjct: 2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGD--P 2608

Query: 84   SAPIYPSAPPQESHS---PYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYP---S 137
              P  PS  P ++H+   P       AN P    P    P      +P   + S P    
Sbjct: 2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRAR 2668

Query: 138  VPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSV 197
               +    ++ P  P +  A      LTS         P P  P  +        P    
Sbjct: 2669 RLGRAAQASSPPQRPRRRAARPTVGSLTSLADPP----PPPPTPEPAPHALVSATPLPPG 2724

Query: 198  QPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPS-VPPAIHDAT--RST 254
              +   A P  P  P+  A  P+ P      A  +  PTTA  P+  PPA   A   R  
Sbjct: 2725 PAAARQASPALPAAPAPPAV-PAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRL 2783

Query: 255  PSPIIAPSVPPTN 267
              P +A       
Sbjct: 2784 TRPAVASLSESRE 2796



 Score = 56.5 bits (136), Expect = 3e-08
 Identities = 57/247 (23%), Positives = 79/247 (31%), Gaps = 17/247 (6%)

Query: 36   PPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVP---TFPSAPIYPSAP 92
            P PP        SP A    P P  +  PP   +    P   S P            S+P
Sbjct: 2624 PDPP----PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP 2679

Query: 93   PQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMP 152
            PQ          + +       P P       P     A    P      QA    P+ P
Sbjct: 2680 PQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAP 2739

Query: 153  NQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQ--PPAYSVQPSHNTAYPTAPL 210
                 P G  P T    A  A  P    P A   P+AP   PP    +P+  +   +   
Sbjct: 2740 APPAVPAG--PATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797

Query: 211  VPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDT 270
            +PS     P+ PPA +     +  P  + A  +PP     T + P+    P  PP     
Sbjct: 2798 LPS--PWDPADPPAAVLAPAAALPPAASPAGPLPP----PTSAQPTAPPPPPGPPPPSLP 2851

Query: 271  IRSALVP 277
            +  ++ P
Sbjct: 2852 LGGSVAP 2858



 Score = 55.3 bits (133), Expect = 6e-08
 Identities = 50/275 (18%), Positives = 81/275 (29%), Gaps = 35/275 (12%)

Query: 29   TSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSV-----PTFP 83
             +     PP          SP+   N P P+     PP  +    P+   V         
Sbjct: 2611 PAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRL 2670

Query: 84   SAPIYPSAPPQ------------------ESHSPYYYSDIRANVPTGGSPVPSYPQGSYP 125
                  S+PPQ                  +   P    +   +     +P+P  P  +  
Sbjct: 2671 GRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQ 2730

Query: 126  SNPYQ----AQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINP 181
            ++P      A  + P+ P  P  P      P     P    P   +    +      +  
Sbjct: 2731 ASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790

Query: 182  LASNKPSAPQPPAYSVQP--------SHNTAYPTAPLVPSTIATAPSAPPANIEDATRST 233
            L+ ++ S P P   +  P        +   A   A  +P   +  P+APP        S 
Sbjct: 2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSL 2850

Query: 234  HPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNR 268
                + AP         +RS  +   AP+ PP  R
Sbjct: 2851 PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRR 2885



 Score = 53.0 bits (127), Expect = 3e-07
 Identities = 55/238 (23%), Positives = 86/238 (36%), Gaps = 23/238 (9%)

Query: 29   TSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIY 88
            TS +   PPPP  + + H   +A   PP P  +    P    P+ P+ P+VP  P+ P  
Sbjct: 2696 TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASP--ALPAAPAPPAVPAGPATPGG 2753

Query: 89   PSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTY 148
            P+ P +                T G P P+ P    P+     + + P+V +  ++  + 
Sbjct: 2754 PARPARPP-------------TTAGPPAPAPPAA--PAAGPPRRLTRPAVASLSESRESL 2798

Query: 149  PSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTA 208
            PS  +    P       ++   + +    P+ P  S +P+AP PP     PS       A
Sbjct: 2799 PSPWDPADPPAAVLAPAAALPPAASPAG-PLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA 2857

Query: 209  PLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPT 266
            P           +P A          P         PA+  +T S   P   P  PP 
Sbjct: 2858 PGGDVRRRPPSRSPAAK---PAAPARPPVRRLAR--PAVSRSTESFALPPDQPERPPQ 2910



 Score = 50.7 bits (121), Expect = 2e-06
 Identities = 53/257 (20%), Positives = 75/257 (29%), Gaps = 10/257 (3%)

Query: 29   TSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIY 88
              S+   PP P     +  +  A   PP+       P     P+ P   + P  P+ P  
Sbjct: 2715 LVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA 2774

Query: 89   PSAPPQESHSPYYYSDIRANVPTGGSP-----VPSYPQGSYPSNPYQAQNSYPSVP---N 140
            P+A P    +    + +  +  +  SP      P+       + P  A  + P  P    
Sbjct: 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834

Query: 141  QPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPS 200
            QP AP   P  P       G        +    S      P A  +P   +    +V  S
Sbjct: 2835 QPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRS 2894

Query: 201  HNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIA 260
               ++   P  P      P APP           P     P  PP        T  P  A
Sbjct: 2895 -TESFALPPDQPERPPQ-PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGA 2952

Query: 261  PSVPPTNRDTIRSALVP 277
                         ALVP
Sbjct: 2953 GEPSGAVPQPWLGALVP 2969



 Score = 49.2 bits (117), Expect = 6e-06
 Identities = 49/232 (21%), Positives = 71/232 (30%), Gaps = 15/232 (6%)

Query: 25   SKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPS 84
            S   +  S  SP  P    ++  +P A   P +      PPP    P+ P  P  P  PS
Sbjct: 2790 SLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPS 2849

Query: 85   APIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQA 144
             P+  S  P          D+R   P+         + + P+ P   + + P+V    ++
Sbjct: 2850 LPLGGSVAPG--------GDVRRRPPSRSPA----AKPAAPARPPVRRLARPAVSRSTES 2897

Query: 145  PNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTA 204
                P  P +   P    P     Q      P P  P     P    P A +  P+    
Sbjct: 2898 FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP---PPPRPQPPLAPTTDPAGAGE 2954

Query: 205  YPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPS 256
               A   P   A  P           +      A A S PP    +     S
Sbjct: 2955 PSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSS 3006



 Score = 47.6 bits (113), Expect = 2e-05
 Identities = 52/275 (18%), Positives = 84/275 (30%), Gaps = 48/275 (17%)

Query: 34   SSPPPPVYQTSSHTSPNAYNNPPSPYHSANPP----------PYYQHPSVPSHPSVPTFP 83
            + PP  + + +  +   +  + PSP+  A+PP          P    P+ P  P     P
Sbjct: 2777 AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQP 2836

Query: 84   SAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQ 143
            +AP  P  PP  S            +P GGS  P       P  P ++  + P+ P +P 
Sbjct: 2837 TAPPPPPGPPPPS------------LPLGGSVAPGGDVRRRP--PSRSPAAKPAAPARPP 2882

Query: 144  A-----PNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQ 198
                  P    S  +    P+        Q            P    +P  P PP     
Sbjct: 2883 VRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPR---- 2938

Query: 199  PSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPI 258
                   P  PL P+T       P   +        P       VP  +       P P 
Sbjct: 2939 -------PQPPLAPTTDPAGAGEPSGAV--------PQPWLGALVPGRVAVPRFRVPQPA 2983

Query: 259  IAPSVPPTNRDTIRSALVPLYPSLDNYMGLEVASE 293
             +   P ++   +    +    S  + + L   ++
Sbjct: 2984 PSREAPASSTPPLTGHSLSRVSSWASSLALHEETD 3018



 Score = 47.6 bits (113), Expect = 2e-05
 Identities = 51/244 (20%), Positives = 73/244 (29%), Gaps = 24/244 (9%)

Query: 36   PPPPVYQTSSHT-SPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQ 94
            P  PVY+  +    P A    P P     P P          P  P+  +  I P  P  
Sbjct: 2475 PGAPVYRRPAEARFPFAAGAAPDPGGGGPPDP--------DAPPAPSRLAPAILPDEPVG 2526

Query: 95   ESHSPYYYSDIR-----ANVPTGGSP-------VPSYPQGSYPSNPYQAQNSYPSVPNQP 142
            E   P   + IR     A+   G  P        P+ P  S P      + S P+V ++ 
Sbjct: 2527 EPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRA 2586

Query: 143  QAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHN 202
            + P+  P    + +AP         +  +  S   P        P +P P A    P   
Sbjct: 2587 RRPDA-PPQSARPRAPVD--DRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPP 2643

Query: 203  TAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPS 262
               P         A    + P       R+   ++        A      S  S    P 
Sbjct: 2644 PTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPP 2703

Query: 263  VPPT 266
             PPT
Sbjct: 2704 PPPT 2707



 Score = 37.2 bits (86), Expect = 0.028
 Identities = 46/255 (18%), Positives = 68/255 (26%), Gaps = 30/255 (11%)

Query: 17  TIAQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSH 76
           T  Q+E F ++    S                    +  P     A  PP     + P+ 
Sbjct: 230 TYLQDEPFVERRVVISHPLRGDIAAPAPPPVVGEGADRAPETARGATGPPPPPEAAAPNG 289

Query: 77  PSVP-----------TFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYP 125
            + P              + P  P  PP          D         SP+P  P+  YP
Sbjct: 290 AAAPPDGVWGAALAGAPLALPAPPDPPPPAPAGDAEEEDDEDGAMEVVSPLP-RPRQHYP 348

Query: 126 SNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASN 185
                        P + +   T PS    L A        S     + S  +   P A  
Sbjct: 349 L----------GFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRRSARHAATPFARG 398

Query: 186 KPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDAT-------RSTHPTTA 238
                Q    +  P+ +   P    VP++    P+ P  + E  +           P   
Sbjct: 399 PGGDDQTRPAAPVPA-SVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAP 457

Query: 239 TAPSVPPAIHDATRS 253
                P    DATR 
Sbjct: 458 ATEPAPDDPDDATRK 472



 Score = 36.1 bits (83), Expect = 0.053
 Identities = 47/194 (24%), Positives = 55/194 (28%), Gaps = 29/194 (14%)

Query: 75   SHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNS 134
            S      FP AP+Y               + R     G +P P       P  P      
Sbjct: 2467 SLLLGELFPGAPVYRRPA-----------EARFPFAAGAAPDPGGGGPPDPDAPPAPSRL 2515

Query: 135  YPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPA 194
             P++   P  P   P  P  L    G   L S           P  PL    P  P  P 
Sbjct: 2516 APAI--LPDEPVGEPVHPRMLTWIRGLEELASDDAGD------PPPPLPPAAP--PAAPD 2565

Query: 195  YSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRST 254
             SV P      P+ P V S  A  P APP       +S  P         P         
Sbjct: 2566 RSVPPPRPAPRPSEPAVTSR-ARRPDAPP-------QSARPRAPVDDRGDPRGPAPPSPL 2617

Query: 255  PSPIIAPSVPPTNR 268
            P    AP  PP + 
Sbjct: 2618 PPDTHAPDPPPPSP 2631



 Score = 33.4 bits (76), Expect = 0.41
 Identities = 51/262 (19%), Positives = 70/262 (26%), Gaps = 23/262 (8%)

Query: 25   SKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPS 84
            S+   +   +   PPV + +      +  +   P      PP  Q P  P     P  P 
Sbjct: 2868 SRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPP 2927

Query: 85   APIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQA 144
             P  P  PP     P          PT        P G+ P     A         + + 
Sbjct: 2928 QPQPPPPPPPRPQPPL--------APTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRV 2979

Query: 145  PNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTA 204
            P   PS      +       + S+ +S AS       LA ++ + P PP    Q      
Sbjct: 2980 PQPAPSREAPASSTPPLTGHSLSRVSSWAS------SLALHEETDP-PPVSLKQTLWPPD 3032

Query: 205  YPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVP 264
                    S   +          D               P       R +PS    P   
Sbjct: 3033 DTEDSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATPEAGARESPSSQFGPPPL 3092

Query: 265  PTN--------RDTIRSALVPL 278
              N        R T RSAL  L
Sbjct: 3093 SANAALSRRYVRSTGRSALAVL 3114


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 56.3 bits (136), Expect = 3e-08
 Identities = 49/264 (18%), Positives = 67/264 (25%), Gaps = 29/264 (10%)

Query: 19  AQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSA-NPPPYYQHPSVPSHP 77
                 S  +  +S  SP    Y+ S+      Y     P  S     P  +     +  
Sbjct: 85  PSVGPDSDLSQKTSTFSPCQSGYEASTDPE---YIPDLQPDPSLWGTAPKPEPQPPQAPE 141

Query: 78  SVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPS 137
           S P  P  P       +E  +          +P    P    PQG  P      Q   P 
Sbjct: 142 SQP-QPQTPAQKMLSLEEVEAQLQQRQQAPQLPQP--PQQVLPQGMPPRQAAFPQQGPPE 198

Query: 138 VP-NQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYS 196
            P   PQ P  +P      Q              + A  P P        P  P P    
Sbjct: 199 QPPGYPQPPQGHPEQVQPQQFLPA-------PSQAPAQPPLPPQL-----PQQPPPLQQP 246

Query: 197 VQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPS 256
             P  +   P  P  P      P  P A          P     P+  P +     +   
Sbjct: 247 QFPGLSQQMPPPPPQPPQQQQQPPQPQA---------QPPPQNQPTPHPGLPQGQNAPLP 297

Query: 257 PIIAPSVPPTNRDTIRSALVPLYP 280
           P   P + P  +        P + 
Sbjct: 298 PPQQPQLLPLVQQPQGQQRGPQFR 321



 Score = 42.8 bits (101), Expect = 4e-04
 Identities = 30/175 (17%), Positives = 36/175 (20%), Gaps = 22/175 (12%)

Query: 35  SPPPPVYQTSSHTSPNAYNNPPS--PYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAP 92
             P  +    +          P   P      P   Q       P     P+ P  P   
Sbjct: 179 VLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAP--SQAPAQPPLPPQL 236

Query: 93  PQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAP--NTYPS 150
           PQ               P      P   Q   P  P   Q        Q Q P  N    
Sbjct: 237 PQ------------QPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTP 284

Query: 151 MPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAY 205
            P   Q  N   P     Q     LP    P    +    +     +      A 
Sbjct: 285 HPGLPQGQNAPLPPPQQPQ----LLPLVQQPQGQQRGPQFREQLVQLSQQQREAL 335


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 52.7 bits (127), Expect = 4e-07
 Identities = 37/215 (17%), Positives = 53/215 (24%), Gaps = 10/215 (4%)

Query: 30  SSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYP 89
           ++    PP P    +S   P     P +P   A  P         + P+  +   AP   
Sbjct: 594 AAGGEGPPAP----ASSGPPEEAARPAAPAAPA-APAAPAPAGAAAAPAEASAAPAPGVA 648

Query: 90  SAPPQESH--SPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNT 147
           +      H   P             G   P+ P  +       A          P    T
Sbjct: 649 APEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAAT 708

Query: 148 YPSMPNQLQAPNGFYP-LTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYP 206
            P+      A         +S  +  A  P P+ P   + P     PA    P       
Sbjct: 709 PPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAA 768

Query: 207 TAPLVPSTIATAPSAPPANIEDATRSTHPTTATAP 241
                P    + PS      ED   S         
Sbjct: 769 APAAAPP--PSPPSEEEEMAEDDAPSMDDEDRRDA 801



 Score = 37.3 bits (87), Expect = 0.020
 Identities = 35/215 (16%), Positives = 52/215 (24%), Gaps = 26/215 (12%)

Query: 51  AYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVP 110
           A    P    S+ PP     P+ P+ P+ P  P+     +AP + S +P           
Sbjct: 595 AGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAP----------- 643

Query: 111 TGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQA 170
                        + + P  +            A    P       AP        +Q A
Sbjct: 644 -APGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPA 702

Query: 171 SQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDAT 230
                           P+A  P   +  P+             + A     P     D  
Sbjct: 703 P--------------APAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDP 748

Query: 231 RSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPP 265
                  A  P  P     A  +   P   PS   
Sbjct: 749 PDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEE 783



 Score = 35.7 bits (83), Expect = 0.060
 Identities = 26/199 (13%), Positives = 43/199 (21%), Gaps = 13/199 (6%)

Query: 71  PSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQ 130
              P  P+    P     P+AP   +          A  P G +  P+    +       
Sbjct: 597 GEGPPAPASSGPPEEAARPAAPAAPAAPA-------APAPAGAAAAPAEASAAPAPGV-- 647

Query: 131 AQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAP 190
                    +         S               ++   + A         A+    AP
Sbjct: 648 ----AAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAP 703

Query: 191 QPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDA 250
            P A       +      P      +    A    +        P         P    A
Sbjct: 704 APAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPA 763

Query: 251 TRSTPSPIIAPSVPPTNRD 269
                +P  AP   P + +
Sbjct: 764 PAPAAAPAAAPPPSPPSEE 782



 Score = 32.3 bits (74), Expect = 0.68
 Identities = 21/139 (15%), Positives = 35/139 (25%), Gaps = 13/139 (9%)

Query: 136 PSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAY 195
            +      AP+   + P    AP    P  ++  A  A+      P  +  P +P   A 
Sbjct: 390 GAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAP 449

Query: 196 SVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTP 255
           +               P+    A   P A    A  +     A   +             
Sbjct: 450 AGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAP--------- 500

Query: 256 SPIIAPSVPPTNRDTIRSA 274
               A      +  T+R  
Sbjct: 501 ----AAPAGADDAATLRER 515



 Score = 31.5 bits (72), Expect = 1.2
 Identities = 34/145 (23%), Positives = 44/145 (30%), Gaps = 18/145 (12%)

Query: 106 RANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLT 165
           R     GG+  P+    S  +    A  +  +      A     + P    AP       
Sbjct: 383 RRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAP------- 435

Query: 166 SSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPAN 225
                + A      N  A   PS P   A S QP      P A     T A AP+ P A 
Sbjct: 436 ---APAPAPPSPAGNAPAGGAPSPPPAAAPSAQP---APAPAAA-PEPTAAPAPAPPAA- 487

Query: 226 IEDATRSTHPTTATAPSVPPAIHDA 250
                 +  P    AP+ P    DA
Sbjct: 488 ---PAPAAAPAAPAAPAAPAGADDA 509



 Score = 30.3 bits (69), Expect = 2.9
 Identities = 16/100 (16%), Positives = 27/100 (27%), Gaps = 1/100 (1%)

Query: 166 SSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPAN 225
           ++      +      P  + +P+AP  PA    P+   A   AP   S       A P +
Sbjct: 594 AAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGA-AAAPAEASAAPAPGVAAPEH 652

Query: 226 IEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPP 265
                     +         A   A  + P      +   
Sbjct: 653 HPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAA 692



 Score = 30.0 bits (68), Expect = 3.6
 Identities = 22/127 (17%), Positives = 34/127 (26%), Gaps = 14/127 (11%)

Query: 34  SSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPP 93
           +         ++  +        +P  +A P P       P+    P  PS      A  
Sbjct: 393 APAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGG 452

Query: 94  QESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPN 153
             S  P              +  PS      P+   +   +    P    AP   P+ P 
Sbjct: 453 APSPPP--------------AAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPA 498

Query: 154 QLQAPNG 160
              AP G
Sbjct: 499 APAAPAG 505



 Score = 29.6 bits (67), Expect = 4.8
 Identities = 25/138 (18%), Positives = 42/138 (30%), Gaps = 19/138 (13%)

Query: 89  PSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQP-QAPNT 147
            +  P  +                 +P  +    +    P  A    P+   QP  AP  
Sbjct: 390 GAGAPAAAAPS----------AAAAAPAAAPAPAAAA--PAAAAAPAPAAAPQPAPAPAP 437

Query: 148 YPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPT 207
            P+ P+           +    A+ ++ P P  P A+ +P+A   PA    P+     P 
Sbjct: 438 APAPPSPAGNAPAGGAPSPPPAAAPSAQPAPA-PAAAPEPTAAPAPAPPAAPA-----PA 491

Query: 208 APLVPSTIATAPSAPPAN 225
           A         AP+     
Sbjct: 492 AAPAAPAAPAAPAGADDA 509


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 50.6 bits (121), Expect = 1e-06
 Identities = 18/115 (15%), Positives = 23/115 (20%), Gaps = 14/115 (12%)

Query: 110 PTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQ 169
                     P G     P  +    P  P  P  PN    M                Q+
Sbjct: 403 QGPQQQFNGQPLG----WPRMSMMPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAA--QK 456

Query: 170 ASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPA 224
                + YP        P+    P     P   +                SA P 
Sbjct: 457 PPMQPVMYP--------PNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQ 503



 Score = 33.2 bits (76), Expect = 0.38
 Identities = 28/158 (17%), Positives = 46/158 (29%), Gaps = 20/158 (12%)

Query: 17  TIAQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVP-S 75
            +AQ +   +      F    P + Q     SP        PY+   P    Q    P  
Sbjct: 359 ALAQRKEQRRAHLQDQFMQLQPRMRQLPM-GSPMGGAMGQPPYYGQGP--QQQFNGQPLG 415

Query: 76  HPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSY 135
            P +   P+ P+ P  P +    P   + + A      +   +  +   P  P     +Y
Sbjct: 416 WPRMSMMPT-PMGPGGPLR----PNGLAPMNAVRAPSRNAQNAAQK--PPMQPVMYPPNY 468

Query: 136 PS--VPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQAS 171
            S  +      P +  S   Q           +   AS
Sbjct: 469 QSLPLSQDLPQPQSTASQGGQN-------KKLAQVLAS 499



 Score = 31.3 bits (71), Expect = 1.6
 Identities = 19/140 (13%), Positives = 34/140 (24%), Gaps = 21/140 (15%)

Query: 140 NQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQAS-----------LPYPINPLASNKPS 188
            Q Q       M + +    G  P        Q +           +P P+ P    +P+
Sbjct: 376 MQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLRPN 435

Query: 189 APQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIH 248
              P      PS N            +   P+            + P +   P       
Sbjct: 436 GLAPMNAVRAPSRNAQNAAQKPPMQPVMYPPNY----------QSLPLSQDLPQPQSTAS 485

Query: 249 DATRSTPSPIIAPSVPPTNR 268
              ++     +  S  P  +
Sbjct: 486 QGGQNKKLAQVLASATPQMQ 505


>gnl|CDD|238489 cd00989, PDZ_metalloprotease, PDZ domain of bacterial and plant
           zinc metalloprotases, presumably membrane-associated or
           integral membrane proteases, which may be involved in
           signalling and regulatory mechanisms. May be responsible
           for substrate recognition and/or binding, as most PDZ
           domains bind C-terminal polypeptides, and binding to
           internal (non-C-terminal) polypeptides and even to
           lipids has been demonstrated. In this subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
           forms the peptide-binding groove base, a circular
           permutation with respect to PDZ domains found in
           Eumetazoan signaling proteins.
          Length = 79

 Score = 44.9 bits (107), Expect = 2e-06
 Identities = 17/62 (27%), Positives = 33/62 (53%), Gaps = 4/62 (6%)

Query: 376 VERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNNIRVVIR--DRPLERT 433
           V  GSPA+  GL+ GD+IL+ING+ +   + + +   ++  P   + + +      +  T
Sbjct: 19  VVPGSPAAKAGLKAGDRILAINGQKIK--SWEDLVDAVQENPGKPLTLTVERNGETITLT 76

Query: 434 VT 435
           +T
Sbjct: 77  LT 78



 Score = 33.0 bits (76), Expect = 0.052
 Identities = 22/73 (30%), Positives = 29/73 (39%), Gaps = 10/73 (13%)

Query: 444 FGFHFNKGQIVSLVKDSSAARNGLLVNHNILEVNG----------ANVVGLKDKELLEAM 493
            G    +  I  +V  S AA+ GL     IL +NG            V     K L   +
Sbjct: 7   PGGPPIEPVIGEVVPGSPAAKAGLKAGDRILAINGQKIKSWEDLVDAVQENPGKPLTLTV 66

Query: 494 EKGGSVLTLTLIP 506
           E+ G  +TLTL P
Sbjct: 67  ERNGETITLTLTP 79


>gnl|CDD|223864 COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer
           membrane].
          Length = 406

 Score = 48.9 bits (117), Expect = 4e-06
 Identities = 26/85 (30%), Positives = 46/85 (54%), Gaps = 7/85 (8%)

Query: 360 IGLRVCAINEG-VFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPV 418
           IG+ +   + G V V     GSPA+  G++ GD I+ I+G++V G ++ +  K+++  P 
Sbjct: 102 IGIELQMEDIGGVKVVSPIDGSPAAKAGIKPGDVIIKIDGKSVGGVSLDEAVKLIRGKPG 161

Query: 419 NNIRVVIR----DRPLERTVTLHRD 439
             + + I      +P   TVTL R+
Sbjct: 162 TKVTLTILRAGGGKPF--TVTLTRE 184



 Score = 44.6 bits (106), Expect = 9e-05
 Identities = 16/55 (29%), Positives = 33/55 (60%), Gaps = 1/55 (1%)

Query: 453 IVSLVKDSSAARNGLLVNHNILEVNGANVVGLKDKELLEAM-EKGGSVLTLTLIP 506
           +VS +  S AA+ G+     I++++G +V G+   E ++ +  K G+ +TLT++ 
Sbjct: 116 VVSPIDGSPAAKAGIKPGDVIIKIDGKSVGGVSLDEAVKLIRGKPGTKVTLTILR 170


>gnl|CDD|233695 TIGR02037, degP_htrA_DO, periplasmic serine protease, Do/DeqQ
           family.  This family consists of a set proteins various
           designated DegP, heat shock protein HtrA, and protease
           DO. The ortholog in Pseudomonas aeruginosa is designated
           MucD and is found in an operon that controls mucoid
           phenotype. This family also includes the DegQ (HhoA)
           paralog in E. coli which can rescue a DegP mutant, but
           not the smaller DegS paralog, which cannot. Members of
           this family are located in the periplasm and have
           separable functions as both protease and chaperone.
           Members have a trypsin domain and two copies of a PDZ
           domain. This protein protects bacteria from thermal and
           other stresses and may be important for the survival of
           bacterial pathogens.// The chaperone function is
           dominant at low temperatures, whereas the proteolytic
           activity is turned on at elevated temperatures [Protein
           fate, Protein folding and stabilization, Protein fate,
           Degradation of proteins, peptides, and glycopeptides].
          Length = 428

 Score = 46.8 bits (112), Expect = 2e-05
 Identities = 43/163 (26%), Positives = 60/163 (36%), Gaps = 34/163 (20%)

Query: 368 NEGVFVCLVERGSPASLVGLRFGDQILSINGETV----AGYNMKQVHKILKAAPVNNIRV 423
             G  V  V  GSPA   GL+ GD I S+NG+ +             K  K   +     
Sbjct: 256 QRGALVAQVLPGSPAEKAGLKAGDVITSVNGKPISSFADLRRAIGTLKPGKKVTLG---- 311

Query: 424 VIRD-RPLERTVTLHRD-------SSGHFGFHF------------NKGQ-----IVSLVK 458
           ++R  +    TVTL          S+   G                KG      +  +V 
Sbjct: 312 ILRKGKEKTITVTLGASPEEQASSSNPFLGLTVANLSPEIRKELRLKGDVKGVVVTKVVS 371

Query: 459 DSSAARNGLLVNHNILEVNGANVVGLKD-KELLEAMEKGGSVL 500
            S AAR GL     IL VN   V  + + +++L   +KGG V 
Sbjct: 372 GSPAARAGLQPGDVILSVNQQPVSSVAELRKVLARAKKGGRVA 414



 Score = 29.5 bits (67), Expect = 4.9
 Identities = 16/64 (25%), Positives = 29/64 (45%), Gaps = 3/64 (4%)

Query: 444 FGFHFNKGQIVS-LVKDSSAARNGLLVNHNILEVNGANVVGLKD-KELLEAMEKGGSVLT 501
            G    +G +V+ ++  S A + GL     I  VNG  +    D +  +  ++ G  V T
Sbjct: 251 LGLEKQRGALVAQVLPGSPAEKAGLKAGDVITSVNGKPISSFADLRRAIGTLKPGKKV-T 309

Query: 502 LTLI 505
           L ++
Sbjct: 310 LGIL 313


>gnl|CDD|221961 pfam13180, PDZ_2, PDZ domain. 
          Length = 81

 Score = 42.2 bits (100), Expect = 2e-05
 Identities = 31/83 (37%), Positives = 46/83 (55%), Gaps = 6/83 (7%)

Query: 358 GLIGLRVCAI-NEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQ-VHKILKA 415
           G IG+RV      GV V  V+ GSPA+  GL+ GD ILSI+G+ V   ++ + +  IL  
Sbjct: 1   GDIGVRVVQNEGTGVTVVSVKEGSPAAKAGLKPGDIILSIDGKKVN--SLTELIEVILNG 58

Query: 416 APVNNIRV-VIRD-RPLERTVTL 436
            P + +++ V RD +     V L
Sbjct: 59  KPGDTVKLTVYRDGKKKTVEVKL 81



 Score = 29.1 bits (66), Expect = 0.98
 Identities = 21/58 (36%), Positives = 28/58 (48%), Gaps = 5/58 (8%)

Query: 449 NKGQIVSLVKDSS-AARNGLLVNHNILEVNGANVVGLKDKELLEAMEKG--GSVLTLT 503
             G  V  VK+ S AA+ GL     IL ++G  V  L   EL+E +  G  G  + LT
Sbjct: 12  GTGVTVVSVKEGSPAAKAGLKPGDIILSIDGKKVNSL--TELIEVILNGKPGDTVKLT 67


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 46.2 bits (109), Expect = 4e-05
 Identities = 49/245 (20%), Positives = 74/245 (30%), Gaps = 48/245 (19%)

Query: 39  PVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHS 98
           PV     + +P A      PY++  P    Q P     P  P   +A          +  
Sbjct: 400 PVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQ 459

Query: 99  PYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSY---PSVPNQPQAPNTYPSMPNQL 155
             Y                +Y Q +     YQ        P V  +P    T P+ P   
Sbjct: 460 STY------------QTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARP--- 504

Query: 156 QAPNGFYPLTSSQQASQAS--------LPYPINPLASNKPSAPQPPAYSVQPSHNTAYPT 207
             P  ++     ++A +          +P P+      K S   P   +V P        
Sbjct: 505 --PLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSSLKAPSVAAVPP-------- 554

Query: 208 APLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPI----IAPSV 263
              V +  A +P A  + ++ AT +   T A A    P    A    P P     I P +
Sbjct: 555 ---VEAAAAVSPLA--SGVKKATLA---TGAAATVAAPVFSLANSGGPRPQVKEGIGPQL 606

Query: 264 PPTNR 268
           P   R
Sbjct: 607 PRPKR 611



 Score = 45.5 bits (107), Expect = 8e-05
 Identities = 44/245 (17%), Positives = 69/245 (28%), Gaps = 25/245 (10%)

Query: 28  TTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPI 87
           T + S+++P  PV QT    S +     P+      P P    P +   P    +P    
Sbjct: 328 TATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAPE--GYPQQSQ 385

Query: 88  YPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNT 147
           Y  A P   ++      ++   P          Q  Y +   +     P     P+ P  
Sbjct: 386 Y--AQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQP-- 441

Query: 148 YPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPT 207
                 Q +     +   S+ Q  Q     P       +   P      V+P        
Sbjct: 442 VAGNAWQAEEQQSTFAPQSTYQTEQT-YQQPAAQEPLYQQPQPVEQQPVVEP-------- 492

Query: 208 APLVPSTIATAPSAPPANI-----EDATRSTHPTTATAPSVPPAIHDATRSTPS--PIIA 260
               P    T P+ PP        E   R      A    +P  + +      S      
Sbjct: 493 ---EPVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSSLKAPSV 549

Query: 261 PSVPP 265
            +VPP
Sbjct: 550 AAVPP 554



 Score = 40.5 bits (94), Expect = 0.003
 Identities = 35/136 (25%), Positives = 45/136 (33%), Gaps = 8/136 (5%)

Query: 57  SPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPV 116
            P     P   YQ P  P  P        P  P AP  +   P      +        PV
Sbjct: 754 QPQQPVAPQQQYQQPQQPVAP--QPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPV 811

Query: 117 PSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLP 176
              PQ   P  P   Q  Y   P QP AP    ++ + L   NG      S+   + + P
Sbjct: 812 APQPQYQQPQQPVAPQPQYQQ-PQQPVAPQPQDTLLHPLLMRNG-----DSRPLHKPTTP 865

Query: 177 YPINPLASNKPSAPQP 192
            P   L +  PS  +P
Sbjct: 866 LPSLDLLTPPPSEVEP 881



 Score = 38.5 bits (89), Expect = 0.010
 Identities = 40/133 (30%), Positives = 45/133 (33%), Gaps = 18/133 (13%)

Query: 109 VPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQ 168
           V     PV    Q   P  P   Q  Y   P QP AP      P Q  AP   Y      
Sbjct: 752 VQQPQQPVAPQQQYQQPQQPVAPQPQYQQ-PQQPVAPQPQYQQPQQPVAPQPQYQQPQQP 810

Query: 169 QASQASLPYPINPLAS-------NKPSAPQPPAYSVQP-------SHNTAYPTAPLVPST 214
            A Q     P  P+A         +P APQP    + P       S     PT PL PS 
Sbjct: 811 VAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPL-PSL 869

Query: 215 IATAPSAPPANIE 227
               P  PP+ +E
Sbjct: 870 DLLTP--PPSEVE 880



 Score = 35.8 bits (82), Expect = 0.066
 Identities = 36/196 (18%), Positives = 57/196 (29%), Gaps = 24/196 (12%)

Query: 6   PSLEDMTFHQLTIAQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHS--AN 63
                        AQ  Y++         +      Q  S  +P +       Y    A 
Sbjct: 416 AQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQ-QSTFAPQSTYQTEQTYQQPAAQ 474

Query: 64  PPPYYQHPSVPSHPSVPTFPSA-PIYPSAPPQESHSPYYYSDI-------RANVPTGGSP 115
            P Y Q   V   P V   P      P+ PP      YY+ ++       R  +     P
Sbjct: 475 EPLYQQPQPVEQQPVVEPEPVVEETKPARPPL-----YYFEEVEEKRAREREQLAAWYQP 529

Query: 116 VPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASL 175
           +P   +   P        S  +VP    A    P      +A        +    + A++
Sbjct: 530 IPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVKKA--------TLATGAAATV 581

Query: 176 PYPINPLASNKPSAPQ 191
             P+  LA++    PQ
Sbjct: 582 AAPVFSLANSGGPRPQ 597



 Score = 30.4 bits (68), Expect = 2.8
 Identities = 31/136 (22%), Positives = 40/136 (29%), Gaps = 13/136 (9%)

Query: 89  PSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTY 148
           P+     S   + +S ++A +  G    P +     P    Q   +      QPQ P   
Sbjct: 716 PAGANPFSLDDFEFSPMKALLDDGPHE-PLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAP 774

Query: 149 PSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTA 208
                Q Q P    P    QQ  Q   P P            Q P   V P      P  
Sbjct: 775 QPQYQQPQQP--VAPQPQYQQPQQPVAPQP----------QYQQPQQPVAPQPQYQQPQQ 822

Query: 209 PLVPSTIATAPSAPPA 224
           P+ P      P  P A
Sbjct: 823 PVAPQPQYQQPQQPVA 838



 Score = 28.9 bits (64), Expect = 7.9
 Identities = 21/84 (25%), Positives = 24/84 (28%), Gaps = 5/84 (5%)

Query: 20  QNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHP-- 77
           Q     +Q         P P YQ               P     P P YQ P  P  P  
Sbjct: 757 QPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQ-QPQQPVAPQPQYQQPQQPVAPQP 815

Query: 78  --SVPTFPSAPIYPSAPPQESHSP 99
               P  P AP      PQ+  +P
Sbjct: 816 QYQQPQQPVAPQPQYQQPQQPVAP 839


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 45.9 bits (109), Expect = 4e-05
 Identities = 36/172 (20%), Positives = 53/172 (30%), Gaps = 20/172 (11%)

Query: 132 QNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQ 191
             +  +     +     P       AP          QA+ A  P      A++ P+AP 
Sbjct: 367 AAAAEAAAPAEKKTPARPEAAAPAAAP--------VAQAAAAPAPAAAPAAAASAPAAPP 418

Query: 192 PPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDAT 251
             A     +   A   A    +  A    AP    + A  +       AP   PA+  A 
Sbjct: 419 AAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAP--EPAVASAA 476

Query: 252 RSTPSPIIAPSVPPTNRDTIRSALVPLYPSLDNYMGLEVASELINAYARENA 303
            +  +   A  + PT    +  A V              A+E I A ARE A
Sbjct: 477 PAPAAAPAAARLTPTEEGDVWHATVQQL----------AAAEAITALARELA 518



 Score = 33.1 bits (76), Expect = 0.40
 Identities = 27/149 (18%), Positives = 42/149 (28%), Gaps = 23/149 (15%)

Query: 65  PPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSY 124
           P      + P+    P  P A    +AP  ++ +        A  P   +  P+ P  + 
Sbjct: 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAA----PAPAAAPAAAASAPAAPPAAA 421

Query: 125 PSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLAS 184
           P  P  A  +         AP                    +   A           +A 
Sbjct: 422 PPAPVAAPAAAAPAAAPAAAP-------------------AAVALAPAPPAQAAPETVAI 462

Query: 185 NKPSAPQPPAYSVQPSHNTAYPTAPLVPS 213
               AP+P   S  P+   A   A L P+
Sbjct: 463 PVRVAPEPAVASAAPAPAAAPAAARLTPT 491



 Score = 33.1 bits (76), Expect = 0.44
 Identities = 21/124 (16%), Positives = 35/124 (28%)

Query: 110 PTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQ 169
           P   +   +  +   P+ P  A  +   V     AP    +      AP           
Sbjct: 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAP 425

Query: 170 ASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDA 229
            +  +   P    A+   +    PA   Q +  T      + P     + +  PA    A
Sbjct: 426 VAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAA 485

Query: 230 TRST 233
            R T
Sbjct: 486 ARLT 489


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 44.3 bits (105), Expect = 9e-05
 Identities = 20/118 (16%), Positives = 27/118 (22%), Gaps = 6/118 (5%)

Query: 77  PSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYP 136
             V T  S   +P      +      S       + G P    P     S    +    P
Sbjct: 161 ADVATTNSDNSFPGEDADPAS----ASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAP 216

Query: 137 SVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPA 194
           S       P +  S  N    P G              L     P   +  + P P  
Sbjct: 217 SSFQSDTPPPSPESPTNP-SPPPGPAAPPPPPVQQVPPLSTA-KPTPPSASATPAPIG 272



 Score = 43.1 bits (102), Expect = 2e-04
 Identities = 27/110 (24%), Positives = 36/110 (32%), Gaps = 11/110 (10%)

Query: 21  NEYFSKQTTSSSFSSPPPPVYQTS--SHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPS 78
           N +  +    +S S   PP       S  SP    + PS       P  +Q  + P  P 
Sbjct: 170 NSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPE 229

Query: 79  VPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNP 128
            PT PS P  P+APP                    +  P+ P  S    P
Sbjct: 230 SPTNPSPPPGPAAPPPPPVQQVPPL---------STAKPTPPSASATPAP 270



 Score = 42.0 bits (99), Expect = 4e-04
 Identities = 27/139 (19%), Positives = 38/139 (27%), Gaps = 20/139 (14%)

Query: 104 DIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYP 163
           D         +  P        ++P    +S P VP+ P  P    S P+    P     
Sbjct: 160 DADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSS-PSDSSLP----- 213

Query: 164 LTSSQQASQASLPYPINPLASNKPSAP-QPPAYSVQPSHNTAYPTAP-LVPSTIATAPSA 221
                       P P +  +   P +P  P   S  P      P     VP      P+ 
Sbjct: 214 ------------PAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTP 261

Query: 222 PPANIEDATRSTHPTTATA 240
           P A+   A          A
Sbjct: 262 PSASATPAPIGGITLDDDA 280



 Score = 41.2 bits (97), Expect = 0.001
 Identities = 23/127 (18%), Positives = 33/127 (25%)

Query: 49  PNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRAN 108
           PN           A+        S P   + P   S    PS+ P     P    D  + 
Sbjct: 148 PNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSP 207

Query: 109 VPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQ 168
             +   P PS  Q   P    ++  +    P     P         L       P  S+ 
Sbjct: 208 SDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASAT 267

Query: 169 QASQASL 175
            A    +
Sbjct: 268 PAPIGGI 274



 Score = 37.4 bits (87), Expect = 0.015
 Identities = 35/130 (26%), Positives = 41/130 (31%), Gaps = 21/130 (16%)

Query: 22  EYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPT 81
           E     TT+S  S P       S+  S      P S     + P   + PS PS  S+P 
Sbjct: 159 EDADVATTNSDNSFPGEDADPASASPSDP----PSSSPGVPSFPSPPEDPSSPSDSSLPP 214

Query: 82  FPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPS--VP 139
            PS+    S  P  S             PT  SP P       P  P   Q    S   P
Sbjct: 215 APSSF--QSDTPPPSPES----------PTNPSPPPGPAA---PPPPPVQQVPPLSTAKP 259

Query: 140 NQPQAPNTYP 149
             P A  T  
Sbjct: 260 TPPSASATPA 269



 Score = 37.0 bits (86), Expect = 0.019
 Identities = 22/131 (16%), Positives = 35/131 (26%), Gaps = 18/131 (13%)

Query: 137 SVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYS 196
           +      + N++P       + +     +    +S     +P  P   + PS    P   
Sbjct: 161 ADVATTNSDNSFPGEDADPASASP----SDPPSSSPGVPSFPSPPEDPSSPSDSSLPPA- 215

Query: 197 VQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPS 256
             PS   +    P   S   T PS PP                + + P         TP 
Sbjct: 216 --PSSFQSDTPPPSPESP--TNPSPPPGPAAPPPPPVQQVPPLSTAKP---------TPP 262

Query: 257 PIIAPSVPPTN 267
              A   P   
Sbjct: 263 SASATPAPIGG 273



 Score = 35.8 bits (83), Expect = 0.048
 Identities = 22/67 (32%), Positives = 25/67 (37%), Gaps = 2/67 (2%)

Query: 30  SSSFSSPPPPVYQTSSH--TSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPI 87
            S  S PP P    S     SP +  NP  P   A PPP       P   + PT PSA  
Sbjct: 207 PSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASA 266

Query: 88  YPSAPPQ 94
            P+    
Sbjct: 267 TPAPIGG 273



 Score = 31.2 bits (71), Expect = 1.3
 Identities = 24/114 (21%), Positives = 37/114 (32%), Gaps = 3/114 (2%)

Query: 171 SQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDAT 230
           + +   +P         S   PP+ S       + P  P  PS  ++ P AP  + +  T
Sbjct: 166 TNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSD-SSLPPAPS-SFQSDT 223

Query: 231 RSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVP-PTNRDTIRSALVPLYPSLD 283
               P + T PS PP            +   S   PT      +       +LD
Sbjct: 224 PPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLD 277



 Score = 30.0 bits (68), Expect = 2.8
 Identities = 15/59 (25%), Positives = 20/59 (33%), Gaps = 2/59 (3%)

Query: 34  SSPPPPVY-QTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSA 91
           S  PPP     ++ + P     PP P     PP     P+ PS  +    P   I    
Sbjct: 221 SDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASA-TPAPIGGITLDD 278


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 44.5 bits (105), Expect = 1e-04
 Identities = 29/190 (15%), Positives = 51/190 (26%), Gaps = 2/190 (1%)

Query: 90  SAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYP 149
           + P   + +P       A  P   +P P+ P  +  + P  A  +        +      
Sbjct: 372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPE 431

Query: 150 SMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAP 209
           ++    QA         +   + A+ P      A+  P      A +       A   AP
Sbjct: 432 ALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAP 491

Query: 210 --LVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTN 267
               P      P    +       +            PA  D   +  +   AP+  P  
Sbjct: 492 ADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAP 551

Query: 268 RDTIRSALVP 277
           R    +  V 
Sbjct: 552 RAAAATEPVV 561



 Score = 44.1 bits (104), Expect = 2e-04
 Identities = 33/194 (17%), Positives = 51/194 (26%), Gaps = 3/194 (1%)

Query: 137 SVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYS 196
                  AP   P+      A     P       + A          +  P+   P   +
Sbjct: 373 GPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEA 432

Query: 197 VQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPS 256
           +  +   +       P+      +AP A    A     P  A A + P     A    P+
Sbjct: 433 LAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPA 492

Query: 257 PIIAPSVPPTNRDTIRSALVPLYPSLDNYMGLEVASELINAYARENAANERESDAAGNTV 316
                  PP        A            G    S    A A  + A E  + A     
Sbjct: 493 D---DDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAP 549

Query: 317 ARQAPRDSCTMIAP 330
           A +A   +  ++AP
Sbjct: 550 APRAAAATEPVVAP 563



 Score = 41.0 bits (96), Expect = 0.001
 Identities = 39/231 (16%), Positives = 55/231 (23%), Gaps = 20/231 (8%)

Query: 34  SSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPP 93
           + P        +  +P A     +    A PP     P+     +      A       P
Sbjct: 372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPA---APAAAPAAAAAARAVAAAPARRSP 428

Query: 94  QESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPN 153
                               +P P+       +    A    P       AP        
Sbjct: 429 APEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAA 488

Query: 154 QLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPS 213
              A +   P         +  P   +        AP        P   TA P       
Sbjct: 489 PAPADDDPPPWEELPPEFASPAPAQPDA-------APAGWVAESIPDPATADPDDAF--E 539

Query: 214 TIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVP 264
           T+A AP+A PA             AT P V P    A+ S    +     P
Sbjct: 540 TLAPAPAAAPA--------PRAAAATEPVVAPRPPRASASGLPDMFDGDWP 582



 Score = 36.0 bits (83), Expect = 0.054
 Identities = 32/153 (20%), Positives = 42/153 (27%), Gaps = 6/153 (3%)

Query: 112 GGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQAS 171
           GG+   +            A     + P    AP   P+ P    AP       +   A 
Sbjct: 370 GGAGPATAAAAPVAQPAPAAAAPAAAAP----APAAPPAAPAA--APAAAAAARAVAAAP 423

Query: 172 QASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATR 231
               P P    A+ + SA  P           A P A   P+     P A  A    A  
Sbjct: 424 ARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARA 483

Query: 232 STHPTTATAPSVPPAIHDATRSTPSPIIAPSVP 264
           +     A A   PP   +      SP  A    
Sbjct: 484 APAAAPAPADDDPPPWEELPPEFASPAPAQPDA 516



 Score = 35.2 bits (81), Expect = 0.089
 Identities = 28/186 (15%), Positives = 48/186 (25%), Gaps = 2/186 (1%)

Query: 149 PSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTA 208
           P        P        +Q A  A+ P    P  +  P+AP     +   +   A   A
Sbjct: 365 PGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPA 424

Query: 209 PLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNR 268
              P+  A A +   +               A     A   A    P    A + P    
Sbjct: 425 RRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAA 484

Query: 269 DTIRSALVPLYPSLDNYMGLEVASELINAYARENAANERESDAAGNTVARQAPRDSCTMI 328
                A     P     +  E AS        + A     +++  +           T+ 
Sbjct: 485 PAAAPAPADDDPPPWEELPPEFAS--PAPAQPDAAPAGWVAESIPDPATADPDDAFETLA 542

Query: 329 APLSSS 334
              +++
Sbjct: 543 PAPAAA 548



 Score = 33.3 bits (76), Expect = 0.34
 Identities = 27/105 (25%), Positives = 37/105 (35%), Gaps = 3/105 (2%)

Query: 161 FYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPS 220
           F P  S   A  A+      P+A   P+A  P A +  P+   A P A    +  A A +
Sbjct: 363 FRPGQSGGGAGPATAAAA--PVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVA 420

Query: 221 APPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPP 265
           A PA    A          A +  P    A    P+   A +  P
Sbjct: 421 AAPARRSPA-PEALAAARQASARGPGGAPAPAPAPAAAPAAAARP 464


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 43.9 bits (103), Expect = 2e-04
 Identities = 61/245 (24%), Positives = 80/245 (32%), Gaps = 23/245 (9%)

Query: 38  PPVYQTSSHTSPNAYNNPPSPYHSANPPPY-YQHPSVPSHPSVPTFPSAP-IYPSAPPQE 95
           P    T+S  + +A    PS   +  P P+  Q P  P+  S     SAP  +P      
Sbjct: 576 PLTSPTTSQLASSA----PSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPI 631

Query: 96  SHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQL 155
              P     I  NV     P P  P     +           +P QP        +P Q 
Sbjct: 632 PMRPLRMQPITFNVL--VFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQ- 688

Query: 156 QAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTI 215
            AP    P            P P+ P A+  P   Q PA +   +   A       P   
Sbjct: 689 WAPGTMQP--------PPRAPTPMRPPAA-PPGRAQRPAAATGRARPPAAAPGRARPPAA 739

Query: 216 ATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSAL 275
           A   + PPA      R+  P  A   + PPA   A    P+P   P  PP  +   R A 
Sbjct: 740 APGRARPPAAA--PGRARPPAAAPGRARPPA---AAPGAPTPQPPPQAPPAPQQRPRGAP 794

Query: 276 VPLYP 280
            P  P
Sbjct: 795 TPQPP 799



 Score = 29.6 bits (66), Expect = 5.6
 Identities = 41/190 (21%), Positives = 52/190 (27%), Gaps = 29/190 (15%)

Query: 56  PSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSP 115
           P  ++    P  +Q P V   P  PT+      P  P     +           P    P
Sbjct: 640 PITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANT---MLPIQWAPGTMQP 696

Query: 116 VPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASL 175
            P  P    P  P  A    P    Q  A  T  + P          P      A   + 
Sbjct: 697 PPRAPT---PMRPPAA----PPGRAQRPAAATGRARPPAAAPGRARPP----AAAPGRAR 745

Query: 176 PYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHP 235
           P    P  +  P+A    A     +     P  P         P APPA  +       P
Sbjct: 746 PPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPP---------PQAPPAPQQ------RP 790

Query: 236 TTATAPSVPP 245
             A  P  PP
Sbjct: 791 RGAPTPQPPP 800


>gnl|CDD|232883 TIGR00225, prc, C-terminal peptidase (prc).  A C-terminal peptidase
           with different substrates in different species including
           processing of D1 protein of the photosystem II reaction
           center in higher plants and cleavage of a peptide of 11
           residues from the precursor form of penicillin-binding
           protein in E.coli E.coli and H influenza have the most
           distal branch of the tree and their proteins have an
           N-terminal 200 amino acids that show no homology to
           other proteins in the database [Protein fate,
           Degradation of proteins, peptides, and glycopeptides,
           Protein fate, Protein modification and repair].
          Length = 334

 Score = 43.1 bits (102), Expect = 2e-04
 Identities = 19/80 (23%), Positives = 39/80 (48%)

Query: 360 IGLRVCAINEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVN 419
           IG++V   +  + +     GSPA   G++ GD+I+ ING++VAG ++     +++     
Sbjct: 53  IGIQVGMDDGEIVIVSPFEGSPAEKAGIKPGDKIIKINGKSVAGMSLDDAVALIRGKKGT 112

Query: 420 NIRVVIRDRPLERTVTLHRD 439
            + + I      + +T    
Sbjct: 113 KVSLEILRAGKSKPLTFTLK 132



 Score = 38.5 bits (90), Expect = 0.007
 Identities = 14/55 (25%), Positives = 30/55 (54%), Gaps = 1/55 (1%)

Query: 453 IVSLVKDSSAARNGLLVNHNILEVNGANVVGLKDKELLEAME-KGGSVLTLTLIP 506
           IVS  + S A + G+     I+++NG +V G+   + +  +  K G+ ++L ++ 
Sbjct: 66  IVSPFEGSPAEKAGIKPGDKIIKINGKSVAGMSLDDAVALIRGKKGTKVSLEILR 120


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 43.3 bits (103), Expect = 3e-04
 Identities = 30/164 (18%), Positives = 47/164 (28%), Gaps = 9/164 (5%)

Query: 110 PTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPL-TSSQ 168
           P    P P  P  S          + P+    P      P  P          PL  ++ 
Sbjct: 361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTS 420

Query: 169 QASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIED 228
           Q   A          + K    +P A S     N+A      V    +    AP      
Sbjct: 421 QLLAARQQLQ-RAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAY 479

Query: 229 ATRSTHPTTATAPSV--PPAI-----HDATRSTPSPIIAPSVPP 265
             ++T+P       V  P A+     H+ T    + + A ++  
Sbjct: 480 RWKATNPVEVKKEPVATPKALKKALEHEKTPELAAKLAAEAIER 523



 Score = 34.5 bits (80), Expect = 0.16
 Identities = 32/169 (18%), Positives = 48/169 (28%), Gaps = 21/169 (12%)

Query: 55  PPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGS 114
           P  P  SA P    Q     + P+    P        PP    +P          P    
Sbjct: 368 PEVPPQSAAPAASAQ---ATAAPTAAVAPPQAPAVPPPPAS--AP-------QQAPA-VP 414

Query: 115 PVPSYPQGSYPSNPYQ-AQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQA 173
              +  Q        Q AQ +  +  ++P A +    + + L+      P  S+ + + A
Sbjct: 415 LPETTSQLLAARQQLQRAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPA 474

Query: 174 SLPY----PINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATA 218
                     NP+   K     P A      H     T  L     A A
Sbjct: 475 KKEAYRWKATNPVEVKKEPVATPKALKKALEHE---KTPELAAKLAAEA 520



 Score = 33.7 bits (78), Expect = 0.24
 Identities = 33/147 (22%), Positives = 47/147 (31%), Gaps = 6/147 (4%)

Query: 178 PINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAP-PANIEDATRSTHPT 236
           P  PL    P  P   A     +  TA PTA + P      P  P  A  +        T
Sbjct: 361 PAAPLPE--PEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPET 418

Query: 237 TATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSALVPLYPSLDNYMGLEVASELIN 296
           T+   +    +  A  +T +    P+     R    SAL  L         LE A     
Sbjct: 419 TSQLLAARQQLQRAQGATKAKKSEPAAASRARPV-NSALERLASVRPAPSALEKAPAKKE 477

Query: 297 AYARE--NAANERESDAAGNTVARQAP 321
           AY  +  N    ++   A     ++A 
Sbjct: 478 AYRWKATNPVEVKKEPVATPKALKKAL 504


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 42.5 bits (100), Expect = 5e-04
 Identities = 41/202 (20%), Positives = 53/202 (26%), Gaps = 22/202 (10%)

Query: 28  TTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPI 87
           T   S   P PP       + P        PY       Y    +    P +P     P 
Sbjct: 87  TVDPSAGEPAPPPPHARRTSEPELPRPGRRPY-----EGYGGPRADDRPPGLPRQDQLPT 141

Query: 88  YPSAPPQESHSP-----------YYYSDIRANVPTG---GSPVPSYPQGSYPSNPYQAQN 133
              A P     P           Y +   R   P      SP    P+      PY A  
Sbjct: 142 ARPAYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYAPEQERDREPYDAGR 201

Query: 134 S-YPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQ--QASQASLPYPINPLASNKPSAP 190
             Y         P      P + +      P  +    +           P+   +PSAP
Sbjct: 202 PEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRGGPGPPERDDAPVVPIRPSAP 261

Query: 191 QPPAYSVQPSHNTAYPTAPLVP 212
            P A    P+     PTA L P
Sbjct: 262 GPLAAQPAPAPGPGEPTARLNP 283


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 42.1 bits (99), Expect = 6e-04
 Identities = 23/96 (23%), Positives = 36/96 (37%), Gaps = 6/96 (6%)

Query: 178 PINPLASNKPSAPQPPAY---SVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTH 234
            I P+ +   +APQP A    S  PS ++A        S    A + P  +  D   +  
Sbjct: 378 HIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVS-VDPPAAVP 436

Query: 235 --PTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNR 268
             P +    +V PA     +  P   ++   P T R
Sbjct: 437 VNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLR 472



 Score = 37.4 bits (87), Expect = 0.017
 Identities = 23/119 (19%), Positives = 36/119 (30%), Gaps = 2/119 (1%)

Query: 26  KQTTSSSFSSP-PPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPS 84
           KQ     F+ P   P    ++  SP+   +  +   SA         + P+    P    
Sbjct: 376 KQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAV 435

Query: 85  APIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQ 143
               PS  PQ      +  + +  V    S  PS  +        QA  +    P   Q
Sbjct: 436 PVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQE-KAEQATGNIKEAPTGTQ 493



 Score = 32.8 bits (75), Expect = 0.46
 Identities = 15/68 (22%), Positives = 22/68 (32%), Gaps = 6/68 (8%)

Query: 206 PTAPLVPSTIATAPSAPPANIEDATRS------THPTTATAPSVPPAIHDATRSTPSPII 259
           P   + P     A +  P+    A+ S          +A   +  PA    T S   P  
Sbjct: 375 PKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAA 434

Query: 260 APSVPPTN 267
            P  PP+ 
Sbjct: 435 VPVNPPST 442



 Score = 31.3 bits (71), Expect = 1.5
 Identities = 15/92 (16%), Positives = 27/92 (29%), Gaps = 7/92 (7%)

Query: 166 SSQQASQASLPYPINPLASNKPSAPQPPAY------SVQPSHNTAYPTAPLVPSTIATAP 219
           ++ Q S A+   P    +S       P +           S +      P+ P + A   
Sbjct: 388 AAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAA-VPVNPPSTAPQA 446

Query: 220 SAPPANIEDATRSTHPTTATAPSVPPAIHDAT 251
             P    E+        ++  PS    I +  
Sbjct: 447 VRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKA 478


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
           subunit 62; Provisional.
          Length = 576

 Score = 41.5 bits (97), Expect = 9e-04
 Identities = 54/215 (25%), Positives = 79/215 (36%), Gaps = 30/215 (13%)

Query: 55  PPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYY-YSDIRANVPTGG 113
           P     +  P P    P  P  PS P     P  P A      SPY  Y D++   PT  
Sbjct: 330 PKESDAADGPKPVPTKPVTPEAPSPPI-EEEPPQPKAVVPRPLSPYTAYEDLKP--PT-- 384

Query: 114 SPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQA 173
           SP+P+ P  S  S+      + P+ P+   +P +  ++P    A              +A
Sbjct: 385 SPIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQ------------VEA 432

Query: 174 SLPYPINPLAS----NKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDA 229
               P++P A       P++P P A +      ++  + P VP T      AP     DA
Sbjct: 433 KKTRPLSPYARYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVPDT------APATAATDA 486

Query: 230 TRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVP 264
                P     P  P A++D  +   SP  A  V 
Sbjct: 487 A--APPPANMRPLSPYAVYDDLKPPTSPSPAAPVG 519



 Score = 31.8 bits (72), Expect = 0.99
 Identities = 54/235 (22%), Positives = 75/235 (31%), Gaps = 31/235 (13%)

Query: 26  KQTTSSSFSSPPPPVYQTSSHTSP-NAYNN-----PPSPYHSANPPPYYQHPSVPSHPSV 79
           +  +      PP P        SP  AY +      P P   ++ P   +     + P+ 
Sbjct: 350 EAPSPPIEEEPPQPKAVVPRPLSPYTAYEDLKPPTSPIPTPPSSSPASSKSVDAVAKPAE 409

Query: 80  PTFPSAPIYPSAPPQESH-----------SPYY-YSDIRANVPTGGSPVPSYPQGSYPSN 127
           P    +P   S  P+              SPY  Y D++   PT  SP P+ P G  PS 
Sbjct: 410 PDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYARYEDLKP--PT--SPSPTAPTGVSPS- 464

Query: 128 PYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKP 187
                +S  SVP  P       +       P    PL+          P   +P A+   
Sbjct: 465 ----VSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSP-AAPVG 519

Query: 188 SAPQPPAYSVQPSHNTAYPTAPLVPSTIATA---PSAPPANIEDATRSTHPTTAT 239
                    V    N+A PTA       A     P +P    ED    T PT + 
Sbjct: 520 KVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPRPLSPYTMYEDLKPPTSPTPSP 574



 Score = 28.7 bits (64), Expect = 8.4
 Identities = 24/87 (27%), Positives = 30/87 (34%), Gaps = 1/87 (1%)

Query: 178 PINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPS-APPANIEDATRSTHPT 236
           P+  L +  PS   PP  S         PT P+ P   +      PP       R   P 
Sbjct: 315 PMEELLAKIPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPY 374

Query: 237 TATAPSVPPAIHDATRSTPSPIIAPSV 263
           TA     PP     T  + SP  + SV
Sbjct: 375 TAYEDLKPPTSPIPTPPSSSPASSKSV 401


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 40.9 bits (95), Expect = 0.001
 Identities = 43/254 (16%), Positives = 70/254 (27%), Gaps = 10/254 (3%)

Query: 28  TTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPI 87
           T+ +   + P     T+  TS       P+P            P+  +     T P    
Sbjct: 500 TSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTE 559

Query: 88  YPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNT 147
                   +        +  +  T G         S       + +S P   +    P  
Sbjct: 560 ESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLL 619

Query: 148 YPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLAS------NKPSAPQPPAYSVQ--- 198
             + P   +      P   S        P P     S      N  ++  P    V    
Sbjct: 620 TSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGPGNSSTSRYPGEVHVTEGM 679

Query: 199 PSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPI 258
           P+ N   P+AP    T     ++        T+ T  +T  A + P     A R+TP   
Sbjct: 680 PNPNATSPSAPSGQKTAVPTVTSTGGKANSTTKETSGSTLMASTSPHTNEGAFRTTPYNA 739

Query: 259 IAPSVPPTNRDTIR 272
                PP+    +R
Sbjct: 740 TTYL-PPSTSSKLR 752



 Score = 30.1 bits (67), Expect = 3.7
 Identities = 35/186 (18%), Positives = 53/186 (28%), Gaps = 9/186 (4%)

Query: 106 RANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLT 165
           +A   T       Y       +   A  + PS+P     P    + P     P G    T
Sbjct: 424 KAPDTTKSVIF-VYTLVHVEPHKTTAVPTTPSLPPASTGPTVSTADPT-SGTPTGTTSST 481

Query: 166 SSQQASQASLPYPINPLA------SNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAP 219
             +  S  S      P A         P+A  P       + N   PT P+V     TA 
Sbjct: 482 LPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPT-PIVIGVTTTAT 540

Query: 220 SAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSALVPLY 279
           S P         ++   T  +P         T +      A +       +  ++  P  
Sbjct: 541 SPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGI 600

Query: 280 PSLDNY 285
           PS  + 
Sbjct: 601 PSSSHS 606


>gnl|CDD|177681 PLN00049, PLN00049, carboxyl-terminal processing protease;
           Provisional.
          Length = 389

 Score = 40.1 bits (94), Expect = 0.002
 Identities = 20/62 (32%), Positives = 35/62 (56%)

Query: 379 GSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNNIRVVIRDRPLERTVTLHR 438
           G PA+  G+R GD IL+I+G +  G ++ +    L+    +++ + +R  P  R VTL R
Sbjct: 112 GGPAARAGIRPGDVILAIDGTSTEGLSLYEAADRLQGPEGSSVELTLRRGPETRLVTLTR 171

Query: 439 DS 440
           + 
Sbjct: 172 EK 173


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 40.4 bits (94), Expect = 0.002
 Identities = 57/245 (23%), Positives = 85/245 (34%), Gaps = 21/245 (8%)

Query: 39  PVYQTSSHTSPNAYNNP-PSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESH 97
           P     S   P+++  P  S      PP     P  PS P +   P+ PI         H
Sbjct: 305 PPLPLPSQAQPHSHTPPSQSALQPQQPPREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKH 364

Query: 98  -----SPYYYSDIRANVPTGGS--PVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPS 150
                 P  +  + +N+P   +  P+ S P    PS         P    Q Q   + P+
Sbjct: 365 PPHLQGPSPFPQMPSNLPPPPALKPLSSLPTHHPPSAHPPPLQLMP----QSQPLQSVPA 420

Query: 151 MPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPL 210
            P  L       P  S+   S      P +P A +  ++   PA    PS  T+ P AP 
Sbjct: 421 QPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAP- 479

Query: 211 VPSTIATAPSAPPANIEDATRSTHPTTATAPSV-----PPAIHDATRSTPSPIIAPSVPP 265
                A++ S PP +   ++          P +     P    +   S P P  +PS  P
Sbjct: 480 ---PRASSGSQPPGSALPSSGGCAGPGPPLPPIQIKEEPLDEAEEPESPPPPPRSPSPEP 536

Query: 266 TNRDT 270
           T  +T
Sbjct: 537 TVVNT 541



 Score = 39.7 bits (92), Expect = 0.004
 Identities = 47/225 (20%), Positives = 63/225 (28%), Gaps = 13/225 (5%)

Query: 20  QNEYFSKQTTSSSFSSP-PPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPS 78
            NE  S  +       P  PP  Q     +      PP+P   A PP      + P+   
Sbjct: 159 DNESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQP 218

Query: 79  VPTFPSAPIY-PSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPS 137
               P + I  PS  PQ   SP+    ++    +  SP P  P   +P + +        
Sbjct: 219 QQPSPLSLISAPSLHPQRLPSPH--PPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMP 276

Query: 138 VPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSV 197
              Q           N  Q            Q+    LP P      +     Q      
Sbjct: 277 HALQQGPVFLQHPSSNPPQP-------FGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQ 329

Query: 198 QPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPS 242
           QP      P AP +P      P   P          HP     PS
Sbjct: 330 QPPREQPLPPAPSMPHI--KPPPTTPIPQLPNQSHKHPPHLQGPS 372



 Score = 38.9 bits (90), Expect = 0.007
 Identities = 34/161 (21%), Positives = 51/161 (31%), Gaps = 5/161 (3%)

Query: 31  SSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSA--PIY 88
            S  + PP + Q+ S     + +     +      P+ QHP           P +     
Sbjct: 416 QSVPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTST 475

Query: 89  PSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTY 148
           P+APP+ S           +      P P  P       P        S P  P++P+  
Sbjct: 476 PAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPIQIKEEPLDEAEEPESPPPPPRSPSPE 535

Query: 149 PS---MPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNK 186
           P+    P+       FY        S A       PLAS+K
Sbjct: 536 PTVVNTPSHASQSARFYKHLDRGYNSCARTDLYFTPLASSK 576



 Score = 32.7 bits (74), Expect = 0.64
 Identities = 43/205 (20%), Positives = 68/205 (33%), Gaps = 11/205 (5%)

Query: 132 QNSYPSVPN-QPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAP 190
           ++S PS+P+ Q    ++  S   QL  P G   +     A+ A    P  P A   P  P
Sbjct: 148 RSSSPSIPSPQDNESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVP--P 205

Query: 191 QPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDA 250
           Q    + QP    A       P ++ +APS  P  +        P TA+  S  P     
Sbjct: 206 QGSPIAAQP----APQPQQPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPA--P 259

Query: 251 TRSTPSPIIAPSVPPTNRDTIRSALVPLYPSLDNYMGLEVASELINAYARENAANERESD 310
           +   P        PP      +  +   +PS +      +A   +      + A      
Sbjct: 260 SSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHT 319

Query: 311 AAGNTVA--RQAPRDSCTMIAPLSS 333
               +    +Q PR+     AP   
Sbjct: 320 PPSQSALQPQQPPREQPLPPAPSMP 344


>gnl|CDD|223821 COG0750, COG0750, Predicted membrane-associated Zn-dependent
           proteases 1 [Cell envelope biogenesis, outer membrane].
          Length = 375

 Score = 39.8 bits (93), Expect = 0.003
 Identities = 32/150 (21%), Positives = 59/150 (39%), Gaps = 7/150 (4%)

Query: 376 VERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNNIRVVIRDRPLERTVT 435
           V   S A+L GLR GD+I++++GE VA ++  +   +  A  V N+  ++  R       
Sbjct: 136 VAPKSAAALAGLRPGDRIVAVDGEKVASWDDVRRLLVAAAGDVFNLLTILVIRL--DGEA 193

Query: 436 LHRDSSGHFGFHFNKGQIVSLVKDSSAARNGLLVNH--NILEVNGANVVGLKD-KELLEA 492
               +            I     D   A +   +    N     G  +V     + +  A
Sbjct: 194 HAVAAEIIKSLGLTPVVIPLKPGDKIVAVDVGAIGLSPNGEPDVGKVLVKYGPLEAVGLA 253

Query: 493 MEKGGSVLTLTLIPTSMYQHMMKKMSFNLV 522
           +EK G ++ LTL    + + +   +S   +
Sbjct: 254 VEKTGRLVKLTL--KMLKKLITGDLSLKNL 281


>gnl|CDD|234977 PRK01741, PRK01741, cell division protein ZipA; Provisional.
          Length = 332

 Score = 39.3 bits (92), Expect = 0.004
 Identities = 34/188 (18%), Positives = 58/188 (30%), Gaps = 20/188 (10%)

Query: 23  YFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTF 82
           YFS    +++F+   PP    S+  +       P  Y    PPP+ Q  +  S       
Sbjct: 33  YFSN---ANTFTRTRPPSRPISNEEADQPNTLNPQSYVETTPPPFQQPQTEESESE---- 85

Query: 83  PSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQP 142
               +      ++S       +I+  +P            S P  P Q Q   P+  N  
Sbjct: 86  --NEVQIQQEVEQS-----VDEIKITLPNQEPAYYMQNHRSEPIQPTQPQYQSPTQTNVA 138

Query: 143 QAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQP-SH 201
                    PN          + SS +  +  L      + S+     +     ++P + 
Sbjct: 139 SMTIEETQSPNV-----PIEGINSSSEQLRVELAELAAEIYSDASHRVELAKNFMEPQAE 193

Query: 202 NTAYPTAP 209
             A P A 
Sbjct: 194 TEAQPEAT 201


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 39.4 bits (92), Expect = 0.004
 Identities = 26/155 (16%), Positives = 45/155 (29%), Gaps = 6/155 (3%)

Query: 112 GGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQAS 171
                 +   G+       A          P+A     +   + +AP       ++    
Sbjct: 382 APGARAAAAVGASAVPAVTAVTGAAGAALAPKA--AAAAAATRAEAPPAAPAPPATADRG 439

Query: 172 QASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATR 231
             +        A     A    +   +     A P A    ++   + + P A  E A R
Sbjct: 440 DDAADGD----APVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPR 495

Query: 232 STHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPT 266
           +  P+ AT  +VP A   A  S      A + P  
Sbjct: 496 AAAPSAATPAAVPDARAPAAASREDAPAAAAPPAP 530



 Score = 37.5 bits (87), Expect = 0.019
 Identities = 26/177 (14%), Positives = 43/177 (24%), Gaps = 3/177 (1%)

Query: 64  PPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGS 123
            P       V      P    A     A    + +    +   A  P   +   +    +
Sbjct: 367 APGGGVPARVAGAVPAPG-ARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEA 425

Query: 124 YPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLA 183
            P+ P     +                   +  A +      +   A   S   P +   
Sbjct: 426 PPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAP 485

Query: 184 SNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATA 240
            +    P P A +   +  T            A+   AP A    A  +  PT A A
Sbjct: 486 PDAAFEPAPRAAAPSAA--TPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAA 540



 Score = 31.4 bits (71), Expect = 1.3
 Identities = 24/156 (15%), Positives = 38/156 (24%), Gaps = 10/156 (6%)

Query: 89  PSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTY 148
            +     + +P      +A      +   + P    P       +          A    
Sbjct: 402 VTGAAGAALAP------KAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANA 455

Query: 149 PSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTA 208
            +  +         P   S  AS  +   P +      P A  P A +     +   P A
Sbjct: 456 RASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAA 515

Query: 209 PLVPSTIAT----APSAPPANIEDATRSTHPTTATA 240
                  A     AP A P     A  +     A A
Sbjct: 516 ASREDAPAAAAPPAPEARPPTPAAAAPAARAGGAAA 551



 Score = 31.0 bits (70), Expect = 1.9
 Identities = 41/180 (22%), Positives = 55/180 (30%), Gaps = 12/180 (6%)

Query: 144 APNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNT 203
           AP+ Y      L     F P  +   A    +P      A   P A    A         
Sbjct: 341 APDEYAGFTMTLLRMLAFEPAVTGGGAPGGGVP-ARVAGAVPAPGARAAAAVGAS----- 394

Query: 204 AYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSV 263
           A P    V      A +   A    ATR+  P  A AP       D      +P+ A + 
Sbjct: 395 AVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKAN 454

Query: 264 PPTNRDTIRSALVPLYPSLDNYMGLEVASELINAYARENAANERESDAAGNTVARQAPRD 323
              + D+ R       P  D+      AS+     A  +AA E    AA  + A  A   
Sbjct: 455 ARASADS-RCDERDAQPPADSGSASAPASD-----APPDAAFEPAPRAAAPSAATPAAVP 508


>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
          Length = 331

 Score = 38.9 bits (91), Expect = 0.005
 Identities = 20/87 (22%), Positives = 30/87 (34%), Gaps = 3/87 (3%)

Query: 163 PLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAP 222
           PL +S     A+ P P  P+ +   ++  P   +           A + PS      +A 
Sbjct: 163 PLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAAT 222

Query: 223 PANIEDA---TRSTHPTTATAPSVPPA 246
           PA    A     +  PT     S P A
Sbjct: 223 PAPAAPATPDGAAPLPTDQAGVSTPAA 249



 Score = 35.0 bits (81), Expect = 0.082
 Identities = 25/106 (23%), Positives = 34/106 (32%), Gaps = 12/106 (11%)

Query: 126 SNPYQAQNSYPSVP----------NQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASL 175
           S+   +QNS  SVP            P       + P   Q P        +    Q ++
Sbjct: 150 SSAELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAV 209

Query: 176 PYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSA 221
             P    A+   +A   PA    P      PT     ST A  P+A
Sbjct: 210 VAPSQ--ANVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADPNA 253



 Score = 33.1 bits (76), Expect = 0.31
 Identities = 20/113 (17%), Positives = 34/113 (30%), Gaps = 10/113 (8%)

Query: 166 SSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPAN 225
           S++ +  +    P++   +  P+    PA            T P    T A A +  PA 
Sbjct: 151 SAELSQNSGQSVPLDTSTTTDPATTPAPA--------APVDTTPTNSQTPAVATAPAPAV 202

Query: 226 IEDATRSTHP--TTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSALV 276
                    P        + P     AT    +P+       +      +ALV
Sbjct: 203 DPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADPNALV 255


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 38.8 bits (90), Expect = 0.005
 Identities = 40/195 (20%), Positives = 55/195 (28%), Gaps = 26/195 (13%)

Query: 27  QTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAP 86
           Q+     +    P Y   S   P     P +      P P  Q P    + S P  P   
Sbjct: 108 QSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQ----QPQPPPQVPQQQQYQSPPQQPQYQ 163

Query: 87  IYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPN 146
             P    Q              V         Y   SYP N     +     P     P+
Sbjct: 164 QNPPPQAQS----------APQVSGLYPEESPYQPQSYPPNEPLPSSMAMQPPYSGAPPS 213

Query: 147 TYPSMPNQLQA---------PNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSV 197
                P Q            PN  +P       SQ    Y  +    +K +     +Y+ 
Sbjct: 214 QQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPPPSQGQEGYGYSGPPPSKGNHGSVASYAP 273

Query: 198 Q---PSHNTAYPTAP 209
           Q    S++TAYP+ P
Sbjct: 274 QGSSQSYSTAYPSLP 288



 Score = 37.2 bits (86), Expect = 0.015
 Identities = 34/163 (20%), Positives = 46/163 (28%), Gaps = 7/163 (4%)

Query: 35  SPPPPVYQTSSHTSPNA----YNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPS 90
             PPP                Y   P P   + P     +P    +      P+ P+  S
Sbjct: 141 PQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYPPNEPLPSS 200

Query: 91  APPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPS 150
              Q  +S    S      P     +   P G   S     Q   PS   +    +  P 
Sbjct: 201 MAMQPPYSGAPPSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPPPSQGQEGYGYSGPPP 260

Query: 151 MPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPP 193
                 +   + P  SSQ  S A   YP  P A+  P A    
Sbjct: 261 SKGNHGSVASYAPQGSSQSYSTA---YPSLPAATVLPQALPMS 300



 Score = 36.1 bits (83), Expect = 0.034
 Identities = 45/191 (23%), Positives = 53/191 (27%), Gaps = 11/191 (5%)

Query: 12  TFHQLTIAQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPP--YYQ 69
           +FH  +  Q     +        +P   V              P     S  PP   Y  
Sbjct: 41  SFHVDSTKQPPAPEQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQQQSQAPPSHQYPS 100

Query: 70  HPS---VPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPS 126
                 V S P  PT    P YP  PP +   P                       S P 
Sbjct: 101 QLPPQQVQSVPQQPTPQQEPYYP--PPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQ 158

Query: 127 NPYQAQNSYPSVPNQPQAPNTYPS-MPNQLQAPNGFYPLTSSQQASQASLPYPINPLASN 185
            P   QN  P   + PQ    YP   P Q Q+     PL SS        PY   P +  
Sbjct: 159 QPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYPPNEPLPSSMAMQP---PYSGAPPSQQ 215

Query: 186 KPSAPQPPAYS 196
               PQP  Y 
Sbjct: 216 FYGPPQPSPYM 226



 Score = 34.5 bits (79), Expect = 0.12
 Identities = 34/151 (22%), Positives = 44/151 (29%), Gaps = 17/151 (11%)

Query: 27  QTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAP 86
           Q         PP   Q   +  P A + P          PY      P+ P   +    P
Sbjct: 146 QVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYPPNEPLPSSMAMQP 205

Query: 87  IYPSAPPQE------SHSPYYYSDI----RANVPTGGSPVPSYPQGSYPSNPYQAQN--- 133
            Y  APP +        SPY Y        +  P+G  P PS  Q  Y  +         
Sbjct: 206 PYSGAPPSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPPPSQGQEGYGYSGPPPSKGNH 265

Query: 134 ----SYPSVPNQPQAPNTYPSMPNQLQAPNG 160
               SY    +       YPS+P     P  
Sbjct: 266 GSVASYAPQGSSQSYSTAYPSLPAATVLPQA 296



 Score = 29.9 bits (67), Expect = 3.8
 Identities = 35/170 (20%), Positives = 48/170 (28%), Gaps = 5/170 (2%)

Query: 110 PTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQ 169
           P    P       + PS+ Y +Q     V + PQ P   P        P+   P  + Q 
Sbjct: 79  PQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPT--PQQEPYYPPPSQPQPPPAQQP 136

Query: 170 ASQASLPYPINPLASNKPSAPQPPAYSVQPS--HNTAYPTAPLVPSTIATAPSAPPANIE 227
            +Q   P P  P      S PQ P Y   P     +A   + L P      P + P N E
Sbjct: 137 QAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYPPN-E 195

Query: 228 DATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSALVP 277
               S       + + P             +        N         P
Sbjct: 196 PLPSSMAMQPPYSGAPPSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPP 245


>gnl|CDD|238490 cd00990, PDZ_glycyl_aminopeptidase, PDZ domain associated with
           archaeal and bacterial M61 glycyl-aminopeptidases. May
           be responsible for substrate recognition and/or binding,
           as most PDZ domains bind C-terminal polypeptides, and
           binding to internal (non-C-terminal) polypeptides and
           even to lipids has been demonstrated. In this subfamily
           of protease-associated PDZ domains a C-terminal
           beta-strand is presumed to form the peptide-binding
           groove base, a circular permutation with respect to PDZ
           domains found in Eumetazoan signaling proteins.
          Length = 80

 Score = 35.5 bits (82), Expect = 0.006
 Identities = 19/78 (24%), Positives = 34/78 (43%), Gaps = 6/78 (7%)

Query: 361 GLRVCAINEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNN 420
           GL +        V  V   SPA   GL  GD+++++NG  V         ++ +    + 
Sbjct: 4   GLTLDKEEGLGKVTFVRDDSPADKAGLVAGDELVAVNGWRVDALQD----RLKEYQAGDP 59

Query: 421 IRVVI--RDRPLERTVTL 436
           + + +   DR +E  +TL
Sbjct: 60  VELTVFRDDRLIEVPLTL 77


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 39.0 bits (91), Expect = 0.007
 Identities = 39/237 (16%), Positives = 64/237 (27%), Gaps = 24/237 (10%)

Query: 34  SSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPP 93
           ++        +          PP P   A        P+       P  P+    P+ P 
Sbjct: 51  AAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPG 110

Query: 94  QESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPN 153
             S  P          P   SP PS      P      +      P    +P    + P 
Sbjct: 111 PSSPDP------PPPTPPPASPPPSPA----PDLSEMLRPVGSPGPPPAASPPAAGASPA 160

Query: 154 QLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPS 213
            + +         +  + QA+LP       +  PS+P        P    +         
Sbjct: 161 AVASD--------AASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSP 212

Query: 214 TIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHD------ATRSTPSPIIAPSVP 264
             A+A S  PA    A      +++ + S   +              P+PI  P+  
Sbjct: 213 ISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRI 269



 Score = 36.3 bits (84), Expect = 0.053
 Identities = 39/244 (15%), Positives = 62/244 (25%), Gaps = 14/244 (5%)

Query: 28  TTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPI 87
               S   PPPP         P +    P+P  S    P       P+        S   
Sbjct: 108 PPGPSSPDPPPPT------PPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAA 161

Query: 88  YPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNT 147
             S       +    S          SP    P  S P      +    S P    A + 
Sbjct: 162 VASDAASSRQAALPLSSPEETARAPSSPPAEPP-PSTPPAAASPRPPRRSSPISASASSP 220

Query: 148 YPSMPNQLQAPNGF--YPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAY 205
            P+         G      +SS+ +     P    PL    P       +     +  + 
Sbjct: 221 APAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPS- 279

Query: 206 PTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAP----SVPPAIHDATRSTPSPIIAP 261
                  S+ +    +P  +         P++  A     S   +   +T S+       
Sbjct: 280 SRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGA 339

Query: 262 SVPP 265
           +V P
Sbjct: 340 AVSP 343



 Score = 35.9 bits (83), Expect = 0.056
 Identities = 37/203 (18%), Positives = 51/203 (25%), Gaps = 10/203 (4%)

Query: 81  TFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPN 140
             P A    +A    S S        A        V +         P       P    
Sbjct: 22  PRPPATPGDAADDLLSGSQGQLVSDSA--ELAAVTVVAGAAACDRFEPPTGPPPGPGTEA 79

Query: 141 QPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPS 200
                 + P+      AP       S      +S   P  P      S P  PA  +   
Sbjct: 80  PANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPP--PPTPPPASPPPSPAPDLSEM 137

Query: 201 HNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHP------TTATAPSVPPAIHDATRST 254
                   P   ++   A ++P A   DA  S            TA +      +   ST
Sbjct: 138 LRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPST 197

Query: 255 PSPIIAPSVPPTNRDTIRSALVP 277
           P    +P  P  +     SA  P
Sbjct: 198 PPAAASPRPPRRSSPISASASSP 220



 Score = 35.1 bits (81), Expect = 0.098
 Identities = 44/250 (17%), Positives = 66/250 (26%), Gaps = 18/250 (7%)

Query: 34  SSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPP 93
           S  PP      S ++ +    P     +A+        S  S  S   +      P   P
Sbjct: 203 SPRPPRRSSPISASASSP--APAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRP 260

Query: 94  QESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPN 153
                P   + I       G      P  S  S   ++ +  PS P    AP++  +  +
Sbjct: 261 APITLP---TRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSS 317

Query: 154 QLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPS 213
              +       TSS   S         P  S  PS  +PP  +   S       +    S
Sbjct: 318 SSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSS 377

Query: 214 TIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRS 273
             A+A                       +V         +   P   P   P +      
Sbjct: 378 PAASAGRPTRR-------------RARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASG 424

Query: 274 ALVPLYPSLD 283
           A    YP L 
Sbjct: 425 AFYARYPLLT 434



 Score = 29.8 bits (67), Expect = 5.4
 Identities = 29/166 (17%), Positives = 48/166 (28%), Gaps = 8/166 (4%)

Query: 29  TSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTF-PSAPI 87
            +S ++ P       SS +SP   +  PSP    + P      +  S  S      S+  
Sbjct: 271 EASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTS 330

Query: 88  YPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNT 147
             S   + +      S  R+  P+   P         PS+P +      +  +   +   
Sbjct: 331 SSSESSRGAAVSPGPSPSRSPSPSRPPPPAD------PSSPRKRPRPSRAPSSPAASAGR 384

Query: 148 YPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPP 193
            P+      A  G      +     A  P P    A     A    
Sbjct: 385 -PTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYAR 429


>gnl|CDD|182262 PRK10139, PRK10139, serine endoprotease; Provisional.
          Length = 455

 Score = 37.6 bits (87), Expect = 0.014
 Identities = 39/160 (24%), Positives = 72/160 (45%), Gaps = 24/160 (15%)

Query: 367 INEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNNIRV-VI 425
           +  G FV  V   S ++  G++ GD I S+NG+ +  +  +   +I    P   +++ ++
Sbjct: 288 VQRGAFVSEVLPNSGSAKAGVKAGDIITSLNGKPLNSF-AELRSRIATTEPGTKVKLGLL 346

Query: 426 RD-RPLERTVTLHRDSSGHF----------GFHFNKGQ---------IVSLVKDSSAARN 465
           R+ +PLE  VTL   +S             G   + GQ         I  +VK S AA+ 
Sbjct: 347 RNGKPLEVEVTLDTSTSSSASAEMITPALQGATLSDGQLKDGTKGIKIDEVVKGSPAAQA 406

Query: 466 GLLVNHNILEVNGANVVGLKDKELLEAMEKGGSVLTLTLI 505
           GL  +  I+ VN   V  +   E+ + +    +++ L ++
Sbjct: 407 GLQKDDVIIGVNRDRVNSI--AEMRKVLAAKPAIIALQIV 444


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 37.9 bits (88), Expect = 0.014
 Identities = 36/156 (23%), Positives = 48/156 (30%), Gaps = 26/156 (16%)

Query: 178 PINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTT 237
           P+      KP+A  P    V+P+       AP      A A + PP             T
Sbjct: 362 PVPAPQPAKPTAAAPSP--VRPT------PAPSTRPKAAAAANIPPK-------EPVRET 406

Query: 238 ATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSALVPLYPSLDNYMGLEVASELINA 297
           AT P VPP       + P P    S P   R  I     P Y             +    
Sbjct: 407 ATPPPVPPR----PVAPPVPHTPESAPKLTRAAIPVDEKPKYTP-------PAPPKEEEK 455

Query: 298 YARENAANERESDAAGNTVARQAPRDSCTMIAPLSS 333
               +     + +A    + R  P  S  + A LSS
Sbjct: 456 ALIADGDVLEQLEAIWKQILRDVPPRSPAVQALLSS 491



 Score = 37.5 bits (87), Expect = 0.017
 Identities = 20/96 (20%), Positives = 22/96 (22%), Gaps = 6/96 (6%)

Query: 136 PSVPNQPQAPNTYPSMPN--QLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPP 193
              P +P A    P  P       P          +        P        P AP  P
Sbjct: 365 APQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPR--PVAPPVP 422

Query: 194 AYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDA 229
                 S       A  V       P APP   E A
Sbjct: 423 H--TPESAPKLTRAAIPVDEKPKYTPPAPPKEEEKA 456



 Score = 34.0 bits (78), Expect = 0.19
 Identities = 18/90 (20%), Positives = 25/90 (27%)

Query: 31  SSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPS 90
           +     P P     +  +P+     P+P            P         T P  P  P 
Sbjct: 358 ALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPV 417

Query: 91  APPQESHSPYYYSDIRANVPTGGSPVPSYP 120
           APP            RA +P    P  + P
Sbjct: 418 APPVPHTPESAPKLTRAAIPVDEKPKYTPP 447



 Score = 30.2 bits (68), Expect = 3.5
 Identities = 20/84 (23%), Positives = 26/84 (30%), Gaps = 9/84 (10%)

Query: 163 PLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAP 222
              S  + + A    P    A+N P    P   +  P      P AP VP         P
Sbjct: 374 AAPSPVRPTPAPSTRPKAAAAANIPPKE-PVRETATPPPVPPRPVAPPVP-------HTP 425

Query: 223 PANIEDATRSTHPTTATAPSVPPA 246
            +     TR+  P        PPA
Sbjct: 426 ES-APKLTRAAIPVDEKPKYTPPA 448


>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in bacteria and eukaryotes, and
           is typically between 384 and 430 amino acids in length.
          Length = 414

 Score = 37.5 bits (87), Expect = 0.015
 Identities = 39/193 (20%), Positives = 57/193 (29%), Gaps = 22/193 (11%)

Query: 84  SAPIYPSAPPQESHSPYYYSDI------RANVPTG--GSPVPSYPQGSYPSNPYQAQNSY 135
           + P  P   PQ    P +  D+      RA+V  G   S     P G   + P  + +  
Sbjct: 159 NKPESPKHKPQPPQQPEWKKDLSRLRQSRASVDLGRTNSFKEVTPVGLMRTPPPGSHSKS 218

Query: 136 PSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAY 195
           PS    P  P++                 +   + S A  P       S + + P     
Sbjct: 219 PSKSGIPDLPSS-RDSEKTKPEKPQQETSSMDTEKSSAPKPRETLDPKSPEKAPPIDTTE 277

Query: 196 SVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVP-PAIHDATRST 254
               S     P A    S  A+A    P+          P+       P  +   + R  
Sbjct: 278 EELKS-----PEASPKESEEASARKRSPSL-------LSPSPKAESPKPLASPGKSPRDP 325

Query: 255 PSPIIAPSVPPTN 267
            SP   P  PP N
Sbjct: 326 LSPRPKPQSPPVN 338


>gnl|CDD|226483 COG3975, COG3975, Predicted protease with the C-terminal PDZ domain
           [General function prediction only].
          Length = 558

 Score = 37.8 bits (88), Expect = 0.015
 Identities = 24/90 (26%), Positives = 34/90 (37%), Gaps = 17/90 (18%)

Query: 361 GLRVCAINEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMK-----QVH----- 410
           GL+V +      +  V  G PA   GL  GD+I++ING +      K     QVH     
Sbjct: 454 GLKVKSEGGHEKITFVFPGGPAYKAGLSPGDKIVAINGISDQLDRYKVNDKIQVHVFREG 513

Query: 411 -------KILKAAPVNNIRVVIRDRPLERT 433
                  K+        I + I DR   + 
Sbjct: 514 RLREFLVKLGGDPTAQYIILPIGDRNPAQL 543


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 37.3 bits (87), Expect = 0.015
 Identities = 23/115 (20%), Positives = 32/115 (27%), Gaps = 4/115 (3%)

Query: 110 PTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQ 169
                     PQ  Y   PY +      V   P+A       P   Q         + Q 
Sbjct: 78  AQEHEAARPSPQHQYQP-PYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQP 136

Query: 170 ASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPA 224
             +  L  P++P       APQP   + QP+     P  P+            P 
Sbjct: 137 QPEQPLQQPVSPQV---APAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPV 188



 Score = 35.8 bits (83), Expect = 0.053
 Identities = 27/136 (19%), Positives = 39/136 (28%), Gaps = 11/136 (8%)

Query: 60  HSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSY 119
           H  N  P        + PS P     P Y SA P++              P    P    
Sbjct: 69  HRVNHAPANAQEHEAARPS-PQHQYQPPYASAQPRQPVQQ---------PPEAQVPPQHA 118

Query: 120 PQGSYP-SNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYP 178
           P+ + P   P Q     P      Q P +    P      +   P   + Q ++      
Sbjct: 119 PRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQ 178

Query: 179 INPLASNKPSAPQPPA 194
             P+A   P   +P  
Sbjct: 179 PEPVAEPAPVMDKPKR 194



 Score = 33.5 bits (77), Expect = 0.28
 Identities = 22/112 (19%), Positives = 33/112 (29%)

Query: 34  SSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPP 93
            +  P   Q      P+  +    PY SA P    Q P     P       A   P    
Sbjct: 71  VNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQ 130

Query: 94  QESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAP 145
           Q ++ P     ++  V    +P P     +        Q + P    QP+  
Sbjct: 131 QPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPV 182


>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
          Length = 1560

 Score = 37.7 bits (87), Expect = 0.019
 Identities = 40/178 (22%), Positives = 62/178 (34%), Gaps = 49/178 (27%)

Query: 23  YFSKQTTSSSFSSPPPPVYQTSSHTSP----NAYNNPP---SPYHSANPPPYYQHPSVPS 75
             +   +++ +S+P         + +       YNNPP   +PY  +NPP    +P   +
Sbjct: 404 QSNAGFSNAGYSNPG---NSNPGYNNAPNSNTPYNNPPNSNTPY--SNPP--NSNPPYSN 456

Query: 76  HPSVPTFPSAPIYPSAPP---QESHSPYY--YSDIRANVPTGGSPVPSYPQGSYPSNPYQ 130
            P   T  S     +APP   ++ HS Y+  Y    AN P    P               
Sbjct: 457 LPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRAANQPAANLP--------------- 501

Query: 131 AQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPS 188
                    NQP A N + +  N +  P    P  S+        PY  N   +  P+
Sbjct: 502 -------TANQPAANNFHGAAGNSVGNPFASRPFGSA--------PYGGNAATTADPN 544



 Score = 28.9 bits (64), Expect = 8.4
 Identities = 36/148 (24%), Positives = 53/148 (35%), Gaps = 23/148 (15%)

Query: 112 GGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQAS 171
            G P   +   SY    Y       +  +     N   S P      N      ++  ++
Sbjct: 380 AGGP---HSNASYNCAAYSNAAQSNAAQSNAGFSNAGYSNP-----GNSNPGYNNAPNSN 431

Query: 172 QASLPYPINPLASNKPSAPQP---PAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIED 228
               PY  NP  SN P +  P   P YS  P  NT Y  APL         +APP++ +D
Sbjct: 432 T---PYN-NPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPL--------SNAPPSSAKD 479

Query: 229 ATRSTHPTTATAPSVPPAIHDATRSTPS 256
              + H       +  PA +  T + P+
Sbjct: 480 HHSAYHAAYQHRAANQPAANLPTANQPA 507


>gnl|CDD|223343 COG0265, DegQ, Trypsin-like serine proteases, typically
           periplasmic, contain C-terminal PDZ domain
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 347

 Score = 36.8 bits (85), Expect = 0.026
 Identities = 19/69 (27%), Positives = 32/69 (46%), Gaps = 1/69 (1%)

Query: 368 NEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNNIRVVIRD 427
             G  V  V  GSPA+  G++ GD I ++NG+ VA      V  +    P + + + +  
Sbjct: 269 AAGAVVLGVLPGSPAAKAGIKAGDIITAVNGKPVA-SLSDLVAAVASNRPGDEVALKLLR 327

Query: 428 RPLERTVTL 436
              ER + +
Sbjct: 328 GGKERELAV 336



 Score = 30.6 bits (69), Expect = 2.2
 Identities = 26/102 (25%), Positives = 38/102 (37%), Gaps = 13/102 (12%)

Query: 415 AAPVNNIRVVIRDRPLERTVTLHRDSSG----------HFGFHFNKGQIV-SLVKDSSAA 463
           A PVN +  V+ +  L     + R   G            G     G +V  ++  S AA
Sbjct: 227 AIPVNLVAPVLDE--LISKGKVVRGYLGVIGEPLTADIALGLPVAAGAVVLGVLPGSPAA 284

Query: 464 RNGLLVNHNILEVNGANVVGLKDKELLEAMEKGGSVLTLTLI 505
           + G+     I  VNG  V  L D     A  + G  + L L+
Sbjct: 285 KAGIKAGDIITAVNGKPVASLSDLVAAVASNRPGDEVALKLL 326


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 37.2 bits (87), Expect = 0.028
 Identities = 16/83 (19%), Positives = 24/83 (28%)

Query: 182 LASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAP 241
            A   P +   P  +   +   A   A    +    AP+  P           P  A A 
Sbjct: 33  FADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA 92

Query: 242 SVPPAIHDATRSTPSPIIAPSVP 264
           +  PA   A  +  +P  A    
Sbjct: 93  AAAPAAPPAAAAAAAPAAAAVED 115



 Score = 33.7 bits (78), Expect = 0.28
 Identities = 20/74 (27%), Positives = 26/74 (35%), Gaps = 3/74 (4%)

Query: 196 SVQPS-HN--TAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATR 252
           SV PS       Y        T A A +A  A+   A  +     A AP+ P A   A  
Sbjct: 24  SVDPSWREFFADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAP 83

Query: 253 STPSPIIAPSVPPT 266
             P+   A +  P 
Sbjct: 84  PKPAAAAAAAAAPA 97



 Score = 33.3 bits (77), Expect = 0.36
 Identities = 20/87 (22%), Positives = 28/87 (32%), Gaps = 5/87 (5%)

Query: 178 PINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTT 237
           P +  A    +A    A S   +   A   A   P+  A A  A P        +     
Sbjct: 38  PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPP-----KPAAAAAA 92

Query: 238 ATAPSVPPAIHDATRSTPSPIIAPSVP 264
           A AP+ PPA   A     + +     P
Sbjct: 93  AAAPAAPPAAAAAAAPAAAAVEDEVTP 119



 Score = 32.9 bits (76), Expect = 0.47
 Identities = 17/86 (19%), Positives = 23/86 (26%)

Query: 170 ASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDA 229
           A            A+   +A      +   +   A P      +    AP  P A    A
Sbjct: 34  ADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAA 93

Query: 230 TRSTHPTTATAPSVPPAIHDATRSTP 255
                P  A A + P A       TP
Sbjct: 94  AAPAAPPAAAAAAAPAAAAVEDEVTP 119



 Score = 32.2 bits (74), Expect = 0.97
 Identities = 16/95 (16%), Positives = 27/95 (28%), Gaps = 8/95 (8%)

Query: 180 NPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTAT 239
            P ++  P+A    A +   +   A           A   +A PA          P  A 
Sbjct: 37  GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPA--------APPKPAA 88

Query: 240 APSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSA 274
           A +   A      +  +   A +        +R A
Sbjct: 89  AAAAAAAPAAPPAAAAAAAPAAAAVEDEVTPLRGA 123



 Score = 29.5 bits (67), Expect = 6.6
 Identities = 22/83 (26%), Positives = 29/83 (34%), Gaps = 3/83 (3%)

Query: 163 PLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAP 222
              +   A+ A+      P A+    AP  PA         A P AP  P+  A A +AP
Sbjct: 40  STAAPTAAAAAAAAAASAPAAAPAAKAPAAPA---PAPPAAAAPAAPPKPAAAAAAAAAP 96

Query: 223 PANIEDATRSTHPTTATAPSVPP 245
            A    A  +     A    V P
Sbjct: 97  AAPPAAAAAAAPAAAAVEDEVTP 119


>gnl|CDD|236802 PRK10942, PRK10942, serine endoprotease; Provisional.
          Length = 473

 Score = 36.3 bits (84), Expect = 0.041
 Identities = 19/49 (38%), Positives = 31/49 (63%), Gaps = 2/49 (4%)

Query: 369 EGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAP 417
           +GV V  V+ G+PA+ +GL+ GD I+  N + V   N+ ++ KIL + P
Sbjct: 408 KGVVVDNVKPGTPAAQIGLKKGDVIIGANQQPVK--NIAELRKILDSKP 454



 Score = 32.4 bits (74), Expect = 0.65
 Identities = 42/159 (26%), Positives = 68/159 (42%), Gaps = 33/159 (20%)

Query: 370 GVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNNIRV------ 423
           G FV  V   S A+  G++ GD I S+NG+ ++ +   +       A V  + V      
Sbjct: 312 GAFVSQVLPNSSAAKAGIKAGDVITSLNGKPISSFAALR-------AQVGTMPVGSKLTL 364

Query: 424 -VIRD-RPLERTVTLHRDS----------SGHFGFHFN-----KGQIVSLVK-DSSAARN 465
            ++RD +P+   V L + S          +G  G   +     KG +V  VK  + AA+ 
Sbjct: 365 GLLRDGKPVNVNVELQQSSQNQVDSSNIFNGIEGAELSNKGGDKGVVVDNVKPGTPAAQI 424

Query: 466 GLLVNHNILEVNGANVVGLKDKELLEAMEKGGSVLTLTL 504
           GL     I+  N   V  +   EL + ++   SVL L +
Sbjct: 425 GLKKGDVIIGANQQPVKNI--AELRKILDSKPSVLALNI 461


>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
           factor (transactivator).  This family includes EBV BRLF1
           and similar ORF 50 proteins from other herpesviruses.
          Length = 500

 Score = 35.9 bits (83), Expect = 0.047
 Identities = 32/203 (15%), Positives = 47/203 (23%), Gaps = 25/203 (12%)

Query: 35  SPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQ 94
           S  P    + S   P+          S    P     S   +    +      +P+  P 
Sbjct: 260 SAEPLAPASPSSRRPSTAQEEQIACSSPQAEPEQGVQS---YVPQSSDSRPSCFPA--PS 314

Query: 95  ESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQ 154
            +        +  N           PQ   P     A  S        +AP   PS   Q
Sbjct: 315 TTQPT----FLPPNTNKKAKRD-RRPQMVTPKQEGGAAVSQNHDGGTVRAPRGRPSGSGQ 369

Query: 155 LQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPST 214
               N     + +   S A+              A   P   VQ     A    P  P +
Sbjct: 370 SPPSNSPLLSSLADTPSGAAH-----------QPASLLPPAVVQQQLEDASDKQPPTPGS 418

Query: 215 ----IATAPSAPPANIEDATRST 233
                       P+ +    R  
Sbjct: 419 SLVPQPDEQELGPSVMALLDRDQ 441


>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
           inhibitor.
          Length = 670

 Score = 36.2 bits (83), Expect = 0.047
 Identities = 24/101 (23%), Positives = 37/101 (36%), Gaps = 8/101 (7%)

Query: 175 LPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTH 234
           +PY     A N  S    P  S   S+ T    +   PST+    ++PPA    +  +T 
Sbjct: 555 VPYIPGLFAGNPGSTNSTPTGSAASSNTTFSSDS---PSTVVAPSTSPPAGHLGSPPATP 611

Query: 235 PTTATAPSVPPAIH-----DATRSTPSPIIAPSVPPTNRDT 270
               +  + PPA H         S  S I   S    + ++
Sbjct: 612 SKIVSPSTSPPASHLGSPSTTPSSPESSIKVASTETASPES 652


>gnl|CDD|238491 cd00991, PDZ_archaeal_metalloprotease, PDZ domain of archaeal zinc
           metalloprotases, presumably membrane-associated or
           integral membrane proteases, which may be involved in
           signalling and regulatory mechanisms. May be responsible
           for substrate recognition and/or binding, as most PDZ
           domains bind C-terminal polypeptides, and binding to
           internal (non-C-terminal) polypeptides and even to
           lipids has been demonstrated. In this subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
           forms the peptide-binding groove base, a circular
           permutation with respect to PDZ domains found in
           Eumetazoan signaling proteins.
          Length = 79

 Score = 32.7 bits (75), Expect = 0.059
 Identities = 17/57 (29%), Positives = 29/57 (50%), Gaps = 4/57 (7%)

Query: 452 QIVSLVKDSSAARNGLLVNHNILEVNGANVVGLKDKELLEAMEKG--GSVLTLTLIP 506
            IV ++  S A    L     I  +NG  +  L+D   +EA++    G V+T+T++P
Sbjct: 13  VIVGVIVGSPAENAVLHTGDVIYSINGTPITTLED--FMEALKPTKPGEVITVTVLP 67



 Score = 30.8 bits (70), Expect = 0.28
 Identities = 15/35 (42%), Positives = 17/35 (48%)

Query: 370 GVFVCLVERGSPASLVGLRFGDQILSINGETVAGY 404
           GV +  V  GSPA    L  GD I SING  +   
Sbjct: 11  GVVIVGVIVGSPAENAVLHTGDVIYSINGTPITTL 45


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 35.1 bits (81), Expect = 0.083
 Identities = 25/148 (16%), Positives = 40/148 (27%), Gaps = 21/148 (14%)

Query: 62  ANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQ 121
           +   P          P  P   +    P+   Q+ ++P          P     +P+ P 
Sbjct: 52  SPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPA---ANPPATFALPAGPA 108

Query: 122 GSY----PSNPYQAQNSYPSVPNQPQAPNTYPS-----MPNQLQAPNGFYPLTSSQQASQ 172
           G      P   Y  Q   P +  Q  A +            QLQ     Y   +S Q   
Sbjct: 109 GPTIQTEPGQLYPVQ--VPVMVTQNPANSPLDQPAQQRALQQLQQR---YGAPASGQLPS 163

Query: 173 ASLPYPINPLASNKPSAPQPPAYSVQPS 200
                  +   +++    Q P     P 
Sbjct: 164 Q----QQSAQKNDESQLQQQPNGETPPQ 187



 Score = 33.9 bits (78), Expect = 0.16
 Identities = 27/134 (20%), Positives = 38/134 (28%), Gaps = 7/134 (5%)

Query: 140 NQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQ- 198
             PQAP     +P  L  P     L +     Q     P    A+N P+    PA     
Sbjct: 51  PSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGP 110

Query: 199 -----PSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATA-PSVPPAIHDATR 252
                P          +V    A +P   PA      +      A A   +P     A +
Sbjct: 111 TIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQSAQK 170

Query: 253 STPSPIIAPSVPPT 266
           +  S +       T
Sbjct: 171 NDESQLQQQPNGET 184



 Score = 33.6 bits (77), Expect = 0.23
 Identities = 30/150 (20%), Positives = 41/150 (27%), Gaps = 12/150 (8%)

Query: 83  PSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNP--------YQAQNS 134
           P AP  P A   +       +     +P G     + P GS  +NP          A  +
Sbjct: 53  PQAP-PPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPT 111

Query: 135 YPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPA 194
             + P Q   P   P M  Q  A +        +   Q    Y     AS +  + Q  A
Sbjct: 112 IQTEPGQL-YPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRY--GAPASGQLPSQQQSA 168

Query: 195 YSVQPSHNTAYPTAPLVPSTIATAPSAPPA 224
                S     P     P     A      
Sbjct: 169 QKNDESQLQQQPNGETPPQQTDGAGDDESE 198



 Score = 32.8 bits (75), Expect = 0.38
 Identities = 26/144 (18%), Positives = 39/144 (27%), Gaps = 6/144 (4%)

Query: 34  SSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPP 93
           S   PP         P            A     +  P+     + P   + P  P+ P 
Sbjct: 52  SPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPT 111

Query: 94  QESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSY-----PSVPNQPQAPNTY 148
            ++     Y      + T         Q +      Q Q  Y       +P+Q Q+    
Sbjct: 112 IQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQSAQKN 171

Query: 149 PSMPNQLQAPNGFYPLTSSQQASQ 172
                Q Q PNG  P   +  A  
Sbjct: 172 DESQLQQQ-PNGETPPQQTDGAGD 194



 Score = 30.9 bits (70), Expect = 1.8
 Identities = 14/73 (19%), Positives = 18/73 (24%), Gaps = 2/73 (2%)

Query: 153 NQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVP 212
           +Q       +    S QA       P          A Q      Q  HNT   +    P
Sbjct: 40  SQSGVAEFPW--DPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANP 97

Query: 213 STIATAPSAPPAN 225
                 P+ P   
Sbjct: 98  PATFALPAGPAGP 110



 Score = 30.1 bits (68), Expect = 2.7
 Identities = 20/148 (13%), Positives = 35/148 (23%), Gaps = 16/148 (10%)

Query: 27  QTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAP 86
            T + S ++ PP  +   +  +       P                    P + T   A 
Sbjct: 87  NTPTGSPAANPPATFALPAGPAGPTIQTEPG------------QLYPVQVPVMVTQNPAN 134

Query: 87  IYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPN 146
                P Q+         ++       S      Q S   N        P+    PQ  +
Sbjct: 135 SPLDQPAQQRALQ----QLQQRYGAPASGQLPSQQQSAQKNDESQLQQQPNGETPPQQTD 190

Query: 147 TYPSMPNQLQAPNGFYPLTSSQQASQAS 174
                 ++          T  Q+   A 
Sbjct: 191 GAGDDESEALVRLREADGTLEQRIKGAE 218


>gnl|CDD|234035 TIGR02860, spore_IV_B, stage IV sporulation protein B.  SpoIVB, the
           stage IV sporulation protein B of endospore-forming
           bacteria such as Bacillus subtilis, is a serine
           proteinase, expressed in the spore (rather than mother
           cell) compartment, that participates in a proteolytic
           activation cascade for Sigma-K. It appears to be
           universal among endospore-forming bacteria and occurs
           nowhere else [Cellular processes, Sporulation and
           germination].
          Length = 402

 Score = 35.0 bits (81), Expect = 0.10
 Identities = 17/64 (26%), Positives = 30/64 (46%), Gaps = 4/64 (6%)

Query: 380 SPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNNIRVVIR--DRPLERTVTLH 437
           SP    G++ GD+IL INGE +   NM  +  ++  A    + + I    + +E  +   
Sbjct: 124 SPGEEAGIQIGDRILKINGEKIK--NMDDLANLINKAGGEKLTLTIERGGKIIETVIKPV 181

Query: 438 RDSS 441
           +D  
Sbjct: 182 KDKE 185


>gnl|CDD|163521 TIGR03809, TIGR03809, TIGR03809 family protein.  This protein
           family contains proteins with a median length of about
           175, including a strongly conserved N-terminal region of
           about 55 amino acids, a conserved extreme C-terminal
           region of about 15 amino acids, and highly variable
           sequence in between the two. Members are found
           invariably with a member of family TIGR03808.
          Length = 168

 Score = 33.3 bits (76), Expect = 0.16
 Identities = 21/90 (23%), Positives = 34/90 (37%), Gaps = 7/90 (7%)

Query: 176 PYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHP 235
           P  ++ L  NK   P+ P Y+            P   +  A  P++ PA +         
Sbjct: 65  PVDLSWLGRNKKLPPRAPFYAADE------TVEPRRFAPPAALPASAPAIVA-LPEEAAA 117

Query: 236 TTATAPSVPPAIHDATRSTPSPIIAPSVPP 265
               AP VP   +DA+ +  + +  P V P
Sbjct: 118 EPRAAPDVPAEPNDASDADVAEVAEPDVVP 147


>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
           partitioning].
          Length = 324

 Score = 33.7 bits (77), Expect = 0.20
 Identities = 27/152 (17%), Positives = 34/152 (22%), Gaps = 10/152 (6%)

Query: 53  NNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTG 112
                PY             V      P F          PQ  + P Y S         
Sbjct: 46  KRDDDPYDEVADDEGVGEVRVVRKNEAPQFTQEHEAARQSPQHQYQPEYASA-------- 97

Query: 113 GSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQ 172
                  PQ    S+P           +Q Q P      P   Q      P+ S+     
Sbjct: 98  -QIKIPVPQPPQISDPPAHPQPTQPALDQEQPPEE-ARQPVLPQEAPAPQPVHSAAPQPA 155

Query: 173 ASLPYPINPLASNKPSAPQPPAYSVQPSHNTA 204
                P  P    +P     PA  V+      
Sbjct: 156 VQTVQPAVPEQQVQPEEVVEPAPEVKRPPRKD 187


>gnl|CDD|232801 TIGR00054, TIGR00054, RIP metalloprotease RseP.  Members of this
           nearly universal bacterial protein family are regulated
           intramembrane proteolysis (RIP) proteases. Older and
           synonymous gene symbols include yaeL in E. coli, mmpA in
           Caulobacter crescentus, etc. This family includes a
           region that hits the PDZ domain, found in a number of
           proteins targeted to the membrane by binding to a
           peptide ligand. The N-terminal region of this family
           contains a perfectly conserved motif HEXGH as found in a
           number of metalloproteinases, where the Glu is the
           active site and the His residues coordinate the metal
           cation [Protein fate, Degradation of proteins, peptides,
           and glycopeptides].
          Length = 419

 Score = 34.0 bits (78), Expect = 0.20
 Identities = 32/147 (21%), Positives = 58/147 (39%), Gaps = 21/147 (14%)

Query: 375 LVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAA---PVNNIRVVIRDRPLE 431
           L+++ S A   G+  GD+ILS+NG  + G+  K V + +      P+  I     +   E
Sbjct: 133 LLDKNSIALEAGIEPGDEILSVNGNKIPGF--KDVRQQIADIAGEPMVEILAERENWTFE 190

Query: 432 RTVTLHRDSSGHFGFHFNKGQIVSLVKDSSAARNGLLVNHNILEVNGANVVGLKD----- 486
               L             +  +  +  +S A + GL     I  +NG  +    D     
Sbjct: 191 VMKELIP------RGPKIEPVLSDVTPNSPAEKAGLKEGDYIQSINGEKLRSWTDFVSAV 244

Query: 487 -----KELLEAMEKGGSVLTLTLIPTS 508
                K +   +E+ G  L+++L P +
Sbjct: 245 KENPGKSMDIKVERNGETLSISLTPEA 271



 Score = 29.8 bits (67), Expect = 3.7
 Identities = 20/72 (27%), Positives = 27/72 (37%), Gaps = 10/72 (13%)

Query: 380 SPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNN-----IRVVIRDRPLERTV 434
           SPA   GL+ GD I SINGE      ++     + A   N      I+V      L  ++
Sbjct: 213 SPAEKAGLKEGDYIQSINGEK-----LRSWTDFVSAVKENPGKSMDIKVERNGETLSISL 267

Query: 435 TLHRDSSGHFGF 446
           T         G 
Sbjct: 268 TPEAKGKIGIGI 279


>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
          Length = 291

 Score = 33.5 bits (76), Expect = 0.21
 Identities = 26/86 (30%), Positives = 38/86 (44%), Gaps = 7/86 (8%)

Query: 164 LTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAP-SAP 222
           +T++ QA+ A+ P      +S K  APQ    S +     + P A   PST   AP   P
Sbjct: 21  ITTTTQAANATTP------SSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTP 74

Query: 223 PANIEDATRSTHPTTATAPSVPPAIH 248
            A    +T+   P + T   VP  I+
Sbjct: 75  NATTPSSTKVETPQSPTTKQVPTEIN 100



 Score = 31.1 bits (70), Expect = 1.2
 Identities = 20/67 (29%), Positives = 30/67 (44%), Gaps = 2/67 (2%)

Query: 156 QAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTI 215
           QA N   P ++  +A Q++ P        +KP+A  PP+  V+    T   T P   ST 
Sbjct: 26  QAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTP--SSTK 83

Query: 216 ATAPSAP 222
              P +P
Sbjct: 84  VETPQSP 90



 Score = 28.8 bits (64), Expect = 7.4
 Identities = 15/71 (21%), Positives = 29/71 (40%)

Query: 188 SAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAI 247
           +A  P +  V+   +T   T    P +   A + P   +E   ++ + TT ++  V    
Sbjct: 29  NATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSSTKVETPQ 88

Query: 248 HDATRSTPSPI 258
              T+  P+ I
Sbjct: 89  SPTTKQVPTEI 99


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 33.8 bits (77), Expect = 0.23
 Identities = 25/132 (18%), Positives = 38/132 (28%), Gaps = 18/132 (13%)

Query: 43  TSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYY 102
           T+S T+P+      +       P   Q  + P+    P  P     P + P  S      
Sbjct: 350 TASLTAPSRVLAAAAKVAVIAAP---QTHTGPADRQRPQRP--DGIPYSVPARSP----- 399

Query: 103 SDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFY 162
                   T   PVP +       +PY  Q+   S   +P  P   P   N    P    
Sbjct: 400 -------MTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGP-VPPQPTNPYVMPISMA 451

Query: 163 PLTSSQQASQAS 174
            +       +  
Sbjct: 452 NMVYPGHPQEHG 463



 Score = 29.6 bits (66), Expect = 4.8
 Identities = 19/62 (30%), Positives = 24/62 (38%)

Query: 35  SPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQ 94
           +  PPV Q        +  NP SP  S  P P    P  P++P V     A +     PQ
Sbjct: 401 TAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPTNPYVMPISMANMVYPGHPQ 460

Query: 95  ES 96
           E 
Sbjct: 461 EH 462



 Score = 29.2 bits (65), Expect = 6.4
 Identities = 17/92 (18%), Positives = 25/92 (27%), Gaps = 2/92 (2%)

Query: 27  QTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAP 86
           QT +       P       ++ P     P + Y              P +P  P     P
Sbjct: 373 QTHTGPADRQRPQRPDGIPYSVPARS--PMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGP 430

Query: 87  IYPSAPPQESHSPYYYSDIRANVPTGGSPVPS 118
                 P +  +PY      AN+   G P   
Sbjct: 431 EPVGPVPPQPTNPYVMPISMANMVYPGHPQEH 462



 Score = 28.8 bits (64), Expect = 9.0
 Identities = 17/102 (16%), Positives = 24/102 (23%), Gaps = 16/102 (15%)

Query: 115 PVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQAS 174
                   + P          P+   +PQ P+  P           + P+          
Sbjct: 362 AAAKVAVIAAPQTHTG-----PADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLV 416

Query: 175 LPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIA 216
            PY            PQ P  S  P      P  P  P  + 
Sbjct: 417 SPYN-----------PQSPGTSYGPEPVGPVPPQPTNPYVMP 447


>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 944

 Score = 33.9 bits (78), Expect = 0.24
 Identities = 23/140 (16%), Positives = 37/140 (26%), Gaps = 1/140 (0%)

Query: 65  PPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSY 124
           PP    P+  S P+     S          +     + S   A + +G +P P      Y
Sbjct: 648 PPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQSVPEAALASGSAPAPPPVPDPY 707

Query: 125 PSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLAS 184
              P++      S  + P          +   A N        Q   Q  +       AS
Sbjct: 708 DRPPWEEAPEVASANDGPNNAAEGNLSESVEDASNSELQAVEQQATHQPQVQAEAQSPAS 767

Query: 185 NKPSAPQPPAYSVQPSHNTA 204
              +  Q  +       N  
Sbjct: 768 -TTALTQTSSEVQDTELNLV 786


>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription
           factor N terminal domain.  The N terminus of the PEA3
           transcription factors is implicated in transactivation
           and in inhibition of DNA binding. Transactivation is
           potentiated by activation of the Ras/MAP kinase and
           protein kinase A signalling cascades. The N terminal
           region contains conserved MAP kinase phosphorylation
           sites.
          Length = 336

 Score = 33.3 bits (76), Expect = 0.27
 Identities = 29/122 (23%), Positives = 38/122 (31%), Gaps = 11/122 (9%)

Query: 23  YFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTF 82
           Y  K  +     +PP       +         P  P  +++PP     P  P     P  
Sbjct: 122 YDRKPASGFKPPTPPSTPCSPVNPQETVRQLQPSGPLSNSSPP----SPHTPLPNQSPLP 177

Query: 83  PSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNP-YQAQNSYPSVPNQ 141
           P      S+ P E       S+     P      P   +GS    P Y  Q S P VP  
Sbjct: 178 PPMSSPDSSYPSEHRFQRQLSEPCLPFP------PPPGRGSRDGRPPYHRQMSEPLVPYP 231

Query: 142 PQ 143
           PQ
Sbjct: 232 PQ 233


>gnl|CDD|238486 cd00986, PDZ_LON_protease, PDZ domain of ATP-dependent LON serine
           proteases. Most PDZ domains bind C-terminal
           polypeptides, though binding to internal
           (non-C-terminal) polypeptides and even to lipids has
           been demonstrated. In this bacterial subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
            is thought to form the peptide-binding groove base, a
           circular permutation with respect to PDZ domains found
           in Eumetazoan signaling proteins.
          Length = 79

 Score = 30.9 bits (70), Expect = 0.27
 Identities = 12/34 (35%), Positives = 20/34 (58%), Gaps = 1/34 (2%)

Query: 368 NEGVFVCLVERGSPASLVGLRFGDQILSINGETV 401
             GV+V  V  G PA+   L+ GD I++++G+  
Sbjct: 7   YHGVYVTSVVEGMPAAGK-LKAGDHIIAVDGKPF 39


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 33.4 bits (77), Expect = 0.31
 Identities = 19/92 (20%), Positives = 29/92 (31%), Gaps = 8/92 (8%)

Query: 180 NPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTAT 239
           +  ASN    P PP  S  P   T     P   +   T P  PP   + ++ +     A 
Sbjct: 514 SGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPAD 573

Query: 240 APSVPPAIHDATRSTPSPIIAPSVPPTNRDTI 271
           +   PP         P          ++ + I
Sbjct: 574 SSPPPP--------IPEEPTPSPTKDSSPEEI 597



 Score = 32.2 bits (74), Expect = 0.84
 Identities = 19/69 (27%), Positives = 26/69 (37%), Gaps = 1/69 (1%)

Query: 178 PINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTT 237
            I+ +A+    A   PA +  P      P+AP      AT PS PPA            T
Sbjct: 364 FISEIANASAPANPTPAPNPSPPPAPIQPSAP-KTKQAATTPSPPPAKASPPIPVPAEPT 422

Query: 238 ATAPSVPPA 246
             +P+ P  
Sbjct: 423 EPSPTPPAN 431



 Score = 31.9 bits (73), Expect = 1.1
 Identities = 13/64 (20%), Positives = 16/64 (25%)

Query: 165 TSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPA 224
           +    A    LP P        P  P P A     +     P     P  I   P+  P 
Sbjct: 530 SPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPT 589

Query: 225 NIED 228
               
Sbjct: 590 KDSS 593



 Score = 30.3 bits (69), Expect = 3.1
 Identities = 14/78 (17%), Positives = 23/78 (29%), Gaps = 3/78 (3%)

Query: 206 PTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPP 265
                  +     P+  P+      + + P T  A + P            PI  P+ P 
Sbjct: 366 SEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPP---PAKASPPIPVPAEPT 422

Query: 266 TNRDTIRSALVPLYPSLD 283
               T  +      PSL+
Sbjct: 423 EPSPTPPANAANAPPSLN 440



 Score = 29.5 bits (67), Expect = 5.8
 Identities = 22/81 (27%), Positives = 25/81 (30%), Gaps = 1/81 (1%)

Query: 187 PSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPA 246
           PSA      +     N      P  P      PSAP       T S  P  A+ P   PA
Sbjct: 361 PSAFISEIANASAPANPTPAPNPSPPPAPIQ-PSAPKTKQAATTPSPPPAKASPPIPVPA 419

Query: 247 IHDATRSTPSPIIAPSVPPTN 267
                  TP    A + P  N
Sbjct: 420 EPTEPSPTPPANAANAPPSLN 440


>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal.  Members
           of this family are found in the N terminal region of
           cytochrome B561, as well as in various other putative
           uncharacterized proteins.
          Length = 559

 Score = 33.2 bits (76), Expect = 0.33
 Identities = 49/273 (17%), Positives = 77/273 (28%), Gaps = 41/273 (15%)

Query: 15  QLTIAQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVP 74
           QL +   +    Q T  S +   PP  +TS+                 N       P VP
Sbjct: 97  QLGLLGVKAKDSQFTVVSQAKKSPPASKTSTP---------------MNTSE----PLVP 137

Query: 75  SHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNS 134
            H S    PS    PS     S +      +  +     SP  SY   SY S+     +S
Sbjct: 138 GHSSFSDSPSRSASPSRKFSPSSTIQQSPQLTPSNK-PASPSSSYQSPSYSSSLGPVNSS 196

Query: 135 YPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPA 194
                N+    ++ P                   +   A +    + + S+      PP 
Sbjct: 197 ----GNRSNLRSS-PWALRSSGDKKDITTDEKYLETFLAEVDEEQHMITSSAGKNATPPE 251

Query: 195 YSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRST 254
                   T        PS    + +A      DA RS     +   S  P       ST
Sbjct: 252 --------TINSFGSSSPSFWNYSRNAS-----DAARSL-KKRSYQLSPSPVPSKQKAST 297

Query: 255 PSPIIAPSVPPTNRDTIRSAL--VPLYPSLDNY 285
                    P  + ++       V + P ++ +
Sbjct: 298 SPKKGEGEPPNMSLESASEVFKRVGVLPQIEQW 330


>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 33.4 bits (76), Expect = 0.34
 Identities = 24/122 (19%), Positives = 38/122 (31%), Gaps = 9/122 (7%)

Query: 206 PTAPLVPSTIATAPS---APPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAP- 261
               + PS   +      AP  ++    R   P    AP   PA   +  S P+P   P 
Sbjct: 376 NDGGVAPSPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPP 435

Query: 262 -----SVPPTNRDTIRSALVPLYPSLDNYMGLEVASELINAYARENAANERESDAAGNTV 316
                 +PP+ + +    +    P +D               +R+       SDAA   V
Sbjct: 436 VARSAPLPPSPQASAPRNVASGKPGVDLGSWQGKFMNFTRNGSRKQPVQASSSDAAQTGV 495

Query: 317 AR 318
             
Sbjct: 496 FE 497


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 33.4 bits (76), Expect = 0.35
 Identities = 25/92 (27%), Positives = 35/92 (38%), Gaps = 5/92 (5%)

Query: 35  SPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQ 94
           S P PV Q     +  +   PP P   +   P  Q P   S           + PS  PQ
Sbjct: 412 SSPSPVPQV---QTNQSMPQPPQPSVPSPGGPGSQPPQ--SVSGGMIPSPPALMPSPSPQ 466

Query: 95  ESHSPYYYSDIRANVPTGGSPVPSYPQGSYPS 126
            S SP     I+ ++ + G P+ +  Q S  S
Sbjct: 467 MSQSPASQRTIQQDMVSPGGPLNTPGQSSVNS 498


>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
          Length = 590

 Score = 33.1 bits (76), Expect = 0.37
 Identities = 29/111 (26%), Positives = 49/111 (44%), Gaps = 2/111 (1%)

Query: 134 SYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPP 193
           S P+  +Q  A ++  S  +Q   P+   P    ++ S +SL +  + L   K S+P PP
Sbjct: 4   SNPTSRSQSHASSS-SSSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSSPSPP 62

Query: 194 AYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATA-PSV 243
           + ++ P+  T  P    +    A A  A  + +       HP   TA P+V
Sbjct: 63  SVTLPPAATTQTPQLNPLQRAAAAALDAVESALVSHLERQHPLPKTADPAV 113



 Score = 29.3 bits (66), Expect = 5.7
 Identities = 21/70 (30%), Positives = 26/70 (37%), Gaps = 3/70 (4%)

Query: 25 SKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPS 84
          S+   SSS SS         S TSP      PS     + P     P + S PS P+   
Sbjct: 10 SQSHASSSSSSSSQS--SPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSS-PSPPSVTL 66

Query: 85 APIYPSAPPQ 94
           P   +  PQ
Sbjct: 67 PPAATTQTPQ 76


>gnl|CDD|215602 PLN03143, PLN03143, nudix hydrolase; Provisional.
          Length = 291

 Score = 32.9 bits (75), Expect = 0.39
 Identities = 16/74 (21%), Positives = 26/74 (35%), Gaps = 10/74 (13%)

Query: 160 GFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAP 219
            F  L S   A    LP  +    ++  S+  P  +S+        P  P  P  +  AP
Sbjct: 1   RFTLLPSRPLA----LPSRMAHKEASSSSSSSPLTHSI------TLPGQPGQPVLVVAAP 50

Query: 220 SAPPANIEDATRST 233
               ++   A  S+
Sbjct: 51  GISSSDFRKAIDSS 64


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 33.1 bits (75), Expect = 0.39
 Identities = 56/275 (20%), Positives = 68/275 (24%), Gaps = 42/275 (15%)

Query: 32  SFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANP---PPYYQHPSVPSHPSVPTFPSAPIY 88
           S  S  P        T        P P     P   P   + P  P  P  P  P  P  
Sbjct: 537 SKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKK 596

Query: 89  PSAP-----PQESHSPYY--YSDIRANVPTGGSPVP--SYPQGSYPSNPYQAQNSYPSVP 139
           P  P     P    SP      DI  +     SP      P    PS+P + +   P + 
Sbjct: 597 PKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEG--PKII 654

Query: 140 NQPQAPNTYPSMPNQLQAPNGFYPLTSSQQA----------------SQASLPYPINPLA 183
             P+ P + P  P   +    FY       A                S      P  P  
Sbjct: 655 KSPKPPKS-PKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGT 713

Query: 184 SNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSV 243
                 P PP               P            PP   E+ T   H T A  P  
Sbjct: 714 PFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPP--EEERTF-FHETPADTPLP 770

Query: 244 --------PPAIHDATRSTPSPIIAPSVPPTNRDT 270
                      IH  T      +  P  P  + D 
Sbjct: 771 DILAEEFKEEDIHAETGEPDEAMKRPDSPSEHEDK 805



 Score = 30.4 bits (68), Expect = 2.8
 Identities = 36/218 (16%), Positives = 57/218 (26%), Gaps = 20/218 (9%)

Query: 25  SKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQ--HPSVPSHPSVPTF 82
            K+  S+   + P           P +   P SP     PPP  +   P  P  P +   
Sbjct: 597 PKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKS 656

Query: 83  PSAPIYPSAPPQESHSPYYYSD------------IRANVPTGGSPVPSYPQGSYPSNPYQ 130
           P  P  P  P        +Y D                +      +        P  P+ 
Sbjct: 657 PKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFT 716

Query: 131 AQNSYPSV-PNQPQAPNTYPSMPNQLQ-APNGFY--PLTSSQQASQASLPYPINPLASNK 186
                P   P   + P      P+  Q     F+  P        +     P+  + + +
Sbjct: 717 TPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEE 776

Query: 187 PSAPQPPAYSVQPSHNTAYPTAPL--VPSTIATAPSAP 222
                  A + +P      P +P           PS P
Sbjct: 777 FKEEDIHAETGEPDEAMKRPDSPSEHEDKPPGDHPSLP 814


>gnl|CDD|222579 pfam14179, YppG, YppG-like protein.  The YppG-like protein family
           includes the B. subtilis YppG protein, which is
           functionally uncharacterized. This family of proteins is
           found in bacteria. Proteins in this family are typically
           between 115 and 181 amino acids in length. There are two
           completely conserved residues (F and G) that may be
           functionally important.
          Length = 110

 Score = 31.2 bits (71), Expect = 0.42
 Identities = 16/79 (20%), Positives = 23/79 (29%), Gaps = 3/79 (3%)

Query: 84  SAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQ 143
                   P  +   PY        +P    P  S PQ             YP    Q Q
Sbjct: 4   QQNTNQYPPQNQQQQPYQQQPYHQQMP---PPPYSPPQQQQGHFMPPQPQPYPKQSPQQQ 60

Query: 144 APNTYPSMPNQLQAPNGFY 162
            P  + S  +Q +  +G +
Sbjct: 61  QPPQFSSFLSQFKNSDGTF 79


>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator.  Shisa is a
           transcription factor-type molecule that physically
           interacts with immature forms of the Wnt receptor
           Frizzled and the FGF receptor within the endoplasmic
           reticulum to inhibit their post-translational maturation
           and trafficking to the cell surface.
          Length = 177

 Score = 32.1 bits (73), Expect = 0.43
 Identities = 21/75 (28%), Positives = 28/75 (37%), Gaps = 6/75 (8%)

Query: 74  PSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQN 133
           P  P +    S  +  +  PQ   +   Y         G  P+P  PQ   P+ PY  Q 
Sbjct: 109 PQRPVMTRATSTTVQTTPLPQPPSTAPSYP---GPQYQGYHPMP--PQPGMPAPPYSLQY 163

Query: 134 SYPSVPNQPQAPNTY 148
             P +  QPQ P  Y
Sbjct: 164 PPPGLL-QPQGPPPY 177



 Score = 29.8 bits (67), Expect = 2.5
 Identities = 15/46 (32%), Positives = 17/46 (36%)

Query: 115 PVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNG 160
           P P     SYP   YQ  +  P  P  P  P +    P  L  P G
Sbjct: 128 PQPPSTAPSYPGPQYQGYHPMPPQPGMPAPPYSLQYPPPGLLQPQG 173



 Score = 28.6 bits (64), Expect = 5.2
 Identities = 18/69 (26%), Positives = 24/69 (34%), Gaps = 2/69 (2%)

Query: 35  SPPPPVYQTSSHTSPNA-YNNPPSPYHSANPPPYYQHPSVPSHPSVPTFP-SAPIYPSAP 92
              P + + +S T        PPS   S   P Y  +  +P  P +P  P S    P   
Sbjct: 109 PQRPVMTRATSTTVQTTPLPQPPSTAPSYPGPQYQGYHPMPPQPGMPAPPYSLQYPPPGL 168

Query: 93  PQESHSPYY 101
            Q    P Y
Sbjct: 169 LQPQGPPPY 177



 Score = 27.9 bits (62), Expect = 8.9
 Identities = 16/46 (34%), Positives = 17/46 (36%), Gaps = 1/46 (2%)

Query: 29  TSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVP 74
             S+  S P P YQ   H  P     P  PY    PPP    P  P
Sbjct: 130 PPSTAPSYPGPQYQGY-HPMPPQPGMPAPPYSLQYPPPGLLQPQGP 174


>gnl|CDD|182723 PRK10779, PRK10779, zinc metallopeptidase RseP; Provisional.
          Length = 449

 Score = 32.7 bits (75), Expect = 0.51
 Identities = 18/67 (26%), Positives = 32/67 (47%), Gaps = 10/67 (14%)

Query: 452 QIVSLVKDSSAARNGLLVNHNILEVNGANVVGLKD----------KELLEAMEKGGSVLT 501
            +  +  +S+A++ GL     I++V+G  +   +           K L   +E+ GS L+
Sbjct: 224 VLAEVQPNSAASKAGLQAGDRIVKVDGQPLTQWQTFVTLVRDNPGKPLALEIERQGSPLS 283

Query: 502 LTLIPTS 508
           LTL P S
Sbjct: 284 LTLTPDS 290


>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
           2'-phosphodiesterase/3'-nucleotidase precursor protein;
           Reviewed.
          Length = 814

 Score = 32.9 bits (75), Expect = 0.54
 Identities = 17/79 (21%), Positives = 23/79 (29%), Gaps = 10/79 (12%)

Query: 197 VQPSHNTAYPTAPLVP-STIATAP----SAPPANIEDATRSTHPTTATAPSVPPAIHDAT 251
           V  +  T+       P  + AT        P A    A   +   TA          D T
Sbjct: 32  VTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETAETSDPTSEATDTT 91

Query: 252 RSTPSPIIAPSVPPTNRDT 270
            S      A +V P   +T
Sbjct: 92  TSE-----ARTVTPAATET 105


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 32.6 bits (74), Expect = 0.54
 Identities = 13/72 (18%), Positives = 22/72 (30%), Gaps = 1/72 (1%)

Query: 187 PSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPA 246
           P+ P+P   +      T  P+    P +      +       A  +       AP  P  
Sbjct: 205 PATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGG 264

Query: 247 IHDA-TRSTPSP 257
                  +TP+P
Sbjct: 265 GEAPPANATPAP 276


>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c.  This family of
           proteins is found in eukaryotes. Proteins in this family
           are typically between 263 and 318 amino acids in length.
           There is a conserved SFT sequence motif. MSA-2 is a
           plasma membrane glycoprotein which can be found in
           Babesia bovis species.
          Length = 201

 Score = 31.7 bits (72), Expect = 0.64
 Identities = 16/50 (32%), Positives = 20/50 (40%), Gaps = 5/50 (10%)

Query: 114 SPVPSYPQG---SYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNG 160
           SP  S  QG   + PS+     N+  S P  P  P   P  P +    NG
Sbjct: 143 SPKHSSSQGTSTTRPSDGSATPNT--SAPPTPGNPAAQPEKPAETPKGNG 190


>gnl|CDD|178806 PRK00030, minC, septum formation inhibitor; Provisional.
          Length = 292

 Score = 32.0 bits (72), Expect = 0.77
 Identities = 20/85 (23%), Positives = 31/85 (36%), Gaps = 4/85 (4%)

Query: 181 PLASNKPSAPQPPAYSVQPSHNTAYPTAPLVP--STIATAPSAPPANIEDATRSTHPTTA 238
           P+  + P A  P      P ++ A P  P VP  +  A A + P  +  D  ++      
Sbjct: 97  PVELSTPVARAPQVIDTAPPNDVATPV-PSVPEATAEAAAKAGPQDDEADGEQADEAPAH 155

Query: 239 TAPSVP-PAIHDATRSTPSPIIAPS 262
              SVP  A  + T +       P 
Sbjct: 156 NPESVPTRAARETTEANRPTATPPQ 180


>gnl|CDD|221404 pfam12067, Sox_C_TAD, Sox C-terminal transactivation domain.  This
           domain is found at the C-terminus of the Sox family of
           transcription factors. It is found associated with
           pfam00505. It binds to the Armadillo repeats (pfam00514)
           in Catenin beta-1 (CTNNB1), which is involved in
           transcriptional regulation. It functions as a
           transactivating domain (TAD).
          Length = 197

 Score = 31.2 bits (71), Expect = 0.79
 Identities = 23/116 (19%), Positives = 35/116 (30%), Gaps = 14/116 (12%)

Query: 29  TSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIY 88
           +  +F SPP    Q      P  YN+  +P ++  P    Q     + P     PS    
Sbjct: 35  SEPAFFSPPC---QEDCQMMPYGYNSSYAPQNAHAPALLRQMAVTENIPQGSPAPSIMGC 91

Query: 89  PSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQA 144
           P+ P       Y                    Q    S P ++Q    ++    QA
Sbjct: 92  PTPPQMYYGQMY----------VPECAKHHPVQLGQLSPPPESQ-HLDTLDQLQQA 136


>gnl|CDD|235904 PRK06995, flhF, flagellar biosynthesis regulator FlhF; Validated.
          Length = 484

 Score = 31.9 bits (73), Expect = 0.82
 Identities = 23/100 (23%), Positives = 32/100 (32%), Gaps = 4/100 (4%)

Query: 176 PYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLV-PSTIATAPSAPPANIEDATRSTH 234
           P    P A+  P A   PA   +P+   A P   LV  +   TA          A  +  
Sbjct: 54  PAAAAPAAAQPPPAA-APAAVSRPAAPAAEPAPWLVEHAKRLTAQREQLVARAAAPAAPE 112

Query: 235 PTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSA 274
                AP+   A  +A R       A + P        +A
Sbjct: 113 AQAPAAPAERAAAENAARRLARA--AAAAPRPRVPADAAA 150


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 32.1 bits (73), Expect = 0.86
 Identities = 26/126 (20%), Positives = 34/126 (26%), Gaps = 8/126 (6%)

Query: 107 ANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVP--NQPQAP--NTYPSMPNQLQAPNGFY 162
             +   G P         P      +   P VP       P     PS   +   P    
Sbjct: 628 GEMIISGFPPVFKTALPRPDYNRGGEAGGPGVPGPVPVGMPAHTARPSRVARGD-PVRPT 686

Query: 163 PLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQ---PSHNTAYPTAPLVPSTIATAP 219
              ++ +A QA  P          P  P  PA +      S   A PTAP  P       
Sbjct: 687 AHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGAEQ 746

Query: 220 SAPPAN 225
           +    N
Sbjct: 747 ADGAEN 752


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 30.6 bits (69), Expect = 0.87
 Identities = 15/47 (31%), Positives = 19/47 (40%)

Query: 178 PINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPA 224
           P  P     P   Q PA +V     T+ P  P  P+    AP+A P 
Sbjct: 84  PPAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPP 130


>gnl|CDD|238322 cd00577, PCNA, Proliferating Cell Nuclear Antigen (PCNA) domain
           found in eukaryotes and archaea.  These polymerase
           processivity factors play a role in DNA replication and
           repair.  PCNA encircles duplex DNA in its central
           cavity, providing a DNA-bound platform for the
           attachment of the polymerase. The trimeric PCNA ring is
           structurally similar to the dimeric ring formed by the
           DNA polymerase processivity factors in bacteria (beta
           subunit DNA polymerase III holoenzyme) and in
           bacteriophages (catalytic subunits in T4 and RB69). This
           structural correspondence further substantiates the
           mechanistic connection between eukaryotic and
           prokaryotic DNA replication that has been suggested on
           biochemical grounds.   PCNA is also involved with
           proteins involved in cell cycle processes such as DNA
           repair and apoptosis. Many of these proteins contain a
           highly conserved motif known as the PIP-box (PCNA
           interacting protein box) which contains the sequence
           Qxx[LIM]xxF[FY]. .
          Length = 248

 Score = 31.4 bits (72), Expect = 0.90
 Identities = 26/139 (18%), Positives = 42/139 (30%), Gaps = 27/139 (19%)

Query: 321 PRDSCTMIAPLSSSGSGLKKAYVTNGIRELVLCKDASGLIGLRVCAINEGVFVCLVERGS 380
           P         L S    LK       +R+L    D+     + + A  +G          
Sbjct: 125 PELEYDATVTLPSDE--LKDI-----VRDLESISDS-----VTISASKDGFKFSAEGELG 172

Query: 381 PASLVGLRFG---DQILSINGETVAGYNMKQVHKILKAAPVNNIRVVIRDRPLERTVTLH 437
            AS+  L         +  +    + Y++K +    KAAP+++             VTL 
Sbjct: 173 GASVTLLPKDSDLLVTIECSEPVSSTYSLKYLKDFTKAAPLSD------------KVTLS 220

Query: 438 RDSSGHFGFHFNKGQIVSL 456
             S G     F       L
Sbjct: 221 FGSDGPLSLEFKIADGGHL 239


>gnl|CDD|222843 PHA02030, PHA02030, hypothetical protein.
          Length = 336

 Score = 31.5 bits (71), Expect = 1.1
 Identities = 17/68 (25%), Positives = 26/68 (38%)

Query: 197 VQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPS 256
           ++P    A    P VP+  A A SA    +  A  +      + P VP           +
Sbjct: 255 IKPKSKAAGSNLPAVPNVAADAGSAAAPAVPAAAAAVAQAAPSVPQVPNVAVLPDVPQVA 314

Query: 257 PIIAPSVP 264
           P+ AP+ P
Sbjct: 315 PVAAPAAP 322



 Score = 29.6 bits (66), Expect = 3.9
 Identities = 15/66 (22%), Positives = 27/66 (40%), Gaps = 2/66 (3%)

Query: 27  QTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAP 86
              + + S+  P V   ++  +  A + P  P  +  P      P     P+ P  P+ P
Sbjct: 271 NVAADAGSAAAPAVPAAAAAVAQAAPSVPQVPNVAVLPDVPQVAPVAA--PAAPEVPAVP 328

Query: 87  IYPSAP 92
           + P+AP
Sbjct: 329 VVPAAP 334


>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
          Length = 256

 Score = 31.2 bits (71), Expect = 1.2
 Identities = 22/89 (24%), Positives = 26/89 (29%), Gaps = 1/89 (1%)

Query: 143 QAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHN 202
           Q P     +P   Q P         QQ +Q   P P         + P+     VQP   
Sbjct: 97  QQPTQLSEVPYNEQTPQVPRSTVQIQQQAQQQQP-PATTAQPQPVTPPRQTTAPVQPQTP 155

Query: 203 TAYPTAPLVPSTIATAPSAPPANIEDATR 231
               T P  P T A       A  E   R
Sbjct: 156 APVRTQPAAPVTQAVEAPKVEAEKEKEQR 184


>gnl|CDD|237182 PRK12727, PRK12727, flagellar biosynthesis regulator FlhF;
           Provisional.
          Length = 559

 Score = 31.5 bits (71), Expect = 1.2
 Identities = 23/107 (21%), Positives = 30/107 (28%), Gaps = 3/107 (2%)

Query: 168 QQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIE 227
           Q+A + +        A+  P APQ P     P H     +A    S      SA    I 
Sbjct: 52  QRALETARSDTPATAAAPAP-APQAPTKPAAPVHAPLKLSANANMSQRQRVASAAEDMI- 109

Query: 228 DATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSA 274
            A  +     +     P A      S PSP            T    
Sbjct: 110 -AAMALRQPVSVPRQAPAAAPVRAASIPSPAAQALAHAAAVRTAPRQ 155



 Score = 31.1 bits (70), Expect = 1.4
 Identities = 27/142 (19%), Positives = 41/142 (28%), Gaps = 20/142 (14%)

Query: 136 PSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAY 195
            + P       T P+ P             S +Q   ++    I  +A  +P +    A 
Sbjct: 66  AAAPAPAPQAPTKPAAPVHAPLKLSANANMSQRQRVASAAEDMIAAMALRQPVSVPRQAP 125

Query: 196 SVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTH-------------PTTATAPS 242
           +  P    + P+        A    A  A +  A R  H              TTA  P 
Sbjct: 126 AAAPVRAASIPSP-------AAQALAHAAAVRTAPRQEHALSAVPEQLFADFLTTAPVPR 178

Query: 243 VPPAIHDATRSTPSPIIAPSVP 264
            P          P P IA ++ 
Sbjct: 179 APVQAPVVAAPAPVPAIAAALA 200



 Score = 31.1 bits (70), Expect = 1.6
 Identities = 40/224 (17%), Positives = 64/224 (28%), Gaps = 48/224 (21%)

Query: 55  PPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIR-------- 106
             +P  +A P P          P  PT P+AP++       + +      +         
Sbjct: 60  SDTPATAAAPAPA---------PQAPTKPAAPVHAPLKLSANANMSQRQRVASAAEDMIA 110

Query: 107 -------ANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPN 159
                   +VP            S PS   QA     +V   P+  +   ++P QL A  
Sbjct: 111 AMALRQPVSVPRQAPAAAPVRAASIPSPAAQALAHAAAVRTAPRQEHALSAVPEQLFA-- 168

Query: 160 GFYPLTSSQQASQASLPYPINPLASNKPSAPQP------PAYSVQPSHNTAYPTAPLVPS 213
                      + A +P            AP P       A++     +           
Sbjct: 169 --------DFLTTAPVPRAPVQAPVVAAPAPVPAIAAALAAHAAYAQDDDEQLDDDGFDL 220

Query: 214 TIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSP 257
             A     PPA +        P    AP+ P A+     + P+P
Sbjct: 221 DDALPQILPPAAL--------PPIVVAPAAPAALAAVAAAAPAP 256


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 31.8 bits (72), Expect = 1.2
 Identities = 17/71 (23%), Positives = 24/71 (33%), Gaps = 12/71 (16%)

Query: 214 TIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRS 273
            +ATAP    A        + P     P  PP         P     P+ PP   D   +
Sbjct: 901 RVATAPGGDAA--------SAPPPGAGPPAPPQ----AVPPPRTTQPPAAPPRGPDVPPA 948

Query: 274 ALVPLYPSLDN 284
           A+  L  +L +
Sbjct: 949 AVAELRETLAD 959


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 31.2 bits (70), Expect = 1.2
 Identities = 29/110 (26%), Positives = 41/110 (37%), Gaps = 9/110 (8%)

Query: 143 QAPNTYPS-MPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSH 201
           QA NT     P   +APN     TS+ +  + S P      + N  + P P     Q + 
Sbjct: 64  QAANTRQERTPKLEKAPNTNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTT 123

Query: 202 NTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDAT 251
            +  P       T  T P  P  N     +ST   T  +P++  A  D T
Sbjct: 124 ESTTPK------TKVTTP--PSTNTPQPMQSTKSDTPQSPTIKQAQTDMT 165



 Score = 29.3 bits (65), Expect = 5.4
 Identities = 19/90 (21%), Positives = 32/90 (35%), Gaps = 14/90 (15%)

Query: 139 PNQPQAPNT-----YPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQP- 192
           P   +APNT       S   ++  P         +  + ++ P P    +     +  P 
Sbjct: 74  PKLEKAPNTNEEKTSASKIEKISQP----KQEEQKSLNISATPAPKQEQSQTTTESTTPK 129

Query: 193 PAYSVQPSHNTAYPTAPLVPSTIATAPSAP 222
              +  PS NT  P    + ST +  P +P
Sbjct: 130 TKVTTPPSTNTPQP----MQSTKSDTPQSP 155


>gnl|CDD|173162 PRK14699, PRK14699, replication factor A; Provisional.
          Length = 484

 Score = 31.5 bits (71), Expect = 1.2
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 293 ELINAYARENAANERESDAAGN 314
           ELINAYARENA  ++     GN
Sbjct: 238 ELINAYARENAFTQKVELQVGN 259


>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
          Length = 1000

 Score = 31.6 bits (71), Expect = 1.3
 Identities = 48/222 (21%), Positives = 67/222 (30%), Gaps = 40/222 (18%)

Query: 34  SSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPY----YQHPSVPSHPSVPTFPSAPIYP 89
           + PP      S H  P A   P   Y    PP      YQ P             A  + 
Sbjct: 739 APPPSHQAPYSGHEEPQAQQAPYPGYWEPRPPQAPYLGYQEPQAQGVQVSSYPGYAGPWG 798

Query: 90  SAP--PQESHSPYYYSDIRANVPTGG--SPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAP 145
                P+  HS  Y+S    +    G  +P P +    +  +    Q+     P      
Sbjct: 799 LRAQHPRYRHSWAYWSQYPGHGHPQGPWAPRPPHLPPQWDGSAGHGQDQVSQFP--HLQS 856

Query: 146 NTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAY 205
            T P      Q P   Y                            Q    S  PS ++  
Sbjct: 857 ETGPPRLQLSQVPQLPYS---------------------------QTLVSSSAPSWSSPQ 889

Query: 206 PTAPLVPSTIATAPSAPPANIEDATRSTHPTTATA-PSVPPA 246
           P AP+ P  I T    PP  ++D+      ++ TA PS+P A
Sbjct: 890 PRAPIRP--IPTRFPPPPMPLQDSMAVGCDSSGTACPSMPFA 929


>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
          Length = 931

 Score = 31.7 bits (72), Expect = 1.3
 Identities = 23/112 (20%), Positives = 32/112 (28%), Gaps = 17/112 (15%)

Query: 160 GFYPLTSSQQASQASLPYPINPLASNKPS---------APQPPAYSVQPSHNTAYPTAPL 210
            F  +TS+Q +    +   +  L S             A Q  A    P    A P    
Sbjct: 13  AFEVITSAQLSRIGRIERELRELRSLVAEGAAGPVAKAAEQMAA----PEAAEAAPLPAA 68

Query: 211 VPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPS 262
             S  +     P        +      A  PS  PA   A    P+  + PS
Sbjct: 69  AESIASPEVPPPVP--PAPAQEGEAPAAEQPSAVPAPSAA--PAPAEPVEPS 116


>gnl|CDD|220840 pfam10667, DUF2486, Protein of unknown function (DUF2486).  This
           family is made up of members from various Burkholderia
           spp. The function is unknown.
          Length = 245

 Score = 31.0 bits (70), Expect = 1.3
 Identities = 24/90 (26%), Positives = 30/90 (33%), Gaps = 7/90 (7%)

Query: 175 LPYPINPLASNKPSAPQPPAYSVQPSH---NTAYPTAPLVPSTIATAPSAPPANIE--DA 229
           +P P  P  +       P      P       A   APL  +  A  P APPA     DA
Sbjct: 78  VPTPHVPAVALPGDTDAPAEPGAAPHVVAERAAAMQAPLPSALAADDPQAPPAGATAADA 137

Query: 230 TRSTHPTT--ATAPSVPPAIHDATRSTPSP 257
             +    T  A   + PPA   A  S  + 
Sbjct: 138 GDAAPDATPPAAGDASPPAAAQAAASAAAA 167


>gnl|CDD|226164 COG3638, COG3638, ABC-type phosphate/phosphonate transport system,
           ATPase component [Inorganic ion transport and
           metabolism].
          Length = 258

 Score = 31.1 bits (71), Expect = 1.3
 Identities = 21/87 (24%), Positives = 36/87 (41%), Gaps = 24/87 (27%)

Query: 446 FHFNKGQIVSLVKDSSAARNGLL--VNHNI------LEVNGANVVGLKDKELLEAMEKGG 497
              N+G++V+++  S A ++ LL  +N  +      +  NG  +  LK KEL +     G
Sbjct: 25  LEINQGEMVAIIGPSGAGKSTLLRSLNGLVDPTSGEILFNGVQITKLKGKELRKLRRDIG 84

Query: 498 SVLTLTLIPTSMYQHMMKKMSFNLVKD 524
            +          +Q       FNLV  
Sbjct: 85  MI----------FQQ------FNLVPR 95


>gnl|CDD|221509 pfam12287, Caprin-1_C, Cytoplasmic
           activation/proliferation-associated protein-1 C term.
           This family of proteins is found in eukaryotes. Proteins
           in this family are typically between 343 and 708 amino
           acids in length. This family is the C terminal region of
           caprin-1. Caprin-1 is a protein involved in regulating
           cellular proliferation. In mutated phenotypes, the G1
           phase of the cell cycle is greatly lengthened, impairing
           normal proliferation. The C terminal region of caprin-1
           contains RGG motifs which are characteristic of RNA
           binding domains. It is possible that caprin-1 functions
           through an RNA binding mechanism.
          Length = 319

 Score = 31.1 bits (70), Expect = 1.4
 Identities = 37/158 (23%), Positives = 52/158 (32%), Gaps = 19/158 (12%)

Query: 28  TTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPI 87
           + +S   +  PP+YQ S    P    +P  P  ++      Q P+  S P+         
Sbjct: 70  SPTSEGYTSSPPLYQPSHTAEPRPQTDPIDPIQASMSLNSEQTPTSSSLPAAS------- 122

Query: 88  YPSAPPQESHSPYYYSDIRAN---------VPTGGSPVPSYPQGSYPSNPYQAQNSY-PS 137
                 Q    P + S I  N         V    +PVP   +        Q Q SY   
Sbjct: 123 -QPQVFQTGSKPLHSSGINVNAAPFQSMQTVFNMNAPVPPVNEPESLKQQNQYQASYNQG 181

Query: 138 VPNQPQAPNTYPSM-PNQLQAPNGFYPLTSSQQASQAS 174
             +QPQ P     +   QLQ   G +     Q    AS
Sbjct: 182 FSSQPQHPVEQTELQQEQLQTVVGSFHSQDQQAHQAAS 219


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 31.0 bits (70), Expect = 1.5
 Identities = 19/84 (22%), Positives = 32/84 (38%), Gaps = 11/84 (13%)

Query: 181 PLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPS-APPANIEDATRSTHPTTAT 239
           P       AP     +  P+   + P +P      A+  S AP + +E+ + S    TAT
Sbjct: 77  PKTGTGKVAPPAATPTSAPTPTPSPPASP------ASGMSAAPASAVEEKSPSEESATAT 130

Query: 240 APSVPPAIHDATRSTPSPIIAPSV 263
           AP  P      +  +     A ++
Sbjct: 131 APESP----STSVPSSGSDAASTL 150


>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
          Length = 421

 Score = 31.2 bits (70), Expect = 1.5
 Identities = 28/123 (22%), Positives = 50/123 (40%), Gaps = 2/123 (1%)

Query: 150 SMPNQLQAPNGFYPLTSSQQASQASLPYP-INPLASNKPSAPQPPAYSVQPSHNTAYPTA 208
           ++P     P+     +S+   S A++P P + P +    SA    A +V  S     P+ 
Sbjct: 298 TVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSD 357

Query: 209 PLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPS-PIIAPSVPPTN 267
             +P T+A   + P         +T    ++  ++    +  T S P+ P     V PT+
Sbjct: 358 VTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITSTANGPTTSLPAAPASNIPVSPTS 417

Query: 268 RDT 270
           RD 
Sbjct: 418 RDA 420


>gnl|CDD|227395 COG5063, CTH1, CCCH-type Zn-finger protein [General function
           prediction only].
          Length = 351

 Score = 31.2 bits (70), Expect = 1.5
 Identities = 26/143 (18%), Positives = 40/143 (27%), Gaps = 4/143 (2%)

Query: 16  LTIAQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPS 75
            TI+           SS          ++S  SP  Y   P   ++  P   Y+   + S
Sbjct: 61  ETISHLSASLDNKEQSSSEKRQKAEDSSASSNSPQNYPWSPRGSNANKPYGLYKTEMLRS 120

Query: 76  HPSVPTFPSAPIYPSAPPQESHS----PYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQA 131
              +P        P A  +   S       Y      V      VP   +  + S P   
Sbjct: 121 STEIPYCRYPDKNPFAHSKAISSLAQTHPKYKTESLEVFINPGYVPYSKRCCFISLPLTD 180

Query: 132 QNSYPSVPNQPQAPNTYPSMPNQ 154
            N  P    +P++     S    
Sbjct: 181 INLQPLSQRKPKSGKNCTSYTLG 203


>gnl|CDD|191251 pfam05283, MGC-24, Multi-glycosylated core protein 24 (MGC-24).
           This family consists of several MGC-24 (or Cd164
           antigen) proteins from eukaryotic organisms.
           MGC-24/CD164 is a sialomucin expressed in many normal
           and cancerous tissues. In humans, soluble and
           transmembrane forms of MGC-24 are produced by
           alternative splicing.
          Length = 187

 Score = 30.4 bits (68), Expect = 1.6
 Identities = 11/82 (13%), Positives = 22/82 (26%), Gaps = 1/82 (1%)

Query: 193 PAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATR 252
           P      S       +         + S           +  PT   +P+     H  + 
Sbjct: 82  PEEPGYCSSQAEVVKSGCQIYNTTDSCSVATTTPVPTNSTAKPTITPSPTTSHH-HVTSE 140

Query: 253 STPSPIIAPSVPPTNRDTIRSA 274
              +  + P+  P  + T  +A
Sbjct: 141 PKTNTTVTPTSQPDRKSTFDAA 162


>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein.  The WWbp domain is
           characterized by several short PY and PT-like motifs of
           the PPPPY form. These appear to bind directly to the WW
           domains of WWP1 and WWP2 and other such diverse proteins
           as dystrophin and YAP (Yes-associated protein). This is
           the WW-domain binding protein WWbp via PY and PY_like
           motifs. The presence of a phosphotyrosine residue in the
           pWBP-1 peptide abolishes WW domain binding which
           suggests a potential regulatory role for tyrosine
           phosphorylation in modulating WW domain-ligand
           interactions. Given the likelihood that WWP1 and WWP2
           function as E3 ubiquitin-protein ligases, it is possible
           that initial substrate-specific recognition occurs via
           WW domain-substrate protein interaction followed by
           ubiquitin transfer and subsequent proteolysis. This
           domain lies just downstream of the GRAM (pfam02893) in
           many members.
          Length = 111

 Score = 29.3 bits (66), Expect = 1.6
 Identities = 21/83 (25%), Positives = 26/83 (31%), Gaps = 4/83 (4%)

Query: 13  FHQ--LTIAQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQH 70
           F Q    +A+    ++  +  S   PPP  Y              P PY    PPPYY  
Sbjct: 23  FGQAMERLAERAQRAQPVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPY--GPPPPYYPA 80

Query: 71  PSVPSHPSVPTFPSAPIYPSAPP 93
           P        P        P  PP
Sbjct: 81  PPGVYPTPPPPNSGYMADPQEPP 103


>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
           Provisional.
          Length = 475

 Score = 30.9 bits (70), Expect = 1.7
 Identities = 29/109 (26%), Positives = 39/109 (35%), Gaps = 24/109 (22%)

Query: 71  PSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQ 130
           P   +   VP  P++P  P   PQ  ++P  +       P  GS     P   YP N   
Sbjct: 127 PVTATGEPVPQMPASPPGPEGEPQPGNTPVSF-------PPQGSVAVPPPTAFYPGNGVT 179

Query: 131 --AQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPY 177
              Q +Y SVP           +PN++Q     Y           SLPY
Sbjct: 180 PPPQVTYQSVP-----------VPNRIQRKTFTYNE----GKKGPSLPY 213



 Score = 30.6 bits (69), Expect = 2.5
 Identities = 24/98 (24%), Positives = 33/98 (33%), Gaps = 5/98 (5%)

Query: 151 MPNQLQAPNGFYPLTSSQQASQASLPYPI---NPLASNKPSAPQPPAYSVQPSHNTA-YP 206
           +  Q++A        + +   Q     P     P   N P +  PP  SV     TA YP
Sbjct: 116 LAEQVKALGANPVTATGEPVPQMPASPPGPEGEPQPGNTPVS-FPPQGSVAVPPPTAFYP 174

Query: 207 TAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVP 244
              + P    T  S P  N       T+      PS+P
Sbjct: 175 GNGVTPPPQVTYQSVPVPNRIQRKTFTYNEGKKGPSLP 212


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 31.2 bits (71), Expect = 1.8
 Identities = 19/132 (14%), Positives = 34/132 (25%), Gaps = 7/132 (5%)

Query: 134 SYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPP 193
            YP V  +PQ          +        P+ +    + A  P    P+        + P
Sbjct: 844 RYPVV--RPQDVQVEEQREAEEVQVQ---PVVAEVPVAAAVEPVVSAPVVEAVAEVVEEP 898

Query: 194 AYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRS 253
               +P            P  IA   +  P  I ++  +     A      P +     +
Sbjct: 899 VVVAEPQPEEVVVVETTHPEVIAAPVTEQPQVITESDVAVAQEVAEHAE--PVVEPQDET 956

Query: 254 TPSPIIAPSVPP 265
                 A +   
Sbjct: 957 ADIEEAAETAEV 968


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 30.8 bits (69), Expect = 1.8
 Identities = 35/157 (22%), Positives = 46/157 (29%), Gaps = 7/157 (4%)

Query: 8   LEDMTFHQLTIAQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPP-- 65
           LE+    +    Q +  S+Q    +    P     T   T P   N   +P     P   
Sbjct: 103 LEEKERQEQMAEQLQRISEQNEQQAIEIDPTEEVNTQEPTQPAGVNVANNPQPQVQPQFG 162

Query: 66  PYYQHPSVPSHPSVPTFPSA---PIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQG 122
           P  Q    P     P  P+    P +   PP     P        N   G  P P + Q 
Sbjct: 163 PNPQQRINPQRFGFPMQPNMGMRPGFNQMPPHMPGMPPNQMRPGFNPMPGMPPRPGFNQN 222

Query: 123 SYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPN 159
                P   +  +   P     P T P  PN  Q P 
Sbjct: 223 P-NMMPNMNRPGFRPQPGGFNHPGT-PMGPNMQQRPG 257


>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824).  This is
           a repeating domain found in fungal proteins. It is
           proline-rich, and the function is not known.
          Length = 135

 Score = 29.5 bits (66), Expect = 1.9
 Identities = 19/68 (27%), Positives = 26/68 (38%), Gaps = 2/68 (2%)

Query: 34  SSPPPPVYQTSSHTSPNAYNNPP--SPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSA 91
             P PPV     + + N +  PP  +P     P P Y     P  P     P    YP  
Sbjct: 59  YPPSPPVSDPRYYPNSNYFPPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQNYPYPPG 118

Query: 92  PPQESHSP 99
           P Q+ ++P
Sbjct: 119 PGQDPYAP 126


>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
           (DUF1750).  This is a fungal domain of unknown function.
          Length = 669

 Score = 30.8 bits (69), Expect = 2.0
 Identities = 12/47 (25%), Positives = 16/47 (34%), Gaps = 7/47 (14%)

Query: 49  PNAYNNPPSPYHSANPP--PYYQHPSVPSHPSVPTFPSAPIYPSAPP 93
             +   PP PY +A     P+Y  P   + P     P       AP 
Sbjct: 210 GQSMYQPPGPYPNAMVGRQPFYPQPGAVAGP-----PKRRGGHKAPR 251


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 30.8 bits (69), Expect = 2.0
 Identities = 27/103 (26%), Positives = 37/103 (35%), Gaps = 7/103 (6%)

Query: 165 TSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPA 224
           T   Q  Q + P      +S  P+ P P   S  PS    +  AP  P      P   P 
Sbjct: 399 TPGTQGPQGTAPAAGMTPSSAAPATPAP---SAAPSPRVPWDDAPPAPPRSGIPPRPAPR 455

Query: 225 NIEDATRSTHP-TTATAPSVPPAIHDATRSTPSPIIAPSVPPT 266
             E +     P + A+A   PP + D + +       PS P T
Sbjct: 456 MPEASPVPGAPDSVASASDAPPTLGDPSDTAEH---TPSGPRT 495



 Score = 30.4 bits (68), Expect = 3.0
 Identities = 31/133 (23%), Positives = 42/133 (31%), Gaps = 17/133 (12%)

Query: 30  SSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYP 89
           S   ++ P P  Q    T+P A   P S   +   P     P VP   + P  P + I P
Sbjct: 391 SGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPP 450

Query: 90  SAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPS--VPNQPQAPNT 147
              P+                   SPVP  P     ++        PS    + P  P T
Sbjct: 451 RPAPRMP---------------EASPVPGAPDSVASASDAPPTLGDPSDTAEHTPSGPRT 495

Query: 148 YPSMPNQLQAPNG 160
           +       Q  NG
Sbjct: 496 WDGFLEFCQGRNG 508


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 30.8 bits (69), Expect = 2.0
 Identities = 27/156 (17%), Positives = 49/156 (31%), Gaps = 15/156 (9%)

Query: 126 SNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASN 185
           +  +  + S+P+ P+Q    +   +  +Q          T++     +S   P+    + 
Sbjct: 177 TTSWPTEVSHPTYPSQVTPQSQPATQGHQ----------TATANQRLSSTE-PVGTQGTT 225

Query: 186 KPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPP 245
             S P  P     PS      +    PST +   S      +    +    T  A S   
Sbjct: 226 TSSNP-EPQTEPPPSQRGPSGSPQHPPSTTSQDQST---TGDGQEHTQRRKTPPATSNRR 281

Query: 246 AIHDATRSTPSPIIAPSVPPTNRDTIRSALVPLYPS 281
           + H      P+     +  PT R T  +      P 
Sbjct: 282 SPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPH 317


>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 292

 Score = 30.3 bits (68), Expect = 2.0
 Identities = 28/142 (19%), Positives = 40/142 (28%), Gaps = 14/142 (9%)

Query: 137 SVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYS 196
           S    P +      +    QA N   P      A   S    + P++S      +P A  
Sbjct: 37  SSSEAPASAEKSIDLNGATQA-NAQQPAPGPTSAENTSQDLSLPPISSTPTQGQEPLAQD 95

Query: 197 VQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTA--TAPSVPPAIHDATRST 254
            Q              +  A  P         A  ST PT     AP    ++  A R  
Sbjct: 96  GQQRVEV-----QGDLNNAAVQPQNLSQLNNVAVTSTLPTEPATVAPVRNASVPTAERPA 150

Query: 255 PS------PIIAPSVPPTNRDT 270
            +       +  P+V P    T
Sbjct: 151 ITRPVRAQAVSEPAVEPKAAKT 172


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 30.9 bits (69), Expect = 2.2
 Identities = 28/137 (20%), Positives = 48/137 (35%), Gaps = 5/137 (3%)

Query: 108 NVPTGGSPVPSYPQGSYPS-NPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTS 166
           N+P       +  Q   P+  P+QA +  P     P +  +      Q   P        
Sbjct: 25  NIPIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDP 84

Query: 167 SQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANI 226
           +    QA+   P   +A    +AP+P A     S   A+       ++ A+    P A+ 
Sbjct: 85  APAPHQAASRAPDPAVAPQLAAAPKPDAAEAFTSAAQAHEAPADAGTSAASKKPDPAAH- 143

Query: 227 EDATRSTHPTTATAPSV 243
              T+ + P  A   S+
Sbjct: 144 ---TQHSPPPFAYTRSM 157



 Score = 29.3 bits (65), Expect = 5.3
 Identities = 21/106 (19%), Positives = 32/106 (30%), Gaps = 8/106 (7%)

Query: 169 QASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPST--------IATAPS 220
               A+L   I     +  +A Q P  +  P    +    P V  T        +A AP+
Sbjct: 16  NLIIANLNTNIPIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPT 75

Query: 221 APPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPT 266
              +   D   + H   + AP    A   A    P    A +    
Sbjct: 76  PAASEKFDPAPAPHQAASRAPDPAVAPQLAAAPKPDAAEAFTSAAQ 121


>gnl|CDD|227361 COG5028, COG5028, Vesicle coat complex COPII, subunit SEC24/subunit
           SFB2/subunit SFB3 [Intracellular trafficking and
           secretion].
          Length = 861

 Score = 30.9 bits (70), Expect = 2.3
 Identities = 28/168 (16%), Positives = 50/168 (29%), Gaps = 8/168 (4%)

Query: 118 SYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPY 177
            YPQ     +   A +   + P++  A  +   M              S +Q  QA+   
Sbjct: 8   VYPQAQSQVHTGAASSKKSARPHRAYANFSAGQMGMP-PYTTPPLQQQSRRQIDQAATAM 66

Query: 178 PINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIED-ATRSTHPT 236
                 +  PS   P   S Q   +    +     +   T P  P    ED     +   
Sbjct: 67  HNTGANNPAPSVMSPAFQSQQKFSSPYGGSMADGTAPKPTNPLVPVDLFEDQPPPISDLF 126

Query: 237 TATAPSVPP----AIHDATRSTPSPIIAPSVP--PTNRDTIRSALVPL 278
               P VPP     +     +     +  ++   P   D ++ + +P 
Sbjct: 127 LPPPPIVPPLTTNFVGSEQSNCSPKYVRSTMYAIPETNDLLKKSKIPF 174


>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
           TopJ.  This model describes TopJ (MG_200, CbpA), a DnaJ
           homolog and probable assembly protein of the Mycoplasma
           terminal organelle. The terminal organelle is involved
           in both cytadherence and gliding motility [Cellular
           processes, Chemotaxis and motility].
          Length = 871

 Score = 30.6 bits (68), Expect = 2.4
 Identities = 18/111 (16%), Positives = 29/111 (26%), Gaps = 7/111 (6%)

Query: 117 PSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLP 176
           P          P     S P++          P  P  +          +  +      P
Sbjct: 242 PQDDSEDDYVIPDAEIISSPTLEVTAPKEVEQPLQPEPVDE-------ETVAETKAEEEP 294

Query: 177 YPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIE 227
            P   + +   SAP+       P  N           TI+T P  P   ++
Sbjct: 295 QPTQTVETKPTSAPESTVEENLPEINQPTQAVQPTSETISTTPVEPTDQLK 345


>gnl|CDD|227952 COG5665, NOT5, CCR4-NOT transcriptional regulation complex, NOT5
           subunit [Transcription].
          Length = 548

 Score = 30.4 bits (68), Expect = 2.5
 Identities = 27/113 (23%), Positives = 37/113 (32%), Gaps = 11/113 (9%)

Query: 146 NTYPSMPNQLQ--APNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNT 203
             Y  M  ++Q  + N   P   + Q S +S+        S K  APQ     V  S   
Sbjct: 191 TIYEDMGCEIQPSSSNNEAPKEGNNQTSLSSIRSSKKQERSPKKKAPQ---RDVSISDRA 247

Query: 204 AYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPS 256
             P AP V S   +  S P          T     T         ++T  TP+
Sbjct: 248 TTPIAPGVESASQSISSTPTPVS------TDTPLHTVKDDSIKFDNSTLGTPT 294


>gnl|CDD|182820 PRK10898, PRK10898, serine endoprotease; Provisional.
          Length = 353

 Score = 30.4 bits (69), Expect = 2.6
 Identities = 16/51 (31%), Positives = 24/51 (47%), Gaps = 12/51 (23%)

Query: 369 EGVFVCLVERGSPASLVGLRFGDQILSING-------ETVAGYNMKQVHKI 412
           +G+ V  V    PA+  G++  D I+S+N        ET     M QV +I
Sbjct: 279 QGIVVNEVSPDGPAAKAGIQVNDLIISVNNKPAISALET-----MDQVAEI 324


>gnl|CDD|203523 pfam06815, RVT_connect, Reverse transcriptase connection domain.
           This domain is known as the connection domain. This
           domain lies between the thumb and palm domains.
          Length = 102

 Score = 28.3 bits (64), Expect = 2.7
 Identities = 6/29 (20%), Positives = 10/29 (34%)

Query: 394 LSINGETVAGYNMKQVHKILKAAPVNNIR 422
           +   G+    Y + Q  K LK       +
Sbjct: 13  IQKLGDGQWTYQIYQEKKPLKTGKYAKQK 41


>gnl|CDD|237011 PRK11892, PRK11892, pyruvate dehydrogenase subunit beta;
           Provisional.
          Length = 464

 Score = 30.3 bits (69), Expect = 3.0
 Identities = 18/70 (25%), Positives = 26/70 (37%), Gaps = 1/70 (1%)

Query: 207 TAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPS-PIIAPSVPP 265
           +A    +  A A  A  A    A  +     A AP+ P A      + P  P     V  
Sbjct: 83  SASDAGAAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAADPDIPAGTEMVTM 142

Query: 266 TNRDTIRSAL 275
           T R+ +R A+
Sbjct: 143 TVREALRDAM 152


>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
           Provisional.
          Length = 576

 Score = 30.3 bits (68), Expect = 3.0
 Identities = 26/139 (18%), Positives = 41/139 (29%), Gaps = 15/139 (10%)

Query: 49  PNAYNNP-PSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRA 107
           P+   +P P P    +  P  ++   P    VP   + P  P   P  S+S +       
Sbjct: 323 PDNPQDPVPPPNEGKDGNPNEENLFPPGDDEVPDESNVPPNPPNVPGGSNSEFSSDVENP 382

Query: 108 NVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSS 167
             P              P  P Q  N      +  + P   P  P   +  N   P    
Sbjct: 383 PNPPN------------PDIPEQEPN--IPEDSNKEVPEDVPMEPEDDRDNNFNEPKKPE 428

Query: 168 QQASQASLPYPINPLASNK 186
            +    + P    PL + +
Sbjct: 429 NKGDGQNEPVIPKPLDNER 447


>gnl|CDD|225499 COG2948, VirB10, Type IV secretory pathway, VirB10 components
           [Intracellular trafficking and secretion].
          Length = 360

 Score = 30.1 bits (68), Expect = 3.1
 Identities = 18/122 (14%), Positives = 27/122 (22%), Gaps = 5/122 (4%)

Query: 164 LTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSH--NTAYPTAPLVPSTIATAPSA 221
               ++    +   P N      P  P P    + P    +   P  P      A     
Sbjct: 45  ALQGEKKRINNTQPPSNVERGTPPLPPLPDDPPLPPPLPVDLGAPVLPDQQVEEAKDQPR 104

Query: 222 P-PANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRD--TIRSALVPL 278
              A    AT  +   +  A     A         +       P           A+ P 
Sbjct: 105 RLRAAELAATSGSRVESDRAVGRVRAALANAAPAAAAPPPAGQPSGQSAKEDFAGAVNPT 164

Query: 279 YP 280
            P
Sbjct: 165 QP 166


>gnl|CDD|236797 PRK10927, PRK10927, essential cell division protein FtsN;
           Provisional.
          Length = 319

 Score = 30.0 bits (67), Expect = 3.4
 Identities = 22/101 (21%), Positives = 30/101 (29%), Gaps = 6/101 (5%)

Query: 128 PYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKP 187
           P Q Q +        Q                 +   T + QA+    P    P  S   
Sbjct: 146 PEQRQQTLQRQRQAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAA----PVQAQPRQSKPA 201

Query: 188 SAPQPPAYSVQ-PSHNTAYPTAPLVPSTIATAPSAPPANIE 227
           S  QP    +Q P+H TA        + +  A  AP    E
Sbjct: 202 STQQPYQDLLQTPAHTTAQSKPQQA-APVTRAADAPKPTAE 241


>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729).  This
           family of proteins is found in viruses. Proteins in this
           family are typically between 145 and 1707 amino acids in
           length. The family is found in association with
           pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
           There is a single completely conserved residue L that
           may be functionally important.
          Length = 115

 Score = 28.5 bits (64), Expect = 3.4
 Identities = 16/86 (18%), Positives = 23/86 (26%), Gaps = 3/86 (3%)

Query: 184 SNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSV 243
           S+  S P+       P      P     P  ++     PP +   A           P+ 
Sbjct: 26  SSCFSPPESAHPDDPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAG 85

Query: 244 PPAIHDATRSTP-SPIIAPSVPPTNR 268
           PP+        P      P   P  R
Sbjct: 86  PPS--PLAPPAPARKPPLPPPRPQRR 109


>gnl|CDD|147601 pfam05505, Ebola_NP, Ebola nucleoprotein.  This family consists of
           Ebola and Marburg virus nucleoproteins. These proteins
           are responsible for encapsidation of genomic RNA. It has
           been found that nucleoprotein DNA vaccines can offer
           protection from the virus.
          Length = 717

 Score = 29.9 bits (67), Expect = 4.3
 Identities = 15/65 (23%), Positives = 23/65 (35%), Gaps = 4/65 (6%)

Query: 17  TIAQNEYFSKQTTSSSFSSPPPPVYQTSS--HTSPNAYNNPPSPYHSAN--PPPYYQHPS 72
           +       ++Q    +  +PP PVY+        P+   NP  P  S         +  S
Sbjct: 566 SDDDPGSDNEQGVDLTEVAPPAPVYRDEKEQDEIPHPAQNPQDPTGSIGNVDSDILRSNS 625

Query: 73  VPSHP 77
            PS P
Sbjct: 626 KPSAP 630


>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
           Provisional.
          Length = 376

 Score = 29.5 bits (66), Expect = 4.4
 Identities = 14/47 (29%), Positives = 19/47 (40%), Gaps = 2/47 (4%)

Query: 53  NNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSP 99
             PPS   + N  P+  HP+    P  P      + P+APP     P
Sbjct: 59  PAPPSTMIATNTKPF--HPAPIDVPPDPPAAQEAVQPTAPPSAQSEP 103


>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 504

 Score = 29.8 bits (67), Expect = 4.4
 Identities = 19/110 (17%), Positives = 33/110 (30%), Gaps = 7/110 (6%)

Query: 170 ASQASLPYPINPLASNKPSAPQPPAYSVQPSHNT-AYPTAPLVPSTIATAPSAPPANIED 228
           A  A    P   +A+  P AP P   + + +       +    P+  ATA  AP  + + 
Sbjct: 338 ALLALGGAPSEGVAAVAPPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAGAPLPDFDP 397

Query: 229 ATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSALVPL 278
             R          +  P                 +    RD + +  + L
Sbjct: 398 RPRGPPAPEPARSAEAPP------LVAPAAAPAGLALRWRDVLAALKMQL 441


>gnl|CDD|237524 PRK13826, PRK13826, Dtr system oriT relaxase; Provisional.
          Length = 1102

 Score = 29.7 bits (67), Expect = 4.5
 Identities = 39/121 (32%), Positives = 51/121 (42%), Gaps = 24/121 (19%)

Query: 335 GSGLKKAYVTNGI----RELVLCKDASGLIGLR--VCAINEGVFVCLVERGSPASLVG-- 386
           GS LK   V + I    R+    K    L  LR  V  +NE     LVERG    +VG  
Sbjct: 567 GSRLKAEAVESLIADWNRDYDPTKTTLILAHLRRDVRMLNEMARAKLVERG----IVGEG 622

Query: 387 ---------LRF--GDQILSINGETVAGYNMKQVHKILKAAPVNNIRVVIRDRPLERTVT 435
                     RF  GDQI+ +  E   G     + K+++AAP N I   I +    R VT
Sbjct: 623 HAFRTADGERRFAAGDQIVFLKNEGSLGVKNGMIGKVVEAAP-NRIVAEIGEGEHRRQVT 681

Query: 436 L 436
           +
Sbjct: 682 V 682


>gnl|CDD|219094 pfam06583, Neogenin_C, Neogenin C-terminus.  This family represents
           the C-terminus of eukaryotic neogenin precursor
           proteins, which contains several potential
           phosphorylation sites. Neogenin is a member of the N-CAM
           family of cell adhesion molecules (and therefore
           contains multiple copies of pfam00047 and pfam00041) and
           is closely related to the DCC tumour suppressor gene
           product - these proteins may play an integral role in
           regulating differentiation programmes and/or cell
           migration events within many adult and embryonic
           tissues.
          Length = 295

 Score = 29.5 bits (66), Expect = 4.6
 Identities = 34/169 (20%), Positives = 48/169 (28%), Gaps = 25/169 (14%)

Query: 120 PQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPI 179
           P  S P  P  + +   ++ N PQ P   PS                    +   L    
Sbjct: 86  PMDSQPPQPVVSAHPIHTLDN-PQYPGILPSPRCGYLHHQFSLRPMPFSTLTVQRLYQHG 144

Query: 180 NPLAS-----NKPSAPQPPAYSVQPSHNTAYPTAPLVP-----------STIATAPSAPP 223
           +   S       P  P  PA   + S+      +  +P           S    A  A  
Sbjct: 145 DRAESVESVRQTPEPPYLPAAQSESSNAAEEAPSRSIPTAHVRPTHPLKSFAVPALPASM 204

Query: 224 ANIEDATRSTHPTTATAPSVPP--------AIHDATRSTPSPIIAPSVP 264
           + IE    ST   T   P++P               RS   P+  PS P
Sbjct: 205 STIEPKLPSTPLLTQQGPTLPKHSVKTASVGTLGRARSPLLPVTVPSAP 253


>gnl|CDD|191511 pfam06409, NPIP, Nuclear pore complex interacting protein (NPIP).
           This family consists of a series of primate specific
           nuclear pore complex interacting protein (NPIP)
           sequences. The function of this family is unknown but is
           well conserved from African apes to humans.
          Length = 350

 Score = 29.4 bits (65), Expect = 5.1
 Identities = 24/109 (22%), Positives = 44/109 (40%), Gaps = 2/109 (1%)

Query: 167 SQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAP-PAN 225
           ++   Q   P   + +  N  S   PP   + P   +A P+A     T A     P P +
Sbjct: 241 NRMGHQPPPPTQQHSIIDNSLSLKTPPECLLTPLPPSALPSADDNLKTPAECLLRPLPPS 300

Query: 226 IEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPS-VPPTNRDTIRS 273
            +D  +         P +PP++ D  +      + PS +PP+  D +++
Sbjct: 301 ADDNIKECPLALKPPPPLPPSVDDNLKECLLAPLPPSALPPSVDDNLKT 349


>gnl|CDD|217931 pfam04153, NOT2_3_5, NOT2 / NOT3 / NOT5 family.  NOT1, NOT2,
          NOT3, NOT4 and NOT5 form a nuclear complex that
          negatively regulates the basal and activated
          transcription of many genes. This family includes NOT2,
          NOT3 and NOT5.
          Length = 131

 Score = 28.1 bits (63), Expect = 5.2
 Identities = 14/49 (28%), Positives = 19/49 (38%), Gaps = 7/49 (14%)

Query: 50 NAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHS 98
           ++ +PPSP  +   PP Y  P    +           YP  PP  S S
Sbjct: 3  PSFASPPSPSDARERPPQYDIPECYLNTP-------SSYPQNPPPLSPS 44


>gnl|CDD|219916 pfam08580, KAR9, Yeast cortical protein KAR9.  The KAR9 protein in
           Saccharomyces cerevisiae is a cytoskeletal protein
           required for karyogamy, correct positioning of the
           mitotic spindle and for orientation of cytoplasmic
           microtubules. KAR9 localises at the shmoo tip in mating
           cells and at the tip of the growing bud in anaphase.
          Length = 626

 Score = 29.5 bits (66), Expect = 5.5
 Identities = 30/191 (15%), Positives = 49/191 (25%), Gaps = 18/191 (9%)

Query: 1   MSSIYPSLE-DMTFHQLTIAQNEYFSKQTTSS--SFSSPPPPVYQTSSHTSPNAYNNPPS 57
             +I P+L          I+  E       SS      PP    + S H   N + N   
Sbjct: 440 RMNIKPTLASIPDEKPSNISVFEDSETSPNSSTLLRDPPPKKCGEESGHLPNNPFFNKLK 499

Query: 58  PYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVP 117
              S+ PP   +   +               PS P     S      +R    +G    P
Sbjct: 500 LTLSSIPPLSPRQSIITLPT-----------PSRPASRISS----LSLRLGSYSGSIVSP 544

Query: 118 SYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPY 177
                          +   SV +         ++         F   +++     +SLP 
Sbjct: 545 PPYPTLVSRKGAAGLSFNRSVSDIEGERIGRYNLLPTRIPALPFKAESTTSSRRSSSLPS 604

Query: 178 PINPLASNKPS 188
           P   +      
Sbjct: 605 PTGVIGFPGSV 615


>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor.  This family is secreted by
           gram-negative Gammaproteobacteria such as Pseudomonas
           syringae of tomato and the fire blight plant pathogen
           Erwinia amylovora, amongst others. It is an essential
           pathogenicity factor of approximately 198 kDa. Its
           injection into the host-plant is dependent upon the
           bacterial type III or Hrp secretion system. The family
           is long and carries a number of predicted functional
           regions, including an ERMS or endoplasmic reticulum
           membrane retention signal at both the C- and the
           N-termini, a leucine-zipper motif from residues 539-560,
           and a nuclear localisation signal at 1358-1361. this
           conserved AvrE-family of effectors is among the few that
           are required for full virulence of many phytopathogenic
           pseudomonads, erwinias and pantoeas.
          Length = 1771

 Score = 29.4 bits (66), Expect = 5.9
 Identities = 22/142 (15%), Positives = 36/142 (25%), Gaps = 9/142 (6%)

Query: 116 VPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASL 175
           +P   Q S      QA       P Q   P       ++         L      +QA  
Sbjct: 54  MPKVFQKSSAPRQIQAA------PPQALNPTAAAPQSSRGPTLRELLALPEDDGETQAPE 107

Query: 176 PYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIAT---APSAPPANIEDATRS 232
             P     +      +     +             +   I     +   PP + E+ T S
Sbjct: 108 SSPSARRLTRSEGVARHEMEDLAGRPVVKPDADRQLRQDILNKSSSSRRPPVSKEEGTSS 167

Query: 233 THPTTATAPSVPPAIHDATRST 254
             P TA A +      +  +  
Sbjct: 168 KMPATALASAALFKDDEIRQEV 189


>gnl|CDD|236863 PRK11153, metN, DL-methionine transporter ATP-binding subunit;
           Provisional.
          Length = 343

 Score = 29.0 bits (66), Expect = 5.9
 Identities = 22/84 (26%), Positives = 37/84 (44%), Gaps = 24/84 (28%)

Query: 446 FHFNKGQIVSLVKDSSAARNGLLVNHNILE--------VNGANVVGLKDKELLEAMEKGG 497
            H   G+I  ++  S A ++ L+   N+LE        V+G ++  L +KEL +A  + G
Sbjct: 26  LHIPAGEIFGVIGASGAGKSTLIRCINLLERPTSGRVLVDGQDLTALSEKELRKARRQIG 85

Query: 498 SVLTLTLIPTSMYQHMMKKMSFNL 521
            +          +QH      FNL
Sbjct: 86  MI----------FQH------FNL 93


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 29.1 bits (65), Expect = 6.1
 Identities = 20/94 (21%), Positives = 24/94 (25%), Gaps = 10/94 (10%)

Query: 106 RANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLT 165
             N+P    P P       P        S  S P       ++          N F P  
Sbjct: 179 GGNIPGAIQPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSF----------NPFLPGP 228

Query: 166 SSQQASQASLPYPINPLASNKPSAPQPPAYSVQP 199
           S  Q S      P  P+         PP    QP
Sbjct: 229 SPAQPSAPPASIPAPPIPPVIQYVAPPPVPPPQP 262


>gnl|CDD|215141 PLN02252, PLN02252, nitrate reductase [NADPH].
          Length = 888

 Score = 29.3 bits (66), Expect = 6.3
 Identities = 18/83 (21%), Positives = 31/83 (37%), Gaps = 8/83 (9%)

Query: 210 LVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRD 269
           LV +  A + SA    +   + ++    A+     P   +     P  ++       + D
Sbjct: 591 LVTTGAAASSSASSHPLSAISTASALAAASPAPGRPVALNPREKIPCRLVEKI--SLSHD 648

Query: 270 TI--RSALVPLYPSLDNYMGLEV 290
               R AL    PS D+ +GL V
Sbjct: 649 VRLFRFAL----PSEDHVLGLPV 667


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 29.3 bits (66), Expect = 6.7
 Identities = 19/66 (28%), Positives = 22/66 (33%), Gaps = 1/66 (1%)

Query: 216 ATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSAL 275
           A AP +             P  A AP VPPA      +   P  AP+ P       RSA 
Sbjct: 381 APAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAAR-PAPAPAPPAAAAPPARSAD 439

Query: 276 VPLYPS 281
                S
Sbjct: 440 PAAAAS 445


>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2).  DAZ
           associated protein 2 has a highly conserved sequence
           throughout evolution including a conserved polyproline
           region and several SH2/SH3 binding sites. It occurs as a
           single copy gene with a four-exon organisation and is
           located on chromosome 12. It encodes a ubiquitously
           expressed protein and binds to DAZ and DAZL1 through DAZ
           repeats.
          Length = 136

 Score = 27.9 bits (62), Expect = 6.8
 Identities = 23/77 (29%), Positives = 27/77 (35%), Gaps = 8/77 (10%)

Query: 49  PNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRAN 108
           P AY+    P +   P       S P        P A +     PQ SH P  Y  I   
Sbjct: 5   PPAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMA-VGPQSSHPPMAYYPI--- 60

Query: 109 VPTGGSPVPSYPQGSYP 125
               G+P P YP GS  
Sbjct: 61  ----GAPPPVYPPGSTV 73


>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
           family.  The proteins represented by this model contain
           three RNA recognition motifs (rrm: pfam00076) and have
           been characterized as poly-pyrimidine tract binding
           proteins associated with RNA splicing factors. In the
           case of PUF60 (GP|6176532), in complex with p54, and in
           the presence of U2AF, facilitates association of U2
           snRNP with pre-mRNA.
          Length = 612

 Score = 29.3 bits (65), Expect = 6.9
 Identities = 27/174 (15%), Positives = 45/174 (25%), Gaps = 39/174 (22%)

Query: 178 PINPLASNKPSAPQPPAYSVQPSHNTAYPTA----------------PLVPSTIATAPSA 221
           P + L      +  P A +V  +  TA   A                P  PS+       
Sbjct: 284 PPDALLQPATVSAIPAAAAVAAAAATAKIMAAEAVAGAAVLGPRAQSPATPSSSLPTDIG 343

Query: 222 PPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPI----------------IAPSVP- 264
             A +  A +          + P  +       P+P+                +AP+   
Sbjct: 344 NKAVVSSAKKEAEEVPPLPQAAPAVVKPGPMEIPTPVPPPGLAIPSLVAPPGLVAPTEIN 403

Query: 265 ------PTNRDTIRSALVPLYPSLDNYMGLEVASELINAYARENAANERESDAA 312
                 P  +       V      D     E + E   +   +  A   E+ AA
Sbjct: 404 PSFLASPRKKMKREKLPVTFGALDDTLAWKEPSKEDQTSEDGKMLAIMGEAAAA 457


>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
           Provisional.
          Length = 418

 Score = 28.9 bits (65), Expect = 7.7
 Identities = 13/58 (22%), Positives = 17/58 (29%), Gaps = 2/58 (3%)

Query: 208 APLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPP 265
           APL  S I T  + P A    A  +    T        A      +   P    +  P
Sbjct: 113 APL--SEIDTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKP 168



 Score = 28.9 bits (65), Expect = 8.6
 Identities = 12/83 (14%), Positives = 17/83 (20%), Gaps = 8/83 (9%)

Query: 161 FYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPS 220
              + +      A+        A          A         + PT P        AP+
Sbjct: 115 LSEIDTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPA 174

Query: 221 APPANIEDATRSTHPTTATAPSV 243
           A P           P     P  
Sbjct: 175 AKP--------PPTPVARADPRE 189


>gnl|CDD|173288 PRK14827, PRK14827, undecaprenyl pyrophosphate synthase;
           Provisional.
          Length = 296

 Score = 28.8 bits (64), Expect = 8.0
 Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 6/63 (9%)

Query: 139 PNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQ 198
           P  P AP+ YP+ P++   P  F  L  +        P        +K +AP+ PA  + 
Sbjct: 14  PQLPPAPDDYPTFPDKSTWPVVFPELPPAPYGGPCRPPQHT-----SKAAAPRIPADRL- 67

Query: 199 PSH 201
           P+H
Sbjct: 68  PNH 70


>gnl|CDD|183690 PRK12698, flgH, flagellar basal body L-ring protein; Reviewed.
          Length = 224

 Score = 28.5 bits (64), Expect = 8.2
 Identities = 15/43 (34%), Positives = 20/43 (46%), Gaps = 11/43 (25%)

Query: 76  HPSVPTFPS-APIYPSAPP----------QESHSPYYYSDIRA 107
              +   P  AP+YP APP          Q+S +   YSDI+A
Sbjct: 21  EKPIADDPYYAPVYPEAPPTKIAATGSIFQDSQANSLYSDIKA 63


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 28.9 bits (65), Expect = 8.2
 Identities = 12/53 (22%), Positives = 17/53 (32%)

Query: 47  TSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSP 99
              +   N   P  + +     +H S  S PS P    +      PP  S S 
Sbjct: 289 ADSSVEANGVEPEPTGSVSDRPRHLSSDSSPSPPDTSDSDPSTETPPPASLSH 341


>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 584

 Score = 28.7 bits (64), Expect = 8.3
 Identities = 26/118 (22%), Positives = 40/118 (33%), Gaps = 15/118 (12%)

Query: 171 SQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPA------ 224
           ++  +  P N L +   +AP P A + +P H       P        A   P A      
Sbjct: 381 TRLDMSIPANLLHNAPQAAPAPSAAAPEPKH--QPAPEPRPVLAPTPASGEPNAAAVRSM 438

Query: 225 --NIEDATRSTHPTTAT--APSVPPAIHDATR--STPSPIIAPSVPPT-NRDTIRSAL 275
              + D  R    TT    A +   A+   T   +  S  +A  +    N D +  AL
Sbjct: 439 WSTVRDKVRQRSRTTEVMLAGATVRALEGNTLVLTHESAPLARRLSEQRNADVLAEAL 496


>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 709

 Score = 28.9 bits (64), Expect = 8.3
 Identities = 26/100 (26%), Positives = 35/100 (35%), Gaps = 12/100 (12%)

Query: 165 TSSQQASQASLPYPINPL--ASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATA--PS 220
           T  Q AS A++P         SN+ +   PP         TA  TA     +I TA    
Sbjct: 405 TPVQTASAAAMPSEGKTAGPVSNQENNDVPPWEDAPDEAQTAAGTAQTSAKSIQTASEAE 464

Query: 221 APPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIA 260
            PP N     ++    T    S  P        + +PI A
Sbjct: 465 TPPENQVSKNKAADNETDAPLSEVP--------SENPIQA 496


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This family
           contains bacterial fibronectin-attachment proteins
           (FAP). Family members are rich in alanine and proline,
           are approximately 300 long, and seem to be restricted to
           mycobacteria. These proteins contain a
           fibronectin-binding motif that allows mycobacteria to
           bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 28.7 bits (64), Expect = 8.4
 Identities = 19/67 (28%), Positives = 23/67 (34%), Gaps = 8/67 (11%)

Query: 199 PSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPI 258
           P+   A P  P  P + A A  AP A          P    A    P   D   + P P 
Sbjct: 34  PATANADPAPPPPPPSTAAAAPAPAAPP--------PPPPPAAPPAPQPDDPNAAPPPPP 85

Query: 259 IAPSVPP 265
             P+ PP
Sbjct: 86  ADPNAPP 92


>gnl|CDD|178125 PLN02509, PLN02509, cystathionine beta-lyase.
          Length = 464

 Score = 28.8 bits (64), Expect = 9.0
 Identities = 13/33 (39%), Positives = 18/33 (54%), Gaps = 1/33 (3%)

Query: 33  FSSPPPPVYQTSSHTSPNAYNNPPSPY-HSANP 64
           F +   P+YQT++   P+A  N P  Y  S NP
Sbjct: 101 FDAMSTPLYQTATFKQPSAIENGPYDYTRSGNP 133


>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
          Length = 171

 Score = 27.8 bits (62), Expect = 9.1
 Identities = 18/84 (21%), Positives = 26/84 (30%), Gaps = 10/84 (11%)

Query: 185 NKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPA---NIEDATRSTHPTTATAP 241
            + S  +P A S+       + T P+   T  T   + PA     E    +  P    A 
Sbjct: 50  EQSSDAKPAAASLPRLAEGNFRTMPVQTVTATTPSGSRPAVTATPEQKLLAPRPLFTAAR 109

Query: 242 SVPPAIHDATRSTPSPIIAPSVPP 265
            V           P   + P  PP
Sbjct: 110 EVKTV-------VPVSSVTPVTPP 126


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 28.4 bits (63), Expect = 9.3
 Identities = 20/87 (22%), Positives = 28/87 (32%), Gaps = 8/87 (9%)

Query: 33  FSSPPPPVYQTSSHTSPNAYNNPPSPYHSA---NPPPYYQHPSVPSHPSVPTFPSAPIYP 89
             SPPPP    S      + +   SP+      N  P    P +    S+P  P+     
Sbjct: 295 IRSPPPPGRLRSPPPLHASDSPVASPHVGELGVNFSPLAFRPGLTRTKSLPRTPNPF--- 351

Query: 90  SAPPQESHSPYYYSDIRANVPTGGSPV 116
                E   P+   D R +  T    V
Sbjct: 352 --HRLEYILPFNERDERESSTTRDVHV 376


>gnl|CDD|235777 PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl carrier
           protein subunit; Validated.
          Length = 155

 Score = 27.8 bits (63), Expect = 9.7
 Identities = 8/39 (20%), Positives = 14/39 (35%)

Query: 238 ATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSALV 276
           A   +  PA   A  + P+   A +        + S +V
Sbjct: 49  AAPVAAAPAAAAAAAAAPAAAPAAAAAEAEGHVVTSPMV 87


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.311    0.128    0.375 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,525,205
Number of extensions: 2537514
Number of successful extensions: 5181
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3504
Number of HSP's successfully gapped: 650
Length of query: 530
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 429
Effective length of database: 6,457,848
Effective search space: 2770416792
Effective search space used: 2770416792
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 61 (27.4 bits)