RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4168
         (454 letters)



>gnl|CDD|216919 pfam02181, FH2, Formin Homology 2 Domain. 
          Length = 372

 Score =  252 bits (645), Expect = 5e-80
 Identities = 107/394 (27%), Positives = 179/394 (45%), Gaps = 44/394 (11%)

Query: 66  TPPPPPPMMGTTPHVLGPLYWTRLIVSPEIPSDTTPLWKELEEVPI---NNLEEFTELFS 122
            P P   +         PL+W ++        D   +W EL+E       +L E  ELFS
Sbjct: 4   EPKPKKKLK--------PLHWDKV----NPAQDRGTVWDELDEESFEKDLDLSELEELFS 51

Query: 123 RQVTAQRPVTRKRQQKSSKVQNVARLIDSKRSQNVGILAQSLHIEFSEIESAIFNMDASV 182
            +   ++    +++  S K +    ++D KRSQN+ IL + L +   EI  AI  MD SV
Sbjct: 52  AKAKKKKSKKSEKKSSSKKKKKEISVLDPKRSQNIAILLRKLKLPPEEIVQAILEMDESV 111

Query: 183 VSLEALQQIYDVRATDEEINLIRAHLASNSDLQLDKPEQFLADLADIPHFSERIACFMFE 242
           + LE L+ +  +  T EE+  ++ +       +L + EQFL +L+ IP   ER+   +F+
Sbjct: 112 LGLELLENLLKMAPTKEELKKLKEY--KGDPSKLGRAEQFLLELSKIPRLEERLEALLFK 169

Query: 243 ADFSDSIALIDSKLNNIKSVCEFLMKSEHLKKVIAIILSLGNFMNGGNRLRGQADGFGLE 302
           + F + +  +   L  +++  E L +S   KK++ +IL+LGN+MN G R RG A GF L 
Sbjct: 170 STFEEEVEELKPSLETLEAASEELRESRKFKKLLELILALGNYMNSGTR-RGNAKGFKLS 228

Query: 303 ILPKLKDVRSKDNSVTLLHFIVRTYLRNSENPLNESLPVPEPGDVDRAASVVFDDIHSQL 362
            L KL D +S DN  TLLH++V+        P        E   V++AA V  + +   +
Sbjct: 229 SLLKLSDTKSTDNKTTLLHYLVKIIREK--LPDLLDFS-SELSHVEKAAKVDLEQLEKDV 285

Query: 363 STLAKKLDAVTISMNKVVQESKPDHMEPFRTKMESCVKTGK----------------FKS 406
             L K L  +   +            + F  KM+  ++  +                FK 
Sbjct: 286 KELEKGLKKLERELEL--SALDEHPDDKFVEKMKEFLEEAEEKLDKLESLLKEAMELFKE 343

Query: 407 VLKFYQYIPKGIPESDVTPKDLFMLWAPFCHDFK 440
           + +++   PK     + +P++ F +   F   FK
Sbjct: 344 LTEYFGEDPK-----ETSPEEFFKILRDFLRMFK 372


>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain.  FH proteins control
           rearrangements of the actin cytoskeleton, especially in
           the context of cytokinesis and cell polarisation.
           Members of this family have been found to interact with
           Rho-GTPases, profilin and other actin-assoziated
           proteins. These interactions are mediated by the
           proline-rich FH1 domain, usually located in front of FH2
           (but not listed in SMART). Despite this cytosolic
           function, vertebrate formins have been assigned
           functions within the nucleus. A set of Formin-Binding
           Proteins (FBPs) has been shown to bind FH1 with their WW
           domain.
          Length = 392

 Score =  234 bits (598), Expect = 7e-73
 Identities = 111/414 (26%), Positives = 194/414 (46%), Gaps = 52/414 (12%)

Query: 64  MGTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIPSDTTPLWKELEEVPINNLEEFTELFSR 123
              P P   +         PL+W +  ++P   S T  +W +++E    +L+E  ELFS 
Sbjct: 1   KKEPKPKKKLK--------PLHWDK--LNPSDLSGT--VWDKIDEESEGDLDELEELFSA 48

Query: 124 QVTAQRP---VTRKRQQKSSKVQNVARLIDSKRSQNVGILAQSLHIEFSEIESAIFNMDA 180
           +   +     V+ K+     K     +++D KRSQN+ IL + LH+ + EI+ AI   D 
Sbjct: 49  KEKTKSASKDVSEKKSILKKKASQEFKILDPKRSQNLAILLRKLHMSYEEIKEAILEGDE 108

Query: 181 SVVSLEALQQIYDVRATDEEINLIRAHLASNSDL-QLDKPEQFLADLADIPHFSERIACF 239
            V+S++ L+Q+     T EE+  +R +     D  +L + EQFL  +++IP+  ER+   
Sbjct: 109 DVLSVDLLEQLLKYAPTKEELKKLREY--KEEDPEELARAEQFLLLISNIPYLEERLNAL 166

Query: 240 MFEADFSDSIALIDSKLNNIKSVCEFLMKSEHLKKVIAIILSLGNFMNGGNRLRGQADGF 299
           +F+A+F + +  +  ++  +++ CE L +S+  +K++ +IL++GN+MNGG+R RGQA GF
Sbjct: 167 LFKANFEEEVEDLKPQIEKVEAACEELRESKKFRKLLELILAIGNYMNGGSR-RGQAYGF 225

Query: 300 GLEILPKLKDVRSKDNSVTLLHFIVRT----YLRNSENPLNES---LPVPEPGDVDRAAS 352
            L  L KL DV+S DN  TLLHF+V+     YL    +P N     + V +P    +AA 
Sbjct: 226 KLSSLLKLSDVKSADNKTTLLHFLVKIIRKKYLGGLSDPENLDDKFIEVMKP--FLKAAK 283

Query: 353 VVFDDIHSQLSTLAKKLDA---VTISMNKVVQESKPDHMEPFRTKMESCVKTGK------ 403
             +D +   LS L  + +          K    S  +  + F   ++   K  +      
Sbjct: 284 EKYDKLQKDLSDLKTRFEKLVEYYGEDPKDT--SPEEFFKDFNEFLKEFSKAAEENIKKE 341

Query: 404 ----------FKSVLKFYQYIPKGIPESDVTPKDLFMLWAPFCHDFKDFWKKEQ 447
                      K   ++ Q   +   + +  P   F +   F        ++  
Sbjct: 342 EEEEERRKKLVKETTEYEQSSSR---QKERNPSMDFEVERDFLGVLDSLLEELG 392


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This
          family contains bacterial fibronectin-attachment
          proteins (FAP). Family members are rich in alanine and
          proline, are approximately 300 long, and seem to be
          restricted to mycobacteria. These proteins contain a
          fibronectin-binding motif that allows mycobacteria to
          bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 53.4 bits (128), Expect = 8e-08
 Identities = 21/64 (32%), Positives = 23/64 (35%)

Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
           +LP    A P PPPPPP           PPP      PP P    P   PP     P+ 
Sbjct: 31 IALPATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNA 90

Query: 81 LGPL 84
            P 
Sbjct: 91 PPPP 94



 Score = 44.9 bits (106), Expect = 4e-05
 Identities = 26/59 (44%), Positives = 28/59 (47%), Gaps = 4/59 (6%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGT----PPPPPPMMGTPPPPPPMMGTTPHVLG 82
           AAPPPPPPP   P P P      PP        PPPPP     PPPP P  G   + +G
Sbjct: 58  AAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPAPEPGRIDNAVG 116



 Score = 33.7 bits (77), Expect = 0.16
 Identities = 18/39 (46%), Positives = 20/39 (51%), Gaps = 4/39 (10%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
           P+   A PPPPP  P  PPPP +    PP    PP P P
Sbjct: 74  PDDPNAAPPPPPADPNAPPPPPVDPNAPP----PPAPEP 108



 Score = 32.2 bits (73), Expect = 0.55
 Identities = 17/49 (34%), Positives = 17/49 (34%), Gaps = 12/49 (24%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P        PPPPP  P  PP            PPP  P    PP P P
Sbjct: 72  PQPDDPNAAPPPPPADPNAPP------------PPPVDPNAPPPPAPEP 108



 Score = 28.3 bits (63), Expect = 7.4
 Identities = 24/83 (28%), Positives = 26/83 (31%), Gaps = 30/83 (36%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMG--------------------------TPPPPP 61
           A PPPP  P   PPP P  G     + G                          T   PP
Sbjct: 90  APPPPPVDPNAPPPPAPEPGRIDNAVGGFSYVVPAGWVESDATHLDYGSALLSKTTGEPP 149

Query: 62  PMMGTPPPPPPMMGTTPHVLGPL 84
                P  PPP+   T  VLG L
Sbjct: 150 M----PGQPPPVANDTRVVLGRL 168


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 54.2 bits (130), Expect = 1e-07
 Identities = 33/95 (34%), Positives = 34/95 (35%), Gaps = 17/95 (17%)

Query: 21   FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP---PMMGTT 77
            F+LP  Q   PP P  PP P P P     PPP    PPPPPP    PP  P   P     
Sbjct: 2898 FALPPDQPERPPQPQAPPPPQPQP---QPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGE 2954

Query: 78   PHVLGPLYW-----------TRLIVSPEIPSDTTP 101
            P    P  W            R  V    PS   P
Sbjct: 2955 PSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAP 2989



 Score = 49.9 bits (119), Expect = 2e-06
 Identities = 19/49 (38%), Positives = 22/49 (44%), Gaps = 5/49 (10%)

Query: 24   PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
             + ++   PP  P   P P       PPP    P PPPP    PPPPPP
Sbjct: 2893 RSTESFALPPDQPERPPQPQA-----PPPPQPQPQPPPPPQPQPPPPPP 2936



 Score = 48.4 bits (115), Expect = 7e-06
 Identities = 21/49 (42%), Positives = 22/49 (44%)

Query: 24   PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            P V  +      PP  P  PP     PPP     PPPPP    PPPPPP
Sbjct: 2889 PAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937



 Score = 46.5 bits (110), Expect = 3e-05
 Identities = 18/69 (26%), Positives = 24/69 (34%), Gaps = 2/69 (2%)

Query: 6   KRAKR--RKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPM 63
            R +R  R     ++      +      P P   P P P P+  + PPP     P   P 
Sbjct: 382 TRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPG 441

Query: 64  MGTPPPPPP 72
               P PPP
Sbjct: 442 SDDGPAPPP 450



 Score = 43.4 bits (102), Expect = 3e-04
 Identities = 26/85 (30%), Positives = 34/85 (40%), Gaps = 17/85 (20%)

Query: 6    KRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTP------------PPM 53
            +R +RR  + T  ++ SL     A PPPPPP P P P  ++  TP            P +
Sbjct: 2681 QRPRRRAARPTVGSLTSL-----ADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPAL 2735

Query: 54   MGTPPPPPPMMGTPPPPPPMMGTTP 78
               P PP    G   P  P     P
Sbjct: 2736 PAAPAPPAVPAGPATPGGPARPARP 2760



 Score = 43.0 bits (101), Expect = 4e-04
 Identities = 14/51 (27%), Positives = 19/51 (37%)

Query: 22   SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            S  ++ +   P  PP  +  P   +     P    PPP       PPPPP 
Sbjct: 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG 2844



 Score = 42.2 bits (99), Expect = 6e-04
 Identities = 21/81 (25%), Positives = 24/81 (29%), Gaps = 4/81 (4%)

Query: 24   PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
            P   A P P PP    P PPP      P      P P P    PPP  P     P  +  
Sbjct: 2608 PRGPAPPSPLPPDTHAPDPPP----PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSR 2663

Query: 84   LYWTRLIVSPEIPSDTTPLWK 104
                R +      S      +
Sbjct: 2664 PRRARRLGRAAQASSPPQRPR 2684



 Score = 40.3 bits (94), Expect = 0.002
 Identities = 22/90 (24%), Positives = 25/90 (27%), Gaps = 10/90 (11%)

Query: 22   SLPNVQAAPPPPPPP----------PPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
            S+P  + AP P  P           PP    P           G  PP P    T  P P
Sbjct: 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDP 2626

Query: 72   PMMGTTPHVLGPLYWTRLIVSPEIPSDTTP 101
            P    +P    P       V P       P
Sbjct: 2627 PPPSPSPAANEPDPHPPPTVPPPERPRDDP 2656



 Score = 39.9 bits (93), Expect = 0.003
 Identities = 24/92 (26%), Positives = 27/92 (29%), Gaps = 17/92 (18%)

Query: 23   LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMG------------TPPPP 70
            L    A PP   P  PLPPP     T PPP  G PPP  P+ G                P
Sbjct: 2812 LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSP 2871

Query: 71   PPMMGTTPHVLGPLYWTRLIVSPEIPSDTTPL 102
                             R +  P +   T   
Sbjct: 2872 AAKPAAPARPP-----VRRLARPAVSRSTESF 2898



 Score = 39.5 bits (92), Expect = 0.004
 Identities = 27/83 (32%), Positives = 29/83 (34%), Gaps = 11/83 (13%)

Query: 22  SLPNVQAAPPPPPPPP-----PLPPPPPMMGTTPPPMMGTPPPP---PPMMGTPPPPPPM 73
               V A+ P P P P     P PP  P+    P    G  PPP   PP   T P P   
Sbjct: 407 PAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDP 466

Query: 74  MGTTPHVLGPLYWTRLIVSPEIP 96
              T   L  L   R    PE P
Sbjct: 467 DDATRKALDALRERR---PPEPP 486



 Score = 39.2 bits (91), Expect = 0.005
 Identities = 17/60 (28%), Positives = 20/60 (33%)

Query: 24   PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
             + ++ P P  P  P           PP      P PPP    P  PPP  G  P  L  
Sbjct: 2793 ESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPL 2852



 Score = 38.4 bits (89), Expect = 0.010
 Identities = 15/49 (30%), Positives = 16/49 (32%), Gaps = 5/49 (10%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPP-----PMMGTPPPPPPMMGTPPPPP 71
           A     PPPPP    P      P       + G P   P     PPP P
Sbjct: 272 ARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPAP 320



 Score = 38.0 bits (88), Expect = 0.011
 Identities = 22/76 (28%), Positives = 24/76 (31%), Gaps = 24/76 (31%)

Query: 24   PNVQAAPPPPPPPPPLPPPPPM-----------MGTTPPPMMG-------------TPPP 59
            P     P PPPPPPP P PP              G  P P +G              P P
Sbjct: 2923 PPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP 2982

Query: 60   PPPMMGTPPPPPPMMG 75
             P         PP+ G
Sbjct: 2983 APSREAPASSTPPLTG 2998



 Score = 38.0 bits (88), Expect = 0.011
 Identities = 25/80 (31%), Positives = 29/80 (36%), Gaps = 7/80 (8%)

Query: 27   QAAPPPP----PPPPPLPPPPPMMGTTP--PPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
            QA+P  P    PP  P  P  P     P  PP    PP P P       PP  + T P V
Sbjct: 2730 QASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRL-TRPAV 2788

Query: 81   LGPLYWTRLIVSPEIPSDTT 100
                     + SP  P+D  
Sbjct: 2789 ASLSESRESLPSPWDPADPP 2808



 Score = 37.2 bits (86), Expect = 0.018
 Identities = 24/98 (24%), Positives = 34/98 (34%), Gaps = 11/98 (11%)

Query: 5    DKRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPP-PPMMGTTPPPMMGTPPPPPPM 63
            D  A  R  +   +         ++PP  P      P    +     PP    PPPP P 
Sbjct: 2654 DDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPP----PPPPTP- 2708

Query: 64   MGTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIPSDTTP 101
                P P  ++  TP   GP    +   SP +P+   P
Sbjct: 2709 ---EPAPHALVSATPLPPGPAAARQ--ASPALPAAPAP 2741



 Score = 37.2 bits (86), Expect = 0.020
 Identities = 20/64 (31%), Positives = 22/64 (34%), Gaps = 18/64 (28%)

Query: 27   QAAPPPPPPP-PPLP-----------PPPPMMGTTP------PPMMGTPPPPPPMMGTPP 68
            ++ PPP P P P  P           PP       P      P     P P PP    P 
Sbjct: 2566 RSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625

Query: 69   PPPP 72
            PPPP
Sbjct: 2626 PPPP 2629



 Score = 35.7 bits (82), Expect = 0.059
 Identities = 15/49 (30%), Positives = 15/49 (30%)

Query: 22   SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
              P     P    P PP P P P      P    T PPP      P P 
Sbjct: 2611 PAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPG 2659



 Score = 34.9 bits (80), Expect = 0.094
 Identities = 14/64 (21%), Positives = 17/64 (26%), Gaps = 6/64 (9%)

Query: 27   QAAPPPPPPPPPLPPPPPMMGTTPPPM------MGTPPPPPPMMGTPPPPPPMMGTTPHV 80
            +     P   P  P  PP+     P +         PP  P     P  PPP        
Sbjct: 2865 RPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPP 2924

Query: 81   LGPL 84
              P 
Sbjct: 2925 PPPQ 2928



 Score = 34.9 bits (80), Expect = 0.12
 Identities = 19/82 (23%), Positives = 23/82 (28%), Gaps = 19/82 (23%)

Query: 21   FSLPNVQAAPPPPPPPPPLPPPPPMM-------------GTTPPPMMGTPPPPPPMMGT- 66
                       PPP  P   P P  +               + PP         P +G+ 
Sbjct: 2636 NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL 2695

Query: 67   -----PPPPPPMMGTTPHVLGP 83
                 PPPPPP     PH L  
Sbjct: 2696 TSLADPPPPPPTPEPAPHALVS 2717



 Score = 34.1 bits (78), Expect = 0.20
 Identities = 19/80 (23%), Positives = 24/80 (30%), Gaps = 16/80 (20%)

Query: 6   KRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP---PP 62
           +R+ R          F+          P  P P   P P     P P+  + PPP   P 
Sbjct: 385 RRSARHAATP-----FARGPGGDDQTRPAAPVPASVPTP----APTPVPASAPPPPATPL 435

Query: 63  MMGTPP----PPPPMMGTTP 78
               P     P PP     P
Sbjct: 436 PSAEPGSDDGPAPPPERQPP 455



 Score = 33.4 bits (76), Expect = 0.30
 Identities = 20/71 (28%), Positives = 25/71 (35%), Gaps = 9/71 (12%)

Query: 22   SLPNVQAAPPPPPPP-----PPLPPPPPMMGTTPPPMMGTPP----PPPPMMGTPPPPPP 72
            + P + AAP PP  P     P  P  P    TT  P    PP      PP   T P    
Sbjct: 2731 ASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790

Query: 73   MMGTTPHVLGP 83
            +  +   +  P
Sbjct: 2791 LSESRESLPSP 2801



 Score = 33.4 bits (76), Expect = 0.34
 Identities = 18/84 (21%), Positives = 25/84 (29%), Gaps = 4/84 (4%)

Query: 22   SLPNVQAAP--PPPPPPPPLPPPPPMMGTTPPPMMGTP--PPPPPMMGTPPPPPPMMGTT 77
            + P   A P  P   PP  L  P     +     + +P  P  PP     P        +
Sbjct: 2764 AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAS 2823

Query: 78   PHVLGPLYWTRLIVSPEIPSDTTP 101
            P    P   +    +P  P    P
Sbjct: 2824 PAGPLPPPTSAQPTAPPPPPGPPP 2847



 Score = 32.2 bits (73), Expect = 0.67
 Identities = 18/63 (28%), Positives = 22/63 (34%), Gaps = 18/63 (28%)

Query: 24   PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMM--------------GTPPPPPPMMGTPPP 69
            P+  A P P    P + P  P+     P M+              G PPPP P    P  
Sbjct: 2504 PDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPLP----PAA 2559

Query: 70   PPP 72
            PP 
Sbjct: 2560 PPA 2562



 Score = 32.2 bits (73), Expect = 0.74
 Identities = 21/85 (24%), Positives = 23/85 (27%), Gaps = 7/85 (8%)

Query: 24   PNVQAAPPPPPPPP----PLPPPPPMMGTTPPPMMGTPPPPPPM---MGTPPPPPPMMGT 76
            P     P  P  PP    P  P PP      PP   T P    +     + P P      
Sbjct: 2748 PATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADP 2807

Query: 77   TPHVLGPLYWTRLIVSPEIPSDTTP 101
               VL P        SP  P     
Sbjct: 2808 PAAVLAPAAALPPAASPAGPLPPPT 2832



 Score = 31.8 bits (72), Expect = 1.1
 Identities = 14/71 (19%), Positives = 17/71 (23%), Gaps = 20/71 (28%)

Query: 22   SLPNVQAAPPPPPP------------------PPPLPPPPPMMGTTPPPM--MGTPPPPP 61
            + P     PP PPP                  PP   P         PP+  +  P    
Sbjct: 2834 AQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSR 2893

Query: 62   PMMGTPPPPPP 72
                   PP  
Sbjct: 2894 STESFALPPDQ 2904



 Score = 31.1 bits (70), Expect = 1.7
 Identities = 21/78 (26%), Positives = 23/78 (29%), Gaps = 30/78 (38%)

Query: 24   PNVQAAPPPPPPPPPLP---------------PPPPMM---------------GTTPPPM 53
            P+     PP P  PP P               P  P M               G  PPP+
Sbjct: 2496 PDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPL 2555

Query: 54   MGTPPPPPPMMGTPPPPP 71
                PP  P    PPP P
Sbjct: 2556 PPAAPPAAPDRSVPPPRP 2573



 Score = 29.9 bits (67), Expect = 4.2
 Identities = 15/57 (26%), Positives = 17/57 (29%), Gaps = 7/57 (12%)

Query: 23   LPNVQAAPPPPPPPPPLP---PPPPMMGTTPPPMMGTP----PPPPPMMGTPPPPPP 72
             P    AP   P     P    P P +G   P  +  P    P P P    P    P
Sbjct: 2938 RPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994



 Score = 29.5 bits (66), Expect = 4.4
 Identities = 19/63 (30%), Positives = 22/63 (34%), Gaps = 18/63 (28%)

Query: 28   AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP----PPPMM--------------GTPPP 69
            AAP P    PP P  PP      P ++   P      P M+              G PPP
Sbjct: 2494 AAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPP 2553

Query: 70   PPP 72
            P P
Sbjct: 2554 PLP 2556


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 48.6 bits (116), Expect = 5e-06
 Identities = 34/130 (26%), Positives = 44/130 (33%), Gaps = 10/130 (7%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPP-----MMGTPPPPPPMMGTPPPP---PPM 73
            LP    AP  PP PP LP  PP +     P     M   PP PP     PP P   PP 
Sbjct: 219 FLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPP 278

Query: 74  MGTTPHVLGPLYWTRLIVSPEIPSDTTPLWKELEEVPINNL--EEFTELFSRQVTAQRPV 131
                   G        + P       PL ++ +         E+  +L  +Q  A    
Sbjct: 279 QNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQLSQQQREALSQE 338

Query: 132 TRKRQQKSSK 141
             KR ++  K
Sbjct: 339 EAKRAKRRHK 348



 Score = 40.5 bits (95), Expect = 0.002
 Identities = 15/71 (21%), Positives = 17/71 (23%), Gaps = 6/71 (8%)

Query: 19  NIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGT------PPPPPPMMGTPPPPPP 72
               LP          P    P  PP     P             P P      PP PP 
Sbjct: 176 PQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQ 235

Query: 73  MMGTTPHVLGP 83
           +    P +  P
Sbjct: 236 LPQQPPPLQQP 246



 Score = 38.2 bits (89), Expect = 0.010
 Identities = 18/61 (29%), Positives = 19/61 (31%), Gaps = 1/61 (1%)

Query: 23  LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLG 82
           L   Q AP  P PP  + P          P  G PP  PP    PP   P        L 
Sbjct: 163 LQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQG-PPEQPPGYPQPPQGHPEQVQPQQFLP 221

Query: 83  P 83
            
Sbjct: 222 A 222



 Score = 37.4 bits (87), Expect = 0.015
 Identities = 15/65 (23%), Positives = 16/65 (24%), Gaps = 7/65 (10%)

Query: 22  SLPNVQAAPPPPPPPPPLPPP----PPMMGTTPPPMMGTPPPPPPMMGTP---PPPPPMM 74
                   PP    P  +PP     P       PP    PP   P    P    P P   
Sbjct: 167 QQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQA 226

Query: 75  GTTPH 79
              P 
Sbjct: 227 PAQPP 231



 Score = 35.9 bits (83), Expect = 0.043
 Identities = 12/49 (24%), Positives = 13/49 (26%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P      P    P    P P      PP     P  PPP+     P   
Sbjct: 204 PQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLS 252



 Score = 35.9 bits (83), Expect = 0.046
 Identities = 16/66 (24%), Positives = 17/66 (25%), Gaps = 11/66 (16%)

Query: 24  PNVQAAPPPPPPPPPL------PPPPPMMGTTPPPMMGTP-----PPPPPMMGTPPPPPP 72
           P  Q  P   PP          P  PP     P             P P      PP PP
Sbjct: 175 PPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPP 234

Query: 73  MMGTTP 78
            +   P
Sbjct: 235 QLPQQP 240



 Score = 34.7 bits (80), Expect = 0.11
 Identities = 16/60 (26%), Positives = 19/60 (31%), Gaps = 3/60 (5%)

Query: 20  IFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
           + SL  V+A        P LP PP  +    P  M       P  G P  PP        
Sbjct: 153 MLSLEEVEAQLQQRQQAPQLPQPPQQV---LPQGMPPRQAAFPQQGPPEQPPGYPQPPQG 209



 Score = 34.4 bits (79), Expect = 0.15
 Identities = 18/69 (26%), Positives = 19/69 (27%), Gaps = 5/69 (7%)

Query: 21  FSLPNVQAAPPPPP--PPPPLPPPPPMMGTTPPPMMGTPPPPPPM---MGTPPPPPPMMG 75
              P  Q   PP P   PPP   P P  G         PPP  P    +   P       
Sbjct: 259 PPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGP 318

Query: 76  TTPHVLGPL 84
                L  L
Sbjct: 319 QFREQLVQL 327



 Score = 32.4 bits (74), Expect = 0.58
 Identities = 14/81 (17%), Positives = 18/81 (22%), Gaps = 7/81 (8%)

Query: 20  IFSLPNVQAAPPPP-PPPPPLPPPP--PMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGT 76
           ++         PP  P   P P  P   M+               P      P PP    
Sbjct: 125 LWGTAPKPEPQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQ----LPQPPQQVL 180

Query: 77  TPHVLGPLYWTRLIVSPEIPS 97
              +            PE P 
Sbjct: 181 PQGMPPRQAAFPQQGPPEQPP 201



 Score = 32.4 bits (74), Expect = 0.64
 Identities = 14/77 (18%), Positives = 15/77 (19%), Gaps = 13/77 (16%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPM-------------MGTTPPPMMGTPPPPPPMMGTPP 68
           SL      P P PP  P   P P                          P PP  +    
Sbjct: 124 SLWGTAPKPEPQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQG 183

Query: 69  PPPPMMGTTPHVLGPLY 85
            PP              
Sbjct: 184 MPPRQAAFPQQGPPEQP 200


>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN).  This family
           consists of several eukaryotic survival motor neuron
           (SMN) proteins. The Survival of Motor Neurons (SMN)
           protein, the product of the spinal muscular
           atrophy-determining gene, is part of a large
           macromolecular complex (SMN complex) that functions in
           the assembly of spliceosomal small nuclear
           ribonucleoproteins (snRNPs). The SMN complex functions
           as a specificity factor essential for the efficient
           assembly of Sm proteins on U snRNAs and likely protects
           cells from illicit, and potentially deleterious,
           non-specific binding of Sm proteins to RNAs.
          Length = 264

 Score = 46.9 bits (111), Expect = 7e-06
 Identities = 22/53 (41%), Positives = 24/53 (45%)

Query: 21  FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
            S  N +  P PPPPPP            PP + G PPP P      PPPPPM
Sbjct: 165 PSPWNPRFPPGPPPPPPGFGRHGEKPSGWPPFLSGWPPPFPLGPPMIPPPPPM 217



 Score = 33.8 bits (77), Expect = 0.14
 Identities = 12/24 (50%), Positives = 14/24 (58%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMG 47
           P +   PPP P  PP+ PPPP M 
Sbjct: 195 PFLSGWPPPFPLGPPMIPPPPPMS 218


>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR.  This region is found
           to the N-terminus of the pfam00170 transcription factor
           domain. It is between 150 and 200 amino acids in length.
           The N-terminal half is rather rich in proline residues
           and has been termed the PRD (proline rich domain),
           whereas the C-terminal half is more polar and has been
           called the MFMR (multifunctional mosaic region). It has
           been suggested that this family is composed of three
           sub-families called A, B and C, classified according to
           motif composition. It has been suggested that some of
           these motifs may be involved in mediating
           protein-protein interactions. The MFMR region contains a
           nuclear localisation signal in bZIP opaque and GBF-2.
           The MFMR also contains a transregulatory activity in
           TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention
           signals.
          Length = 189

 Score = 45.6 bits (108), Expect = 1e-05
 Identities = 27/85 (31%), Positives = 28/85 (32%), Gaps = 24/85 (28%)

Query: 24  PNVQAAPPPPPPPPPL-------PPPPPMMGTTPPPMM---GTPPP-----PP------- 61
             +QA   P PPPP         P P P M     PMM   GTPPP     PP       
Sbjct: 32  SAMQAYYGPRPPPPYFNSSVASSPQPHPYMWGPQQPMMPPYGTPPPYAAMYPPGGVYAHP 91

Query: 62  --PMMGTPPPPPPMMGTTPHVLGPL 84
             P    P  P  M         PL
Sbjct: 92  SMPPGSHPFSPYAMPSAEVPGSTPL 116



 Score = 37.9 bits (88), Expect = 0.005
 Identities = 13/67 (19%), Positives = 17/67 (25%), Gaps = 8/67 (11%)

Query: 29 APPPPPPPPPLPPPPPMMGTTP--------PPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
           P    P   +    P     P              PP     + + P P P M      
Sbjct: 8  KPSKSSPKTSVQEDTPTPTVYPDWSAMQAYYGPRPPPPYFNSSVASSPQPHPYMWGPQQP 67

Query: 81 LGPLYWT 87
          + P Y T
Sbjct: 68 MMPPYGT 74



 Score = 37.1 bits (86), Expect = 0.008
 Identities = 13/63 (20%), Positives = 19/63 (30%), Gaps = 3/63 (4%)

Query: 24 PNVQAAPPPPPPPPPLPPPPP---MMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
          P+  +         P P   P    M     P    P     +  +P P P M G    +
Sbjct: 9  PSKSSPKTSVQEDTPTPTVYPDWSAMQAYYGPRPPPPYFNSSVASSPQPHPYMWGPQQPM 68

Query: 81 LGP 83
          + P
Sbjct: 69 MPP 71


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 45.7 bits (108), Expect = 3e-05
 Identities = 17/52 (32%), Positives = 17/52 (32%), Gaps = 3/52 (5%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
               P     PP    P    T   P  GT P      GTP PP P  G  P
Sbjct: 220 TTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAE---GTPAPPTPGGGEAP 268



 Score = 41.9 bits (98), Expect = 5e-04
 Identities = 16/48 (33%), Positives = 18/48 (37%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           P+   +PP    P P          T P   GTP PP P  G  PP  
Sbjct: 224 PSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEAPPAN 271



 Score = 41.1 bits (96), Expect = 9e-04
 Identities = 15/52 (28%), Positives = 17/52 (32%), Gaps = 1/52 (1%)

Query: 31  PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLG 82
           P  P P P     P   T  P    +PP       +     P  GTTP   G
Sbjct: 205 PATPRPTPRTTASPET-TPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEG 255



 Score = 33.8 bits (77), Expect = 0.20
 Identities = 19/76 (25%), Positives = 24/76 (31%), Gaps = 8/76 (10%)

Query: 30  PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP----PPMMGTPPPPPPMMGTTPHVLGPLY 85
           P  P   P L P    +  TP P   T   P     P   T PP   +   +  +  P  
Sbjct: 188 PALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQA 247

Query: 86  WTRLIVSPEIPSDTTP 101
            T    +PE      P
Sbjct: 248 GT----TPEAEGTPAP 259



 Score = 33.4 bits (76), Expect = 0.24
 Identities = 15/64 (23%), Positives = 17/64 (26%), Gaps = 6/64 (9%)

Query: 38  PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIPS 97
           P  P P    T  P    T P P      P    P   TT              +   P+
Sbjct: 205 PATPRPTPRTTASPE---TTPTPSTTTSPPSTTIPAPSTTIAAPQA---GTTPEAEGTPA 258

Query: 98  DTTP 101
             TP
Sbjct: 259 PPTP 262


>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
           (DUF2360).  This is the conserved 140 amino acid region
           of a family of proteins conserved from nematodes to
           humans. One C. elegans member is annotated as a
           Daf-16-dependent longevity protein 1 but this could not
           be confirmed. The function is unknown.
          Length = 147

 Score = 43.5 bits (103), Expect = 3e-05
 Identities = 18/60 (30%), Positives = 21/60 (35%), Gaps = 5/60 (8%)

Query: 18  SNIFSLPNVQAAPPPPPPPP-----PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           ++I  L +V     PPPP         PPPPP       P     P  PP    P  P  
Sbjct: 45  ASIPGLEDVTVQTTPPPPASAITNGGPPPPPPARAEAASPPPPEAPAEPPAEPEPEAPAE 104



 Score = 33.1 bits (76), Expect = 0.13
 Identities = 20/62 (32%), Positives = 20/62 (32%), Gaps = 13/62 (20%)

Query: 24  PNVQAAPPPPP-------PPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGT 76
             VQ  PPPP        PPPP P          PP    P  PP     P P  P   T
Sbjct: 53  VTVQTTPPPPASAITNGGPPPPPP---ARAEAASPPPPEAPAEPPAE---PEPEAPAENT 106

Query: 77  TP 78
             
Sbjct: 107 VT 108



 Score = 29.3 bits (66), Expect = 2.1
 Identities = 19/69 (27%), Positives = 21/69 (30%), Gaps = 13/69 (18%)

Query: 34  PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSP 93
                 PPPP    T      G PPPPPP       PPP                    P
Sbjct: 53  VTVQTTPPPPASAITN-----GGPPPPPPARAEAASPPP-PEAPAEPPAE-------PEP 99

Query: 94  EIPSDTTPL 102
           E P++ T  
Sbjct: 100 EAPAENTVT 108


>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1.  This region is found
           in some of the Diaphanous related formins (Drfs). It
           consists of low complexity repeats of around 12
           residues.
          Length = 160

 Score = 43.4 bits (102), Expect = 4e-05
 Identities = 39/81 (48%), Positives = 42/81 (51%), Gaps = 24/81 (29%)

Query: 22  SLPNVQAAPPPPP--------PPPPLP-------PPPPMMGTT----PPPM---MGTPPP 59
            LP   A PPPPP        PPPPLP       PPPP+ G      PPP+    G PPP
Sbjct: 65  PLPGATAIPPPPPLPGAAGIPPPPPLPGGAGIPPPPPPLPGGAAVPPPPPLPGGPGVPPP 124

Query: 60  PPPMMGTP--PPPPPMMGTTP 78
           PPP  G P  PPPPP MG+ P
Sbjct: 125 PPPFPGAPGIPPPPPGMGSPP 145



 Score = 42.2 bits (99), Expect = 9e-05
 Identities = 25/57 (43%), Positives = 26/57 (45%), Gaps = 6/57 (10%)

Query: 23  LPNVQAAPPPPPPP------PPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
           LP   A PPPPP P      PP PP P   G  PPP     PPPPP     P  P +
Sbjct: 103 LPGGAAVPPPPPLPGGPGVPPPPPPFPGAPGIPPPPPGMGSPPPPPFGFGVPAAPVL 159



 Score = 40.7 bits (95), Expect = 4e-04
 Identities = 32/67 (47%), Positives = 34/67 (50%), Gaps = 12/67 (17%)

Query: 23  LPNVQAAPPPPP---------PPPPLPPPPPMMGTTPPPMMGTP--PPPPPMMGTPPPPP 71
           LP     PPPPP         PPPPLP  P +    PPP  G P  PPPPP MG+PPPPP
Sbjct: 90  LPGGAGIPPPPPPLPGGAAVPPPPPLPGGPGVP-PPPPPFPGAPGIPPPPPGMGSPPPPP 148

Query: 72  PMMGTTP 78
              G   
Sbjct: 149 FGFGVPA 155



 Score = 38.4 bits (89), Expect = 0.002
 Identities = 35/77 (45%), Positives = 38/77 (49%), Gaps = 23/77 (29%)

Query: 22  SLPNVQAAPPPP--------PPPPPL------PPPPPMMGTT----PPPMMGT-----PP 58
            LP     PPPP        PPPPPL      PPPPP+ G T    PPP+ G      PP
Sbjct: 29  PLPGGTGIPPPPPLPGGAAIPPPPPLPGVAGIPPPPPLPGATAIPPPPPLPGAAGIPPPP 88

Query: 59  PPPPMMGTPPPPPPMMG 75
           P P   G PPPPPP+ G
Sbjct: 89  PLPGGAGIPPPPPPLPG 105



 Score = 35.3 bits (81), Expect = 0.023
 Identities = 32/71 (45%), Positives = 36/71 (50%), Gaps = 10/71 (14%)

Query: 23  LPNVQAAPPPPPPPPP--LPPPPPMMGTT----PPPMMG----TPPPPPPMMGTPPPPPP 72
           LP V   PPPPP P    +PPPPP+ G      PPP+ G     PPPPP   G   PPPP
Sbjct: 54  LPGVAGIPPPPPLPGATAIPPPPPLPGAAGIPPPPPLPGGAGIPPPPPPLPGGAAVPPPP 113

Query: 73  MMGTTPHVLGP 83
            +   P V  P
Sbjct: 114 PLPGGPGVPPP 124



 Score = 34.5 bits (79), Expect = 0.039
 Identities = 25/56 (44%), Positives = 28/56 (50%), Gaps = 10/56 (17%)

Query: 30 PPPPPPP-----PPLPPPPPMMGTTPPPMMG-----TPPPPPPMMGTPPPPPPMMG 75
          PPPPP P     PP PP P  +   PPP +       PPPP P     PPPPP+ G
Sbjct: 1  PPPPPLPGGVGIPPPPPLPGGVCIPPPPPLPGGTGIPPPPPLPGGAAIPPPPPLPG 56



 Score = 34.5 bits (79), Expect = 0.050
 Identities = 33/69 (47%), Positives = 37/69 (53%), Gaps = 21/69 (30%)

Query: 30 PPPP-------PPPPPL------PPPPPMMGTT----PPPMMGT----PPPPPPMMGTPP 68
          PPPP       PPPPPL      PPPPP+ G T    PPP+ G     PPPP P +   P
Sbjct: 2  PPPPLPGGVGIPPPPPLPGGVCIPPPPPLPGGTGIPPPPPLPGGAAIPPPPPLPGVAGIP 61

Query: 69 PPPPMMGTT 77
          PPPP+ G T
Sbjct: 62 PPPPLPGAT 70


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 45.3 bits (108), Expect = 4e-05
 Identities = 23/99 (23%), Positives = 34/99 (34%), Gaps = 4/99 (4%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWT 87
           A+     PPPP   PPP   T P P    P    P    PPPPP     + +    +   
Sbjct: 517 ASNTAKTPPPPQKSPPPPAPTPPLP---QPTATAPPPTPPPPPPTATQASSNAPAQIPAD 573

Query: 88  RLIVSPEIPSDTTPLWKELEEVPINNL-EEFTELFSRQV 125
                P     T    K+     I+   +   + F+ ++
Sbjct: 574 SSPPPPIPEEPTPSPTKDSSPEEIDKAAKNLADFFNGEI 612



 Score = 35.3 bits (82), Expect = 0.071
 Identities = 19/49 (38%), Positives = 20/49 (40%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P   A PP PPPPPP             P   +PPPP P   TP P   
Sbjct: 543 PTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTKD 591



 Score = 34.6 bits (80), Expect = 0.11
 Identities = 17/55 (30%), Positives = 19/55 (34%), Gaps = 4/55 (7%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTP----PPPPPMMGTPPPPPP 72
              +     P PP P P    PP     PPP         P   P   +PPPP P
Sbjct: 527 PQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIP 581



 Score = 34.6 bits (80), Expect = 0.11
 Identities = 15/49 (30%), Positives = 15/49 (30%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P      P P    P P PPP   T        P   P     PPP P 
Sbjct: 534 PAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPE 582



 Score = 33.4 bits (77), Expect = 0.24
 Identities = 22/91 (24%), Positives = 29/91 (31%), Gaps = 10/91 (10%)

Query: 12  KKKETYSNIFSLPNVQAAPPPPPPPPPL-----PPPPPMMGTTPPPMMGTPPPPPPMMGT 66
                       P+   AP  P  P        P PPP   + P P+   P  P P    
Sbjct: 371 ASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSP---- 426

Query: 67  PPPPPPMMGTTPHVLGPLYWTRLIVSPEIPS 97
            PP           L  L W +++   E+PS
Sbjct: 427 TPPANAANAPPSLNLEEL-WQQILAKLELPS 456



 Score = 33.4 bits (77), Expect = 0.24
 Identities = 28/117 (23%), Positives = 38/117 (32%), Gaps = 14/117 (11%)

Query: 23  LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLG 82
           LP+   +       P  P P P     P P+  + P       TP PPP        V  
Sbjct: 360 LPSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPA 419

Query: 83  PLYWTRLIVSPEIPSDTTPLWKELEEVPINNLEEFTELFSRQVTAQRPVTRK--RQQ 137
               T    +P   +   P        P  NLEE  +    ++  + P TR    QQ
Sbjct: 420 EP--TEPSPTPPANAANAP--------PSLNLEELWQQILAKL--ELPSTRMLLSQQ 464



 Score = 33.0 bits (76), Expect = 0.33
 Identities = 14/50 (28%), Positives = 16/50 (32%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
           P +       PPP P PPPP     +       P    P    P  P P 
Sbjct: 538 PPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPS 587



 Score = 32.6 bits (75), Expect = 0.42
 Identities = 19/50 (38%), Positives = 21/50 (42%)

Query: 23  LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           LP   A  PPP PPPP P         P  +     PPPP+   P P P 
Sbjct: 540 LPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPT 589



 Score = 32.2 bits (74), Expect = 0.63
 Identities = 11/49 (22%), Positives = 12/49 (24%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
                  PPPPPP             P      PP P     +P     
Sbjct: 545 ATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTKDSS 593



 Score = 31.9 bits (73), Expect = 0.76
 Identities = 17/61 (27%), Positives = 20/61 (32%)

Query: 12  KKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
            +    S   +        PPP P  P  P      TTP P      PP P+   P  P 
Sbjct: 366 SEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPS 425

Query: 72  P 72
           P
Sbjct: 426 P 426



 Score = 31.5 bits (72), Expect = 1.2
 Identities = 17/62 (27%), Positives = 21/62 (33%)

Query: 12  KKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           +K               A  PPP PPP PP      +  P  +     PPP +   P P 
Sbjct: 528 QKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPS 587

Query: 72  PM 73
           P 
Sbjct: 588 PT 589



 Score = 29.5 bits (67), Expect = 4.7
 Identities = 21/69 (30%), Positives = 27/69 (39%), Gaps = 1/69 (1%)

Query: 41  PPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSP-EIPSDT 99
                   TPPP   +PPPP P    P P       TP    P        +P +IP+D+
Sbjct: 515 GSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADS 574

Query: 100 TPLWKELEE 108
           +P     EE
Sbjct: 575 SPPPPIPEE 583


>gnl|CDD|218146 pfam04554, Extensin_2, Extensin-like region. 
          Length = 57

 Score = 40.5 bits (95), Expect = 6e-05
 Identities = 20/51 (39%), Positives = 26/51 (50%), Gaps = 2/51 (3%)

Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGT--PPPPPPMMGTPPPPPPMMGTTP 78
           PPPP     PPPP    + PPP+       PPPP+  +PPPP  +  + P
Sbjct: 5  SPPPPVKQYSPPPPYYYKSPPPPVKSPVYKSPPPPVYKSPPPPKYVYKSPP 55



 Score = 40.1 bits (94), Expect = 6e-05
 Identities = 21/50 (42%), Positives = 28/50 (56%), Gaps = 3/50 (6%)

Query: 24 PNVQAAPPPPPP---PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
          P V+   PPPP     PP P   P+  + PPP+  +PPPP  +  +PPPP
Sbjct: 8  PPVKQYSPPPPYYYKSPPPPVKSPVYKSPPPPVYKSPPPPKYVYKSPPPP 57


>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
           role in the biomineralisation of teeth.  They seem to
           regulate formation of crystallites during the secretory
           stage of tooth enamel development and are thought to
           play a major role in the structural organisation and
           mineralisation of developing enamel. The extracellular
           matrix of the developing enamel comprises two major
           classes of protein: the hydrophobic amelogenins and the
           acidic enamelins. Circular dichroism studies of porcine
           amelogenin have shown that the protein consists of 3
           discrete folding units: the N-terminal region appears to
           contain beta-strand structures, while the C-terminal
           region displays characteristics of a random coil
           conformation. Subsequent studies on the bovine protein
           have indicated the amelogenin structure to contain a
           repetitive beta-turn segment and a "beta-spiral" between
           Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
           rich region. The beta-spiral offers a probable site for
           interactions with Ca2+ ions. Muatations in the human
           amelogenin gene (AMGX) cause X-linked hypoplastic
           amelogenesis imperfecta, a disease characterised by
           defective enamel. A 9bp deletion in exon 2 of AMGX
           results in the loss of codons for Ile5, Leu6, Phe7 and
           Ala8, and replacement by a new threonine codon,
           disrupting the 16-residue (Met1-Ala16) amelogenin signal
           peptide.
          Length = 165

 Score = 42.5 bits (100), Expect = 8e-05
 Identities = 19/59 (32%), Positives = 23/59 (38%), Gaps = 2/59 (3%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMG--TPPPPPPMMGTPPPPPPMMGTTP 78
           +LP     P  P P  P  P  PM    P   +    P PP P M    P PP++   P
Sbjct: 91  NLPQPAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPPLLPDLP 149



 Score = 35.9 bits (83), Expect = 0.016
 Identities = 18/74 (24%), Positives = 24/74 (32%), Gaps = 3/74 (4%)

Query: 23  LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLG 82
           +P   +  P     P LP P         P    PP P   M   PP  P+    P    
Sbjct: 76  VPGQHSMTPTQHHQPNLPQPAQQP---FQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPL 132

Query: 83  PLYWTRLIVSPEIP 96
           P  +    + P +P
Sbjct: 133 PPMFPMQPLPPLLP 146



 Score = 33.2 bits (76), Expect = 0.12
 Identities = 18/60 (30%), Positives = 20/60 (33%), Gaps = 6/60 (10%)

Query: 18  SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
                   +Q  PP  P PP  P PP       PPM    P PP +   P    P    T
Sbjct: 106 QPPQPQQPMQPQPPVHPIPPLPPQPP------LPPMFPMQPLPPLLPDLPLEAWPATDKT 159



 Score = 29.4 bits (66), Expect = 2.8
 Identities = 17/61 (27%), Positives = 20/61 (32%), Gaps = 5/61 (8%)

Query: 23  LPNVQAAPPPPPPPP--PLPPPPPMMGT---TPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
           +P + A  P  P  P  P+P    M  T    P        P  P    PP P   M   
Sbjct: 58  IPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQ 117

Query: 78  P 78
           P
Sbjct: 118 P 118


>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein.  The WWbp domain is
           characterized by several short PY and PT-like motifs of
           the PPPPY form. These appear to bind directly to the WW
           domains of WWP1 and WWP2 and other such diverse proteins
           as dystrophin and YAP (Yes-associated protein). This is
           the WW-domain binding protein WWbp via PY and PY_like
           motifs. The presence of a phosphotyrosine residue in the
           pWBP-1 peptide abolishes WW domain binding which
           suggests a potential regulatory role for tyrosine
           phosphorylation in modulating WW domain-ligand
           interactions. Given the likelihood that WWP1 and WWP2
           function as E3 ubiquitin-protein ligases, it is possible
           that initial substrate-specific recognition occurs via
           WW domain-substrate protein interaction followed by
           ubiquitin transfer and subsequent proteolysis. This
           domain lies just downstream of the GRAM (pfam02893) in
           many members.
          Length = 111

 Score = 41.6 bits (98), Expect = 8e-05
 Identities = 21/46 (45%), Positives = 22/46 (47%), Gaps = 3/46 (6%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            AAPPP  PPPP  P PP +  TPPP              PPPP P
Sbjct: 65  YAAPPPYGPPPPYYPAPPGVYPTPPPPN---SGYMADPQEPPPPYP 107



 Score = 38.5 bits (90), Expect = 8e-04
 Identities = 28/80 (35%), Positives = 34/80 (42%), Gaps = 9/80 (11%)

Query: 1   MEYMDKRAKRRKKKETYSNIFSLPNVQAAPPPPPP------PPPLPPPPPMMGTTPPPMM 54
           ME + +RA+R +     S  +  P       P P       PPP  PPPP     PP + 
Sbjct: 27  MERLAERAQRAQPVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYP-APPGVY 85

Query: 55  GTPPPP--PPMMGTPPPPPP 72
            TPPPP    M     PPPP
Sbjct: 86  PTPPPPNSGYMADPQEPPPP 105



 Score = 37.4 bits (87), Expect = 0.002
 Identities = 22/72 (30%), Positives = 23/72 (31%), Gaps = 2/72 (2%)

Query: 14  KETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP--PPPMMGTPPPPP 71
            E       +       PPP     L P P       PP  G PPP  P P    P PPP
Sbjct: 31  AERAQRAQPVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPP 90

Query: 72  PMMGTTPHVLGP 83
           P  G       P
Sbjct: 91  PNSGYMADPQEP 102



 Score = 34.3 bits (79), Expect = 0.026
 Identities = 15/40 (37%), Positives = 16/40 (40%), Gaps = 1/40 (2%)

Query: 24  PNVQAAPPPPPPPPPLP-PPPPMMGTTPPPMMGTPPPPPP 62
           P     PPP  P PP   P PP   +        PPPP P
Sbjct: 68  PPPYGPPPPYYPAPPGVYPTPPPPNSGYMADPQEPPPPYP 107


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 42.1 bits (99), Expect = 9e-05
 Identities = 18/56 (32%), Positives = 19/56 (33%), Gaps = 4/56 (7%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
           P    APP P  PP    P P + T PP      PP P        P P     P 
Sbjct: 81  PLTPPAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAP----AAAVPAPAAAPPPS 132



 Score = 38.7 bits (90), Expect = 0.002
 Identities = 15/45 (33%), Positives = 15/45 (33%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
             PP    P P  P PP   T   P       P P    PP  PP
Sbjct: 91  VTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPP 135



 Score = 36.7 bits (85), Expect = 0.007
 Identities = 16/49 (32%), Positives = 17/49 (34%), Gaps = 2/49 (4%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P     P    P P +P PPP      PP      P P     PPP  P
Sbjct: 88  PEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPA--AAPPPSDP 134



 Score = 34.4 bits (79), Expect = 0.040
 Identities = 16/48 (33%), Positives = 16/48 (33%), Gaps = 4/48 (8%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
                AP  P PPP   P  P       P     PPP      PP PP
Sbjct: 95  TAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSD----PPQPP 138



 Score = 27.5 bits (61), Expect = 9.6
 Identities = 12/36 (33%), Positives = 12/36 (33%), Gaps = 1/36 (2%)

Query: 28  AAPPPPPPPPPLP-PPPPMMGTTPPPMMGTPPPPPP 62
             P   P  PP P    P     PPP     PP  P
Sbjct: 106 PPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQPPRAP 141


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 44.8 bits (106), Expect = 9e-05
 Identities = 14/56 (25%), Positives = 16/56 (28%), Gaps = 7/56 (12%)

Query: 24  PNVQAAPPPPPPPPPLP-------PPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            +  A   P  P P  P       PPPP +     P              P P PP
Sbjct: 687 AHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPP 742



 Score = 44.4 bits (105), Expect = 1e-04
 Identities = 11/42 (26%), Positives = 12/42 (28%), Gaps = 1/42 (2%)

Query: 31  PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           PPPP  P    P P   +        P P PP          
Sbjct: 711 PPPPDLPAAAGPAPCGSSLIASPTAPPEPEPP-GAEQADGAE 751



 Score = 42.9 bits (101), Expect = 3e-04
 Identities = 15/59 (25%), Positives = 17/59 (28%), Gaps = 1/59 (1%)

Query: 26  VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGT-TPHVLGP 83
             AA   P  P P  PP    G  PPP +     P P   +    P       P     
Sbjct: 688 HHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGAEQ 746



 Score = 42.1 bits (99), Expect = 5e-04
 Identities = 14/55 (25%), Positives = 15/55 (27%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           P    A    P  P    PP   G  PPP        P   G+     P     P
Sbjct: 685 PTAHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEP 739



 Score = 34.4 bits (79), Expect = 0.15
 Identities = 19/75 (25%), Positives = 20/75 (26%), Gaps = 9/75 (12%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWT 87
           A P       P+ P         P       PP    G PPPP       P   G     
Sbjct: 672 ARPSRVARGDPVRPTAHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCG----- 726

Query: 88  RLIVSPEIPSDTTPL 102
               S  I S T P 
Sbjct: 727 ----SSLIASPTAPP 737



 Score = 28.6 bits (64), Expect = 7.9
 Identities = 12/62 (19%), Positives = 13/62 (20%), Gaps = 17/62 (27%)

Query: 30  PPPPPPPPPLP---------------PPPPMM--GTTPPPMMGTPPPPPPMMGTPPPPPP 72
           PP      P P               P P  M   T  P  +    P  P         P
Sbjct: 636 PPVFKTALPRPDYNRGGEAGGPGVPGPVPVGMPAHTARPSRVARGDPVRPTAHHAALRAP 695

Query: 73  MM 74
             
Sbjct: 696 QA 697


>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729).  This
           family of proteins is found in viruses. Proteins in this
           family are typically between 145 and 1707 amino acids in
           length. The family is found in association with
           pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
           There is a single completely conserved residue L that
           may be functionally important.
          Length = 115

 Score = 40.4 bits (95), Expect = 2e-04
 Identities = 21/52 (40%), Positives = 22/52 (42%), Gaps = 4/52 (7%)

Query: 24  PNVQAAPPPPPPP----PPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           P V A    PPP     PP  P PP+ G   PP    PP P      PPP P
Sbjct: 55  PPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPLAPPAPARKPPLPPPRP 106



 Score = 37.3 bits (87), Expect = 0.003
 Identities = 18/47 (38%), Positives = 22/47 (46%), Gaps = 3/47 (6%)

Query: 29 APPPPP---PPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
          +PP       PPP+  P P +  TPPP+      PPP     PPP P
Sbjct: 30 SPPESAHPDDPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDP 76



 Score = 35.8 bits (83), Expect = 0.008
 Identities = 17/44 (38%), Positives = 20/44 (45%), Gaps = 2/44 (4%)

Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
            PP    P  PPP   G   PP++ TPPP   +   PPP  P 
Sbjct: 29 FSPPESAHPDDPPPV--GDPRPPVVDTPPPVSAVWVLPPPSEPA 70



 Score = 32.0 bits (73), Expect = 0.17
 Identities = 16/51 (31%), Positives = 19/51 (37%), Gaps = 4/51 (7%)

Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
            PP   P PP+   PP +          PPP  P    P P PP+ G   
Sbjct: 39 DPPPVGDPRPPVVDTPPPVSAVWVL----PPPSEPAAPPPDPEPPVPGPAG 85



 Score = 31.2 bits (71), Expect = 0.39
 Identities = 17/52 (32%), Positives = 17/52 (32%), Gaps = 3/52 (5%)

Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP---PPPMMGTPPPPP 71
           P       PPP       PPP     PPP    P P    PP    PP P 
Sbjct: 45 DPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPLAPPAPA 96



 Score = 29.3 bits (66), Expect = 1.8
 Identities = 12/23 (52%), Positives = 13/23 (56%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPP 44
           + P    APP P   PPLPPP P
Sbjct: 84  AGPPSPLAPPAPARKPPLPPPRP 106



 Score = 28.5 bits (64), Expect = 3.0
 Identities = 17/50 (34%), Positives = 18/50 (36%)

Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
             P PP    PPP   +   PPP     PPP P    P P  P     P
Sbjct: 43 VGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPLAP 92



 Score = 28.5 bits (64), Expect = 3.3
 Identities = 16/51 (31%), Positives = 17/51 (33%), Gaps = 2/51 (3%)

Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGT--PPPPPPMMGTPPPPPP 72
          P+       P PP    PPP       PP      PPP P      P  PP
Sbjct: 37 PDDPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPP 87


>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator.  Shisa is a
           transcription factor-type molecule that physically
           interacts with immature forms of the Wnt receptor
           Frizzled and the FGF receptor within the endoplasmic
           reticulum to inhibit their post-translational maturation
           and trafficking to the cell surface.
          Length = 177

 Score = 41.7 bits (98), Expect = 2e-04
 Identities = 19/55 (34%), Positives = 20/55 (36%), Gaps = 2/55 (3%)

Query: 26  VQAAPPPPPP--PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           VQ  P P PP   P  P P        PP  G P PP  +   PP      G  P
Sbjct: 122 VQTTPLPQPPSTAPSYPGPQYQGYHPMPPQPGMPAPPYSLQYPPPGLLQPQGPPP 176



 Score = 32.5 bits (74), Expect = 0.25
 Identities = 19/60 (31%), Positives = 23/60 (38%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
           P +  A        PLP PP    + P P      P PP  G P PP  +    P +L P
Sbjct: 112 PVMTRATSTTVQTTPLPQPPSTAPSYPGPQYQGYHPMPPQPGMPAPPYSLQYPPPGLLQP 171



 Score = 30.2 bits (68), Expect = 1.4
 Identities = 17/41 (41%), Positives = 18/41 (43%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
           S P  Q     P PP P  P PP     PPP +  P  PPP
Sbjct: 136 SYPGPQYQGYHPMPPQPGMPAPPYSLQYPPPGLLQPQGPPP 176


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 42.5 bits (100), Expect = 4e-04
 Identities = 22/116 (18%), Positives = 32/116 (27%), Gaps = 8/116 (6%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL 81
             P    +  P        PP   +  T  P    P PP P+    P  P          
Sbjct: 379 VRPTPAPSTRPKAAAAANIPPKEPVRETATP---PPVPPRPVAPPVPHTPESAPKLTRAA 435

Query: 82  GPLYWTRLIVSPEIPSDTTPLWKELEEVPINNLEEFTELFSRQVTAQRPVTRKRQQ 137
            P+        P  P +     K L     + LE+   ++  Q+    P      Q
Sbjct: 436 IPVDEKPKYTPPAPPKEE---EKALIA-DGDVLEQLEAIWK-QILRDVPPRSPAVQ 486



 Score = 39.8 bits (93), Expect = 0.003
 Identities = 22/89 (24%), Positives = 28/89 (31%), Gaps = 12/89 (13%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP--PPMMGTTPHVL 81
           P  Q A P    P P+ P P              PP  P+  T  PP  PP     P   
Sbjct: 364 PAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPP--- 420

Query: 82  GPLYWTRLIVSPEIPSDTTPLWKELEEVP 110
                  +  +PE     T     ++E P
Sbjct: 421 -------VPHTPESAPKLTRAAIPVDEKP 442


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 42.5 bits (100), Expect = 4e-04
 Identities = 17/40 (42%), Positives = 17/40 (42%), Gaps = 6/40 (15%)

Query: 24  PNVQAAPPPPP-PPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
           P   AA  PPP   PP PP        PPP    PP  PP
Sbjct: 906 PGGDAASAPPPGAGPPAPPQA-----VPPPRTTQPPAAPP 940



 Score = 40.6 bits (95), Expect = 0.002
 Identities = 14/40 (35%), Positives = 14/40 (35%), Gaps = 4/40 (10%)

Query: 33  PPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            P       PPP  G  PP        PPP    PP  PP
Sbjct: 905 APGGDAASAPPP--GAGPP--APPQAVPPPRTTQPPAAPP 940



 Score = 28.7 bits (64), Expect = 8.8
 Identities = 13/44 (29%), Positives = 13/44 (29%), Gaps = 4/44 (9%)

Query: 44  PMMGTTPPPMMGTPPPPPPMMGTPPPP----PPMMGTTPHVLGP 83
           P  G       G     PP    PP P    PP   T P    P
Sbjct: 897 PGTGRVATAPGGDAASAPPPGAGPPAPPQAVPPPRTTQPPAAPP 940


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 42.2 bits (99), Expect = 5e-04
 Identities = 16/70 (22%), Positives = 17/70 (24%), Gaps = 1/70 (1%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWT 87
             P   P P P P   P      P   G  P        P    P     P    P  W 
Sbjct: 442 RGPGGAPAPAPAPAAAPA-AAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWE 500

Query: 88  RLIVSPEIPS 97
            L      P+
Sbjct: 501 ELPPEFASPA 510



 Score = 39.9 bits (93), Expect = 0.003
 Identities = 20/107 (18%), Positives = 26/107 (24%), Gaps = 4/107 (3%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
               A P P        P        P P    PPP   +      P P          P
Sbjct: 463 RPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAP----AQPDAAP 518

Query: 84  LYWTRLIVSPEIPSDTTPLWKELEEVPINNLEEFTELFSRQVTAQRP 130
             W    +     +D    ++ L   P           +  V A RP
Sbjct: 519 AGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRP 565



 Score = 36.8 bits (85), Expect = 0.024
 Identities = 26/135 (19%), Positives = 32/135 (23%), Gaps = 31/135 (22%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMM---------------GTPPPPP 71
            AA P P  PP  P   P        +   P    P                 G P P P
Sbjct: 393 AAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAP 452

Query: 72  -----PMMGTTPHVLGPLYWTRLIVSPEIPS----------DTTPLWKEL-EEVPINNLE 115
                P     P   GP        +    +          D  P W+EL  E       
Sbjct: 453 APAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPA 512

Query: 116 EFTELFSRQVTAQRP 130
           +     +  V    P
Sbjct: 513 QPDAAPAGWVAESIP 527



 Score = 34.9 bits (80), Expect = 0.085
 Identities = 8/50 (16%), Positives = 8/50 (16%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
           P  Q AP    P    P P                            P  
Sbjct: 381 PVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAP 430



 Score = 33.7 bits (77), Expect = 0.24
 Identities = 8/51 (15%), Positives = 9/51 (17%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMM 74
             V    P    P    P P      P            +   P    P  
Sbjct: 380 APVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAP 430



 Score = 32.5 bits (74), Expect = 0.46
 Identities = 11/57 (19%), Positives = 11/57 (19%), Gaps = 4/57 (7%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
            A P      P   P P        P    P P  P       P            P
Sbjct: 371 GAGPATAAAAPVAQPAPA----AAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAP 423



 Score = 31.4 bits (71), Expect = 1.1
 Identities = 10/49 (20%), Positives = 10/49 (20%), Gaps = 3/49 (6%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P   AA P   P P    P       P P      P             
Sbjct: 374 PATAAAAPVAQPAPAAAAPAA---AAPAPAAPPAAPAAAPAAAAAARAV 419



 Score = 28.7 bits (64), Expect = 7.7
 Identities = 12/60 (20%), Positives = 15/60 (25%), Gaps = 3/60 (5%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPP---MMGTPPPPPPMMGTPPPPPPMMGTTP 78
               V AA    P        P      PPP   +      P P      P   +  + P
Sbjct: 468 GPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIP 527


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 41.6 bits (98), Expect = 5e-04
 Identities = 13/49 (26%), Positives = 17/49 (34%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P   +   PP   P +P  P        P   + PP P    +  PPP 
Sbjct: 179 PASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPS 227



 Score = 40.8 bits (96), Expect = 0.001
 Identities = 18/52 (34%), Positives = 19/52 (36%)

Query: 21  FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           F  P    + P     PP P         P P   T P PPP    PPPPP 
Sbjct: 197 FPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPV 248



 Score = 39.7 bits (93), Expect = 0.002
 Identities = 16/59 (27%), Positives = 18/59 (30%), Gaps = 13/59 (22%)

Query: 24  PNVQAAPPPPPPPPP----------LPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P   +   P  P PP          LPP P    +  PP     P  P     PP P  
Sbjct: 187 PPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPP---PSPESPTNPSPPPGPAA 242



 Score = 39.3 bits (92), Expect = 0.003
 Identities = 14/51 (27%), Positives = 17/51 (33%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           A+P  PP   P  P  P     P     +  PP P       PPP   +  
Sbjct: 182 ASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPT 232



 Score = 38.9 bits (91), Expect = 0.004
 Identities = 19/57 (33%), Positives = 22/57 (38%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           S P+    P  P      P P      TPPP   +P  P P  G   PPPP +   P
Sbjct: 196 SFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVP 252



 Score = 37.7 bits (88), Expect = 0.008
 Identities = 21/76 (27%), Positives = 27/76 (35%), Gaps = 8/76 (10%)

Query: 17  YSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGT----TPPPMMGTPPPPPPMM----GTPP 68
                      ++ PP P       PPP   +    +PPP    PPPPP        T  
Sbjct: 199 SPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAK 258

Query: 69  PPPPMMGTTPHVLGPL 84
           P PP    TP  +G +
Sbjct: 259 PTPPSASATPAPIGGI 274



 Score = 35.4 bits (82), Expect = 0.046
 Identities = 17/59 (28%), Positives = 22/59 (37%)

Query: 16  TYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMM 74
           +     + P    +P  P PPP    PPP      PP+    P PP    TP P   + 
Sbjct: 217 SSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGIT 275



 Score = 34.7 bits (80), Expect = 0.078
 Identities = 16/56 (28%), Positives = 18/56 (32%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
           P      P  P PPP P  PP       P + T  P PP     P P   +     
Sbjct: 224 PPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLDDD 279



 Score = 30.8 bits (70), Expect = 1.4
 Identities = 16/54 (29%), Positives = 18/54 (33%), Gaps = 4/54 (7%)

Query: 18  SNIFSLPNVQAAPPPPPPPPPLP----PPPPMMGTTPPPMMGTPPPPPPMMGTP 67
                 P   + PP P  PPP P    PP      TPP    TP P   +    
Sbjct: 225 PPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLDD 278


>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
           synthetase/uroporphyrin-III C-methyltransferase;
           Reviewed.
          Length = 656

 Score = 42.0 bits (99), Expect = 5e-04
 Identities = 15/69 (21%), Positives = 19/69 (27%), Gaps = 5/69 (7%)

Query: 24  PNVQAAPPPP-PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLG 82
              Q A   P P           + + P      P PPP     PP  PP          
Sbjct: 268 AAAQPATAAPAPSRMTDTNDSKSVTSQPAAAAAAPAPPPN----PPATPPEPPARRGRGS 323

Query: 83  PLYWTRLIV 91
              W  ++V
Sbjct: 324 AALWFVVVV 332



 Score = 30.5 bits (69), Expect = 2.2
 Identities = 7/40 (17%), Positives = 9/40 (22%), Gaps = 2/40 (5%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPP 61
           +  N   +    P      P PP     P      P    
Sbjct: 283 TDTNDSKSVTSQPAAAAAAPAPP--PNPPATPPEPPARRG 320


>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
           (DUF2076).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function. The domain,
           however, is found in various periplasmic ligand-binding
           sensor proteins.
          Length = 234

 Score = 41.2 bits (97), Expect = 6e-04
 Identities = 13/45 (28%), Positives = 16/45 (35%)

Query: 18  SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
           S +F     +  P  P   PP PP  P  G+  P   G    P  
Sbjct: 84  SGMFGGGAPRPPPAAPAVQPPAPPARPGWGSGGPSQQGAGQQPGY 128



 Score = 36.2 bits (84), Expect = 0.023
 Identities = 11/45 (24%), Positives = 11/45 (24%)

Query: 31  PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
             P PPP  P   P      P      P        P    P  G
Sbjct: 90  GAPRPPPAAPAVQPPAPPARPGWGSGGPSQQGAGQQPGYAQPGPG 134


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 41.7 bits (98), Expect = 7e-04
 Identities = 16/53 (30%), Positives = 17/53 (32%), Gaps = 3/53 (5%)

Query: 22  SLPNVQAAPPPPPPPPPLPPP---PPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
             P   A   P P  P  PPP   PP+    P       P P P       PP
Sbjct: 382 PAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPP 434



 Score = 34.7 bits (80), Expect = 0.11
 Identities = 15/48 (31%), Positives = 15/48 (31%), Gaps = 7/48 (14%)

Query: 29  APPPPPP----PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           AP PP      P P  P  P     PP     P  P       P P P
Sbjct: 381 APAPPSAAWGAPTPAAPAAPPPAAAPPV---PPAAPARPAAARPAPAP 425



 Score = 30.9 bits (70), Expect = 1.8
 Identities = 13/48 (27%), Positives = 13/48 (27%), Gaps = 2/48 (4%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
               AA PP PP    P  P      P P       PP     P    
Sbjct: 399 APPPAAAPPVPPAA--PARPAAARPAPAPAPPAAAAPPARSADPAAAA 444



 Score = 30.1 bits (68), Expect = 3.1
 Identities = 15/58 (25%), Positives = 16/58 (27%), Gaps = 18/58 (31%)

Query: 22  SLPNVQAAPPPPPPPP-------PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           + P   AAPP PP  P       P P P P                PP     P    
Sbjct: 398 AAPPPAAAPPVPPAAPARPAAARPAPAPAP-----------PAAAAPPARSADPAAAA 444


>gnl|CDD|218350 pfam04959, ARS2, Arsenite-resistance protein 2.  Arsenite is a
           carcinogenic compound which can act as a co-mutagen by
           inhibiting DNA repair. Arsenite-resistance protein 2 is
           thought to play a role in arsenite resistance.
          Length = 211

 Score = 40.2 bits (94), Expect = 0.001
 Identities = 28/78 (35%), Positives = 36/78 (46%), Gaps = 10/78 (12%)

Query: 5   DKRAKRRKKKETYSNIF------SLPNVQAAPPPPPPPPPLPPPPPMMG-TTPPPMMGTP 57
           DK  + RK+ E ++N        +LP ++   PP P P  L P  P     TP  +M   
Sbjct: 103 DKVEEVRKEVEFFNNYLTDAKRPALPEIKPLQPPKPDPGGLAPGLPGYPPQTPQALMPYG 162

Query: 58  PPPPPMMGTP---PPPPP 72
            P PPMMG     PP PP
Sbjct: 163 QPRPPMMGYGRGGPPFPP 180


>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
           carboxylase.
          Length = 274

 Score = 40.6 bits (95), Expect = 0.001
 Identities = 19/50 (38%), Positives = 23/50 (46%), Gaps = 7/50 (14%)

Query: 27  QAAPPPPPPPPPL----PPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           +A P PPPP P +    PPP  M   +PP     P  P     +PPP P 
Sbjct: 139 EALPQPPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPAS---SPPPTPA 185



 Score = 38.7 bits (90), Expect = 0.004
 Identities = 25/72 (34%), Positives = 26/72 (36%), Gaps = 6/72 (8%)

Query: 11  RKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPP--PMMGTTPPPMMGTPPPPPPMMGT-- 66
           RKK+         P V   PPPP   PP  PP   P           TP  PPP      
Sbjct: 136 RKKEALPQPPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPKS 195

Query: 67  --PPPPPPMMGT 76
             PP   PM GT
Sbjct: 196 SHPPLKSPMAGT 207


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 40.9 bits (96), Expect = 0.002
 Identities = 16/44 (36%), Positives = 17/44 (38%)

Query: 29  APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            PPPP PPP  PPP P    +        P PPP    P     
Sbjct: 115 DPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGAS 158



 Score = 39.0 bits (91), Expect = 0.006
 Identities = 16/48 (33%), Positives = 19/48 (39%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           P   +  PPPP PPP  PPP         +     P PP   +PP   
Sbjct: 109 PGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAG 156



 Score = 38.2 bits (89), Expect = 0.011
 Identities = 14/55 (25%), Positives = 18/55 (32%), Gaps = 4/55 (7%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           P    A PPP P P L      +G+  PP   +PP         P        + 
Sbjct: 119 PTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAG----ASPAAVASDAASS 169



 Score = 37.1 bits (86), Expect = 0.021
 Identities = 14/44 (31%), Positives = 15/44 (34%)

Query: 29  APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
                    P  P      T P P    PPPP P   +PPP P 
Sbjct: 88  PTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPA 131



 Score = 37.1 bits (86), Expect = 0.021
 Identities = 15/46 (32%), Positives = 18/46 (39%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            ++P PPPP PP   PPP        M+     P P     PP   
Sbjct: 111 PSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAG 156



 Score = 36.7 bits (85), Expect = 0.032
 Identities = 16/54 (29%), Positives = 20/54 (37%), Gaps = 3/54 (5%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPM---MGTPPPPPP 72
           +  N   + P        P  P   G+  PP   +P PPPP       PP P P
Sbjct: 79  APANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAP 132



 Score = 36.3 bits (84), Expect = 0.043
 Identities = 19/55 (34%), Positives = 21/55 (38%), Gaps = 6/55 (10%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPP------PPPPMMGTPPPPPP 72
           P   AA PP   P P P P P +   P P  G  P       PP  + TP P   
Sbjct: 889 PKAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDPRGGFRRVPPGDLHTPAPSAA 943



 Score = 35.9 bits (83), Expect = 0.052
 Identities = 15/52 (28%), Positives = 17/52 (32%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
              P  P PPP  PPP     +P P +     P    G PP   P       
Sbjct: 108 PPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASP 159



 Score = 35.1 bits (81), Expect = 0.083
 Identities = 15/53 (28%), Positives = 19/53 (35%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
             PPP PPP   PP P    +     +G+P PPP          P    +   
Sbjct: 115 DPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAA 167



 Score = 34.4 bits (79), Expect = 0.13
 Identities = 16/44 (36%), Positives = 18/44 (40%)

Query: 29  APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           +P PP P  P PPPP     +PPP               P PPP
Sbjct: 105 SPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPP 148



 Score = 34.4 bits (79), Expect = 0.14
 Identities = 14/49 (28%), Positives = 18/49 (36%), Gaps = 4/49 (8%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
           + A  P  PP   PP  P    +P P    P    P+  +   P P  G
Sbjct: 181 ETARAPSSPPAEPPPSTPPAAASPRP----PRRSSPISASASSPAPAPG 225



 Score = 34.0 bits (78), Expect = 0.19
 Identities = 12/49 (24%), Positives = 14/49 (28%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P  +A        P          +       TPP P      PP PPP
Sbjct: 75  PGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPP 123



 Score = 34.0 bits (78), Expect = 0.20
 Identities = 14/72 (19%), Positives = 18/72 (25%)

Query: 7   RAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGT 66
              R  +        +    +       P PP P   P     P      PP   P    
Sbjct: 846 PPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRP 905

Query: 67  PPPPPPMMGTTP 78
            P P   +G  P
Sbjct: 906 RPAPRVKLGPMP 917



 Score = 34.0 bits (78), Expect = 0.22
 Identities = 12/68 (17%), Positives = 17/68 (25%)

Query: 6   KRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMG 65
           + ++  K K   +   +        P PP P   P                 P P P   
Sbjct: 849 RSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPA 908

Query: 66  TPPPPPPM 73
                 PM
Sbjct: 909 PRVKLGPM 916



 Score = 33.6 bits (77), Expect = 0.29
 Identities = 15/52 (28%), Positives = 18/52 (34%), Gaps = 3/52 (5%)

Query: 21  FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            S     +  PP P  P  PPP P   + PP      P    M+     P P
Sbjct: 98  ASPAREGSPTPPGPSSPDPPPPTPPPASPPPS---PAPDLSEMLRPVGSPGP 146



 Score = 33.2 bits (76), Expect = 0.34
 Identities = 10/54 (18%), Positives = 13/54 (24%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
                 ++P      P  PP  P   T P      PP     +      P    
Sbjct: 171 QAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAP 224



 Score = 32.8 bits (75), Expect = 0.45
 Identities = 10/34 (29%), Positives = 12/34 (35%)

Query: 29  APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
           +P P  PPPP  P  P     P     +P     
Sbjct: 350 SPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAG 383



 Score = 32.8 bits (75), Expect = 0.48
 Identities = 13/56 (23%), Positives = 15/56 (26%), Gaps = 1/56 (1%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
              PPP  PPP  P P +     P     PPP          P  +          
Sbjct: 118 PPTPPPASPPP-SPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQA 172



 Score = 32.1 bits (73), Expect = 0.67
 Identities = 13/57 (22%), Positives = 14/57 (24%), Gaps = 5/57 (8%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           S           P P P   P P     PPP     P  P     P   P     + 
Sbjct: 331 SSSESSRGAAVSPGPSPSRSPSP--SRPPPP---ADPSSPRKRPRPSRAPSSPAASA 382



 Score = 32.1 bits (73), Expect = 0.78
 Identities = 9/49 (18%), Positives = 10/49 (20%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            +    P   PP     P               P   P      P  PP
Sbjct: 142 GSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPP 190



 Score = 31.7 bits (72), Expect = 0.90
 Identities = 13/51 (25%), Positives = 15/51 (29%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
            AP       P      +   +P       PP P     PPP PP     P
Sbjct: 78  EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPP 128



 Score = 31.7 bits (72), Expect = 1.2
 Identities = 16/51 (31%), Positives = 18/51 (35%), Gaps = 6/51 (11%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP--PPP 72
           P  +A P    PP      PP     P P     P P   +G  PP  P P
Sbjct: 877 PEPRARPGAAAPPKAAAAAPPAGAPAPRP----RPAPRVKLGPMPPGGPDP 923



 Score = 30.9 bits (70), Expect = 1.6
 Identities = 8/50 (16%), Positives = 10/50 (20%), Gaps = 3/50 (6%)

Query: 28  AAPPPPPP---PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMM 74
           A+PPP P       L P                  P  +           
Sbjct: 124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAA 173



 Score = 30.9 bits (70), Expect = 1.8
 Identities = 23/73 (31%), Positives = 26/73 (35%), Gaps = 3/73 (4%)

Query: 7   RAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGT 66
            A   +   T+S     P   A    P PP P  P PP    TPPP    PP P P +  
Sbjct: 80  PANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPP--PPTPPP-ASPPPSPAPDLSE 136

Query: 67  PPPPPPMMGTTPH 79
              P    G  P 
Sbjct: 137 MLRPVGSPGPPPA 149



 Score = 30.5 bits (69), Expect = 2.0
 Identities = 11/45 (24%), Positives = 13/45 (28%), Gaps = 2/45 (4%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            +P P P   P P  PP       P     P P     +P     
Sbjct: 341 VSPGPSPSRSPSPSRPPPPADPSSP--RKRPRPSRAPSSPAASAG 383



 Score = 30.1 bits (68), Expect = 2.8
 Identities = 10/49 (20%), Positives = 11/49 (22%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
             V +              P      P      PPP  P     P PP 
Sbjct: 160 AAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPR 208



 Score = 30.1 bits (68), Expect = 2.9
 Identities = 13/55 (23%), Positives = 14/55 (25%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
             V  A       PP  PPP      P     + P        P  P      TP
Sbjct: 54  TVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTP 108



 Score = 29.8 bits (67), Expect = 3.6
 Identities = 12/45 (26%), Positives = 13/45 (28%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            A        PL  P         P    PP  PP   +P PP  
Sbjct: 165 DAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRR 209



 Score = 29.4 bits (66), Expect = 4.8
 Identities = 10/46 (21%), Positives = 12/46 (26%)

Query: 25  NVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
             +   P   P    PP               P P P +   P PP
Sbjct: 873 RPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPP 918



 Score = 29.4 bits (66), Expect = 5.0
 Identities = 11/52 (21%), Positives = 15/52 (28%), Gaps = 3/52 (5%)

Query: 21  FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            +LP           P   P P     +       +P P P   G+ P P  
Sbjct: 263 ITLPTRIWEASGWNGPSSRPGPASSSSSPRER---SPSPSPSSPGSGPAPSS 311



 Score = 29.0 bits (65), Expect = 6.1
 Identities = 21/81 (25%), Positives = 27/81 (33%), Gaps = 5/81 (6%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP--PPPMMGTPPPPPPMMGTTPH 79
           S P    +   P  P     P P   ++P P   TPPP  PPP   +P P    M     
Sbjct: 86  STPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPP---SPAPDLSEMLRPVG 142

Query: 80  VLGPLYWTRLIVSPEIPSDTT 100
             GP        +   P+   
Sbjct: 143 SPGPPPAASPPAAGASPAAVA 163



 Score = 28.6 bits (64), Expect = 9.5
 Identities = 13/79 (16%), Positives = 15/79 (18%), Gaps = 5/79 (6%)

Query: 7   RAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGT 66
                K+K                 P  PP     PPP   +                  
Sbjct: 813 SRTASKRKSRSHTPDGGSESSG---PARPPGAAARPPPARSSESSKSKPAAAGGRARGKN 869

Query: 67  P--PPPPPMMGTTPHVLGP 83
               P PP     P    P
Sbjct: 870 GRRRPRPPEPRARPGAAAP 888


>gnl|CDD|216368 pfam01213, CAP_N, Adenylate cyclase associated (CAP) N terminal. 
          Length = 313

 Score = 39.8 bits (93), Expect = 0.002
 Identities = 15/33 (45%), Positives = 19/33 (57%)

Query: 12  KKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPP 44
           KK    +   +LP V ++ P  PPPPP PPPP 
Sbjct: 211 KKGPVAAAKSALPAVSSSAPSAPPPPPPPPPPS 243



 Score = 32.5 bits (74), Expect = 0.41
 Identities = 11/33 (33%), Positives = 11/33 (33%), Gaps = 4/33 (12%)

Query: 40  PPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P                  PPPP    PPPPPP
Sbjct: 214 PVAAAKSALPAVSSSAPSAPPPP----PPPPPP 242



 Score = 32.1 bits (73), Expect = 0.59
 Identities = 14/48 (29%), Positives = 16/48 (33%), Gaps = 9/48 (18%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
               A P      P  PPPPP     PPP    PP  P +  +     
Sbjct: 217 AAKSALPAVSSSAPSAPPPPP-----PPP----PPSVPTISNSVESAS 255


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 40.4 bits (95), Expect = 0.002
 Identities = 17/78 (21%), Positives = 19/78 (24%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
           P     P P P P P PP P             P   P     P P      T      P
Sbjct: 425 PAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAP 484

Query: 84  LYWTRLIVSPEIPSDTTP 101
                   +P  P+    
Sbjct: 485 PAAPAPAAAPAAPAAPAA 502



 Score = 37.3 bits (87), Expect = 0.017
 Identities = 13/51 (25%), Positives = 13/51 (25%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           AAP P    P     P       P     P P PP      P        P
Sbjct: 408 AAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPP 458



 Score = 37.3 bits (87), Expect = 0.018
 Identities = 14/46 (30%), Positives = 14/46 (30%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            AAP   P P P   P P     P P     P   P     P  P 
Sbjct: 459 AAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA 504



 Score = 37.3 bits (87), Expect = 0.019
 Identities = 13/49 (26%), Positives = 13/49 (26%), Gaps = 1/49 (2%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGT-TPPPMMGTPPPPPPMMGTPPPPPPMM 74
               PP P  PP P   P      P P     P   P    P     M 
Sbjct: 738 PVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMA 786



 Score = 37.3 bits (87), Expect = 0.019
 Identities = 14/57 (24%), Positives = 14/57 (24%), Gaps = 4/57 (7%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
            AA  P   P P    P       P       P P     P P PP         G 
Sbjct: 401 AAAAAPAAAPAPAAAAPAAAAAPAPA----AAPQPAPAPAPAPAPPSPAGNAPAGGA 453



 Score = 36.9 bits (86), Expect = 0.023
 Identities = 14/55 (25%), Positives = 15/55 (27%), Gaps = 2/55 (3%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           P+  AA P   P P    P       P       P P P     PP P       
Sbjct: 399 PSAAAAAPAAAPAPAAAAPAAAAAPAPAA--APQPAPAPAPAPAPPSPAGNAPAG 451



 Score = 36.5 bits (85), Expect = 0.031
 Identities = 14/58 (24%), Positives = 15/58 (25%)

Query: 26  VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
             AA P P    P     P     P P     P P P       P     + P    P
Sbjct: 405 APAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAP 462



 Score = 36.5 bits (85), Expect = 0.033
 Identities = 17/51 (33%), Positives = 18/51 (35%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           +       P P PP P  PP P      PPP     P   P    PP PP 
Sbjct: 730 APSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780



 Score = 35.7 bits (83), Expect = 0.049
 Identities = 15/57 (26%), Positives = 16/57 (28%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           + P   A  PPP   P   P P       P     P PP        P  P     P
Sbjct: 447 NAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAP 503



 Score = 35.7 bits (83), Expect = 0.055
 Identities = 16/53 (30%), Positives = 16/53 (30%), Gaps = 2/53 (3%)

Query: 23  LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
            P   A  P  PPPPP P P       PPP    P     M     P      
Sbjct: 747 DPPDPAGAPAQPPPPPAPAPAAAPAAAPPP--SPPSEEEEMAEDDAPSMDDED 797



 Score = 35.7 bits (83), Expect = 0.056
 Identities = 15/52 (28%), Positives = 16/52 (30%), Gaps = 5/52 (9%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
           P+ Q AP P   P P   P P      PP    P   P     P  P     
Sbjct: 462 PSAQPAPAPAAAPEPTAAPAP-----APPAAPAPAAAPAAPAAPAAPAGADD 508



 Score = 35.3 bits (82), Expect = 0.073
 Identities = 15/55 (27%), Positives = 15/55 (27%), Gaps = 3/55 (5%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
               AAP     P P   P P     P P    PP P         P P     P
Sbjct: 411 APAAAAPAAAAAPAPAAAPQPAPAPAPAP---APPSPAGNAPAGGAPSPPPAAAP 462



 Score = 34.6 bits (80), Expect = 0.12
 Identities = 17/78 (21%), Positives = 18/78 (23%), Gaps = 5/78 (6%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP---PPPMMGTPPPPPPMMGTTPHVLGP 83
            AA P      P   P P      P     P P   P P     P P P         G 
Sbjct: 395 AAAAPSAAAAAPAAAPAP--AAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGG 452

Query: 84  LYWTRLIVSPEIPSDTTP 101
                   +P       P
Sbjct: 453 APSPPPAAAPSAQPAPAP 470



 Score = 34.6 bits (80), Expect = 0.14
 Identities = 14/50 (28%), Positives = 14/50 (28%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
               AA  P P   P P P P     PP   G  P        P   P  
Sbjct: 415 AAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSA 464



 Score = 34.2 bits (79), Expect = 0.16
 Identities = 13/46 (28%), Positives = 14/46 (30%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           Q AP P   PP      P           + P P      P PP P
Sbjct: 700 QPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEP 745



 Score = 34.2 bits (79), Expect = 0.17
 Identities = 10/52 (19%), Positives = 11/52 (21%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
             +  P P P   P P       PP        P        P       T 
Sbjct: 461 APSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATL 512



 Score = 33.4 bits (77), Expect = 0.29
 Identities = 11/57 (19%), Positives = 13/57 (22%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
             P   +     P      PPP    +  P       P P     P PP        
Sbjct: 437 PAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAA 493



 Score = 32.7 bits (75), Expect = 0.44
 Identities = 6/43 (13%), Positives = 6/43 (13%)

Query: 29  APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
                    P    P      P          P     P P  
Sbjct: 385 LGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAA 427



 Score = 32.7 bits (75), Expect = 0.53
 Identities = 16/80 (20%), Positives = 21/80 (26%), Gaps = 1/80 (1%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL 81
           S P  +AA P  P  P  P  P   G    P   +  P P +      P  +        
Sbjct: 606 SGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDA-SD 664

Query: 82  GPLYWTRLIVSPEIPSDTTP 101
           G   W          +    
Sbjct: 665 GGDGWPAKAGGAAPAAPPPA 684



 Score = 32.3 bits (74), Expect = 0.62
 Identities = 15/60 (25%), Positives = 16/60 (26%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
           P           P   PP      + P P    P P PP    PP P       P    P
Sbjct: 705 PAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAP 764



 Score = 32.3 bits (74), Expect = 0.65
 Identities = 15/51 (29%), Positives = 15/51 (29%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           A  P    P PLPP P             PPPP P     P   P      
Sbjct: 730 APSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780



 Score = 32.3 bits (74), Expect = 0.66
 Identities = 13/60 (21%), Positives = 15/60 (25%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
           P   ++ PP     P  P  P     P P      P        P          HV  P
Sbjct: 601 PAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVP 660



 Score = 31.9 bits (73), Expect = 0.88
 Identities = 11/47 (23%), Positives = 11/47 (23%)

Query: 26  VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
               P       P  PPP      P    G  P  P       PP  
Sbjct: 666 GDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAG 712



 Score = 31.5 bits (72), Expect = 1.0
 Identities = 15/74 (20%), Positives = 19/74 (25%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWT 87
           A    PP P    PP        P     P  P P      P        P V  P +  
Sbjct: 595 AGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHP 654

Query: 88  RLIVSPEIPSDTTP 101
           + +  P+       
Sbjct: 655 KHVAVPDASDGGDG 668



 Score = 31.1 bits (71), Expect = 1.4
 Identities = 16/53 (30%), Positives = 17/53 (32%), Gaps = 4/53 (7%)

Query: 26  VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
              A   P P PP P  PP     P      PPPPP       P      + P
Sbjct: 731 PSPAADDPVPLPPEPDDPPDPAGAPAQ----PPPPPAPAPAAAPAAAPPPSPP 779



 Score = 30.7 bits (70), Expect = 1.7
 Identities = 14/60 (23%), Positives = 15/60 (25%), Gaps = 3/60 (5%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPP---MMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
              P      P   PP    G + P        P PP P     P   P     P    P
Sbjct: 707 ATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAP 766



 Score = 30.7 bits (70), Expect = 1.8
 Identities = 12/55 (21%), Positives = 13/55 (23%), Gaps = 2/55 (3%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
                 P P       PP P    + PP     P  P        P P      P
Sbjct: 584 VEAVVGPAPGAAGGEGPPAPA--SSGPPEEAARPAAPAAPAAPAAPAPAGAAAAP 636



 Score = 30.7 bits (70), Expect = 2.1
 Identities = 10/51 (19%), Positives = 11/51 (21%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           S         P   P  +  P    G    P       P      P P  P
Sbjct: 640 SAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAP 690



 Score = 30.7 bits (70), Expect = 2.1
 Identities = 13/49 (26%), Positives = 13/49 (26%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
                A P  PPP P P  P       P      P   P  G    P  
Sbjct: 671 AKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAA 719



 Score = 30.0 bits (68), Expect = 3.3
 Identities = 14/51 (27%), Positives = 15/51 (29%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           A PP P P  P  P         P    TPP          PP    G + 
Sbjct: 680 APPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASA 730



 Score = 30.0 bits (68), Expect = 3.4
 Identities = 14/49 (28%), Positives = 14/49 (28%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
               AAPPP P P     P       P P     PP          PP 
Sbjct: 675 GAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQ 723



 Score = 30.0 bits (68), Expect = 3.7
 Identities = 10/47 (21%), Positives = 10/47 (21%)

Query: 26  VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
               P P P  P  P         P P    P          PP   
Sbjct: 679 AAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAA 725



 Score = 29.6 bits (67), Expect = 3.9
 Identities = 11/48 (22%), Positives = 11/48 (22%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           P   A P         P          PP    P  P    G  P  P
Sbjct: 654 PKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQP 701



 Score = 29.6 bits (67), Expect = 4.0
 Identities = 14/49 (28%), Positives = 14/49 (28%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
            AAP  PPP P    P    G  P      P   PP      P      
Sbjct: 675 GAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQ 723



 Score = 29.6 bits (67), Expect = 4.1
 Identities = 15/80 (18%), Positives = 17/80 (21%), Gaps = 2/80 (2%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
           P       PP P    PP        P        P P   G    P          +  
Sbjct: 592 PGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPA--GAAAAPAEASAAPAPGVAA 649

Query: 84  LYWTRLIVSPEIPSDTTPLW 103
                  V+    SD    W
Sbjct: 650 PEHHPKHVAVPDASDGGDGW 669



 Score = 29.6 bits (67), Expect = 4.2
 Identities = 12/49 (24%), Positives = 13/49 (26%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
               AAP   P    +P         P    G  P  PP    P  P  
Sbjct: 644 APGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAA 692



 Score = 28.8 bits (65), Expect = 7.4
 Identities = 6/50 (12%), Positives = 6/50 (12%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
                      P                     PP P P     P     
Sbjct: 649 APEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAP 698



 Score = 28.4 bits (64), Expect = 8.9
 Identities = 12/49 (24%), Positives = 12/49 (24%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P    A P   P     P     G   P   G   P  P     P  P 
Sbjct: 643 PAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPA 691



 Score = 28.4 bits (64), Expect = 9.6
 Identities = 10/46 (21%), Positives = 10/46 (21%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
               P  PPP   P  P       P    P P           P  
Sbjct: 674 GGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAA 719



 Score = 28.4 bits (64), Expect = 9.9
 Identities = 8/50 (16%), Positives = 8/50 (16%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
               A     P   P     P                 P    P P P  
Sbjct: 640 SAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAA 689


>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
          modification].
          Length = 2365

 Score = 40.3 bits (94), Expect = 0.002
 Identities = 20/37 (54%), Positives = 20/37 (54%), Gaps = 11/37 (29%)

Query: 36 PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
          PP  PPPPP     PPP  G  PP  P    PPPPPP
Sbjct: 5  PPGNPPPPP-----PPP--GFEPPSQP----PPPPPP 30



 Score = 39.6 bits (92), Expect = 0.004
 Identities = 18/30 (60%), Positives = 18/30 (60%), Gaps = 5/30 (16%)

Query: 33 PPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
          PP  PP PPPPP  G  PP     PPPPPP
Sbjct: 5  PPGNPPPPPPPP--GFEPP---SQPPPPPP 29



 Score = 39.2 bits (91), Expect = 0.005
 Identities = 17/31 (54%), Positives = 18/31 (58%), Gaps = 9/31 (29%)

Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP 60
          PPPPPPPP   PP     + PPP    PPPP
Sbjct: 9  PPPPPPPPGFEPP-----SQPPP----PPPP 30



 Score = 36.5 bits (84), Expect = 0.030
 Identities = 16/26 (61%), Positives = 16/26 (61%), Gaps = 3/26 (11%)

Query: 22 SLPNVQAAPPPPPP---PPPLPPPPP 44
          SLP     PPPPPP   PP  PPPPP
Sbjct: 3  SLPPGNPPPPPPPPGFEPPSQPPPPP 28



 Score = 36.1 bits (83), Expect = 0.052
 Identities = 16/35 (45%), Positives = 17/35 (48%), Gaps = 3/35 (8%)

Query: 46 MGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
          M + PP   G PPPPPP  G  PP  P     P V
Sbjct: 1  MASLPP---GNPPPPPPPPGFEPPSQPPPPPPPGV 32


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 39.9 bits (94), Expect = 0.003
 Identities = 11/54 (20%), Positives = 11/54 (20%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
             AP  P P PP    P               P  P        P        V
Sbjct: 64  AKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEV 117



 Score = 39.1 bits (92), Expect = 0.005
 Identities = 11/58 (18%), Positives = 12/58 (20%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPL 84
            A  P  P P P     P     P         P           P        + PL
Sbjct: 63  AAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVTPL 120



 Score = 37.9 bits (89), Expect = 0.013
 Identities = 12/49 (24%), Positives = 12/49 (24%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
               AAP    P  P P PP       PP              PP    
Sbjct: 56  SAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAA 104



 Score = 36.8 bits (86), Expect = 0.023
 Identities = 10/55 (18%), Positives = 10/55 (18%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
               A    P    P  P P       P     P         P  PP       
Sbjct: 53  AAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAA 107



 Score = 35.6 bits (83), Expect = 0.053
 Identities = 9/49 (18%), Positives = 9/49 (18%)

Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
              AA     P        P      PP    P  PP           
Sbjct: 47 AAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAA 95



 Score = 35.6 bits (83), Expect = 0.057
 Identities = 11/45 (24%), Positives = 11/45 (24%)

Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
          AA    P   P    P      PP       PP P         P
Sbjct: 52 AAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAP 96



 Score = 34.9 bits (81), Expect = 0.11
 Identities = 8/49 (16%), Positives = 9/49 (18%)

Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
          P     P           + P        P  P    P    P     P
Sbjct: 38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKP 86



 Score = 34.5 bits (80), Expect = 0.14
 Identities = 9/48 (18%), Positives = 10/48 (20%)

Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           +  A              P        P    P PP       PP P
Sbjct: 39 GSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKP 86



 Score = 34.1 bits (79), Expect = 0.18
 Identities = 7/46 (15%), Positives = 7/46 (15%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP 69
           P         P  PP P         P           P       
Sbjct: 70  PAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115



 Score = 33.7 bits (78), Expect = 0.23
 Identities = 7/49 (14%), Positives = 7/49 (14%)

Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
                            P        P      PP       PP P  
Sbjct: 40 STAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAA 88



 Score = 32.6 bits (75), Expect = 0.54
 Identities = 9/51 (17%), Positives = 9/51 (17%)

Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
          AA          P   P       P    P    P     P         P
Sbjct: 46 AAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAP 96



 Score = 32.2 bits (74), Expect = 0.63
 Identities = 9/58 (15%), Positives = 13/58 (22%)

Query: 15 ETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           ++   F+     +   P                 P       P P P     P  PP
Sbjct: 27 PSWREFFADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPP 84



 Score = 32.2 bits (74), Expect = 0.65
 Identities = 9/49 (18%), Positives = 9/49 (18%), Gaps = 1/49 (2%)

Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
          P   AA        P   P       P P        P     P     
Sbjct: 44 PTAAAAAAAAAASAPAAAPAAKAPAAPAP-APPAAAAPAAPPKPAAAAA 91


>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824).  This is
           a repeating domain found in fungal proteins. It is
           proline-rich, and the function is not known.
          Length = 135

 Score = 37.6 bits (87), Expect = 0.003
 Identities = 18/42 (42%), Positives = 19/42 (45%)

Query: 31  PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           PPPP   P+PPP P  G  P      P   PP    P PP P
Sbjct: 78  PPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQNYPYPPGP 119



 Score = 36.0 bits (83), Expect = 0.010
 Identities = 23/90 (25%), Positives = 32/90 (35%), Gaps = 12/90 (13%)

Query: 2   EYMDKRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTT--------PPPM 53
           +Y  +R +++ ++E             +   P  P P PP PP+            PPP 
Sbjct: 22  KYSQRRERKKAERERERYRHDHDAYSDSYEEPYDPTPYPPSPPVSDPRYYPNSNYFPPPP 81

Query: 54  MGTPPPPPPMMGT----PPPPPPMMGTTPH 79
             TP PPP           PPPP     P 
Sbjct: 82  GSTPVPPPGPQPGYNPADYPPPPGAVPPPQ 111



 Score = 34.9 bits (80), Expect = 0.023
 Identities = 27/80 (33%), Positives = 32/80 (40%), Gaps = 18/80 (22%)

Query: 7   RAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLP---------PPPPMMGTTPPPMMGTP 57
           R + R   + YS+ +  P     P PP PP   P         PPPP  G+TP P  G  
Sbjct: 36  RERYRHDHDAYSDSYEEP-YDPTPYPPSPPVSDPRYYPNSNYFPPPP--GSTPVPPPGPQ 92

Query: 58  P------PPPPMMGTPPPPP 71
           P       PPP    PPP  
Sbjct: 93  PGYNPADYPPPPGAVPPPQN 112



 Score = 33.3 bits (76), Expect = 0.082
 Identities = 21/52 (40%), Positives = 21/52 (40%), Gaps = 5/52 (9%)

Query: 24  PNVQAAPPPP---PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           PN    PPPP   P PPP P P       PPP    PPP       PP P  
Sbjct: 72  PNSNYFPPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQN--YPYPPGPGQ 121



 Score = 32.9 bits (75), Expect = 0.11
 Identities = 20/56 (35%), Positives = 20/56 (35%), Gaps = 3/56 (5%)

Query: 17  YSNIFSLPNVQAAPPPPPPPP---PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP 69
            SN F  P      PPP P P   P   PPP     PP     PP P      P P
Sbjct: 73  NSNYFPPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQNYPYPPGPGQDPYAPRP 128


>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
          Length = 318

 Score = 38.7 bits (90), Expect = 0.004
 Identities = 16/52 (30%), Positives = 20/52 (38%), Gaps = 3/52 (5%)

Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP---PPMMGTPPPPPP 72
             ++  PP  P PP P PP     +P     +PP P   PP      P  P
Sbjct: 4  ARSRSPSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAP 55



 Score = 37.6 bits (87), Expect = 0.011
 Identities = 16/44 (36%), Positives = 18/44 (40%)

Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
          +A    P PP    PP P    +P       PP PP  G  PPP
Sbjct: 3  RARSRSPSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPP 46



 Score = 32.6 bits (74), Expect = 0.35
 Identities = 24/87 (27%), Positives = 27/87 (31%), Gaps = 21/87 (24%)

Query: 22 SLPNVQAAPPPPPPPPPLP--------PPPPMMGTTPPPMMGTPP----------PPPPM 63
          S    +   PP P PP  P        P PP  G  PPP     P          P    
Sbjct: 8  SPSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRGCPAGVT 67

Query: 64 MGTPPPPPPMMGTTPHVLG---PLYWT 87
            +  PP P +G          PL WT
Sbjct: 68 FSSSAPPRPPLGLDDAPAATPPPLDWT 94


>gnl|CDD|233927 TIGR02557, HpaP, type III secretion protein HpaP.  This family of
          genes is always found in type III secretion operons,
          althought its function in the processes of secretion
          and virulence is unclear. Hpa stands for Hrp-associated
          gene, where Hrp stands for hypersensitivity response
          and virulence.
          Length = 201

 Score = 37.9 bits (88), Expect = 0.005
 Identities = 17/53 (32%), Positives = 18/53 (33%), Gaps = 2/53 (3%)

Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
           A PP P PPPP     P            PPP  P    P  PP     + H
Sbjct: 36 YAPPPRPEPPPPCDEDRPEPRADTR--ASDPPPEAPTDADPAQPPDDPDASAH 86



 Score = 29.1 bits (65), Expect = 4.0
 Identities = 18/86 (20%), Positives = 19/86 (22%), Gaps = 15/86 (17%)

Query: 26  VQAAPPPPPPPPPLPPPPPMMGT---TPPPMMGTPPPPPPMMGTPPPPP-------PMMG 75
           +Q A    P P   P  P    T              PPP     P PP       P   
Sbjct: 2   IQGARLARPAPAD-PARPARRRTPLAQLRRRDALAYAPPP----RPEPPPPCDEDRPEPR 56

Query: 76  TTPHVLGPLYWTRLIVSPEIPSDTTP 101
                  P         P  P D   
Sbjct: 57  ADTRASDPPPEAPTDADPAQPPDDPD 82


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 39.0 bits (91), Expect = 0.005
 Identities = 15/48 (31%), Positives = 17/48 (35%), Gaps = 2/48 (4%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP 69
           +    +  P PPPPPPP  PP P     P     T P         P 
Sbjct: 263 ATRLPEPEPQPPPPPPPPEPPEP--EEEPDEPDQTDPDDGEETDQIPE 308



 Score = 36.3 bits (84), Expect = 0.036
 Identities = 17/52 (32%), Positives = 18/52 (34%), Gaps = 6/52 (11%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           +A   P P P P PPPPP     P P      P  P   T P         P
Sbjct: 262 RATRLPEPEPQPPPPPPP-----PEPPEPEEEPDEPDQ-TDPDDGEETDQIP 307



 Score = 32.1 bits (73), Expect = 0.62
 Identities = 15/51 (29%), Positives = 16/51 (31%), Gaps = 4/51 (7%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMM----GTPPPP 70
           P     PPPP PP P   P     T P     T   P  +M        P 
Sbjct: 271 PQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPEELMFDAVEADLPD 321



 Score = 30.2 bits (68), Expect = 2.5
 Identities = 15/43 (34%), Positives = 15/43 (34%), Gaps = 9/43 (20%)

Query: 30  PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            P     P   P PP     PPP    PPP PP     P  P 
Sbjct: 260 LPRATRLPEPEPQPP-----PPP----PPPEPPEPEEEPDEPD 293



 Score = 29.0 bits (65), Expect = 6.5
 Identities = 16/43 (37%), Positives = 16/43 (37%), Gaps = 9/43 (20%)

Query: 29  APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
            P     P P P PPP     PPP    PP PP     P  P 
Sbjct: 260 LPRATRLPEPEPQPPP-----PPP----PPEPPEPEEEPDEPD 293


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score = 38.7 bits (90), Expect = 0.006
 Identities = 18/68 (26%), Positives = 20/68 (29%)

Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYW 86
          Q AP PP P     PPPP     P      P    P    P    P     P        
Sbjct: 16 QPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKPASLWKL 75

Query: 87 TRLIVSPE 94
             +V P+
Sbjct: 76 EDFVVEPQ 83



 Score = 31.4 bits (71), Expect = 0.91
 Identities = 12/50 (24%), Positives = 12/50 (24%)

Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
          P   AA P PPPP     P                P          P P 
Sbjct: 21 PPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKPA 70



 Score = 31.4 bits (71), Expect = 0.92
 Identities = 14/60 (23%), Positives = 16/60 (26%), Gaps = 1/60 (1%)

Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
          P    APPPP      P           P    P    P     P P  +      V+ P
Sbjct: 24 PAAAPAPPPPAKTAA-PATKAAAPAAAAPRAEKPKKDKPRRERKPKPASLWKLEDFVVEP 82



 Score = 31.4 bits (71), Expect = 0.98
 Identities = 15/76 (19%), Positives = 15/76 (19%), Gaps = 13/76 (17%)

Query: 29  APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTR 88
                P P P  P         PP       P      P    P     P    P     
Sbjct: 12  GEAEQPAPAPPSPAA----APAPPPPAKTAAPATKAAAPAAAAPRAE-KPKKDKP----- 61

Query: 89  LIVSPEIPSDTTPLWK 104
                E       LWK
Sbjct: 62  ---RRERKPKPASLWK 74


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 38.9 bits (91), Expect = 0.006
 Identities = 11/52 (21%), Positives = 11/52 (21%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
           P    A  P   P       P     P      P  PP      P   P   
Sbjct: 381 PARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAA 432



 Score = 37.8 bits (88), Expect = 0.013
 Identities = 12/54 (22%), Positives = 14/54 (25%), Gaps = 3/54 (5%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP---PPPPPMMGTTP 78
           AAP       P P   P    + P       PP P+       P   P      
Sbjct: 391 AAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAA 444



 Score = 37.4 bits (87), Expect = 0.016
 Identities = 11/51 (21%), Positives = 11/51 (21%), Gaps = 2/51 (3%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP--PPMMGTPPPPPP 72
           P           P P   P         P    PP P   P    P   P 
Sbjct: 389 PAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPA 439



 Score = 37.0 bits (86), Expect = 0.018
 Identities = 17/76 (22%), Positives = 22/76 (28%), Gaps = 4/76 (5%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGT-TPHVLG 82
               AA P      P PP      T   P+   P P        P   P     TP   G
Sbjct: 435 AAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEG 494

Query: 83  PLY---WTRLIVSPEI 95
            ++     +L  +  I
Sbjct: 495 DVWHATVQQLAAAEAI 510



 Score = 36.6 bits (85), Expect = 0.028
 Identities = 12/45 (26%), Positives = 12/45 (26%), Gaps = 3/45 (6%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           AA  P  PP   PP P      P        P          P P
Sbjct: 410 AASAPAAPPAAAPPAPV---AAPAAAAPAAAPAAAPAAVALAPAP 451



 Score = 35.1 bits (81), Expect = 0.075
 Identities = 10/55 (18%), Positives = 11/55 (20%), Gaps = 4/55 (7%)

Query: 22  SLPNVQAAPPPPPP----PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           +      AP   P      P  PP          P    P   P          P
Sbjct: 395 AQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAP 449



 Score = 34.7 bits (80), Expect = 0.12
 Identities = 13/63 (20%), Positives = 15/63 (23%), Gaps = 8/63 (12%)

Query: 24  PNVQAAPPPPPPPPPL-PPPPPMMGTTPPPMMGTPPPPP-------PMMGTPPPPPPMMG 75
           P   A P P   P    P   P        +   PP          P+   P P      
Sbjct: 417 PPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAA 476

Query: 76  TTP 78
             P
Sbjct: 477 PAP 479



 Score = 34.3 bits (79), Expect = 0.14
 Identities = 12/57 (21%), Positives = 13/57 (22%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
            A   PP   PP P   P            P         P    P     P  + P
Sbjct: 412 SAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAP 468



 Score = 33.9 bits (78), Expect = 0.17
 Identities = 13/55 (23%), Positives = 15/55 (27%), Gaps = 4/55 (7%)

Query: 22  SLPNVQAAPPP----PPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           +     AA P      P  PP   PP  +           P   P      P PP
Sbjct: 398 AAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPP 452



 Score = 32.8 bits (75), Expect = 0.40
 Identities = 11/60 (18%), Positives = 11/60 (18%), Gaps = 2/60 (3%)

Query: 21  FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPM--MGTPPPPPPMMGTPPPPPPMMGTTP 78
           F       A  P     P  P        P         P   P      P  P     P
Sbjct: 364 FKPAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPP 423


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 38.7 bits (90), Expect = 0.006
 Identities = 19/59 (32%), Positives = 23/59 (38%), Gaps = 6/59 (10%)

Query: 19  NIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPM------MGTPPPPP 71
           N F      A P  PP   P PP PP++    PP +  P P  P+       G  P  P
Sbjct: 222 NPFLPGPSPAQPSAPPASIPAPPIPPVIQYVAPPPVPPPQPIIPIQHIRAVTGETPANP 280



 Score = 34.9 bits (80), Expect = 0.087
 Identities = 19/51 (37%), Positives = 21/51 (41%), Gaps = 7/51 (13%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            LP V   P  P P P  P  PP    + P     PP PP +    PPP P
Sbjct: 215 RLPRVSFNPFLPGPSPAQPSAPP---ASIP----APPIPPVIQYVAPPPVP 258



 Score = 30.7 bits (69), Expect = 2.1
 Identities = 18/72 (25%), Positives = 21/72 (29%), Gaps = 30/72 (41%)

Query: 31  PPPPPPPPLPPPPPMMGTTPPPMMGTP------------------------------PPP 60
             PPPP  LP  PP   +  P    TP                              PP 
Sbjct: 186 IQPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPAPPI 245

Query: 61  PPMMGTPPPPPP 72
           PP++    PPP 
Sbjct: 246 PPVIQYVAPPPV 257


>gnl|CDD|222828 PHA01732, PHA01732, proline-rich protein.
          Length = 94

 Score = 35.5 bits (81), Expect = 0.007
 Identities = 15/43 (34%), Positives = 16/43 (37%)

Query: 20 IFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
          IF  P     P P PP P  PPPP      P P +       P
Sbjct: 3  IFRAPKPPEPPAPLPPAPVPPPPPAPPAPVPEPTVKPVNAEAP 45



 Score = 32.4 bits (73), Expect = 0.092
 Identities = 13/35 (37%), Positives = 13/35 (37%)

Query: 35 PPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP 69
          P PP PP P      PPP    P P P     P  
Sbjct: 7  PKPPEPPAPLPPAPVPPPPPAPPAPVPEPTVKPVN 41



 Score = 32.0 bits (72), Expect = 0.13
 Identities = 21/52 (40%), Positives = 23/52 (44%), Gaps = 10/52 (19%)

Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
          AP PP PP PLPP P      PPP     PP PP     P   P+    P +
Sbjct: 6  APKPPEPPAPLPPAP-----VPPP-----PPAPPAPVPEPTVKPVNAEAPKI 47


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 38.1 bits (89), Expect = 0.009
 Identities = 16/45 (35%), Positives = 16/45 (35%), Gaps = 12/45 (26%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           A   P PP PP   PPP            PPPPP     PP    
Sbjct: 255 ATRLPAPPQPPEEEPPP------------PPPPPEDDDDPPEDEE 287



 Score = 36.5 bits (85), Expect = 0.027
 Identities = 10/40 (25%), Positives = 10/40 (25%)

Query: 23  LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
           LP     P   PPPPP PP                     
Sbjct: 258 LPAPPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRAL 297


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
           family of proteins are found in large numbers in the
           Trichomonas vaginalis proteome. The function of this
           protein is unknown.
          Length = 422

 Score = 37.7 bits (87), Expect = 0.010
 Identities = 11/48 (22%), Positives = 11/48 (22%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           P VQ     P    P   P       P      P P  P    P    
Sbjct: 207 PTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQ 254



 Score = 37.7 bits (87), Expect = 0.010
 Identities = 14/57 (24%), Positives = 18/57 (31%)

Query: 6   KRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
           + A +  ++ T  N    P VQ     P   P   P  P    TP      PP    
Sbjct: 198 RHAPKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQ 254



 Score = 37.0 bits (85), Expect = 0.017
 Identities = 12/55 (21%), Positives = 13/55 (23%), Gaps = 4/55 (7%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           P  Q     P   P +  P       P       P  P    TP  P      T 
Sbjct: 203 PTQQPTVQNPAQQPTVQNPAQ----QPQQQPQQQPVQPAQQPTPQNPAQQPPQTE 253



 Score = 34.6 bits (79), Expect = 0.10
 Identities = 11/46 (23%), Positives = 12/46 (26%), Gaps = 2/46 (4%)

Query: 29  APPPPPPPPPLPPP--PPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           AP P   P    P   P +      P       P      P P  P
Sbjct: 200 APKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNP 245



 Score = 31.2 bits (70), Expect = 1.2
 Identities = 9/46 (19%), Positives = 10/46 (21%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            A  P   P    P   P +          P   P      P P  
Sbjct: 199 HAPKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQN 244



 Score = 30.4 bits (68), Expect = 1.9
 Identities = 11/51 (21%), Positives = 13/51 (25%), Gaps = 5/51 (9%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           ++ N    P    P    P   P      P    TP  P       PP   
Sbjct: 208 TVQNPAQQPTVQNPAQQ-PQQQPQQQPVQPAQQPTPQNPAQ----QPPQTE 253



 Score = 30.4 bits (68), Expect = 1.9
 Identities = 10/48 (20%), Positives = 10/48 (20%), Gaps = 4/48 (8%)

Query: 31  PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
            P P   P    P    T   P       P       P  P    T  
Sbjct: 200 APKPTQQPTVQNPAQQPTVQNP----AQQPQQQPQQQPVQPAQQPTPQ 243


>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI.  This minor capsid
           protein may act as a link between the external capsid
           and the internal DNA-protein core. The C-terminal 11
           residues may function as a protease cofactor leading to
           enzyme activation.
          Length = 238

 Score = 37.1 bits (86), Expect = 0.010
 Identities = 14/49 (28%), Positives = 18/49 (36%), Gaps = 1/49 (2%)

Query: 24  PNVQAAPPPPPPPPPLPPPP-PMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
             V AAP PP     + P P P+        +  P    P+    PP P
Sbjct: 140 VLVPAAPEPPSYEETIKPGPAPVEEPVDSMAIAVPAIDTPVTLELPPAP 188



 Score = 32.5 bits (74), Expect = 0.31
 Identities = 10/51 (19%), Positives = 11/51 (21%), Gaps = 5/51 (9%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
                    P P     L P  P     P       P P P+    P    
Sbjct: 124 DPIQALQPRPRPDVEEVLVPAAPE---PPSYEETIKPGPAPV--EEPVDSM 169



 Score = 30.9 bits (70), Expect = 1.2
 Identities = 13/72 (18%), Positives = 18/72 (25%), Gaps = 6/72 (8%)

Query: 27  QAAPPPPPPPPPLPP-PPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLY 85
           +       P P       P+    P P          ++   P PP    T     GP  
Sbjct: 107 EKVLGEEEPAPQEETVADPIQALQPRPRP---DVEEVLVPAAPEPPSYEETIK--PGPAP 161

Query: 86  WTRLIVSPEIPS 97
               + S  I  
Sbjct: 162 VEEPVDSMAIAV 173



 Score = 30.5 bits (69), Expect = 1.5
 Identities = 12/57 (21%), Positives = 15/57 (26%), Gaps = 6/57 (10%)

Query: 23  LPNVQAAPPPPPPPPPLPPPP------PMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
           L  V     P P    +  P       P        +   P PP       P P P+
Sbjct: 106 LEKVLGEEEPAPQEETVADPIQALQPRPRPDVEEVLVPAAPEPPSYEETIKPGPAPV 162



 Score = 30.2 bits (68), Expect = 2.1
 Identities = 10/55 (18%), Positives = 13/55 (23%), Gaps = 2/55 (3%)

Query: 26  VQAAPPPPPP--PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
            +  P       P    PP       P P     P     +  P    P+    P
Sbjct: 131 PRPRPDVEEVLVPAAPEPPSYEETIKPGPAPVEEPVDSMAIAVPAIDTPVTLELP 185


>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 598

 Score = 38.0 bits (89), Expect = 0.010
 Identities = 8/45 (17%), Positives = 9/45 (20%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           A   P     P           P   +   P        PP P  
Sbjct: 407 APGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPPAPAA 451



 Score = 37.2 bits (87), Expect = 0.019
 Identities = 6/57 (10%), Positives = 7/57 (12%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
                     PP        P                 P   +   P         P
Sbjct: 391 PPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPP 447



 Score = 36.0 bits (84), Expect = 0.042
 Identities = 10/45 (22%), Positives = 11/45 (24%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           AAP      P    P   +   P        PP P     P    
Sbjct: 415 AAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPPAPAAAPQPAVRL 459



 Score = 36.0 bits (84), Expect = 0.045
 Identities = 15/93 (16%), Positives = 22/93 (23%), Gaps = 11/93 (11%)

Query: 32  PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLIV 91
           PP P      PP   G    P     P           P   +   P             
Sbjct: 391 PPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPD-----------A 439

Query: 92  SPEIPSDTTPLWKELEEVPINNLEEFTELFSRQ 124
           +    +   P       V +N+ E+   L   +
Sbjct: 440 AAAAAAPPAPAAAPQPAVRLNSFEDIVALAEEK 472



 Score = 34.1 bits (79), Expect = 0.15
 Identities = 7/57 (12%), Positives = 8/57 (14%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
             P      PP     P  P                 P   +   P           
Sbjct: 392 PSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPPA 448



 Score = 33.7 bits (78), Expect = 0.23
 Identities = 10/51 (19%), Positives = 10/51 (19%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
                 AA P      P    P       P        PP     P P   
Sbjct: 408 PGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPPAPAAAPQPAVR 458


>gnl|CDD|237802 PRK14723, flhF, flagellar biosynthesis regulator FlhF;
          Provisional.
          Length = 767

 Score = 37.9 bits (88), Expect = 0.010
 Identities = 10/36 (27%), Positives = 11/36 (30%)

Query: 25 NVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP 60
          + QA  PP P P P     P             P P
Sbjct: 54 SAQAYAPPAPAPLPAALVAPAPAAASIAAPAAVPAP 89



 Score = 35.5 bits (82), Expect = 0.065
 Identities = 10/39 (25%), Positives = 12/39 (30%), Gaps = 2/39 (5%)

Query: 26 VQAAPPP--PPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
          V A+     PP P PLP            +      P P
Sbjct: 51 VAASAQAYAPPAPAPLPAALVAPAPAAASIAAPAAVPAP 89



 Score = 34.8 bits (80), Expect = 0.11
 Identities = 11/57 (19%), Positives = 12/57 (21%), Gaps = 8/57 (14%)

Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPL 84
           A       PP P P P                       P P    G    + G L
Sbjct: 51 VAASAQAYAPPAPAPLP----AALVAPAPAAASIAAPAAVPAP----GAIGDLRGEL 99



 Score = 31.7 bits (72), Expect = 0.86
 Identities = 6/30 (20%), Positives = 6/30 (20%)

Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTTPPP 52
           P     P     P P           P P
Sbjct: 60 PPAPAPLPAALVAPAPAAASIAAPAAVPAP 89


>gnl|CDD|237011 PRK11892, PRK11892, pyruvate dehydrogenase subunit beta;
           Provisional.
          Length = 464

 Score = 38.0 bits (89), Expect = 0.010
 Identities = 10/55 (18%), Positives = 11/55 (20%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGT 76
           S  +  AAP          P                 P  P       P    GT
Sbjct: 83  SASDAGAAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAADPDIPAGT 137



 Score = 34.1 bits (79), Expect = 0.16
 Identities = 12/66 (18%), Positives = 17/66 (25%), Gaps = 7/66 (10%)

Query: 15  ETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP--- 71
           E+ S+  + P   A      P              P      P  P   +   P  P   
Sbjct: 82  ESASDAGAAPAAAAEAAAAAPAAAAAAAAKKAAPAPAA----PAAPAAEVAADPDIPAGT 137

Query: 72  PMMGTT 77
            M+  T
Sbjct: 138 EMVTMT 143


>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
          Length = 226

 Score = 36.9 bits (86), Expect = 0.011
 Identities = 18/57 (31%), Positives = 19/57 (33%), Gaps = 6/57 (10%)

Query: 24  PNVQAAPPPPPPPPPLPPP--PPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           PN    P P P  PP P P   P     P   +  PP P P     P P       P
Sbjct: 93  PNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKP----EPKPVVEEKAAP 145



 Score = 36.1 bits (84), Expect = 0.021
 Identities = 17/55 (30%), Positives = 21/55 (38%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           P++  A   PP  P  P P P+    P P+    P P P      PP P     P
Sbjct: 83  PSLDPATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKP 137



 Score = 35.4 bits (82), Expect = 0.043
 Identities = 17/64 (26%), Positives = 20/64 (31%), Gaps = 2/64 (3%)

Query: 24  PNVQAAPPPP--PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL 81
           P   A P  P  P P P+ PP P     P P               P P P++       
Sbjct: 87  PATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKAAPT 146

Query: 82  GPLY 85
           G  Y
Sbjct: 147 GKAY 150



 Score = 32.7 bits (75), Expect = 0.30
 Identities = 15/49 (30%), Positives = 20/49 (40%), Gaps = 6/49 (12%)

Query: 26  VQAAPPPPPP--PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           V+A     P   P  + PP   +   P P+   PP P P+    P P P
Sbjct: 75  VRAGDAAAPSLDPATVAPPNTPVEPEPAPV--EPPKPKPV--EKPKPKP 119


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 37.7 bits (87), Expect = 0.012
 Identities = 21/59 (35%), Positives = 22/59 (37%), Gaps = 6/59 (10%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPP--PPPPMMGTPPPPPPMMGTTPHVLGPL 84
             P PPP  PP P   P    TP P    PP   P  M   P   P   G T  +L  L
Sbjct: 774 PTPQPPPQAPPAPQQRPRGAPTPQP----PPQAGPTSMQLMPRAAPGQQGPTKQILRQL 828



 Score = 36.6 bits (84), Expect = 0.026
 Identities = 20/59 (33%), Positives = 20/59 (33%), Gaps = 3/59 (5%)

Query: 28  AAPPPPPPPPPLPPPPPMMGT---TPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
            A PP   P    PP    G     PPP     P   P     P PPP  G T   L P
Sbjct: 753 RARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMP 811



 Score = 34.7 bits (79), Expect = 0.13
 Identities = 17/55 (30%), Positives = 17/55 (30%), Gaps = 5/55 (9%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP----PPPPPMMGTTP 78
            A PP   P    PP    G   PP    P    P    P    P PPP     P
Sbjct: 733 RARPPAAAPGRARPPAAAPGRARPPA-AAPGRARPPAAAPGAPTPQPPPQAPPAP 786



 Score = 33.5 bits (76), Expect = 0.24
 Identities = 17/57 (29%), Positives = 19/57 (33%), Gaps = 4/57 (7%)

Query: 26  VQAAP----PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           +Q AP    PPP  P P+ PP    G    P   T    PP        PP      
Sbjct: 687 IQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGR 743



 Score = 29.6 bits (66), Expect = 4.7
 Identities = 16/49 (32%), Positives = 16/49 (32%), Gaps = 6/49 (12%)

Query: 28  AAPPPPPPP--PPLPPPPPMMGTTP-PPMMGTPP---PPPPMMGTPPPP 70
            AP P  PP  PP     P   T    P    P    PP    G   PP
Sbjct: 699 RAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPP 747



 Score = 29.3 bits (65), Expect = 5.9
 Identities = 11/51 (21%), Positives = 11/51 (21%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
            A  P        PP    G   PP        PP        PP      
Sbjct: 713 RAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGR 763


>gnl|CDD|222579 pfam14179, YppG, YppG-like protein.  The YppG-like protein family
          includes the B. subtilis YppG protein, which is
          functionally uncharacterized. This family of proteins
          is found in bacteria. Proteins in this family are
          typically between 115 and 181 amino acids in length.
          There are two completely conserved residues (F and G)
          that may be functionally important.
          Length = 110

 Score = 35.1 bits (81), Expect = 0.014
 Identities = 14/53 (26%), Positives = 15/53 (28%), Gaps = 4/53 (7%)

Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
             P  Q  PPPP  PP       M    PP     P   P     P     +
Sbjct: 21 QQQPYHQQMPPPPYSPPQQQQGHFM----PPQPQPYPKQSPQQQQPPQFSSFL 69



 Score = 33.9 bits (78), Expect = 0.039
 Identities = 11/50 (22%), Positives = 11/50 (22%)

Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
                PP      P         M  PP  PP        PP     P 
Sbjct: 5  QNTNQYPPQNQQQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPPQPQPYPK 54



 Score = 33.2 bits (76), Expect = 0.072
 Identities = 11/54 (20%), Positives = 11/54 (20%), Gaps = 5/54 (9%)

Query: 24 PNVQAAPPPPPPPPPL--PPPPPMMGTTPPPMMGTPPP---PPPMMGTPPPPPP 72
           N    PP      P    P    M   P            PP     P   P 
Sbjct: 5  QNTNQYPPQNQQQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPPQPQPYPKQSPQ 58


>gnl|CDD|151322 pfam10873, DUF2668, Protein of unknown function (DUF2668).  Members
           in this family of proteins are annotated as Cysteine and
           tyrosine-rich protein 1, however currently no function
           is known.
          Length = 154

 Score = 35.6 bits (82), Expect = 0.019
 Identities = 16/52 (30%), Positives = 18/52 (34%), Gaps = 13/52 (25%)

Query: 25  NVQAAPPPPPP---------PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP 67
           N  + P  PPP         P  LPPP     +  P       PPPP  G  
Sbjct: 105 NAISYPMAPPPYTYDHEMEYPTDLPPPY----SPAPQASAQRSPPPPYPGNS 152



 Score = 29.5 bits (66), Expect = 2.2
 Identities = 17/62 (27%), Positives = 21/62 (33%), Gaps = 11/62 (17%)

Query: 14  KETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
           +  + N  S P    APPP      +  P  +    PPP        P       PPPP 
Sbjct: 100 RAAHINAISYP---MAPPPYTYDHEMEYPTDL----PPPYSPA----PQASAQRSPPPPY 148

Query: 74  MG 75
            G
Sbjct: 149 PG 150


>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl
          carrier protein subunit; Validated.
          Length = 130

 Score = 35.2 bits (81), Expect = 0.019
 Identities = 18/56 (32%), Positives = 19/56 (33%), Gaps = 4/56 (7%)

Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL 81
           QAA P  P   P+P P               P P    G    P PM GT   VL
Sbjct: 25 AQAAAPAQPASTPVPVPTEASPQVEAQ----APQPAAAAGADAMPSPMPGTILKVL 76


>gnl|CDD|182059 PRK09752, PRK09752, adhesin; Provisional.
          Length = 1250

 Score = 37.3 bits (86), Expect = 0.020
 Identities = 16/44 (36%), Positives = 20/44 (45%), Gaps = 3/44 (6%)

Query: 18  SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPP 61
           +N + L + +  PP PP P P P P P     P P    P P P
Sbjct: 908 NNDWYLRSQEVTPPSPPDPDPTPDPDPTPDPDPTP---DPEPTP 948



 Score = 32.0 bits (72), Expect = 0.79
 Identities = 17/38 (44%), Positives = 17/38 (44%), Gaps = 11/38 (28%)

Query: 34  PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           PP PP P P P     P P   TP P P    TP P P
Sbjct: 920 PPSPPDPDPTP----DPDP---TPDPDP----TPDPEP 946


>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
           terminal.  This domain family is found in eukaryotes,
           and is typically between 177 and 207 amino acids in
           length. The family is found in association with
           pfam00778, pfam02377, pfam00610, pfam00595. The segment
           polarity gene dishevelled (dsh) is required for pattern
           formation of the embryonic segments. It is involved in
           the determination of body organisation through the
           Wingless pathway (analogous to the Wnt-1 pathway).
          Length = 202

 Score = 36.1 bits (83), Expect = 0.022
 Identities = 15/38 (39%), Positives = 19/38 (50%), Gaps = 2/38 (5%)

Query: 34  PPPPPLPPPPPMMGTTPPPMMGTPPPPPPM--MGTPPP 69
            P  P P  PPM+   PPP    PP  PP+  + + PP
Sbjct: 151 APGVPPPYNPPMLMMKPPPPSPGPPGAPPVRELASVPP 188



 Score = 29.5 bits (66), Expect = 2.4
 Identities = 12/37 (32%), Positives = 19/37 (51%)

Query: 20  IFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGT 56
            ++ P +   PPPP P PP  PP   + + PP +  +
Sbjct: 157 PYNPPMLMMKPPPPSPGPPGAPPVRELASVPPELTAS 193



 Score = 28.0 bits (62), Expect = 8.6
 Identities = 18/54 (33%), Positives = 21/54 (38%), Gaps = 5/54 (9%)

Query: 24  PNVQAAPPP--PPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
           P+ ++ PPP        L  P        P   G PPP  P M    PPPP  G
Sbjct: 123 PSERSGPPPSEGSVRSSLSHPSSHSSYGAP---GVPPPYNPPMLMMKPPPPSPG 173


>gnl|CDD|220260 pfam09483, HpaP, Type III secretion protein (HpaP).  This entry
          represents proteins encoded by genes which are always
          found in type III secretion operons, although their
          function in the processes of secretion and virulence is
          unclear. Hpa stands for Hrp-associated gene, where Hrp
          stands for hypersensitivity response and virulence. see
          also PMID:18584024.
          Length = 185

 Score = 35.9 bits (83), Expect = 0.022
 Identities = 15/51 (29%), Positives = 16/51 (31%), Gaps = 1/51 (1%)

Query: 23 LPNVQAAPPP-PPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
              +A PP  P PP P     P     P      P PP    G   P P 
Sbjct: 21 RRAARAGPPGTPAPPGPAEDAHPEFPERPRDAPAPPAPPRATDGDRDPQPL 71



 Score = 32.8 bits (75), Expect = 0.18
 Identities = 11/49 (22%), Positives = 13/49 (26%), Gaps = 4/49 (8%)

Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
              ++ A    PP  P PP P        P      P        PP 
Sbjct: 16 YARLMRRAARAGPPGTPAPPGPAEDAHPEFP----ERPRDAPAPPAPPR 60



 Score = 28.2 bits (63), Expect = 6.5
 Identities = 11/57 (19%), Positives = 11/57 (19%), Gaps = 4/57 (7%)

Query: 26 VQAAPPPP-PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP---PMMGTTP 78
            AAP                    PP     P P        P  P   P     P
Sbjct: 3  AGAAPRAARRSFDYARLMRRAARAGPPGTPAPPGPAEDAHPEFPERPRDAPAPPAPP 59


>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
           Provisional.
          Length = 475

 Score = 36.7 bits (85), Expect = 0.023
 Identities = 16/58 (27%), Positives = 17/58 (29%), Gaps = 3/58 (5%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP---PMMGTTP 78
                      P P +P  PP     P P       PP      PPP    P  G TP
Sbjct: 123 LGANPVTATGEPVPQMPASPPGPEGEPQPGNTPVSFPPQGSVAVPPPTAFYPGNGVTP 180



 Score = 31.7 bits (72), Expect = 0.76
 Identities = 19/59 (32%), Positives = 22/59 (37%), Gaps = 6/59 (10%)

Query: 24  PNVQAAPPPPPPP-PPLPPPPPMMGTTPPPMMGTPPPPP---PMMGTPPPPPPMMGTTP 78
           P  Q    PP P   P P   P+  + PP      PPP    P  G  PPP     + P
Sbjct: 134 PVPQMPASPPGPEGEPQPGNTPV--SFPPQGSVAVPPPTAFYPGNGVTPPPQVTYQSVP 190



 Score = 29.0 bits (65), Expect = 5.2
 Identities = 14/43 (32%), Positives = 16/43 (37%), Gaps = 3/43 (6%)

Query: 23  LPNVQAAPPPPPPPPPLPPPP---PMMGTTPPPMMGTPPPPPP 62
            P       PP     +PPP    P  G TPPP +     P P
Sbjct: 150 QPGNTPVSFPPQGSVAVPPPTAFYPGNGVTPPPQVTYQSVPVP 192


>gnl|CDD|235904 PRK06995, flhF, flagellar biosynthesis regulator FlhF; Validated.
          Length = 484

 Score = 36.5 bits (85), Expect = 0.027
 Identities = 8/46 (17%), Positives = 9/46 (19%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            AAP  P    P  P            +       P    P     
Sbjct: 105 AAAPAAPEAQAPAAPAERAAAENAARRLARAAAAAPRPRVPADAAA 150



 Score = 31.5 bits (72), Expect = 1.1
 Identities = 9/44 (20%), Positives = 11/44 (25%)

Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
          A P    P    PPP         P      P P ++       
Sbjct: 52 APPAAAAPAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLT 95



 Score = 31.1 bits (71), Expect = 1.2
 Identities = 8/61 (13%), Positives = 9/61 (14%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
             V  A  P  P    P  P                         P          V   
Sbjct: 100 QLVARAAAPAAPEAQAPAAPAERAAAENAARRLARAAAAAPRPRVPADAAAAVADAVKAR 159

Query: 84  L 84
           +
Sbjct: 160 I 160



 Score = 29.2 bits (66), Expect = 5.9
 Identities = 17/99 (17%), Positives = 21/99 (21%), Gaps = 21/99 (21%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPP---------------------PP 60
           +L    AA P    PPP   P  +     P     P                      P 
Sbjct: 50  ALAPPAAAAPAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQREQLVARAAAPA 109

Query: 61  PPMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIPSDT 99
            P    P  P                      P +P+D 
Sbjct: 110 APEAQAPAAPAERAAAENAARRLARAAAAAPRPRVPADA 148


>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2).  DAZ
          associated protein 2 has a highly conserved sequence
          throughout evolution including a conserved polyproline
          region and several SH2/SH3 binding sites. It occurs as
          a single copy gene with a four-exon organisation and is
          located on chromosome 12. It encodes a ubiquitously
          expressed protein and binds to DAZ and DAZL1 through
          DAZ repeats.
          Length = 136

 Score = 34.4 bits (79), Expect = 0.032
 Identities = 19/70 (27%), Positives = 24/70 (34%), Gaps = 4/70 (5%)

Query: 15 ETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPP--PMMGTPPPPPP 72
          E Y   + +P     P    P P      PM          + PP    P+   PP  PP
Sbjct: 10 ELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYPP 69

Query: 73 MMGTTPHVLG 82
            G+T  V G
Sbjct: 70 --GSTVLVQG 77



 Score = 32.1 bits (73), Expect = 0.24
 Identities = 10/41 (24%), Positives = 11/41 (26%)

Query: 40 PPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
            PP       P  +  P    P    P P P M      V
Sbjct: 3  DAPPAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQV 43



 Score = 31.7 bits (72), Expect = 0.35
 Identities = 17/64 (26%), Positives = 19/64 (29%), Gaps = 4/64 (6%)

Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM----MGTTPH 79
          P       P    PP    P      P P M  P      +G     PPM    +G  P 
Sbjct: 6  PAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPP 65

Query: 80 VLGP 83
          V  P
Sbjct: 66 VYPP 69



 Score = 29.4 bits (66), Expect = 1.7
 Identities = 11/52 (21%), Positives = 13/52 (25%), Gaps = 3/52 (5%)

Query: 32 PPPPPPPLPPPPPMMGTTPPPMMGT---PPPPPPMMGTPPPPPPMMGTTPHV 80
          P  PP       P     P   M     P P P M         +   + H 
Sbjct: 2  PDAPPAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHP 53



 Score = 29.4 bits (66), Expect = 1.8
 Identities = 16/73 (21%), Positives = 22/73 (30%)

Query: 30  PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRL 89
           P  PP    L  P  ++         + P P P M  P      +G         Y+   
Sbjct: 2   PDAPPAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIG 61

Query: 90  IVSPEIPSDTTPL 102
              P  P  +T L
Sbjct: 62  APPPVYPPGSTVL 74



 Score = 27.9 bits (62), Expect = 6.1
 Identities = 16/70 (22%), Positives = 18/70 (25%), Gaps = 24/70 (34%)

Query: 27 QAAPPPPPP-PPPLPPPPPMMGTTPPPM---MGTPPPPPPMMGTP--------------- 67
             P P    P           + PP     +G PPP  P   T                
Sbjct: 29 APYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYPPGSTVLVQGGFDAGARFGPG 88

Query: 68 -----PPPPP 72
               PPPPP
Sbjct: 89 TGSSIPPPPP 98


>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 584

 Score = 36.0 bits (83), Expect = 0.035
 Identities = 12/50 (24%), Positives = 14/50 (28%), Gaps = 8/50 (16%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           S+P       P   P P    P      P      P P P  +  P P  
Sbjct: 386 SIPANLLHNAPQAAPAPSAAAP-----EPKH---QPAPEPRPVLAPTPAS 427



 Score = 34.5 bits (79), Expect = 0.12
 Identities = 15/68 (22%), Positives = 18/68 (26%), Gaps = 16/68 (23%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL 81
            L N   A P P    P P   P             P P P+      P P  G      
Sbjct: 391 LLHNAPQAAPAPSAAAPEPKHQP------------APEPRPV----LAPTPASGEPNAAA 434

Query: 82  GPLYWTRL 89
               W+ +
Sbjct: 435 VRSMWSTV 442


>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
           family.  The proteins represented by this model contain
           three RNA recognition motifs (rrm: pfam00076) and have
           been characterized as poly-pyrimidine tract binding
           proteins associated with RNA splicing factors. In the
           case of PUF60 (GP|6176532), in complex with p54, and in
           the presence of U2AF, facilitates association of U2
           snRNP with pre-mRNA.
          Length = 612

 Score = 36.2 bits (83), Expect = 0.036
 Identities = 21/79 (26%), Positives = 29/79 (36%), Gaps = 5/79 (6%)

Query: 18  SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
           ++I +   V +A       PPLP   P +   P PM    P PPP +  P    P     
Sbjct: 340 TDIGNKAVVSSAKKEAEEVPPLPQAAPAV-VKPGPMEIPTPVPPPGLAIPSLVAPPGLVA 398

Query: 78  PHVLGPLYWTRLIVSPEIP 96
           P  + P      + SP   
Sbjct: 399 PTEINP----SFLASPRKK 413



 Score = 35.8 bits (82), Expect = 0.047
 Identities = 30/138 (21%), Positives = 38/138 (27%), Gaps = 12/138 (8%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL 81
           + P V    P   P P  PP   +     PP +  P    P     P         P   
Sbjct: 364 AAPAVVKPGPMEIPTPVPPPGLAIPSLVAPPGLVAPTEINPSFLASPRKKMKREKLPVTF 423

Query: 82  GPLYWTRLIVSPEIPSDTTPLWKEL------------EEVPINNLEEFTELFSRQVTAQR 129
           G L  T     P     T+   K L            E       +E  EL  + V    
Sbjct: 424 GALDDTLAWKEPSKEDQTSEDGKMLAIMGEAAAALALEPKKKKKEKEGEELQPKLVMNSE 483

Query: 130 PVTRKRQQKSSKVQNVAR 147
             +   Q+  S   N AR
Sbjct: 484 DASLASQEGMSIRGNSAR 501


>gnl|CDD|237541 PRK13881, PRK13881, conjugal transfer protein TrbI; Provisional.
          Length = 472

 Score = 35.9 bits (83), Expect = 0.044
 Identities = 12/46 (26%), Positives = 15/46 (32%)

Query: 26  VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           ++ A P   P  P  P    +    P     PP PP   G P    
Sbjct: 86  IEPASPLKVPEMPTGPASAPLPIARPDNPDAPPTPPANPGNPGQVN 131



 Score = 31.7 bits (72), Expect = 0.75
 Identities = 11/50 (22%), Positives = 13/50 (26%), Gaps = 4/50 (8%)

Query: 23  LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           +        P  P  P   P P+      P     PP PP     P    
Sbjct: 86  IEPASPLKVPEMPTGPASAPLPI----ARPDNPDAPPTPPANPGNPGQVN 131


>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin
          carboxyl carrier protein subunit; Validated.
          Length = 153

 Score = 34.5 bits (79), Expect = 0.045
 Identities = 15/57 (26%), Positives = 21/57 (36%), Gaps = 1/57 (1%)

Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPP-PPPPMMGTPPPPPPMMGTTPHVL 81
          +  +      P P P P P + + P P+    P P P   G      PM G    +L
Sbjct: 43 IDLSAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPASAGENVVTAPMPGKILRIL 99


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 35.9 bits (83), Expect = 0.047
 Identities = 18/79 (22%), Positives = 25/79 (31%), Gaps = 12/79 (15%)

Query: 5   DKRAKRRKKKETYSNIFS------LPNVQAAPPPPP------PPPPLPPPPPMMGTTPPP 52
           D  +  R  K+    +F+       P+  AA  P P        P  P        TPP 
Sbjct: 367 DDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPT 426

Query: 53  MMGTPPPPPPMMGTPPPPP 71
           +   PP   P+      P 
Sbjct: 427 VSVDPPAAVPVNPPSTAPQ 445



 Score = 33.6 bits (77), Expect = 0.21
 Identities = 18/81 (22%), Positives = 22/81 (27%), Gaps = 2/81 (2%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMM--GTTPHVLGPL 84
           Q+A  P   PP +   PP      PP        P         P        P  L P+
Sbjct: 415 QSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPI 474

Query: 85  YWTRLIVSPEIPSDTTPLWKE 105
                  +  I    T   KE
Sbjct: 475 QEKAEQATGNIKEAPTGTQKE 495


>gnl|CDD|227547 COG5222, COG5222, Uncharacterized conserved protein, contains RING
           Zn-finger [General function prediction only].
          Length = 427

 Score = 35.5 bits (81), Expect = 0.050
 Identities = 21/68 (30%), Positives = 28/68 (41%), Gaps = 7/68 (10%)

Query: 4   MDKRAKRRKKKET--YSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPP 61
           M ++ KR     +  +S   + P  ++A   P P  P        G  P PMM  P  PP
Sbjct: 357 MSEKRKREDPNSSAVFSKATAEPAFKSAMAIPMPSMPHV-----QGFPPFPMMPLPQMPP 411

Query: 62  PMMGTPPP 69
            MM   PP
Sbjct: 412 MMMIANPP 419



 Score = 30.9 bits (69), Expect = 1.6
 Identities = 14/35 (40%), Positives = 14/35 (40%), Gaps = 2/35 (5%)

Query: 47  GTTPPPMMGTPP--PPPPMMGTPPPPPPMMGTTPH 79
              P P M      PP PMM  P  PP MM   P 
Sbjct: 385 MAIPMPSMPHVQGFPPFPMMPLPQMPPMMMIANPP 419


>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
          Length = 540

 Score = 35.7 bits (82), Expect = 0.052
 Identities = 18/57 (31%), Positives = 19/57 (33%), Gaps = 4/57 (7%)

Query: 26  VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLG 82
           V   P P PP    PP          P    PP  PP    P P P  +    HV G
Sbjct: 160 VGPRPVPTPPGGTPPPDDDEGDEAGAPATPAPPLHPP----PAPHPHPIAEVAHVRG 212



 Score = 30.7 bits (69), Expect = 1.7
 Identities = 14/49 (28%), Positives = 15/49 (30%), Gaps = 9/49 (18%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P     PP   PPP                 G P  P P +  PP P P
Sbjct: 162 PRPVPTPPGGTPPP---------DDDEGDEAGAPATPAPPLHPPPAPHP 201


>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
          Length = 463

 Score = 35.5 bits (81), Expect = 0.052
 Identities = 18/71 (25%), Positives = 25/71 (35%), Gaps = 3/71 (4%)

Query: 5   DKRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTP---PPMMGTPPPPP 61
            K A   K ++  S +     +     P P PP      P + + P    P   + PPPP
Sbjct: 160 TKVAIISKSEDAASQVTPSQKIPETTDPKPSPPAEDKQKPKVESAPVAEKPKAPSSPPPP 219

Query: 62  PMMGTPPPPPP 72
                 P  PP
Sbjct: 220 KQSAKEPQLPP 230



 Score = 33.2 bits (75), Expect = 0.30
 Identities = 15/55 (27%), Positives = 18/55 (32%), Gaps = 1/55 (1%)

Query: 27  QAAPPPPPPPPPLP-PPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
           Q  P    P    P P PP      P +   P    P   + PPPP      P +
Sbjct: 174 QVTPSQKIPETTDPKPSPPAEDKQKPKVESAPVAEKPKAPSSPPPPKQSAKEPQL 228


>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator.  In
           eukaryotes, this family of proteins induces
           mitochondrial fission.
          Length = 248

 Score = 35.1 bits (81), Expect = 0.054
 Identities = 14/28 (50%), Positives = 16/28 (57%)

Query: 26  VQAAPPPPPPPPPLPPPPPMMGTTPPPM 53
           V   PPPPPPPPP PPP     T+   +
Sbjct: 180 VLEVPPPPPPPPPPPPPSLQQSTSAIDL 207



 Score = 34.7 bits (80), Expect = 0.071
 Identities = 17/48 (35%), Positives = 19/48 (39%), Gaps = 9/48 (18%)

Query: 31  PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
            P       P  PP    T  P++  PPPPPP     PPPPP      
Sbjct: 162 VPSSSTTSFPISPP----TEEPVLEVPPPPPP-----PPPPPPPSLQQ 200



 Score = 33.2 bits (76), Expect = 0.19
 Identities = 17/42 (40%), Positives = 21/42 (50%), Gaps = 9/42 (21%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPM 63
           S  +   +PP   P   +PPPPP     PPP    PPPPP +
Sbjct: 166 STTSFPISPPTEEPVLEVPPPPP-----PPP----PPPPPSL 198



 Score = 31.6 bits (72), Expect = 0.61
 Identities = 11/28 (39%), Positives = 12/28 (42%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPP 51
           P  +      PPPPP PPPPP       
Sbjct: 174 PPTEEPVLEVPPPPPPPPPPPPPSLQQS 201



 Score = 30.1 bits (68), Expect = 1.9
 Identities = 16/58 (27%), Positives = 17/58 (29%), Gaps = 21/58 (36%)

Query: 21  FSLPNVQAAPP-----PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
               +           P  PP   P                PPPPPP    PPPPPP 
Sbjct: 155 LLSSDESVPSSSTTSFPISPPTEEPVLE------------VPPPPPP----PPPPPPP 196



 Score = 28.9 bits (65), Expect = 4.4
 Identities = 16/44 (36%), Positives = 18/44 (40%), Gaps = 9/44 (20%)

Query: 30  PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
            PP   P    PPPP     PPP    PPPPPP +        +
Sbjct: 173 SPPTEEPVLEVPPPP-----PPP----PPPPPPSLQQSTSAIDL 207


>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
          Length = 209

 Score = 34.4 bits (79), Expect = 0.067
 Identities = 14/46 (30%), Positives = 15/46 (32%), Gaps = 3/46 (6%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           Q A     P     P      TTP   MG P  PP +    P P  
Sbjct: 97  QKATSMSEPATENKPAEV---TTPVEPMGLPETPPAVPVPAPAPAV 139



 Score = 30.6 bits (69), Expect = 1.2
 Identities = 9/37 (24%), Positives = 11/37 (29%)

Query: 26  VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
           +        P     P  PM     PP +  P P P 
Sbjct: 102 MSEPATENKPAEVTTPVEPMGLPETPPAVPVPAPAPA 138


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 35.2 bits (82), Expect = 0.067
 Identities = 10/49 (20%), Positives = 13/49 (26%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P    A PPPP   P   P   +  T   ++          G       
Sbjct: 393 PPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKS 441



 Score = 33.7 bits (78), Expect = 0.20
 Identities = 15/62 (24%), Positives = 18/62 (29%)

Query: 21  FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
             LP  +  P    P               PP     PPPP       P  P+  TT  +
Sbjct: 363 APLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQL 422

Query: 81  LG 82
           L 
Sbjct: 423 LA 424



 Score = 32.9 bits (76), Expect = 0.40
 Identities = 10/56 (17%), Positives = 13/56 (23%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
                A   PP  P +PPPP       P +         +            T   
Sbjct: 384 TAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAK 439



 Score = 32.9 bits (76), Expect = 0.43
 Identities = 13/55 (23%), Positives = 14/55 (25%)

Query: 26  VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
               P P  PP    P      T  P     PP  P +   P   P      P  
Sbjct: 362 AAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLP 416



 Score = 32.1 bits (74), Expect = 0.68
 Identities = 10/57 (17%), Positives = 14/57 (24%), Gaps = 2/57 (3%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           S     +A     P   + PP       PPP    P   P +         +     
Sbjct: 374 SAAPAASAQATAAPTAAVAPPQAPAV--PPPPASAPQQAPAVPLPETTSQLLAARQQ 428



 Score = 32.1 bits (74), Expect = 0.73
 Identities = 14/57 (24%), Positives = 16/57 (28%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
            A  P P  PP    P      T  P     PP  P +  PP   P       +   
Sbjct: 362 AAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPET 418



 Score = 31.4 bits (72), Expect = 1.1
 Identities = 12/58 (20%), Positives = 14/58 (24%), Gaps = 3/58 (5%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP---PPPMMGTPPPPPPMMGTTP 78
           P      P  PP    P         P   +  P     PPP    P   P +     
Sbjct: 361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPET 418



 Score = 31.4 bits (72), Expect = 1.3
 Identities = 9/87 (10%), Positives = 14/87 (16%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
           P   A    P  P P      +                            + +    L  
Sbjct: 402 PPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSEPAAASRARPVNSALERLAS 461

Query: 84  LYWTRLIVSPEIPSDTTPLWKELEEVP 110
           +      +           WK    V 
Sbjct: 462 VRPAPSALEKAPAKKEAYRWKATNPVE 488



 Score = 28.7 bits (65), Expect = 8.3
 Identities = 7/50 (14%), Positives = 11/50 (22%)

Query: 23  LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           L   Q A       P        + +    +    P P  +   P     
Sbjct: 429 LQRAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEA 478


>gnl|CDD|237000 PRK11855, PRK11855, dihydrolipoamide acetyltransferase; Reviewed.
          Length = 547

 Score = 35.2 bits (82), Expect = 0.069
 Identities = 13/55 (23%), Positives = 13/55 (23%), Gaps = 3/55 (5%)

Query: 26  VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
           V AA P     P    P       P P       P       P      G  PH 
Sbjct: 194 VAAAAPAAAAAPAAAAPAAAAAAAPAPAPAAAAAPAA---AAPAAAAAPGKAPHA 245



 Score = 29.8 bits (68), Expect = 3.2
 Identities = 8/38 (21%), Positives = 8/38 (21%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPP 61
               AA  P       P P P     P          P
Sbjct: 202 AAAPAAAAPAAAAAAAPAPAPAAAAAPAAAAPAAAAAP 239


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 34.9 bits (80), Expect = 0.083
 Identities = 18/57 (31%), Positives = 20/57 (35%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
           Q+ PP  P P  +   PP  G  P      PP P P M   P   P  G       P
Sbjct: 189 QSYPPNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPP 245



 Score = 34.5 bits (79), Expect = 0.10
 Identities = 12/48 (25%), Positives = 13/48 (27%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           P  Q      P PPP  P      + P        PPP     P    
Sbjct: 131 PPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSG 178



 Score = 34.1 bits (78), Expect = 0.13
 Identities = 15/59 (25%), Positives = 17/59 (28%), Gaps = 1/59 (1%)

Query: 21  FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
           +  P  Q  PPP   P    P PP           +PP  P     PPP          
Sbjct: 121 YYPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQ-SPPQQPQYQQNPPPQAQSAPQVSG 178



 Score = 32.2 bits (73), Expect = 0.55
 Identities = 14/48 (29%), Positives = 16/48 (33%), Gaps = 6/48 (12%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           P  Q+AP      P   P  P    + PP     P P  M   PP   
Sbjct: 168 PQAQSAPQVSGLYPEESPYQPQ---SYPP---NEPLPSSMAMQPPYSG 209



 Score = 30.7 bits (69), Expect = 1.8
 Identities = 15/59 (25%), Positives = 16/59 (27%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLY 85
            + P PPP   P    P      P       PP  P     PPP          L P  
Sbjct: 125 PSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEE 183



 Score = 30.3 bits (68), Expect = 2.0
 Identities = 16/67 (23%), Positives = 19/67 (28%)

Query: 21  FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV 80
            +      A  PP P P  P P     +  PP    P   PP      P  P     P+ 
Sbjct: 63  DAPLQQVNAALPPAPAPQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYY 122

Query: 81  LGPLYWT 87
             P    
Sbjct: 123 PPPSQPQ 129



 Score = 28.4 bits (63), Expect = 9.6
 Identities = 14/54 (25%), Positives = 16/54 (29%), Gaps = 5/54 (9%)

Query: 31  PPPPPPPPLPPPPPMMGTT-----PPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
           P   PP  +   P           PPP    PPP        P PPP +     
Sbjct: 99  PSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQ 152


>gnl|CDD|147144 pfam04834, Adeno_E3_14_5, Early E3 14.5 kDa protein.  The E3B
          14.5 kDa was first identified in Human adenovirus type
          5. It is an integral membrane protein oriented with its
          C terminus in the cytoplasm. It functions to
          down-regulate the epidermal growth factor receptor and
          prevent tumour necrosis factor cytolysis. It achieves
          this through the interaction with E3 10.4 kDa protein.
          Length = 97

 Score = 32.4 bits (74), Expect = 0.088
 Identities = 10/21 (47%), Positives = 11/21 (52%)

Query: 24 PNVQAAPPPPPPPPPLPPPPP 44
          P+ Q  P   PPPPP P P  
Sbjct: 63 PDPQHIPLQQPPPPPEPQPRA 83



 Score = 27.0 bits (60), Expect = 6.5
 Identities = 10/26 (38%), Positives = 13/26 (50%), Gaps = 1/26 (3%)

Query: 16 TYSNIFSLPNVQAAPPPPPPPPPLPP 41
           Y +   +P +Q  PPPP P P  P 
Sbjct: 61 RYPDPQHIP-LQQPPPPPEPQPRAPS 85


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 35.0 bits (80), Expect = 0.099
 Identities = 14/52 (26%), Positives = 16/52 (30%), Gaps = 3/52 (5%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
            +     PP  P    P   G  P      PPP  P      PP    G+ P
Sbjct: 440 HSGLHSGPPQSPFAQHPFTSGGLPAI---GPPPSLPTSTPAAPPRASSGSQP 488



 Score = 32.4 bits (73), Expect = 0.59
 Identities = 24/107 (22%), Positives = 30/107 (28%), Gaps = 24/107 (22%)

Query: 24  PNVQAAPPPPPPPPPL----------------PPPP--PMMGTTPPPMMGTPPPPPPMMG 65
             + A P P P  P                  P PP  P   +   P    P    P   
Sbjct: 208 SPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSS 267

Query: 66  TPPPPPPMMGTTPHVL--GPLYWTRLIVSPEIPSDTTPLWKELEEVP 110
              P PPM    PH L  GP++      +P  P            +P
Sbjct: 268 HHGPGPPM----PHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLP 310



 Score = 32.4 bits (73), Expect = 0.61
 Identities = 15/46 (32%), Positives = 17/46 (36%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
           A  PPP  P   P  PP   +   P     P      G  PP PP+
Sbjct: 464 AIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPI 509



 Score = 31.6 bits (71), Expect = 1.1
 Identities = 20/62 (32%), Positives = 21/62 (33%), Gaps = 15/62 (24%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPP---------------PPPP 72
             P  PP   PLPP P M    PPP    P  P      PP               PPPP
Sbjct: 326 LQPQQPPREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPSNLPPPP 385

Query: 73  MM 74
            +
Sbjct: 386 AL 387



 Score = 31.2 bits (70), Expect = 1.5
 Identities = 24/100 (24%), Positives = 34/100 (34%), Gaps = 3/100 (3%)

Query: 5   DKRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPP--PPPMMGTTPPPMMGTPPPPPP 62
           D  A+++  +        +P   A  P  PPP P     PP        P      P P 
Sbjct: 165 DSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPL 224

Query: 63  MMGTPPPPPPMMGTTPH-VLGPLYWTRLIVSPEIPSDTTP 101
            + + P   P    +PH  L P   ++    P  PS   P
Sbjct: 225 SLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHP 264



 Score = 30.4 bits (68), Expect = 2.2
 Identities = 19/60 (31%), Positives = 25/60 (41%), Gaps = 7/60 (11%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPM-----MGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           P++Q   P P  P  LPPPP +     + T  PP     PPP  +M    P   +    P
Sbjct: 366 PHLQGPSPFPQMPSNLPPPPALKPLSSLPTHHPP--SAHPPPLQLMPQSQPLQSVPAQPP 423



 Score = 28.5 bits (63), Expect = 8.9
 Identities = 16/69 (23%), Positives = 21/69 (30%), Gaps = 12/69 (17%)

Query: 28  AAPPPP----PPPPPLPPP---PPMMG---TTPPPMMGTPPPPPPMMGTPPPPP--PMMG 75
           +A PPP    P   PL      PP++    + PP     P           P    P   
Sbjct: 400 SAHPPPLQLMPQSQPLQSVPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTS 459

Query: 76  TTPHVLGPL 84
                +GP 
Sbjct: 460 GGLPAIGPP 468



 Score = 28.5 bits (63), Expect = 9.4
 Identities = 15/56 (26%), Positives = 19/56 (33%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
           P            P  P PP  +   P      P  PPP   +P P P ++ T  H
Sbjct: 489 PGSALPSSGGCAGPGPPLPPIQIKEEPLDEAEEPESPPPPPRSPSPEPTVVNTPSH 544


>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 504

 Score = 34.8 bits (80), Expect = 0.10
 Identities = 12/57 (21%), Positives = 15/57 (26%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPL 84
                     PLP   P     P P        PP++     P  +      VL  L
Sbjct: 381 PTAAATAAGAPLPDFDPRPRGPPAPEPARSAEAPPLVAPAAAPAGLALRWRDVLAAL 437



 Score = 31.7 bits (72), Expect = 0.85
 Identities = 11/45 (24%), Positives = 12/45 (26%), Gaps = 6/45 (13%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           +AP         P P         P    PP P P      PP  
Sbjct: 379 SAPTAAATAAGAPLP------DFDPRPRGPPAPEPARSAEAPPLV 417



 Score = 31.0 bits (70), Expect = 1.3
 Identities = 12/43 (27%), Positives = 12/43 (27%), Gaps = 5/43 (11%)

Query: 26  VQAAPPPPPPPP---PLPPPPPMMGTTPPPMMGTPPPPPPMMG 65
             AA     P P   P P  PP     P      PP   P   
Sbjct: 382 TAAATAAGAPLPDFDPRPRGPP--APEPARSAEAPPLVAPAAA 422



 Score = 29.4 bits (66), Expect = 4.7
 Identities = 16/70 (22%), Positives = 16/70 (22%), Gaps = 10/70 (14%)

Query: 19  NIFSLPNVQAAPPPPPP------PPPLPPPP---PMMGTTPPPMMGTPPPPP-PMMGTPP 68
              S      APP P P         L           T      G P P   P    PP
Sbjct: 344 GAPSEGVAAVAPPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAGAPLPDFDPRPRGPP 403

Query: 69  PPPPMMGTTP 78
            P P      
Sbjct: 404 APEPARSAEA 413



 Score = 28.3 bits (63), Expect = 9.0
 Identities = 12/67 (17%), Positives = 17/67 (25%), Gaps = 4/67 (5%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
             V++    P          P+    P P     P P      PP   P           
Sbjct: 372 KEVRSLRSAPTAAAT-AAGAPLPDFDPRPRGPPAPEPARSAEAPPLVAPAAAPAG---LA 427

Query: 84  LYWTRLI 90
           L W  ++
Sbjct: 428 LRWRDVL 434


>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
          Length = 280

 Score = 34.5 bits (78), Expect = 0.10
 Identities = 25/84 (29%), Positives = 32/84 (38%), Gaps = 11/84 (13%)

Query: 29  APPPPPP----PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP------PPPPPMMGTTP 78
           AP  P P    P P P  PP    T PP    P P  P    P      PP PP+    P
Sbjct: 99  APAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPACPPSTRQCPPAPPLPTPKP 158

Query: 79  H-VLGPLYWTRLIVSPEIPSDTTP 101
                P++    +  P+ P+ + P
Sbjct: 159 APAAKPIFLHNQLPPPDYPAASCP 182



 Score = 28.3 bits (62), Expect = 8.0
 Identities = 19/74 (25%), Positives = 20/74 (27%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWT 87
           A  P    P P  P        P      P P  P    P  PPP +   P    P    
Sbjct: 83  APSPACAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPACPP 142

Query: 88  RLIVSPEIPSDTTP 101
                P  P   TP
Sbjct: 143 STRQCPPAPPLPTP 156


>gnl|CDD|221404 pfam12067, Sox_C_TAD, Sox C-terminal transactivation domain.  This
           domain is found at the C-terminus of the Sox family of
           transcription factors. It is found associated with
           pfam00505. It binds to the Armadillo repeats (pfam00514)
           in Catenin beta-1 (CTNNB1), which is involved in
           transcriptional regulation. It functions as a
           transactivating domain (TAD).
          Length = 197

 Score = 33.9 bits (78), Expect = 0.10
 Identities = 14/60 (23%), Positives = 18/60 (30%), Gaps = 10/60 (16%)

Query: 31  PPPPPPPPLPPPPPMMGTTPPPMM--------GTPPPPPPMMGTPPPPP--PMMGTTPHV 80
                 P   P P +MG   PP M              P  +G   PPP    + T   +
Sbjct: 74  AVTENIPQGSPAPSIMGCPTPPQMYYGQMYVPECAKHHPVQLGQLSPPPESQHLDTLDQL 133



 Score = 29.7 bits (67), Expect = 2.5
 Identities = 11/49 (22%), Positives = 15/49 (30%), Gaps = 3/49 (6%)

Query: 26 VQAAPPP--PPPPPPLPPPPPMMGTTP-PPMMGTPPPPPPMMGTPPPPP 71
           Q  P        P     P ++        +    P P +MG P PP 
Sbjct: 48 CQMMPYGYNSSYAPQNAHAPALLRQMAVTENIPQGSPAPSIMGCPTPPQ 96


>gnl|CDD|235309 PRK04596, minC, septum formation inhibitor; Reviewed.
          Length = 248

 Score = 34.2 bits (78), Expect = 0.11
 Identities = 14/31 (45%), Positives = 15/31 (48%)

Query: 26  VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGT 56
           V   PPPPPPP    P PP+    P  M  T
Sbjct: 117 VSPPPPPPPPPARAEPAPPVARPAPGRMQRT 147



 Score = 33.8 bits (77), Expect = 0.12
 Identities = 12/26 (46%), Positives = 13/26 (50%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPP 52
             +PPPPPPPPP    P      P P
Sbjct: 116 AVSPPPPPPPPPARAEPAPPVARPAP 141



 Score = 28.8 bits (64), Expect = 5.3
 Identities = 12/26 (46%), Positives = 13/26 (50%), Gaps = 4/26 (15%)

Query: 47  GTTPPPMMGTPPPPPPMMGTPPPPPP 72
             +PPP    PPPPPP    P PP  
Sbjct: 116 AVSPPP----PPPPPPARAEPAPPVA 137



 Score = 28.8 bits (64), Expect = 5.6
 Identities = 16/38 (42%), Positives = 17/38 (44%), Gaps = 7/38 (18%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMM 64
           +AA  PPPPP   PPPP      PP       P P  M
Sbjct: 114 RAAVSPPPPP---PPPPARAEPAPP----VARPAPGRM 144


>gnl|CDD|165431 PHA03160, PHA03160, hypothetical protein; Provisional.
          Length = 499

 Score = 34.7 bits (79), Expect = 0.11
 Identities = 20/71 (28%), Positives = 26/71 (36%), Gaps = 6/71 (8%)

Query: 31  PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLI 90
           P   P       PP+  +   PM        PM      PPPM    PHV  P    R++
Sbjct: 402 PYGAPKNDHHLLPPLACSQQLPMQPLHVQQAPMQAPHVAPPPM--QPPHVQQP----RVL 455

Query: 91  VSPEIPSDTTP 101
            S +  S+  P
Sbjct: 456 PSTDGASNEAP 466


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 34.0 bits (78), Expect = 0.12
 Identities = 16/57 (28%), Positives = 17/57 (29%), Gaps = 1/57 (1%)

Query: 23  LPNVQAAPPPPPPPPPLPPPPPMMGTTPPP-MMGTPPPPPPMMGTPPPPPPMMGTTP 78
            P      P PP  P  PP P        P     P  P P     P P P +   P
Sbjct: 59  EPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQP 115



 Score = 30.5 bits (69), Expect = 1.6
 Identities = 16/55 (29%), Positives = 17/55 (30%), Gaps = 5/55 (9%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           P  +  PP   P PP  P  P     P P    P   P     P  P P     P
Sbjct: 56  PQPEPEPPEEQPKPPTEPETP-----PEPTPPKPKEKPKPEKKPKKPKPKPKPKP 105



 Score = 29.8 bits (67), Expect = 2.3
 Identities = 11/49 (22%), Positives = 12/49 (24%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P  +     P PP     PP      P         P      P P P 
Sbjct: 58  PEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPK 106



 Score = 28.2 bits (63), Expect = 8.9
 Identities = 16/53 (30%), Positives = 19/53 (35%), Gaps = 1/53 (1%)

Query: 21 FSLPNVQAAPPPPPPPPPLP-PPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           ++  + A     P   P P P PP     PP    TPP P P      P P 
Sbjct: 39 LAVFLLAAKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPE 91


>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
           dihydrolipoamide succinyltransferase.  This model
           represents an Actinobacterial clade of E2 enzyme, a
           component of the 2-oxoglutarate dehydrogenase complex
           involved in the TCA cycle. These proteins have multiple
           domains including the catalytic domain (pfam00198), one
           or two biotin domains (pfam00364) and an E3-component
           binding domain (pfam02817).
          Length = 579

 Score = 34.2 bits (78), Expect = 0.14
 Identities = 10/50 (20%), Positives = 10/50 (20%), Gaps = 2/50 (4%)

Query: 25  NVQAAPPPPPPPPPL--PPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           N   A P     P        P            P PP P         P
Sbjct: 204 NAAPAEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAP 253



 Score = 32.7 bits (74), Expect = 0.47
 Identities = 11/53 (20%), Positives = 13/53 (24%), Gaps = 1/53 (1%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGT 76
              +A  P      P P P        P     P P P     P    P+   
Sbjct: 211 AEEEAPAPSEAGSEPAPDPAARAPHAAPD-PPAPAPAPAKTAAPAAAAPVSSG 262



 Score = 32.3 bits (73), Expect = 0.61
 Identities = 14/56 (25%), Positives = 15/56 (26%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
           A   P  P     P P   G+ P P      P        P P P     P    P
Sbjct: 203 ANAAPAEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAPAAAAP 258



 Score = 29.2 bits (65), Expect = 5.1
 Identities = 12/38 (31%), Positives = 13/38 (34%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMG 65
           + P P  P P   P P      P P    P P  P  G
Sbjct: 84  SEPAPAAPEPEAAPEPEAPAPAPTPAAEAPAPAAPQAG 121


>gnl|CDD|215496 PLN02918, PLN02918, pyridoxine (pyridoxamine) 5'-phosphate
          oxidase.
          Length = 544

 Score = 34.1 bits (78), Expect = 0.15
 Identities = 11/40 (27%), Positives = 14/40 (35%)

Query: 32 PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
              P P+ PPPP   +       T     P  G+  PP 
Sbjct: 15 QSLLPLPISPPPPHSSSLSSSPSPTQRFLTPSQGSRLPPR 54


>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
           component of pyruvate dehydrogenase complex.
          Length = 539

 Score = 34.1 bits (78), Expect = 0.16
 Identities = 14/49 (28%), Positives = 15/49 (30%), Gaps = 5/49 (10%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P+  AAP  P   P  PPP       P        P P       PP  
Sbjct: 202 PSSSAAPAAPKAKPSPPPPKEEEVEKP-----ASSPEPKASKPSAPPSS 245



 Score = 29.4 bits (66), Expect = 5.0
 Identities = 12/52 (23%), Positives = 15/52 (28%)

Query: 42  PPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSP 93
           P        P    +PPPP       P   P    +     P    R+  SP
Sbjct: 202 PSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSSGDRIFASP 253


>gnl|CDD|218108 pfam04487, CITED, CITED.  CITED, CBP/p300-interacting
          transactivator with ED-rich tail, are characterized by
          a conserved 32-amino acid sequence at the C-terminus.
          CITED proteins do not bind DNA directly and are thought
          to function as transcriptional co-activators.
          Length = 206

 Score = 33.3 bits (76), Expect = 0.16
 Identities = 9/40 (22%), Positives = 10/40 (25%)

Query: 31 PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
          P      P+    P        M      P P M  P P 
Sbjct: 59 PRSAMSGPMGGGHPHQSMPAYMMFNPSSKPQPFMLVPGPQ 98



 Score = 33.3 bits (76), Expect = 0.18
 Identities = 16/75 (21%), Positives = 20/75 (26%), Gaps = 20/75 (26%)

Query: 24  PNVQAAPPPPPPPPPL-------------------PPPPPMMGTTPPPMMGTPPPPP-PM 63
           P+ +  P    P P L                   P   P  G       G   PP    
Sbjct: 84  PSSKPQPFMLVPGPQLMASMQLQKLNTQYQGHAGAPAGHPGGGGPQQFRPGAGQPPGMQH 143

Query: 64  MGTPPPPPPMMGTTP 78
           M  P  PP ++ T  
Sbjct: 144 MPAPALPPNVIDTDL 158



 Score = 32.9 bits (75), Expect = 0.22
 Identities = 15/76 (19%), Positives = 17/76 (22%), Gaps = 16/76 (21%)

Query: 23  LPNVQAAPPPPPPPPPLPPPPPMMG--------TTPPPMMGTPP-------PPPPMMGTP 67
              +   P   P P  L P P +M         T      G P        P     G  
Sbjct: 77  PAYMMFNPSSKPQPFMLVPGPQLMASMQLQKLNTQYQGHAGAPAGHPGGGGPQQFRPGAG 136

Query: 68  PPPP-PMMGTTPHVLG 82
            PP    M        
Sbjct: 137 QPPGMQHMPAPALPPN 152


>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein.  Tymoviruses
           are single stranded RNA viruses. This family includes a
           protein of unknown function that has been named based on
           its molecular weight. Tymoviruses such as the ononis
           yellow mosaic tymovirus encode only three proteins. Of
           these two are overlapping this protein overlaps a larger
           ORF that is thought to be the polymerase.
          Length = 458

 Score = 33.9 bits (78), Expect = 0.17
 Identities = 15/53 (28%), Positives = 20/53 (37%)

Query: 18  SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
           S++ +  +     PP   P P    P    + P    GTPPPP       PP 
Sbjct: 406 SHVGASSSKHHRLPPSVLPGPRLSSPSPSPSLPTRRPGTPPPPASPPTPSPPS 458


>gnl|CDD|220596 pfam10138, Tellurium_res, Tellurium resistance protein.  Members
          of this family confer resistance to the metalloid
          element tellurium and its salts.
          Length = 98

 Score = 31.6 bits (72), Expect = 0.18
 Identities = 12/34 (35%), Positives = 12/34 (35%), Gaps = 11/34 (32%)

Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
             P PPP P PP P             PP  PP
Sbjct: 1  PAAPVPPPAPAPPAPA-----------PPPAAPP 23



 Score = 30.8 bits (70), Expect = 0.35
 Identities = 11/18 (61%), Positives = 11/18 (61%)

Query: 27 QAAPPPPPPPPPLPPPPP 44
           AAP PPP P P  P PP
Sbjct: 1  PAAPVPPPAPAPPAPAPP 18



 Score = 30.4 bits (69), Expect = 0.46
 Identities = 8/21 (38%), Positives = 8/21 (38%)

Query: 24 PNVQAAPPPPPPPPPLPPPPP 44
                P P PP P  PP  P
Sbjct: 2  AAPVPPPAPAPPAPAPPPAAP 22



 Score = 27.0 bits (60), Expect = 7.0
 Identities = 8/17 (47%), Positives = 8/17 (47%)

Query: 28 AAPPPPPPPPPLPPPPP 44
             P PP P P P  PP
Sbjct: 7  PPAPAPPAPAPPPAAPP 23



 Score = 27.0 bits (60), Expect = 7.0
 Identities = 14/41 (34%), Positives = 14/41 (34%), Gaps = 18/41 (43%)

Query: 31 PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
          P  P PPP P PP               P PP    P  PP
Sbjct: 1  PAAPVPPPAPAPPA--------------PAPP----PAAPP 23


>gnl|CDD|147759 pfam05782, ECM1, Extracellular matrix protein 1 (ECM1).  This
           family consists of several eukaryotic extracellular
           matrix protein 1 (ECM1) sequences. ECM1 has been shown
           to regulate endochondral bone formation, stimulate the
           proliferation of endothelial cells and induce
           angiogenesis. Mutations in the ECM1 gene can cause
           lipoid proteinosis, a disorder which causes generalised
           thickening of skin, mucosae and certain viscera.
           Classical features include beaded eyelid papules and
           laryngeal infiltration leading to hoarseness.
          Length = 419

 Score = 33.8 bits (77), Expect = 0.18
 Identities = 30/91 (32%), Positives = 35/91 (38%), Gaps = 7/91 (7%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWT 87
           AAPP PP    LP   P+     PP  G     PP   +P   P      P  L P    
Sbjct: 48  AAPPSPPLSRRLPVDHPVTSQHDPPFEGQSEVQPP--PSPEDIPVYEEDWPTFLNP---N 102

Query: 88  RLIVSPEIPSDTTPLWKELE--EVPINNLEE 116
                P +P +  PL KEL   +VPI   E 
Sbjct: 103 VDKAGPAVPQEAIPLQKELPPPQVPIEQKEV 133


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
           procyclic acidic repetitive protein (PARP) like
           sequences. The procyclic acidic repetitive protein
           (parp) genes of Trypanosoma brucei encode a small family
           of abundant surface proteins whose expression is
           restricted to the procyclic form of the parasite. They
           are found at two unlinked loci, parpA and parpB;
           transcription of both loci is developmentally regulated.
          Length = 145

 Score = 32.6 bits (73), Expect = 0.19
 Identities = 7/49 (14%), Positives = 9/49 (18%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P+ +      P P       P                       P P P
Sbjct: 60  PDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEP 108



 Score = 31.1 bits (69), Expect = 0.54
 Identities = 13/55 (23%), Positives = 14/55 (25%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           P  +  P P       P P       P P       P P     P P P     P
Sbjct: 64  PEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPEP 118



 Score = 31.1 bits (69), Expect = 0.58
 Identities = 10/72 (13%), Positives = 16/72 (22%), Gaps = 4/72 (5%)

Query: 7   RAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGT 66
           + K  K  +  ++  +  +    P     P P                     P P    
Sbjct: 41  KGKGEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEG----EEEPEPEETG 96

Query: 67  PPPPPPMMGTTP 78
              P P     P
Sbjct: 97  EEEPEPEPEPEP 108



 Score = 31.1 bits (69), Expect = 0.59
 Identities = 15/61 (24%), Positives = 16/61 (26%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
              +  P P       P P       P P     P P P     P P P   T   V  P
Sbjct: 74  EEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPEPEPEPGAATLKSVALP 133

Query: 84  L 84
            
Sbjct: 134 F 134


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 33.8 bits (77), Expect = 0.20
 Identities = 14/64 (21%), Positives = 22/64 (34%)

Query: 8   AKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP 67
              R++ +    I      ++     PP P     P ++    P   GT   P P+   P
Sbjct: 378 PADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVP 437

Query: 68  PPPP 71
           P P 
Sbjct: 438 PQPT 441



 Score = 33.4 bits (76), Expect = 0.26
 Identities = 19/117 (16%), Positives = 27/117 (23%), Gaps = 11/117 (9%)

Query: 28  AAPPPPPPPPPLPPPPPMM---GTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPL 84
           AA        P     P        P  +  + P   PM   PP P              
Sbjct: 362 AAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNP 421

Query: 85  YWTRLIVSPEIPSDTTPLWKELEEVPINNLEEFTELFSRQVTAQRPVTRKRQQKSSK 141
                   PE      P       +PI+         +  V    P     ++K  +
Sbjct: 422 QSPGTSYGPEPVGPVPPQPTNPYVMPIS--------MANMVYPGHPQEHGHERKRKR 470


>gnl|CDD|177451 PHA02669, PHA02669, hypothetical protein; Provisional.
          Length = 210

 Score = 33.1 bits (75), Expect = 0.21
 Identities = 10/47 (21%), Positives = 13/47 (27%), Gaps = 4/47 (8%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPP----PPPPMMGTPPPP 70
           + PP     PP+ P P  +         T         P     P P
Sbjct: 140 SGPPRQANIPPVTPYPDEVSVGVGSGPSTEHGHYEGDGPEQDLEPEP 186


>gnl|CDD|144451 pfam00859, CTF_NFI, CTF/NF-I family transcription modulation
           region. 
          Length = 295

 Score = 33.5 bits (76), Expect = 0.21
 Identities = 19/45 (42%), Positives = 19/45 (42%), Gaps = 3/45 (6%)

Query: 30  PPPPPPPPPLPPPPPMMGTTPPPMM---GTPPPPPPMMGTPPPPP 71
            PPPPPP   P P PM  T PP      G   P  P   TP   P
Sbjct: 227 APPPPPPMARPVPLPMPDTKPPTTSTEGGATSPTSPTYSTPSTSP 271



 Score = 28.5 bits (63), Expect = 8.2
 Identities = 18/46 (39%), Positives = 19/46 (41%)

Query: 33  PPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           P P    PPPPPM    P PM  T PP     G    P     +TP
Sbjct: 222 PTPMLAPPPPPPMARPVPLPMPDTKPPTTSTEGGATSPTSPTYSTP 267


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 33.6 bits (77), Expect = 0.22
 Identities = 14/85 (16%), Positives = 20/85 (23%), Gaps = 2/85 (2%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMM--GTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL 81
           P     P    P   +  P          PPM     PP           P   +T    
Sbjct: 428 PGGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQG 487

Query: 82  GPLYWTRLIVSPEIPSDTTPLWKEL 106
           G       +++   P     +  E 
Sbjct: 488 GQNKKLAQVLASATPQMQKQVLGER 512



 Score = 29.4 bits (66), Expect = 4.8
 Identities = 12/48 (25%), Positives = 14/48 (29%), Gaps = 1/48 (2%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMM 74
                 P P  P  P  P  G  P   +  P          PP  P+M
Sbjct: 417 PRMSMMPTPMGPGGPLRPN-GLAPMNAVRAPSRNAQNAAQKPPMQPVM 463


>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
          Length = 931

 Score = 33.6 bits (77), Expect = 0.23
 Identities = 12/61 (19%), Positives = 16/61 (26%), Gaps = 2/61 (3%)

Query: 13  KKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMG--TTPPPMMGTPPPPPPMMGTPPPP 70
           +                PPP PP P      P     +  P     P P  P+  +    
Sbjct: 61  EAAPLPAAAESIASPEVPPPVPPAPAQEGEAPAAEQPSAVPAPSAAPAPAEPVEPSLAAN 120

Query: 71  P 71
           P
Sbjct: 121 P 121



 Score = 30.6 bits (69), Expect = 1.9
 Identities = 17/94 (18%), Positives = 24/94 (25%), Gaps = 4/94 (4%)

Query: 7   RAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTT---PPPMMGTPPPPPPM 63
           R  R    E  +   +    Q A P      PLP     + +    PP            
Sbjct: 33  RELRSLVAEGAAGPVAKAAEQMAAPEAAEAAPLPAAAESIASPEVPPPVPPAPAQEGEAP 92

Query: 64  MGTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIPS 97
               P   P     P    P+    L  +P   +
Sbjct: 93  AAEQPSAVPAPSAAPAPAEPV-EPSLAANPFAAA 125


>gnl|CDD|233154 TIGR00859, ENaC, sodium channel transporter.  The Epithelial Na+
           Channel (ENaC) Family (TC 1.A.06)The ENaC family
           consists of sodium channels from animals and has no
           recognizable homologues in other eukaryotes or bacteria.
           The vertebrate ENaC proteins from epithelial cells
           cluster tightly together on the phylogenetic tree:
           voltage-insensitive ENaC homologues are also found in
           the brain. Eleven sequenced C. elegans proteins,
           including the degenerins, are distantly related to the
           vertebrate proteins as well as to each other. At least
           some ofthese proteins form part of a mechano-transducing
           complex for touch sensitivity. Other members of the ENaC
           family, the acid-sensing ion channels, ASIC1-3,are homo-
           or hetero-oligomeric neuronal H+-gated channels that
           mediate pain sensation in response to tissue acidosis.
           The homologous Helix aspersa(FMRF-amide)-activated Na+
           channel is the first peptide neurotransmitter-gated
           ionotropic receptor to be sequenced.Mammalian ENaC is
           important for the maintenance of Na+ balance and the
           regulation of blood pressure. Three homologous ENaC
           subunits, a, b and g, havebeen shown to assemble to form
           the highly Na+-selective channel.This model is designed
           from the vertebrate members of the ENaC family
           [Transport and binding proteins, Cations and iron
           carrying compounds].
          Length = 595

 Score = 33.5 bits (77), Expect = 0.24
 Identities = 15/64 (23%), Positives = 21/64 (32%), Gaps = 6/64 (9%)

Query: 6   KRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMG 65
            R ++  ++         P   +A  PP      PP  P     P P          + G
Sbjct: 533 WRFRKWWQRRRGPPYAEPPEPVSADTPPSLQLDDPPTFP--SALPLP----HASGLSLPG 586

Query: 66  TPPP 69
           TPPP
Sbjct: 587 TPPP 590



 Score = 32.4 bits (74), Expect = 0.56
 Identities = 11/43 (25%), Positives = 13/43 (30%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
           A PP P      P        T P  +  P      +   PPP
Sbjct: 548 AEPPEPVSADTPPSLQLDDPPTFPSALPLPHASGLSLPGTPPP 590



 Score = 31.2 bits (71), Expect = 1.3
 Identities = 12/50 (24%), Positives = 15/50 (30%), Gaps = 5/50 (10%)

Query: 29  APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
            PP   PP P+    P            PP  P  +  P      +  TP
Sbjct: 544 GPPYAEPPEPVSADTPP-SLQLD----DPPTFPSALPLPHASGLSLPGTP 588



 Score = 28.9 bits (65), Expect = 6.4
 Identities = 10/25 (40%), Positives = 12/25 (48%), Gaps = 1/25 (4%)

Query: 30  PPPPPPPPPLPPPPPM-MGTTPPPM 53
           PP  P   PLP    + +  TPPP 
Sbjct: 567 PPTFPSALPLPHASGLSLPGTPPPN 591


>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
          Length = 331

 Score = 33.1 bits (76), Expect = 0.27
 Identities = 13/75 (17%), Positives = 17/75 (22%)

Query: 25  NVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPL 84
           +    P   P P       P    TP       P   P       P      T     P 
Sbjct: 167 STTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPA 226

Query: 85  YWTRLIVSPEIPSDT 99
                  +  +P+D 
Sbjct: 227 APATPDGAAPLPTDQ 241



 Score = 29.6 bits (67), Expect = 3.4
 Identities = 14/60 (23%), Positives = 15/60 (25%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
           P V  AP P   P       P            P  P    G  P P    G +     P
Sbjct: 192 PAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADP 251


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 33.3 bits (76), Expect = 0.29
 Identities = 10/46 (21%), Positives = 12/46 (26%), Gaps = 1/46 (2%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPP-PPPPMMGTPPPPPP 72
           AAP P  P   +  P P                      +   PPP
Sbjct: 583 AAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQAAESRGAPPP 628



 Score = 29.8 bits (67), Expect = 3.2
 Identities = 10/45 (22%), Positives = 11/45 (24%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           A P   P   P P  P +    P P         P  G       
Sbjct: 575 AKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQA 619



 Score = 29.0 bits (65), Expect = 7.2
 Identities = 8/56 (14%), Positives = 9/56 (16%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
           AA  P   P   P P         P                        +     P
Sbjct: 573 AAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQAAESRGAPPP 628


>gnl|CDD|219241 pfam06958, Pyocin_S, S-type Pyocin.  This family represents a
           conserved region approximately 180 residues long within
           bacterial S-type pyocins. Pyocins are polypeptide toxins
           produced by, and active against, bacteria. S-type
           pyocins cause cell death by DNA breakdown due to
           endonuclease activity.
          Length = 139

 Score = 31.9 bits (73), Expect = 0.30
 Identities = 16/60 (26%), Positives = 19/60 (31%), Gaps = 7/60 (11%)

Query: 24  PNVQAAPPPPPPPP-----PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
             V        P       P  PP  +  +T  P    PP  P + GTP  P      TP
Sbjct: 54  TGVYEFTLDAVPGRTILWTPASPPGDIPSSTSSP--APPPDTPVITGTPITPVVDPVITP 111



 Score = 27.7 bits (62), Expect = 6.8
 Identities = 10/36 (27%), Positives = 13/36 (36%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPM 63
             P     P P P  P + GT   P++     P P 
Sbjct: 79  DIPSSTSSPAPPPDTPVITGTPITPVVDPVITPYPA 114



 Score = 27.7 bits (62), Expect = 7.0
 Identities = 14/44 (31%), Positives = 15/44 (34%), Gaps = 2/44 (4%)

Query: 29  APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           A PP   P     P P      P + GTP  P       P P P
Sbjct: 74  ASPPGDIPSSTSSPAP--PPDTPVITGTPITPVVDPVITPYPAP 115


>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
          Length = 1000

 Score = 33.1 bits (75), Expect = 0.32
 Identities = 19/68 (27%), Positives = 24/68 (35%), Gaps = 6/68 (8%)

Query: 8   AKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP 67
           AK  +K +           +A PP   P    PP         PP+M  P   P +M TP
Sbjct: 530 AKPHRKVQDGFQRSGRRQKRATPPKVSPSDRGPP------KASPPVMAPPSTGPRVMATP 583

Query: 68  PPPPPMMG 75
              P  M 
Sbjct: 584 STGPRDMA 591



 Score = 31.9 bits (72), Expect = 0.93
 Identities = 12/42 (28%), Positives = 16/42 (38%)

Query: 31  PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P   P  P+ P           ++  PP  PP +   P PPP
Sbjct: 455 PSDQPSVPVEPAHLTPVEHTTVILHQPPQSPPTVAIKPAPPP 496



 Score = 30.0 bits (67), Expect = 2.9
 Identities = 16/55 (29%), Positives = 24/55 (43%), Gaps = 5/55 (9%)

Query: 24  PNVQAAPPPPPPP----PPLPPPPPMMGTTPPPMMGTPPPPPPM-MGTPPPPPPM 73
           P++Q+   PP       P LP    ++ ++ P      P  P   + T  PPPPM
Sbjct: 852 PHLQSETGPPRLQLSQVPQLPYSQTLVSSSAPSWSSPQPRAPIRPIPTRFPPPPM 906



 Score = 29.6 bits (66), Expect = 4.2
 Identities = 14/50 (28%), Positives = 18/50 (36%), Gaps = 1/50 (2%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           S P  +  P   P  P  P     +  T   +   P  PP +   P PPP
Sbjct: 448 STPE-RPGPSDQPSVPVEPAHLTPVEHTTVILHQPPQSPPTVAIKPAPPP 496


>gnl|CDD|236692 PRK10431, PRK10431, N-acetylmuramoyl-l-alanine amidase II;
           Provisional.
          Length = 445

 Score = 32.9 bits (75), Expect = 0.34
 Identities = 16/56 (28%), Positives = 24/56 (42%), Gaps = 8/56 (14%)

Query: 7   RAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
            A +R+    Y+ +F+   + A  PPPPPP     PP +      P +  P    P
Sbjct: 109 EAVKRQNGSNYTVVFT---INADVPPPPPP-----PPVVAKRVETPAVVAPRVSEP 156


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 33.1 bits (75), Expect = 0.35
 Identities = 14/47 (29%), Positives = 16/47 (34%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
              Q + P P  P   PP     G  P P    P P P M  +P   
Sbjct: 428 QPPQPSVPSPGGPGSQPPQSVSGGMIPSPPALMPSPSPQMSQSPASQ 474



 Score = 33.1 bits (75), Expect = 0.39
 Identities = 20/58 (34%), Positives = 22/58 (37%), Gaps = 5/58 (8%)

Query: 24  PNVQAAPPP--PPPPPPLPPPPPMMGTTPPPM-MGTPPPPPPMMGTPPPPPPMMGTTP 78
           P  Q       P PP P  P P   G+ PP    G   P PP     P P P M  +P
Sbjct: 416 PVPQVQTNQSMPQPPQPSVPSPGGPGSQPPQSVSGGMIPSPP--ALMPSPSPQMSQSP 471



 Score = 29.6 bits (66), Expect = 4.2
 Identities = 11/42 (26%), Positives = 15/42 (35%), Gaps = 1/42 (2%)

Query: 30  PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           PP       +P PP +M  +P P M   P     +      P
Sbjct: 444 PPQSVSGGMIPSPPALM-PSPSPQMSQSPASQRTIQQDMVSP 484


>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region.  The
           defective chorion-1 gene (dec-1) in Drosophila encodes
           follicle cell proteins necessary for proper eggshell
           assembly. Multiple products of the dec-1 gene are formed
           by alternative RNA splicing and proteolytic processing.
           Cleavage products include S80 (80 kDa) which is
           incorporated into the eggshell, and further proteolysis
           of S80 gives S60 (60 kDa).
          Length = 407

 Score = 32.8 bits (74), Expect = 0.36
 Identities = 11/32 (34%), Positives = 14/32 (43%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMG 55
           P   AAPP  P P    P  P+     P ++G
Sbjct: 114 PAPAAAPPAAPAPAADTPAAPIPDAVQPAILG 145



 Score = 31.7 bits (71), Expect = 0.85
 Identities = 19/61 (31%), Positives = 22/61 (36%), Gaps = 13/61 (21%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL 81
           S+P +  A  P P P P P   P            P  P P   TP  P P     P +L
Sbjct: 98  SMPGLLGAAAPVPAPAPAPAAAP------------PAAPAPAADTPAAPIP-DAVQPAIL 144

Query: 82  G 82
           G
Sbjct: 145 G 145



 Score = 30.2 bits (67), Expect = 2.4
 Identities = 13/42 (30%), Positives = 16/42 (38%), Gaps = 4/42 (9%)

Query: 37  PPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           P +P  P ++G   P     P P P     PP  P     TP
Sbjct: 94  PAMPSMPGLLGAAAP----VPAPAPAPAAAPPAAPAPAADTP 131


>gnl|CDD|221170 pfam11696, DUF3292, Protein of unknown function (DUF3292).  This
           eukaryotic family of proteins has no known function.
          Length = 641

 Score = 32.8 bits (75), Expect = 0.37
 Identities = 11/44 (25%), Positives = 14/44 (31%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           AAP PPPP   L   P    +     +        +     P P
Sbjct: 402 AAPLPPPPSSSLRKAPSSPASIDHKQLNLGASEEEIDQAAAPEP 445


>gnl|CDD|221827 pfam12881, NUT_N, NUT protein N terminus.  This family includes
          the NUT protein. The gene encoding for NUT protein
          (Nuclear Testis protein) is found fused to BRD3 or BRD4
          genes, in some aggressive types of carcinoma, due to
          chromosomal translocations. Proteins of the BRD family
          contain two bromodomains that bind transcriptionally
          active chromatin through associations with acetylated
          histones H3 and H4. Such proteins are crucial for the
          regulation of cell cycle progression. On the other
          hand, little is known about NUT protein. NUT is known
          to have a Nuclear Export Sequence (NES) as well as a
          Nuclear Localization Signal (NLS), both located towards
          the C-terminal end of the protein. A fused NUT-GFP
          protein showed either cytoplasmic or nuclear
          localization, suggesting that it is subject to
          nuclear/cytoplasmic shuttling. Consistent with this
          possibility, treatment with leptomycin B an inhibitor
          of CRM1-dependent nuclear export resulted in
          re-distribution of NUT-GFP to the nucleus. Inspection
          of NUT revealed a C-terminal sequence similar to known
          nuclear export sequences (NES) which are often
          regulated by phosphorylation.
          Length = 328

 Score = 32.6 bits (74), Expect = 0.37
 Identities = 15/57 (26%), Positives = 21/57 (36%), Gaps = 4/57 (7%)

Query: 32 PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMM----GTTPHVLGPL 84
          P  PP P PP  P     PP +     P  P++ +  P  P++    G      G  
Sbjct: 22 PFLPPAPQPPDQPFWEPPPPLVTAGFSPGNPLVLSALPSMPLVTEGGGPGLSGAGAG 78



 Score = 28.4 bits (63), Expect = 8.0
 Identities = 18/60 (30%), Positives = 23/60 (38%), Gaps = 10/60 (16%)

Query: 24 PNVQAAP----PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPM-MGTPPPPPPMMGTTP 78
          P++   P     P    P LPP P      PP      PPPP +  G  P  P ++   P
Sbjct: 5  PDMTMNPGASLSPFAALPFLPPAP-----QPPDQPFWEPPPPLVTAGFSPGNPLVLSALP 59



 Score = 28.0 bits (62), Expect = 9.6
 Identities = 15/47 (31%), Positives = 18/47 (38%), Gaps = 2/47 (4%)

Query: 26  VQAAPPPPPPPP--PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
           +   P PPPPPP   L P   +    P P   T    P  +  P P 
Sbjct: 150 ILGLPLPPPPPPVAQLVPIVSLENAWPGPQGATGEGGPAAIQKPSPG 196


>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 226

 Score = 32.2 bits (73), Expect = 0.41
 Identities = 14/49 (28%), Positives = 18/49 (36%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            +V   P   PP    P   P+   TP P+     PP   +   P P P
Sbjct: 87  ASVDPQPVAQPPVESTPAGVPVAAQTPKPVKPPKQPPAGAVPAKPTPKP 135



 Score = 31.8 bits (72), Expect = 0.61
 Identities = 15/53 (28%), Positives = 17/53 (32%), Gaps = 5/53 (9%)

Query: 24  PNVQAAPPPPP-----PPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           P V++ P   P     P P  PP  P  G  P      P P P       P  
Sbjct: 97  PPVESTPAGVPVAAQTPKPVKPPKQPPAGAVPAKPTPKPEPKPVAEPAAAPTG 149



 Score = 31.0 bits (70), Expect = 0.92
 Identities = 12/45 (26%), Positives = 13/45 (28%), Gaps = 3/45 (6%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
               P   P     P P+     PP    P  P P    P P P 
Sbjct: 99  VESTPAGVPVAAQTPKPVKPPKQPPAGAVPAKPTP---KPEPKPV 140


>gnl|CDD|225499 COG2948, VirB10, Type IV secretory pathway, VirB10 components
           [Intracellular trafficking and secretion].
          Length = 360

 Score = 32.4 bits (74), Expect = 0.43
 Identities = 13/69 (18%), Positives = 17/69 (24%), Gaps = 1/69 (1%)

Query: 3   YMDKRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
            +    KR    +  SN+          P  PP PP P P  +     P          P
Sbjct: 45  ALQGEKKRINNTQPPSNVERGTPPLPPLPDDPPLPP-PLPVDLGAPVLPDQQVEEAKDQP 103

Query: 63  MMGTPPPPP 71
                    
Sbjct: 104 RRLRAAELA 112


>gnl|CDD|219753 pfam08226, DUF1720, Domain of unknown function (DUF1720).  This
          domain is found in different combinations with cortical
          patch components EF hand, SH3 and ENTH and is therefore
          likely to be involved in cytoskeletal processes. This
          family contains many hypothetical proteins.
          Length = 73

 Score = 29.7 bits (67), Expect = 0.47
 Identities = 13/60 (21%), Positives = 13/60 (21%), Gaps = 2/60 (3%)

Query: 26 VQAAPPPPPPPPPLPPPP--PMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
                 P    PL P P   M               P   G  P     M T    L P
Sbjct: 9  QPPQQQQPQQQGPLQPQPTGFMQPQPTGFGQQQQGLQPQQTGFQPQAGQQMPTGTGPLQP 68


>gnl|CDD|117486 pfam08919, F_actin_bind, F-actin binding.  The F-actin binding
          domain forms a compact bundle of four antiparallel
          alpha-helices, which are arranged in a left-handed
          topology. Binding of F-actin to the F-actin binding
          domain may result in cytoplasmic retention and
          subcellular distribution of the protein, as well as
          possible inhibition of protein function.
          Length = 179

 Score = 31.6 bits (71), Expect = 0.48
 Identities = 17/40 (42%), Positives = 21/40 (52%), Gaps = 1/40 (2%)

Query: 38 PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
          P+PP  P   +T  P+ GTPP P P+  T P P  M   T
Sbjct: 6  PVPPAVPKPQSTAKPV-GTPPSPVPLPSTSPSPSKMANGT 44


>gnl|CDD|221598 pfam12484, PE_PPE_C, Polymorphic PE/PPE proteins C terminal.
          This domain family is found in bacteria, and is
          approximately 90 amino acids in length. The family is
          found in association with pfam00823. There is a
          conserved SVP sequence motif. There is a single
          completely conserved residue W that may be functionally
          important. The proteins in this family are PE/PPE
          proteins implicated in immunostimulation and virulence.
          Length = 81

 Score = 30.0 bits (68), Expect = 0.51
 Identities = 10/57 (17%), Positives = 11/57 (19%)

Query: 40 PPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIP 96
          P         P   +            P  P            P Y  R  V P  P
Sbjct: 25 PAVSAAAAALPGAGVAAAAEAAAGAVLPGMPGAAAAGRGSAGFPRYGFRPTVMPRPP 81



 Score = 27.7 bits (62), Expect = 3.3
 Identities = 11/49 (22%), Positives = 14/49 (28%), Gaps = 2/49 (4%)

Query: 22 SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTP--PPPPPMMGTPP 68
          +LP    A         + P  P          G P     P +M  PP
Sbjct: 33 ALPGAGVAAAAEAAAGAVLPGMPGAAAAGRGSAGFPRYGFRPTVMPRPP 81


>gnl|CDD|219053 pfam06482, Endostatin, Collagenase NC10 and Endostatin.  NC10
           stands for Non-helical region 10 and is taken from human
           COL15A1. A mutation in this region in human COL18A1 is
           associated with an increased risk of prostrate cancer.
           This domain is cleaved from the precursor and forms
           endostatin. Endostatin is a key tumour suppressor and
           has been used highly successfully to treat cancer. It is
           a potent angiogenesis inhibitor. Endostatin also binds a
           zinc ion near the N-terminus; this is likely to be of
           structural rather than functional importance according
           to.
          Length = 291

 Score = 32.0 bits (73), Expect = 0.59
 Identities = 12/70 (17%), Positives = 15/70 (21%), Gaps = 5/70 (7%)

Query: 15  ETYSNIFSLPNVQAAPPP-PPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
                +      Q  P    P   P  P PP +             P P    P   P  
Sbjct: 50  GELVPLPGTTATQPPPVVLTPWSDPRLPDPPHLPDPQTHSATAHRNPHP----PLNSPAR 105

Query: 74  MGTTPHVLGP 83
           +        P
Sbjct: 106 IAHVHQDFQP 115


>gnl|CDD|216648 pfam01690, PLRV_ORF5, Potato leaf roll virus readthrough protein.
           This family consists mainly of the potato leaf roll
          virus readthrough protein. This is generated via a
          readthrough of open reading frame 3 a coat protein
          allowing transcription of open reading frame 5 to give
          an extended coat protein with a large c-terminal
          addition or read through domain. The readthrough
          protein is thought to play a role in the circulative
          aphid transmission of potato leaf roll virus. Also in
          the family is open reading frame 6 from beet western
          yellows virus and potato leaf roll virus both
          luteovirus and an unknown protein from cucurbit
          aphid-borne yellows virus a closterovirus.
          Length = 460

 Score = 32.0 bits (73), Expect = 0.67
 Identities = 14/37 (37%), Positives = 14/37 (37%), Gaps = 8/37 (21%)

Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
          V   P P P P P PPPP      P P    P P   
Sbjct: 1  VDGEPGPKPGPTPTPPPP-----APTP---EPTPAKH 29


>gnl|CDD|217439 pfam03229, Alpha_GJ, Alphavirus glycoprotein J. 
          Length = 126

 Score = 30.5 bits (68), Expect = 0.69
 Identities = 15/42 (35%), Positives = 17/42 (40%), Gaps = 1/42 (2%)

Query: 44 PMMGTTPPPMMGTPPP-PPPMMGTPPPPPPMMGTTPHVLGPL 84
             G+T     G PP    P   TP P PP     P V+G L
Sbjct: 52 AAAGSTIAATAGAPPTDSTPAGETPGPFPPTDFALPLVIGGL 93


>gnl|CDD|220093 pfam09030, Creb_binding, Creb binding.  The Creb binding domain
          assumes a structure comprising of three alpha-helices
          which pack in a bundle, exposing a hydrophobic groove
          between alpha-1 and alpha-3 within which complimentary
          domains found in the protein 'activator for thyroid
          hormone and retinoid receptors' (ACTR) can dock.
          Docking of these domains is required for the
          recruitment of RNA polymerase II and the basal
          transcription machinery.
          Length = 104

 Score = 30.0 bits (67), Expect = 0.71
 Identities = 19/70 (27%), Positives = 23/70 (32%), Gaps = 5/70 (7%)

Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMM---GTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYW 86
           P   P  PLP    M    P P+M          P  G   P       +P+ L  L  
Sbjct: 2  QPGQWPQGPLPQQQQMQPGMPRPVMQMVAQHAVAGPRPGLVQPGISRGIVSPNALQDLL- 60

Query: 87 TRLIVSPEIP 96
           R + SP  P
Sbjct: 61 -RTLKSPSSP 69


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
           subunit 62; Provisional.
          Length = 576

 Score = 31.8 bits (72), Expect = 0.72
 Identities = 24/101 (23%), Positives = 31/101 (30%), Gaps = 3/101 (2%)

Query: 31  PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP---PPPPPMMGTTPHVLGPLYWT 87
           PP  P P  P       ++   +   P   P    T    PPP  M   +P+ +      
Sbjct: 449 PPTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKP 508

Query: 88  RLIVSPEIPSDTTPLWKELEEVPINNLEEFTELFSRQVTAQ 128
               SP  P          E V + N    T L   Q  AQ
Sbjct: 509 PTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQ 549


>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
           Provisional.
          Length = 418

 Score = 32.0 bits (73), Expect = 0.73
 Identities = 13/51 (25%), Positives = 14/51 (27%), Gaps = 4/51 (7%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
                P  P    P P P   + P P     PP P     P   PP     
Sbjct: 137 AEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPA----PAAKPPPTPVA 183



 Score = 30.0 bits (68), Expect = 2.5
 Identities = 15/55 (27%), Positives = 15/55 (27%), Gaps = 2/55 (3%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTP 78
           P   AA       P  P       T  PP    P PP       P P      TP
Sbjct: 129 PAAAAAAKAEKTTPEKPKAAAP--TPEPPAASKPTPPAAAKPPEPAPAAKPPPTP 181



 Score = 29.7 bits (67), Expect = 3.9
 Identities = 17/63 (26%), Positives = 19/63 (30%), Gaps = 1/63 (1%)

Query: 8   AKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLP-PPPPMMGTTPPPMMGTPPPPPPMMGT 66
           A             + P    A  P P PP    P PP     P P     PPP P+   
Sbjct: 126 AAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKPPPTPVARA 185

Query: 67  PPP 69
            P 
Sbjct: 186 DPR 188



 Score = 28.9 bits (65), Expect = 7.0
 Identities = 14/48 (29%), Positives = 15/48 (31%), Gaps = 1/48 (2%)

Query: 26  VQAAPPPPPPPPPLPPPPPMMGTTPP-PMMGTPPPPPPMMGTPPPPPP 72
           +     PP   P          TTP  P    P P PP    P PP  
Sbjct: 118 IDTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAA 165


>gnl|CDD|221459 pfam12200, DUF3597, Domain of unknown function (DUF3597).  This
          family of proteins is found in bacteria, eukaryotes and
          viruses. Proteins in this family are typically between
          126 and 281 amino acids in length. The function of this
          domain is unknown. The structure of this domain has
          been found to contain five helices with a long flexible
          loop between helices one and two.
          Length = 124

 Score = 30.4 bits (69), Expect = 0.73
 Identities = 9/37 (24%), Positives = 10/37 (27%), Gaps = 4/37 (10%)

Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPM 63
           AA P P   P   P              TPP    +
Sbjct: 14 AAAAPAPAAAPATAPAAA----AAAAPAATPPAAASV 46



 Score = 28.5 bits (64), Expect = 3.8
 Identities = 9/46 (19%), Positives = 10/46 (21%), Gaps = 13/46 (28%)

Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
          AA  P P   P   P                       TPP    +
Sbjct: 14 AAAAPAPAAAPATAPAA-------------AAAAAPAATPPAAASV 46



 Score = 27.7 bits (62), Expect = 6.7
 Identities = 7/35 (20%), Positives = 7/35 (20%), Gaps = 3/35 (8%)

Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
           A    P P   P   P              PP  
Sbjct: 12 EAAAAAPAPAAAPATAP---AAAAAAAPAATPPAA 43


>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
          Length = 256

 Score = 31.6 bits (72), Expect = 0.75
 Identities = 12/51 (23%), Positives = 13/51 (25%), Gaps = 3/51 (5%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
             P    A P P  PP     P    T  P       P  P+      P  
Sbjct: 128 QQPPATTAQPQPVTPPRQTTAPVQPQTPAPV---RTQPAAPVTQAVEAPKV 175



 Score = 30.8 bits (70), Expect = 1.2
 Identities = 14/57 (24%), Positives = 15/57 (26%), Gaps = 2/57 (3%)

Query: 18  SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPP--PPPPMMGTPPPPPP 72
           S +      Q   PP     P P  PP   T P       P    P    T     P
Sbjct: 117 STVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAP 173


>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
           Provisional.
          Length = 576

 Score = 31.9 bits (72), Expect = 0.82
 Identities = 12/42 (28%), Positives = 13/42 (30%)

Query: 31  PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P     PP PP  P    +        PP PP    P   P 
Sbjct: 355 PDESNVPPNPPNVPGGSNSEFSSDVENPPNPPNPDIPEQEPN 396


>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
          (LMP2) protein.  This family consists of several
          Gammaherpesvirus latent membrane protein (LMP2)
          proteins. Epstein-Barr virus is a human
          Gammaherpesvirus that infects and establishes latency
          in B lymphocytes in vivo. The latent membrane protein 2
          (LMP2) gene is expressed in latently infected B cells
          and encodes two protein isoforms, LMP2A and LMP2B, that
          are identical except for an additional N-terminal 119
          aa cytoplasmic domain which is present in the LMP2A
          isoform. LMP2A is thought to play a key role in either
          the establishment or the maintenance of latency and/or
          the reactivation of productive infection from the
          latent state. The significance of LMP2B and its role in
          pathogenesis remain unclear.
          Length = 489

 Score = 31.4 bits (71), Expect = 0.96
 Identities = 16/74 (21%), Positives = 19/74 (25%), Gaps = 19/74 (25%)

Query: 14 KETYSNIFSLPNVQAAPP-PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPP---- 68
             Y + F     +  PP P     P   PPP  G              P+    P    
Sbjct: 28 NPYYPSSFGSSWDRPGPPVPEDYDAPSHRPPPYGG----SNGDRHGGYQPLGQQDPSLYA 83

Query: 69 ----------PPPP 72
                    PPPP
Sbjct: 84 GLGQNGGGGLPPPP 97



 Score = 29.1 bits (65), Expect = 5.1
 Identities = 10/49 (20%), Positives = 12/49 (24%), Gaps = 8/49 (16%)

Query: 31 PPPPPPPPLP--------PPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
             P     P        P  P    +     G P P      +  PPP
Sbjct: 11 AGGPRSHGGPDGDEGDSNPYYPSSFGSSWDRPGPPVPEDYDAPSHRPPP 59


>gnl|CDD|215605 PLN03153, PLN03153, hypothetical protein; Provisional.
          Length = 537

 Score = 31.4 bits (71), Expect = 0.96
 Identities = 10/42 (23%), Positives = 16/42 (38%), Gaps = 2/42 (4%)

Query: 6   KRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMG 47
           K  +       ++  +  P V   P PPPP  P  P   ++ 
Sbjct: 61  KDWEGSPDTLLWNKRYHHPIV--TPLPPPPSSPSLPSSLLLD 100


>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 433 to 538 amino acids in length. This domain is
           found associated with pfam00616, pfam00168. This domain
           has two conserved sequence motifs: DLQ and PLSFQNP.
          Length = 489

 Score = 31.6 bits (71), Expect = 0.99
 Identities = 10/50 (20%), Positives = 15/50 (30%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
                 A P      P  P      G  PP ++ + P    M+ +  P  
Sbjct: 250 LPRQNSAGPQRRVDQPSPPGGGSHRGRIPPSLLSSLPSEGSMLSSEWPQS 299


>gnl|CDD|152353 pfam11918, DUF3436, Domain of unknown function (DUF3436).  This
          presumed domain is functionally uncharacterized. This
          domain is found in eukaryotes. This domain is about 50
          amino acids in length. This domain is found associated
          with pfam03572. This domain has two conserved sequence
          motifs: DPRL and SYEP.
          Length = 55

 Score = 28.3 bits (64), Expect = 0.99
 Identities = 7/18 (38%), Positives = 8/18 (44%)

Query: 24 PNVQAAPPPPPPPPPLPP 41
          P+   AP   P  P L P
Sbjct: 23 PSYLEAPQQAPALPNLTP 40


>gnl|CDD|234398 TIGR03921, T7SS_mycosin, type VII secretion-associated serine
           protease mycosin.  Members of this family are
           subtilisin-related serine proteases, found strictly in
           the Actinobacteria and associated with type VII
           secretion operons. The designation mycosin is used for
           members from Mycobacterium [Protein fate, Protein and
           peptide secretion and trafficking, Protein fate, Protein
           modification and repair].
          Length = 350

 Score = 31.1 bits (71), Expect = 1.0
 Identities = 12/39 (30%), Positives = 13/39 (33%), Gaps = 11/39 (28%)

Query: 34  PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
            PP    P  P            P P  P+    PPPPP
Sbjct: 286 LPPEDGRPLRP-----------APAPARPVAAPAPPPPP 313



 Score = 30.0 bits (68), Expect = 2.7
 Identities = 14/42 (33%), Positives = 16/42 (38%), Gaps = 6/42 (14%)

Query: 26  VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP 67
           +    PP    P  P P P       P+   P PPPP   TP
Sbjct: 282 LTGELPPEDGRPLRPAPAPA-----RPVAA-PAPPPPPDDTP 317


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 31.4 bits (71), Expect = 1.1
 Identities = 12/65 (18%), Positives = 15/65 (23%), Gaps = 6/65 (9%)

Query: 11  RKKKETYSNIFSLPNVQAAPPPPPPPPPLPPP---PPMMGTTPPPMMGTPPPPPPMMGTP 67
           + K  T           +AP P P PP  P                  +P         P
Sbjct: 76  KPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEE---KSPSEESATATAP 132

Query: 68  PPPPP 72
             P  
Sbjct: 133 ESPST 137


>gnl|CDD|234797 PRK00575, tatA, twin arginine translocase protein A; Provisional.
          Length = 92

 Score = 29.3 bits (66), Expect = 1.2
 Identities = 6/36 (16%), Positives = 7/36 (19%)

Query: 34 PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP 69
                    P    TP P+      P    G    
Sbjct: 52 AEASAAAAQAPYQVATPTPVQSQRVDPAAASGQDST 87



 Score = 27.8 bits (62), Expect = 3.8
 Identities = 5/36 (13%), Positives = 9/36 (25%), Gaps = 3/36 (8%)

Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
          +A+      P  +  P P+      P          
Sbjct: 53 EASAAAAQAPYQVATPTPVQSQRVDP---AAASGQD 85


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 30.8 bits (70), Expect = 1.2
 Identities = 14/47 (29%), Positives = 18/47 (38%)

Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
          V  A   PP     PP P +     P  +  PP   P++   P P P
Sbjct: 53 VAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIPKPEPKP 99



 Score = 29.7 bits (67), Expect = 3.1
 Identities = 13/45 (28%), Positives = 15/45 (33%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           +      PPP P+  P P     P P    P   P     P P P
Sbjct: 59  EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIPKPEPKPKPKP 103


>gnl|CDD|236511 PRK09429, mepA, penicillin-insensitive murein endopeptidase;
           Reviewed.
          Length = 275

 Score = 31.0 bits (71), Expect = 1.2
 Identities = 18/60 (30%), Positives = 19/60 (31%), Gaps = 25/60 (41%)

Query: 24  PNVQAAPPPPP-----------PPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P  +   PPPP              P  PP P           T P P P    PPP PP
Sbjct: 221 PECEDQAPPPPGDGCGAELASWSDAPFEPPKP----------TTKPKPKP----PPPLPP 266


>gnl|CDD|223029 PHA03264, PHA03264, envelope glycoprotein D; Provisional.
          Length = 416

 Score = 31.1 bits (70), Expect = 1.3
 Identities = 19/88 (21%), Positives = 21/88 (23%), Gaps = 18/88 (20%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPP----MMGTPPPPPPMM------GTPPPPPPM 73
           P      P PP        P               G    P P        G P P PP 
Sbjct: 268 PAPSGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGGEPKPGPP- 326

Query: 74  MGTTPHVLGPLYWTRLIVSPEIPSDTTP 101
               P    P  W      P + + T P
Sbjct: 327 -RPAPDADRPEGW------PSLEAITFP 347



 Score = 29.2 bits (65), Expect = 4.4
 Identities = 13/42 (30%), Positives = 14/42 (33%)

Query: 29  APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
           A    P P P  P P        P +     PPP   TP  P
Sbjct: 316 AAGGEPKPGPPRPAPDADRPEGWPSLEAITFPPPTPATPAVP 357


>gnl|CDD|215379 PLN02708, PLN02708, Probable pectinesterase/pectinesterase
          inhibitor.
          Length = 553

 Score = 31.0 bits (70), Expect = 1.3
 Identities = 10/32 (31%), Positives = 14/32 (43%)

Query: 20 IFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPP 51
          +F  P+  +        P   PPPP   +TPP
Sbjct: 14 LFHSPSSSSNRHHHHHTPSPSPPPPSSPSTPP 45


>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
          Length = 333

 Score = 30.8 bits (70), Expect = 1.4
 Identities = 10/38 (26%), Positives = 12/38 (31%)

Query: 32  PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP 69
           P P  P   P  P +   P P+   P P         P
Sbjct: 136 PQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEP 173



 Score = 30.4 bits (69), Expect = 1.9
 Identities = 12/49 (24%), Positives = 14/49 (28%), Gaps = 3/49 (6%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P   A  P   P     P  P+     P      P P P+   P P   
Sbjct: 121 PAQPAPQPVQQPAYQPQPEQPLQQPVSPQ---VAPAPQPVHSAPQPAQQ 166



 Score = 29.2 bits (66), Expect = 4.5
 Identities = 11/49 (22%), Positives = 13/49 (26%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           P  +     P  P   P P P+     P      P  P     P P   
Sbjct: 136 PQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAE 184



 Score = 29.2 bits (66), Expect = 4.9
 Identities = 14/53 (26%), Positives = 15/53 (28%), Gaps = 2/53 (3%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
            A P  P   PP    PP     P P    P P       P P  P+      
Sbjct: 98  SAQPRQPVQQPPEAQVPP--QHAPRPAQPAPQPVQQPAYQPQPEQPLQQPVSP 148



 Score = 28.8 bits (65), Expect = 5.3
 Identities = 11/47 (23%), Positives = 15/47 (31%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
              P   P  P + P P  + + P P      P  P+    P P   
Sbjct: 138 PEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAE 184


>gnl|CDD|233055 TIGR00617, rpa1, replication factor-a protein 1 (rpa1).  All
           proteins in this family for which functions are known
           are part of a multiprotein complex made up of homologs
           of RPA1, RPA2 and RPA3 that bind ssDNA and function in
           the recognition of DNA damage for nucleotide excision
           repairThis family is based on the phylogenomic analysis
           of JA Eisen (1999, Ph.D. Thesis, Stanford University)
           [DNA metabolism, DNA replication, recombination, and
           repair].
          Length = 608

 Score = 31.2 bits (71), Expect = 1.4
 Identities = 14/69 (20%), Positives = 19/69 (27%), Gaps = 8/69 (11%)

Query: 19  NIFSLPNVQAAPPPPPPPPPLPPPP-------PMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
            + S    Q     P   P  PP                 P  G   PP P  G+     
Sbjct: 123 YLDSWHEEQVLASKPATNPANPPNAKAPKNEVASYNNAANPERGN-APPAPNSGSTRRVM 181

Query: 72  PMMGTTPHV 80
           P+   +P+ 
Sbjct: 182 PIASLSPYQ 190


>gnl|CDD|148844 pfam07469, DUF1518, Domain of unknown function (DUF1518).  This
          domain, which is usually found tandemly repeated, is
          found various receptor co-activating proteins.
          Length = 56

 Score = 27.9 bits (62), Expect = 1.4
 Identities = 15/53 (28%), Positives = 16/53 (30%), Gaps = 3/53 (5%)

Query: 22 SLPNVQAAPPPPPPPP---PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           +P     P  P  PP   P PP   M     P       P  PMM     P 
Sbjct: 1  GMPGQMGNPRMPQAPPQQFPYPPNYGMGQQPDPAFTSPFSPQSPMMSPRMGPS 53


>gnl|CDD|223041 PHA03321, PHA03321, tegument protein VP11/12; Provisional.
          Length = 694

 Score = 31.1 bits (70), Expect = 1.4
 Identities = 12/27 (44%), Positives = 13/27 (48%)

Query: 18  SNIFSLPNVQAAPPPPPPPPPLPPPPP 44
           S +FS P     PPP P   P PP  P
Sbjct: 632 SPLFSPPPAARFPPPDPALSPEPPALP 658



 Score = 28.8 bits (64), Expect = 8.4
 Identities = 24/106 (22%), Positives = 25/106 (23%), Gaps = 29/106 (27%)

Query: 23  LPNVQAAPPPPPPPPPLPPP--------------------------PPMMGTTPPPMMGT 56
            P     PPPPP   P   P                              G  PPP    
Sbjct: 435 APRRDNDPPPPPRARPGSTPACARRARAQRARDAGPEYVDPLGALRRLPAGAAPPPEPAA 494

Query: 57  PPPPP---PMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIPSDT 99
            P P      MG  PP  P        L P +       PE   D 
Sbjct: 495 APSPATYYTRMGGGPPRLPPRNRATETLRPDWGPPAAAPPEQMEDP 540


>gnl|CDD|219837 pfam08430, Fork_head_N, Forkhead N-terminal region.  The region
           described in this family is found towards the N-terminus
           of various eukaryotic fork head/HNF-3-related
           transcription factors (which contain the pfam00250
           domain). These proteins play key roles in embryogenesis,
           maintenance of differentiated cell states, and
           tumorigenesis.
          Length = 137

 Score = 29.6 bits (66), Expect = 1.5
 Identities = 12/44 (27%), Positives = 12/44 (27%)

Query: 33  PPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGT 76
               P        MG      MGT   P   MG   P P   G 
Sbjct: 61  NGMSPGYGGAGSPMGMMGMSSMGTSLSPSGTMGAMGPMPAGSGG 104



 Score = 29.6 bits (66), Expect = 1.5
 Identities = 11/61 (18%), Positives = 15/61 (24%), Gaps = 7/61 (11%)

Query: 25  NVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPP-------PPPPMMGTPPPPPPMMGTT 77
           +  +     P       P  MMG +      +P        P P   G    P   M   
Sbjct: 56  SSSSMNGMSPGYGGAGSPMGMMGMSSMGTSLSPSGTMGAMGPMPAGSGGSLSPNMSMSRA 115

Query: 78  P 78
            
Sbjct: 116 S 116


>gnl|CDD|217197 pfam02711, Pap_E4, E4 protein.  This is is a family of
          Papillomavirus proteins, E4, coded for by ORF4. A
          splice variant, E1--E4, exists but neither the function
          of E4 or E1--E4 is known.
          Length = 95

 Score = 28.8 bits (65), Expect = 1.6
 Identities = 9/15 (60%), Positives = 9/15 (60%)

Query: 30 PPPPPPPPPLPPPPP 44
           PP PPPPP P  P 
Sbjct: 21 TPPRPPPPPPPWAPK 35



 Score = 27.2 bits (61), Expect = 5.3
 Identities = 9/15 (60%), Positives = 9/15 (60%)

Query: 30 PPPPPPPPPLPPPPP 44
            PP PPPP PP  P
Sbjct: 20 TTPPRPPPPPPPWAP 34



 Score = 26.9 bits (60), Expect = 7.0
 Identities = 8/16 (50%), Positives = 8/16 (50%)

Query: 29 APPPPPPPPPLPPPPP 44
               PP PP PPPP 
Sbjct: 17 HTYTTPPRPPPPPPPW 32



 Score = 26.9 bits (60), Expect = 9.1
 Identities = 10/20 (50%), Positives = 10/20 (50%)

Query: 23 LPNVQAAPPPPPPPPPLPPP 42
          L      P PPPPPPP  P 
Sbjct: 16 LHTYTTPPRPPPPPPPWAPK 35


>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
          Length = 590

 Score = 30.8 bits (70), Expect = 1.6
 Identities = 12/46 (26%), Positives = 21/46 (45%)

Query: 8  AKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPM 53
           + R++K + S++   P++   P    P PP    PP   T  P +
Sbjct: 32 PRPRRRKPSASSLLHTPSILPLPKLSSPSPPSVTLPPAATTQTPQL 77



 Score = 28.9 bits (65), Expect = 5.9
 Identities = 10/29 (34%), Positives = 10/29 (34%)

Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPP 58
           P   P P L  P P   T PP      P
Sbjct: 47 TPSILPLPKLSSPSPPSVTLPPAATTQTP 75



 Score = 28.9 bits (65), Expect = 6.9
 Identities = 16/67 (23%), Positives = 22/67 (32%), Gaps = 1/67 (1%)

Query: 18 SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
          S+  S  +  + P    P P    P        P ++  P    P   +   PP     T
Sbjct: 15 SSSSSSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSSPSPPSVTLPPAATTQT 74

Query: 78 PHVLGPL 84
          P  L PL
Sbjct: 75 PQ-LNPL 80


>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription
           factor N terminal domain.  The N terminus of the PEA3
           transcription factors is implicated in transactivation
           and in inhibition of DNA binding. Transactivation is
           potentiated by activation of the Ras/MAP kinase and
           protein kinase A signalling cascades. The N terminal
           region contains conserved MAP kinase phosphorylation
           sites.
          Length = 336

 Score = 30.6 bits (69), Expect = 1.6
 Identities = 19/63 (30%), Positives = 20/63 (31%), Gaps = 5/63 (7%)

Query: 17  YSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMM-----GTPPPPPPMMGTPPPPP 71
           YS     P     PP PP  P  P  P        P         P P  P+    P PP
Sbjct: 119 YSAYDRKPASGFKPPTPPSTPCSPVNPQETVRQLQPSGPLSNSSPPSPHTPLPNQSPLPP 178

Query: 72  PMM 74
           PM 
Sbjct: 179 PMS 181


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 30.9 bits (70), Expect = 1.6
 Identities = 15/55 (27%), Positives = 16/55 (29%), Gaps = 7/55 (12%)

Query: 24  PNVQAAPPPPPPPP-------PLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           P+     PPPP          P P   P  G   P     PP  P     P   P
Sbjct: 90  PSAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARP 144



 Score = 30.6 bits (69), Expect = 2.3
 Identities = 15/63 (23%), Positives = 18/63 (28%), Gaps = 4/63 (6%)

Query: 23  LPNVQAAPP---PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPH 79
           LP  + A P     P P   P      G      +G PP  P        P       P+
Sbjct: 139 LPTARPAYPAYQQRPEPGAWPRAADDYGWQQQR-LGFPPRAPYASPASYAPEQERDREPY 197

Query: 80  VLG 82
             G
Sbjct: 198 DAG 200


>gnl|CDD|181205 PRK08040, PRK08040, putative semialdehyde dehydrogenase;
           Provisional.
          Length = 336

 Score = 30.4 bits (69), Expect = 1.6
 Identities = 31/98 (31%), Positives = 43/98 (43%), Gaps = 22/98 (22%)

Query: 147 RLIDSKRS--QNVGILAQSLHIEFSEIESAIFNMDASVVSLEALQQIYDVRATDEEINLI 204
           RL+D  R   Q+ G     L I  S ++S +F   A +V  EAL+ +    A +E     
Sbjct: 214 RLVDQVRKILQDEG-----LPISVSCVQSPVFYGHAQMVHFEALRPL----AAEE----A 260

Query: 205 RAHLASNSDLQL----DKPEQFLADLADIPHFSERIAC 238
           R  L    D+ L    D P Q + D +  PH S  I C
Sbjct: 261 RDALEQGEDIVLSEENDYPTQ-VGDASGNPHLS--IGC 295


>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
           (DUF1750).  This is a fungal domain of unknown function.
          Length = 669

 Score = 30.8 bits (69), Expect = 1.7
 Identities = 13/36 (36%), Positives = 14/36 (38%), Gaps = 1/36 (2%)

Query: 37  PPLPPPPPMMGTTP-PPMMGTPPPPPPMMGTPPPPP 71
           PP P P  M+G  P  P  G    PP   G    P 
Sbjct: 216 PPGPYPNAMVGRQPFYPQPGAVAGPPKRRGGHKAPR 251


>gnl|CDD|216078 pfam00716, Peptidase_S21, Assemblin (Peptidase family S21). 
          Length = 326

 Score = 30.5 bits (69), Expect = 1.8
 Identities = 14/51 (27%), Positives = 14/51 (27%), Gaps = 11/51 (21%)

Query: 18  SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPP 68
                L   QA   PP P  P PP               PP P P     P
Sbjct: 287 DAYNQLLAGQAFNQPPDPQGPAPPAELA-----------PPAPAPPPPANP 326


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 30.8 bits (69), Expect = 1.8
 Identities = 19/57 (33%), Positives = 20/57 (35%), Gaps = 2/57 (3%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMM--GTPPPPPPMMGTPPPPPPMMGTTPHVLG 82
            AP     P    P  P     P P +     PP PP  G PP P P M     V G
Sbjct: 408 TAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVPG 464



 Score = 30.8 bits (69), Expect = 1.8
 Identities = 25/73 (34%), Positives = 28/73 (38%), Gaps = 10/73 (13%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTP---PPMMGTPPPP--PPMMGTPPPPPPMMG--- 75
           P   AAP P  P    PP PP  G  P   P M    P P  P  + +    PP +G   
Sbjct: 424 PAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVPGAPDSVASASDAPPTLGDPS 483

Query: 76  -TTPHV-LGPLYW 86
            T  H   GP  W
Sbjct: 484 DTAEHTPSGPRTW 496



 Score = 29.3 bits (65), Expect = 5.2
 Identities = 18/49 (36%), Positives = 18/49 (36%), Gaps = 8/49 (16%)

Query: 26  VQAAPPPPPPPPPLPP---PPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
             A P P   P P  P    PP      PP  G PP P P M    P P
Sbjct: 420 APATPAPSAAPSPRVPWDDAPPA-----PPRSGIPPRPAPRMPEASPVP 463


>gnl|CDD|235166 PRK03837, PRK03837, transcriptional regulator NanR; Provisional.
          Length = 241

 Score = 30.4 bits (69), Expect = 1.8
 Identities = 16/43 (37%), Positives = 21/43 (48%), Gaps = 4/43 (9%)

Query: 195 RATDEEINLIRAHLASNSDLQLDKPEQFLADLADIPHFSERIA 237
            ATDE+I L+R  L  NS   L     F+   +D+  F   IA
Sbjct: 125 HATDEQIALLRKALERNSQ-SLGDNAAFIR--SDM-EFHRVIA 163


>gnl|CDD|218602 pfam05478, Prominin, Prominin.  The prominins are an emerging
           family of proteins that among the multispan membrane
           proteins display a novel topology. Mouse prominin and
           human prominin (mouse)-like 1 (PROML1) are predicted to
           contain five membrane spanning domains, with an
           N-terminal domain exposed to the extracellular space
           followed by four, alternating small cytoplasmic and
           large extracellular, loops and a cytoplasmic C-terminal
           domain. The exact function of prominin is unknown
           although in humans defects in PROM1, the gene coding for
           prominin, cause retinal degeneration.
          Length = 807

 Score = 30.8 bits (70), Expect = 1.8
 Identities = 16/58 (27%), Positives = 29/58 (50%), Gaps = 3/58 (5%)

Query: 346 DVDRAASVVFDDIHSQL-STLAKKLDAVTISMNKVVQESKPDHMEPFRTKMESCVKTG 402
           D+D A   +   IH +L S +   L+A+   + +V+QE K D +E     ++   + G
Sbjct: 215 DLDGAGEHLGGRIHDELKSHVLPVLEAIV-ELGQVLQEMK-DGLESVNKTLKELREGG 270


>gnl|CDD|216325 pfam01140, Gag_MA, Matrix protein (MA), p15.  The matrix protein,
           p15, is encoded by the gag gene. MA is involved in
           pathogenicity.
          Length = 129

 Score = 29.4 bits (66), Expect = 1.9
 Identities = 12/42 (28%), Positives = 17/42 (40%), Gaps = 5/42 (11%)

Query: 32  PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
           PPP   P   PP ++  +  P   +P P       PP P  +
Sbjct: 92  PPPWVKPFVDPPKVLLPSSTPKPVSPSPS-----APPRPSSL 128


>gnl|CDD|236886 PRK11249, katE, hydroperoxidase II; Provisional.
          Length = 752

 Score = 30.8 bits (70), Expect = 2.0
 Identities = 13/52 (25%), Positives = 14/52 (26%), Gaps = 8/52 (15%)

Query: 30 PPPPPPPPPLPPP----PPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
            P     PL       P M    P      P P P    TPP   P    +
Sbjct: 4  KNPHQHQSPLHDSSEAKPGMDSLAPEDGSHRPAPEP----TPPGAQPTAPGS 51


>gnl|CDD|221518 pfam12301, CD99L2, CD99 antigen like protein 2.  This family of
          proteins is found in eukaryotes. Proteins in this
          family are typically between 165 and 237 amino acids in
          length. CD99L2 and CD99 are involved in
          trans-endothelial migration of neutrophils in vitro and
          in the recruitment of neutrophils into inflamed
          peritoneum.
          Length = 154

 Score = 29.4 bits (66), Expect = 2.1
 Identities = 10/49 (20%), Positives = 11/49 (22%), Gaps = 6/49 (12%)

Query: 28 AAPPPPPPPPPLPPPPPM----MGTTPPPMMGTPPPPPPMMGTPPPPPP 72
          A  P   P  P  P        +             P P     P P P
Sbjct: 12 AMKPTKKPATPKKPSKGDDDFDLEDALDGGNDGGAGPKP--KPRPQPDP 58


>gnl|CDD|223068 PHA03384, PHA03384, early DNA-binding protein E2A; Provisional.
          Length = 445

 Score = 30.4 bits (69), Expect = 2.1
 Identities = 9/51 (17%), Positives = 14/51 (27%)

Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
           +   P   PP LPP         P           ++      PP+  + 
Sbjct: 13 SSDDSPSLEPPELPPKKKGRRRVSPVEEEEEEEEAEVVAVGFSYPPVRISR 63


>gnl|CDD|234012 TIGR02784, addA_alphas, double-strand break repair helicase AddA,
           alphaproteobacterial type.  AddAB, also called RexAB,
           substitutes for RecBCD in several bacterial lineages.
           These DNA recombination proteins act before synapse and
           are particularly important for DNA repair of
           double-stranded breaks by homologous recombination. The
           term AddAB is used broadly, with AddA homologous between
           the alphaproteobacteria (as modeled here) and the
           Firmicutes, while the partner AddB proteins show no
           strong homology across the two groups of species [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 1135

 Score = 30.4 bits (69), Expect = 2.3
 Identities = 7/35 (20%), Positives = 8/35 (22%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
             P        L   P  +    P     P P  P
Sbjct: 917 GLPVEAAQTDTLEALPDWLRAPAPAEPALPRPLAP 951



 Score = 29.7 bits (67), Expect = 4.1
 Identities = 6/35 (17%), Positives = 8/35 (22%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPP 62
                 P   P+           P  +  P P  P
Sbjct: 909 TRRDWDPVGLPVEAAQTDTLEALPDWLRAPAPAEP 943


>gnl|CDD|226435 COG3921, COG3921, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 300

 Score = 30.1 bits (68), Expect = 2.4
 Identities = 10/44 (22%), Positives = 12/44 (27%)

Query: 32  PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMG 75
           P   P      P      P  +     P  P+    PP PP   
Sbjct: 71  PKGLPILALAGPVGELGQPMDLPAPANPGDPLALPEPPSPPTKP 114


>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein.  This is a
           family of proteins which appears to have a highly
           conserved zinc finger domain at the C terminal end,
           described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
           predicted to contain a coiled coil. Members are
           annotated as being tumour-associated antigen HCA127 in
           humans but this could not confirmed.
          Length = 215

 Score = 29.8 bits (67), Expect = 2.4
 Identities = 14/59 (23%), Positives = 16/59 (27%), Gaps = 13/59 (22%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP-------------PPPM 73
            +P   P PP   P PP   +T P               P               PPPM
Sbjct: 121 ISPATSPVPPVPLPDPPAFPSTLPANAAAAAAAQQQRDVPQNLINRHRMPPSFQQPPPM 179


>gnl|CDD|193414 pfam12941, HCV_NS5a_C, HCV NS5a protein C-terminal region.  This is
           a family of proteins found in the hepatitis C virus.
           This family contains the C-terminal region of the NS5A
           protein. CC The molecular function of the non-structural
           5a protein is uncertain. The NS5a protein is
           phosphorylated when expressed in mammalian cells. It is
           thought to interact with the ds RNA dependent
           (interferon inducible) kinase PKR.
          Length = 244

 Score = 29.7 bits (67), Expect = 2.5
 Identities = 22/55 (40%), Positives = 25/55 (45%), Gaps = 15/55 (27%)

Query: 30  PPPPPPPPPLP----PP--PPMMGT------TPPPMMGTPPPPPPMMGTPPPPPP 72
           PP P  PP LP    P   PP++ T       PP + G   PP    G PP PPP
Sbjct: 99  PPRPKFPPALPIWARPDYNPPLLETWKAPDYEPPTVHGCALPPA---GLPPVPPP 150



 Score = 28.2 bits (63), Expect = 9.4
 Identities = 9/19 (47%), Positives = 10/19 (52%)

Query: 24  PNVQAAPPPPPPPPPLPPP 42
           P V     PP   PP+PPP
Sbjct: 132 PTVHGCALPPAGLPPVPPP 150


>gnl|CDD|179538 PRK03100, PRK03100, sec-independent translocase; Provisional.
          Length = 136

 Score = 28.8 bits (65), Expect = 2.6
 Identities = 10/38 (26%), Positives = 11/38 (28%), Gaps = 6/38 (15%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP 59
              N +  P  P    P PP         P   G  PP
Sbjct: 99  QTGNGRPTPSSPDAAGPAPPDA------TPLAPGERPP 130


>gnl|CDD|221759 pfam12764, Gly-rich_Ago1, Glycine-rich region of argonaut.  This
          domain is often found at the very N-terminal of
          argonaut-like proteins.
          Length = 102

 Score = 28.4 bits (63), Expect = 2.6
 Identities = 10/55 (18%), Positives = 13/55 (23%)

Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPL 84
              PP P +P             + T P P     +  PP             L
Sbjct: 31 STGGPPRPSVPELHQATQAPYQAAVSTQPAPSEASSSSQPPESSSLQVTQQFQQL 85


>gnl|CDD|235777 PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl
          carrier protein subunit; Validated.
          Length = 155

 Score = 29.0 bits (66), Expect = 2.7
 Identities = 11/55 (20%), Positives = 12/55 (21%), Gaps = 5/55 (9%)

Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP-----PPMMGT 76
           AA P  P        P              P   P              PM+GT
Sbjct: 35 AAAAPVAPVAQQAAAAPVAAAPAAAAAAAAAPAAAPAAAAAEAEGHVVTSPMVGT 89



 Score = 28.6 bits (65), Expect = 4.5
 Identities = 10/48 (20%), Positives = 11/48 (22%), Gaps = 5/48 (10%)

Query: 24 PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP-----PPMMGT 66
          P  Q A   P    P             P              PM+GT
Sbjct: 42 PVAQQAAAAPVAAAPAAAAAAAAAPAAAPAAAAAEAEGHVVTSPMVGT 89


>gnl|CDD|219061 pfam06495, Transformer, Fruit fly transformer protein.  This family
           consists of transformer proteins from several Drosophila
           species and also from Ceratitis capitata (Mediterranean
           fruit fly). The transformer locus (tra) produces an RNA
           processing protein that alternatively splices the
           doublesex pre-mRNA in the sex determination hierarchy of
           Drosophila melanogaster.
          Length = 182

 Score = 29.2 bits (65), Expect = 2.7
 Identities = 21/72 (29%), Positives = 27/72 (37%), Gaps = 9/72 (12%)

Query: 9   KRRKKKETYSNIFSLPNVQAAPPPPPP---------PPPLPPPPPMMGTTPPPMMGTPPP 59
           +RR++  + S     P +   P P P          PPP P    M G  P  M+  P P
Sbjct: 97  ERRRRSRSRSRYSRTPRIITVPVPVPAADYPYAYGWPPPAPQFSGMQGAFPYGMLPRPVP 156

Query: 60  PPPMMGTPPPPP 71
           P       PP P
Sbjct: 157 PYFAPYPRPPAP 168


>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
          Length = 200

 Score = 29.5 bits (66), Expect = 2.7
 Identities = 14/40 (35%), Positives = 15/40 (37%), Gaps = 2/40 (5%)

Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP 69
          PP  P P   PPP P     PP    T      +   PPP
Sbjct: 53 PPTTPHPSSQPPPCPPSPGHPPQTNDTHEKDLAL--QPPP 90



 Score = 28.4 bits (63), Expect = 7.3
 Identities = 14/40 (35%), Positives = 15/40 (37%)

Query: 21 FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP 60
          F  P        PPP PP P  PP    T    +   PPP
Sbjct: 51 FCPPTTPHPSSQPPPCPPSPGHPPQTNDTHEKDLALQPPP 90


>gnl|CDD|178678 PLN03132, PLN03132, NADH dehydrogenase (ubiquinone) flavoprotein
          1; Provisional.
          Length = 461

 Score = 30.2 bits (68), Expect = 2.8
 Identities = 11/19 (57%), Positives = 11/19 (57%)

Query: 26 VQAAPPPPPPPPPLPPPPP 44
           QAA     P PP PPPPP
Sbjct: 7  TQAAATAATPQPPPPPPPP 25



 Score = 29.5 bits (66), Expect = 4.7
 Identities = 9/22 (40%), Positives = 9/22 (40%)

Query: 27 QAAPPPPPPPPPLPPPPPMMGT 48
            A      P P PPPPP   T
Sbjct: 7  TQAAATAATPQPPPPPPPPEKT 28



 Score = 29.1 bits (65), Expect = 4.9
 Identities = 11/21 (52%), Positives = 11/21 (52%)

Query: 21 FSLPNVQAAPPPPPPPPPLPP 41
          FS      A  P PPPPP PP
Sbjct: 5  FSTQAAATAATPQPPPPPPPP 25


>gnl|CDD|190856 pfam04078, Rcd1, Cell differentiation family, Rcd1-like.  Two of
           the members in this family have been characterized as
           being involved in regulation of Ste11 regulated sex
           genes. Mammalian Rcd1 is a novel transcriptional
           cofactor that mediates retinoic acid-induced cell
           differentiation.
          Length = 262

 Score = 29.9 bits (68), Expect = 2.8
 Identities = 12/34 (35%), Positives = 20/34 (58%)

Query: 301 LEILPKLKDVRSKDNSVTLLHFIVRTYLRNSENP 334
             +L K+ +  ++  S  LL  ++R YLR S+NP
Sbjct: 187 ATVLNKMVEQLTRQPSPRLLKHVIRCYLRLSDNP 220


>gnl|CDD|223067 PHA03380, PHA03380, transactivating tegument protein VP16;
           Provisional.
          Length = 432

 Score = 29.6 bits (67), Expect = 3.2
 Identities = 17/70 (24%), Positives = 22/70 (31%), Gaps = 17/70 (24%)

Query: 11  RKKKETYSNIFSLPNVQAAPPPPPPPPPL-------PPPPPMMGTTPPPMMGTPPPPPPM 63
           R K E YS             P  P   L           P  G+T   ++  P P   +
Sbjct: 332 RAKLEAYSEAH----------PSEPLSVLREHTYARRREKPNYGSTIEALLDPPSPSAVL 381

Query: 64  MGTPPPPPPM 73
            G P PP  +
Sbjct: 382 PGDPVPPLTV 391


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 29.7 bits (67), Expect = 3.2
 Identities = 13/52 (25%), Positives = 15/52 (28%)

Query: 23  LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMM 74
           LP     PPP      LP        TP       PP    +   P  P + 
Sbjct: 62  LPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPTIQ 113



 Score = 29.3 bits (66), Expect = 4.3
 Identities = 13/49 (26%), Positives = 15/49 (30%), Gaps = 3/49 (6%)

Query: 28 AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMM---GTPPPPPPM 73
           +P  PPP   LP P P    T                  G+P   PP 
Sbjct: 51 PSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPA 99



 Score = 28.6 bits (64), Expect = 7.3
 Identities = 11/60 (18%), Positives = 14/60 (23%), Gaps = 4/60 (6%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP----PPPMMGTPPPPPPMMGTTPH 79
            +     PPP    P P P P        +          P       PP    +   P 
Sbjct: 49  WDPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPA 108


>gnl|CDD|177646 PHA03418, PHA03418, hypothetical E4 protein; Provisional.
          Length = 230

 Score = 29.3 bits (65), Expect = 3.3
 Identities = 17/54 (31%), Positives = 18/54 (33%), Gaps = 5/54 (9%)

Query: 31  PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPP-----PMMGTPPPPPPMMGTTPH 79
           P    P P P      G  PP       PP       + G PPP PP     PH
Sbjct: 132 PLHGDPDPPPGATGGQGEEPPEGGEESQPPLGEGEGAVEGHPPPLPPAPEPKPH 185



 Score = 28.6 bits (63), Expect = 5.7
 Identities = 25/97 (25%), Positives = 35/97 (36%), Gaps = 21/97 (21%)

Query: 27  QAAPPPPP---------PPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP----- 72
              P PPP         PP       P +G     + G PPP PP     P P P     
Sbjct: 134 HGDPDPPPGATGGQGEEPPEGGEESQPPLGEGEGAVEGHPPPLPP----APEPKPHNGDA 189

Query: 73  ---MMGTTPHVLGPLYWTRLIVSPEIPSDTTPLWKEL 106
              ++GT   +LG    +   +  +I  D    W++L
Sbjct: 190 THGLLGTVACLLGTWEESFRQLVEDIQEDLDDYWRKL 226


>gnl|CDD|219490 pfam07627, PSCyt3, Protein of unknown function (DUF1588).  A
          region of similarity shared by several Rhodopirellula
          baltica cytochrome-like proteins that are predicted to
          be secreted. These proteins also match pfam07626 and
          pfam07624.
          Length = 102

 Score = 27.9 bits (63), Expect = 3.3
 Identities = 10/23 (43%), Positives = 10/23 (43%)

Query: 27 QAAPPPPPPPPPLPPPPPMMGTT 49
             PPPPP  P L P  P   T 
Sbjct: 37 DPPPPPPPNVPALEPDDPGALTL 59


>gnl|CDD|237940 PRK15313, PRK15313, autotransport protein MisL; Provisional.
          Length = 955

 Score = 29.8 bits (66), Expect = 3.4
 Identities = 28/102 (27%), Positives = 38/102 (37%), Gaps = 28/102 (27%)

Query: 11  RKKKETYSNIFSLPNVQAAPPPPPP--PPPLPPPP----PMMGTTPPPMMGTPPPPPPMM 64
           +K K  Y   +  P+    P P  P  P P+ P P    P+    P P++  P  P P+ 
Sbjct: 547 QKGKNWYLTSYIEPDEPIIPDPVDPVIPDPVIPDPVDPDPVDPVIPDPVIPDPVDPDPV- 605

Query: 65  GTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIP----SDTTPL 102
             P P  P                +I  P IP    SDT P+
Sbjct: 606 -DPEPVDP----------------VIPDPTIPDIGQSDTPPI 630


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 29.9 bits (67), Expect = 3.5
 Identities = 30/176 (17%), Positives = 47/176 (26%), Gaps = 12/176 (6%)

Query: 25  NVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTP-PPPPPMMGTPPPPPPM-MGTTPHVLG 82
             +  P   P P      P     TP   + TP         TP   P M          
Sbjct: 262 RQKNTPAQTPMPERSWQTPA---QTPARRISTPMTEEIKSWQTPLQTPAMYSSDYQAPKP 318

Query: 83  PLYWTRLIVSPEIPSDTTPLWKELEEVPIN--NLEEFTELFSRQVTAQRPVTRKRQQKSS 140
              +T   +  E           L +      +     +   R+V      +    +  S
Sbjct: 319 EPIYTWEELLRERFPSDLFAISSLPDSDSEASDSGPTRKRKRRRVPPLPEYSSDEDEDDS 378

Query: 141 KVQNVARLIDSKRSQNVGILAQSLHI---EFSEIESAIFNMDASVVSLEALQQIYD 193
               V    + KR +        L +   E S+       M+  V    A+Q+ YD
Sbjct: 379 DEDEVDYEKERKRRREED--KNFLRLKALELSKYAGVNERMEKIVQVTRAMQRTYD 432


>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
          Length = 306

 Score = 29.6 bits (66), Expect = 3.5
 Identities = 19/61 (31%), Positives = 23/61 (37%), Gaps = 6/61 (9%)

Query: 23 LPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTP---PPPPPMMGTTPH 79
          L  + A  PP   P P P P   +     P+    P  P  +  P   P P P    TPH
Sbjct: 12 LAGLVANVPPAAAPTPQPNP---VIQPQAPVQPGQPGAPQQLAIPTQQPQPVPTSAMTPH 68

Query: 80 V 80
          V
Sbjct: 69 V 69


>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
           Provisional.
          Length = 376

 Score = 29.5 bits (66), Expect = 3.6
 Identities = 20/80 (25%), Positives = 21/80 (26%), Gaps = 29/80 (36%)

Query: 11  RKKKETYSNIFSLPNVQAAPPPP------------PPPPPLPPPPPMMGTTPPPMMGTPP 58
           R KKE              P PP            P P  +PP PP            P 
Sbjct: 52  RSKKEN------------EPAPPSTMIATNTKPFHPAPIDVPPDPP-----AAQEAVQPT 94

Query: 59  PPPPMMGTPPPPPPMMGTTP 78
            PP     P    P    TP
Sbjct: 95  APPSAQSEPERNEPRPEETP 114



 Score = 28.3 bits (63), Expect = 8.5
 Identities = 10/51 (19%), Positives = 14/51 (27%), Gaps = 7/51 (13%)

Query: 40  PPPPPMMGTTPPP-------MMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
            PP  M+ T   P       +   PP     +    PP        +   P
Sbjct: 60  APPSTMIATNTKPFHPAPIDVPPDPPAAQEAVQPTAPPSAQSEPERNEPRP 110


>gnl|CDD|233366 TIGR01348, PDHac_trf_long, pyruvate dehydrogenase complex
           dihydrolipoamide acetyltransferase, long form.  This
           model describes a subset of pyruvate dehydrogenase
           complex dihydrolipoamide acetyltransferase specifically
           close by both phylogenetic and per cent identity (UPGMA)
           trees. Members of this set include two or three copies
           of the lipoyl-binding domain. E. coli AceF is a member
           of this model, while mitochondrial and some other
           bacterial forms belong to a separate model [Energy
           metabolism, Pyruvate dehydrogenase].
          Length = 546

 Score = 29.5 bits (66), Expect = 3.9
 Identities = 11/51 (21%), Positives = 13/51 (25%), Gaps = 3/51 (5%)

Query: 20  IFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
           I +L    + P   P P    P       T P     P          P P
Sbjct: 186 ILTLSVAGSTPATAPAPASAQPAAQSPAATQPEPAAAPAAAK---AQAPAP 233


>gnl|CDD|216421 pfam01299, Lamp, Lysosome-associated membrane glycoprotein (Lamp). 
          Length = 305

 Score = 29.3 bits (66), Expect = 3.9
 Identities = 8/30 (26%), Positives = 10/30 (33%), Gaps = 5/30 (16%)

Query: 27  QAAPPPPPPPPPLPPPPPMMGTTPPPMMGT 56
             +P     P P P P P       P +G 
Sbjct: 85  TPSPTTVATPSPSPTPVP-----SSPAVGN 109



 Score = 28.5 bits (64), Expect = 6.3
 Identities = 11/29 (37%), Positives = 12/29 (41%), Gaps = 3/29 (10%)

Query: 48  TTPPPMMGTPPPPPPMMGTPPPPPPMMGT 76
            TP P     P P P   TP P  P +G 
Sbjct: 84  DTPSPTTVATPSPSP---TPVPSSPAVGN 109



 Score = 28.5 bits (64), Expect = 7.2
 Identities = 9/27 (33%), Positives = 12/27 (44%), Gaps = 3/27 (11%)

Query: 40  PPPPPMMGTTPPPMMGTPPPPPPMMGT 66
           P P  +   +P P   TP P  P +G 
Sbjct: 86  PSPTTVATPSPSP---TPVPSSPAVGN 109


>gnl|CDD|215915 pfam00429, TLV_coat, ENV polyprotein (coat polyprotein). 
          Length = 560

 Score = 29.8 bits (67), Expect = 3.9
 Identities = 20/72 (27%), Positives = 23/72 (31%), Gaps = 22/72 (30%)

Query: 32  PPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLIV 91
           PP  P PL PP P                       P  PP  GT   +L  L    L +
Sbjct: 212 PPSQPHPLAPPLP----------------------RPTSPPGPGTGDRLLDLLQGAYLTL 249

Query: 92  SPEIPSDTTPLW 103
           +   PS T   W
Sbjct: 250 NATNPSLTQDCW 261


>gnl|CDD|218566 pfam05349, GATA-N, GATA-type transcription activator, N-terminal.
           GATA transcription factors mediate cell differentiation
           in a diverse range of tissues. Mutation are often
           associated with certain congenital human disorders. The
           six classical vertebrate GATA proteins, GATA-1 to
           GATA-6, are highly homologous and have two tandem zinc
           fingers. The classical GATA transcription factors
           function transcription activators. In lower metazoans
           GATA proteins carry a single canonical zinc finger. This
           family represents the N-terminal domain of the family of
           GATA transcription activators.
          Length = 177

 Score = 29.0 bits (65), Expect = 3.9
 Identities = 12/60 (20%), Positives = 16/60 (26%)

Query: 18  SNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTT 77
           S + SLP +Q              P                P PP+  +    PPM    
Sbjct: 41  SMLPSLPYLQGCEASQQAHALAAHPGWSQAAAESSAFNPGSPHPPVGFSYSHSPPMSNGG 100


>gnl|CDD|180714 PRK06827, PRK06827, phosphoribosylpyrophosphate synthetase;
           Provisional.
          Length = 382

 Score = 29.5 bits (67), Expect = 3.9
 Identities = 11/36 (30%), Positives = 20/36 (55%)

Query: 241 FEADFSDSIALIDSKLNNIKSVCEFLMKSEHLKKVI 276
            E D S  IA I   LN+  S+ + L  ++ ++K++
Sbjct: 342 IEVDMSKLIARIIDALNHDVSLSKLLDPTDIIQKLL 377


>gnl|CDD|219865 pfam08493, AflR, Aflatoxin regulatory protein.  This domain is
          found in the aflatoxin regulatory protein (AflR) which
          is involved in the regulation of the biosynthesis of
          aflatoxin in the fungal genus Aspergillus. It occurs
          together with the fungal Zn(2)-Cys(6) binuclear cluster
          domain (pfam00172).
          Length = 275

 Score = 29.1 bits (65), Expect = 3.9
 Identities = 15/52 (28%), Positives = 21/52 (40%), Gaps = 1/52 (1%)

Query: 16 TYSNIFSLPNVQAAPPPPPPPPPLPPPP-PMMGTTPPPMMGTPPPPPPMMGT 66
          T S+    P VQ+ P    PP   P  P      +P     +PP  P + G+
Sbjct: 21 TTSSSHPQPPVQSGPSSIQPPVATPHTPNGTSSPSPKFSHQSPPAEPELWGS 72


>gnl|CDD|150201 pfam09447, Cnl2_NKP2, Cnl2/NKP2 family protein.  This family
           includes the Cnl2 kinetochore protein.
          Length = 67

 Score = 26.9 bits (60), Expect = 4.0
 Identities = 14/55 (25%), Positives = 28/55 (50%), Gaps = 11/55 (20%)

Query: 114 LEEFTELFSRQVTAQRPVTRKRQQKSSKVQNVARLIDSKRSQNVGILAQSLHIEF 168
           LE+FTELF R+V           +   +++ + R +  +R + +  +A ++  EF
Sbjct: 22  LEQFTELFPRRV-----------RSEPQIRTLYRELQRQREKVLDKVAANIEQEF 65


>gnl|CDD|233381 TIGR01371, met_syn_B12ind,
           5-methyltetrahydropteroyltriglutamate--homocysteine
           S-methyltransferase.  This model describes the
           cobalamin-independent methionine synthase. A family of
           uncharacterized archaeal proteins is homologous to the
           C-terminal region of this family. That family is
           excluded from this model but, along with this family,
           belongs to Pfam model pfam01717 [Amino acid
           biosynthesis, Aspartate family].
          Length = 750

 Score = 29.6 bits (67), Expect = 4.1
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 167 EFSEIESAIFNMDASVVSLEA 187
           EF+EI  +I ++DA V+S+EA
Sbjct: 642 EFNEIIESIADLDADVISIEA 662


>gnl|CDD|235709 PRK06132, PRK06132, hypothetical protein; Provisional.
          Length = 359

 Score = 29.3 bits (66), Expect = 4.2
 Identities = 10/51 (19%), Positives = 12/51 (23%), Gaps = 3/51 (5%)

Query: 21  FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
            + P + A   P   P P P         P      P          P  P
Sbjct: 179 VAHPLLLA---PKAAPAPAPASDAAANGRPAAPAPAPAEAAWADADAPTGP 226


>gnl|CDD|236507 PRK09424, pntA, NAD(P) transhydrogenase subunit alpha; Provisional.
          Length = 509

 Score = 29.4 bits (67), Expect = 4.2
 Identities = 6/24 (25%), Positives = 6/24 (25%), Gaps = 2/24 (8%)

Query: 24  PNVQAAPPPPP--PPPPLPPPPPM 45
             V AAP      P        P 
Sbjct: 373 IQVSAAPAAAAAAPAAKEEEKKPA 396


>gnl|CDD|152747 pfam12312, NeA_P2, Nepovirus subgroup A polyprotein.  This family
           of proteins is found in viruses. Proteins in this family
           are typically between 259 and 1110 amino acids in
           length. The family is found in association with
           pfam03688, pfam03689, pfam03391. This family is one of
           the polyproteins expressed by Nepoviruses in subgroup A.
          Length = 175

 Score = 28.8 bits (64), Expect = 4.3
 Identities = 11/19 (57%), Positives = 11/19 (57%)

Query: 24  PNVQAAPPPPPPPPPLPPP 42
           P     PPPPPPPP   PP
Sbjct: 113 PGGPCLPPPPPPPPIQKPP 131



 Score = 27.6 bits (61), Expect = 8.9
 Identities = 10/13 (76%), Positives = 11/13 (84%)

Query: 31  PPPPPPPPLPPPP 43
           PPPPPPPP+  PP
Sbjct: 119 PPPPPPPPIQKPP 131


>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
          Length = 291

 Score = 29.2 bits (65), Expect = 4.3
 Identities = 17/66 (25%), Positives = 20/66 (30%), Gaps = 6/66 (9%)

Query: 26  VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPP------PPPPMMGTTPH 79
           V+A    PP      P      TTPP      P   P   TP       P  P     P 
Sbjct: 38  VEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSSTKVETPQSPTTKQVPT 97

Query: 80  VLGPLY 85
            + P +
Sbjct: 98  EINPKF 103


>gnl|CDD|237859 PRK14908, PRK14908, glycyl-tRNA synthetase; Provisional.
          Length = 1000

 Score = 29.6 bits (67), Expect = 4.3
 Identities = 6/23 (26%), Positives = 8/23 (34%)

Query: 46  MGTTPPPMMGTPPPPPPMMGTPP 68
           +   PPP     P   P +  P 
Sbjct: 290 LLKVPPPPAAVTPKKTPDICGPE 312



 Score = 29.2 bits (66), Expect = 5.8
 Identities = 6/22 (27%), Positives = 8/22 (36%)

Query: 60  PPPMMGTPPPPPPMMGTTPHVL 81
           PPP     P   P +     +L
Sbjct: 294 PPPPAAVTPKKTPDICGPETLL 315


>gnl|CDD|177594 PHA03356, PHA03356, tegument protein UL11; Provisional.
          Length = 93

 Score = 27.5 bits (61), Expect = 4.4
 Identities = 16/48 (33%), Positives = 19/48 (39%), Gaps = 3/48 (6%)

Query: 19 NIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPP---PPPM 63
          N +  P ++    PP   PP P  PP   T      G  P    PPPM
Sbjct: 45 NFYISPELRVVTQPPGRQPPRPSGPPSRHTRHWSPSGARPAPTFPPPM 92


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 29.6 bits (66), Expect = 4.4
 Identities = 13/40 (32%), Positives = 16/40 (40%)

Query: 31  PPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
           P   P     PPP       P  +   PPPP  + +PPP 
Sbjct: 271 PRTSPKRFRSPPPRKARGRSPRRLIRSPPPPGRLRSPPPL 310


>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
          Length = 580

 Score = 29.3 bits (66), Expect = 4.6
 Identities = 15/68 (22%), Positives = 21/68 (30%), Gaps = 2/68 (2%)

Query: 35  PPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHV--LGPLYWTRLIVS 92
            PP        + T+    +  PP  P      P   P     P      P    R   S
Sbjct: 55  YPPRETGSGGGVATSTIYTVPRPPRGPEQTLDKPDSLPASRELPPGPTPVPPGGFRGASS 114

Query: 93  PEIPSDTT 100
           P + +D+T
Sbjct: 115 PRLGADST 122


>gnl|CDD|99960 cd03784, GT1_Gtf_like, This family includes the Gtfs, a group of
           homologous glycosyltransferases involved in the final
           stages of the biosynthesis of antibiotics vancomycin and
           related chloroeremomycin. Gtfs transfer sugar moieties
           from an activated NDP-sugar donor to the oxidatively
           cross-linked heptapeptide core of vancomycin group
           antibiotics. The core structure is important for the
           bioactivity of the antibiotics.
          Length = 401

 Score = 29.3 bits (66), Expect = 5.0
 Identities = 16/55 (29%), Positives = 19/55 (34%)

Query: 17  YSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
            S++  L     A  PPPP  P        G    P  G PPP   +      PP
Sbjct: 187 GSDVPELYGFSPAVLPPPPDWPRFDLVTGYGFRDVPYNGPPPPELWLFLAAGRPP 241


>gnl|CDD|219979 pfam08704, GCD14, tRNA methyltransferase complex GCD14 subunit.
           GCD14 is a subunit of the tRNA methyltransferase complex
           and is required for 1-methyladenosine modification and
           maturation of initiator methionyl-tRNA.
          Length = 309

 Score = 29.0 bits (65), Expect = 5.1
 Identities = 12/42 (28%), Positives = 23/42 (54%), Gaps = 1/42 (2%)

Query: 181 SVVSLEALQQIYDVR-ATDEEINLIRAHLASNSDLQLDKPEQ 221
            + ++E L + YDVR  +   I+L R  L  N   +++ P++
Sbjct: 228 EIETIEVLPRQYDVRTVSLPVIDLGRDTLEENERRRIEGPKE 269


>gnl|CDD|235367 PRK05222, PRK05222,
           5-methyltetrahydropteroyltriglutamate--homocysteine
           S-methyltransferase; Provisional.
          Length = 758

 Score = 29.3 bits (67), Expect = 5.1
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 167 EFSEIESAIFNMDASVVSLEA 187
           EF++I  AI  +DA V+S+E 
Sbjct: 648 EFNDIIDAIAALDADVISIET 668


>gnl|CDD|215382 PLN02712, PLN02712, arogenate dehydrogenase.
          Length = 667

 Score = 29.2 bits (65), Expect = 5.2
 Identities = 15/44 (34%), Positives = 17/44 (38%), Gaps = 1/44 (2%)

Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
          +PPPPP PPP    P    TT  P +           T   P P
Sbjct: 1  SPPPPPQPPPS-LFPTSKPTTMSPRLSLSIKSQSATATDKQPLP 43


>gnl|CDD|114474 pfam05750, Rubella_Capsid, Rubella capsid protein.  Rubella virus
           is an enveloped positive-strand RNA virus of the family
           Togaviridae. Virions are composed of three structural
           proteins: a capsid and two membrane-spanning
           glycoproteins, E2 and E1. During virus assembly, the
           capsid interacts with genomic RNA to form nucleocapsids.
           It has been discovered that capsid phosphorylation
           serves to negatively regulate binding of viral genomic
           RNA. This may delay the initiation of nucleocapsid
           assembly until sufficient amounts of virus glycoproteins
           accumulate at the budding site and/or prevent
           non-specific binding to cellular RNA when levels of
           genomic RNA are low. It follows that at a late stage in
           replication, the capsid may undergo dephosphorylation
           before nucleocapsid assembly occurs.
          Length = 300

 Score = 29.1 bits (64), Expect = 5.2
 Identities = 15/39 (38%), Positives = 17/39 (43%), Gaps = 5/39 (12%)

Query: 27  QAAPPPPPPPPPLPPPPPMM-----GTTPPPMMGTPPPP 60
           Q   P P   PP  P PP M     G+ P P +G P  P
Sbjct: 90  QTPAPKPSRAPPQQPQPPRMQTGRGGSAPRPELGPPTNP 128


>gnl|CDD|180801 PRK07033, PRK07033, hypothetical protein; Provisional.
          Length = 427

 Score = 29.2 bits (66), Expect = 5.3
 Identities = 7/40 (17%), Positives = 7/40 (17%)

Query: 34 PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPM 73
                          P P   TP   P       P P  
Sbjct: 1  NTSSDPFSAGSGGFVPPNPGDRTPAAAPAAGAPFQPRPGR 40



 Score = 28.1 bits (63), Expect = 9.9
 Identities = 10/17 (58%), Positives = 10/17 (58%)

Query: 23  LPNVQAAPPPPPPPPPL 39
           LP VQ A P P PPP  
Sbjct: 268 LPVVQIARPAPQPPPAP 284


>gnl|CDD|216412 pfam01285, TEA, TEA/ATTS domain family. 
          Length = 424

 Score = 29.0 bits (65), Expect = 5.6
 Identities = 14/106 (13%), Positives = 23/106 (21%), Gaps = 23/106 (21%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
              +++        P  P           +            +      +    P     
Sbjct: 137 SAQRSSAQHLFYIIPGGPSWR------TSIKPFSSSHYGSHNSSAYSDHLRPLQP----- 185

Query: 84  LYWTRLIVSPEIPSDTTPLWKELEEVPINNLEEFTELFSRQVTAQR 129
                   S E+P    P W+      I  LE     FS  V+   
Sbjct: 186 -------YSGELPPPLGPNWQASNSKKIRGLE-----FSMWVSQPN 219


>gnl|CDD|182101 PRK09835, PRK09835, sensor kinase CusS; Provisional.
          Length = 482

 Score = 29.0 bits (65), Expect = 5.6
 Identities = 15/34 (44%), Positives = 19/34 (55%)

Query: 87  TRLIVSPEIPSDTTPLWKELEEVPINNLEEFTEL 120
           T LI   EI    +   KELE+V  +NLEE T +
Sbjct: 278 TNLITQTEIALSQSRSQKELEDVLYSNLEELTRM 311


>gnl|CDD|177556 PHA03195, PHA03195, tegument protein VP11/12; Provisional.
          Length = 746

 Score = 29.2 bits (65), Expect = 5.6
 Identities = 17/54 (31%), Positives = 22/54 (40%), Gaps = 7/54 (12%)

Query: 32  PPPPPPPLPPPPPMMGTT----PPPMMGT---PPPPPPMMGTPPPPPPMMGTTP 78
           PP  P P PP     GT     P  + G+   P  PP  + +P   P   G+T 
Sbjct: 440 PPQGPLPAPPNSKTRGTFRRPRPGSVRGSRQLPASPPSNIVSPRTNPAAEGSTA 493


>gnl|CDD|239164 cd02763, MopB_2, The MopB_2 CD includes a group of related
           uncharacterized bacterial molybdopterin-binding
           oxidoreductase-like domains with a putative N-terminal
           iron-sulfur [4Fe-4S] cluster binding site and
           molybdopterin cofactor binding site. These members
           belong to the molybdopterin_binding (MopB) superfamily
           of proteins.
          Length = 679

 Score = 29.0 bits (65), Expect = 6.0
 Identities = 17/43 (39%), Positives = 19/43 (44%), Gaps = 4/43 (9%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPP 70
             PP P   PPLP PP +     P    TP   PP +G P  P
Sbjct: 339 HKPPYPRHIPPLPKPPKIPSADKP---FTPLYGPP-LGWPASP 377


>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 29.1 bits (65), Expect = 6.0
 Identities = 12/53 (22%), Positives = 15/53 (28%), Gaps = 2/53 (3%)

Query: 22  SLPNVQAAPPPPPP--PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPP 72
           S    + AP P  P       P  P      P  + +P   P     PP    
Sbjct: 387 SPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVARS 439


>gnl|CDD|234383 TIGR03895, protease_PatA, cyanobactin maturation protease,
           PatA/PatG family.  This model describes a protease
           domain associated with the maturation of various members
           of the cyanobactin family of ribosomally produced,
           heavily modified bioactive metabolites. Members include
           the PatA protein and C-terminal domain of the PatG
           protein of Prochloron didemni, TenA and a region of TenG
           from Nostoc spongiaeforme var. tenue, etc.
          Length = 602

 Score = 28.9 bits (65), Expect = 6.1
 Identities = 8/57 (14%), Positives = 14/57 (24%), Gaps = 3/57 (5%)

Query: 23  LPNVQAAPPPPPPPPPLPPPPPMMGTT---PPPMMGTPPPPPPMMGTPPPPPPMMGT 76
           + +  +  PP     P P   P+        P           +     P  P+   
Sbjct: 255 IESSTSVIPPGRAAEPAPVSIPVAAPGEGATPAAAQIELSAGVLPNAISPATPVRPA 311


>gnl|CDD|188060 TIGR00531, BCCP, acetyl-CoA carboxylase, biotin carboxyl carrier
          protein.  This model is designed to identify biotin
          carboxyl carrier protein as a peptide of acetyl-CoA
          carboxylase. Scoring below the trusted cutoff is a
          related protein encoded in a region associated with
          polyketide synthesis in the prokaryote
          Saccharopolyspora hirsuta, and a reported
          chloroplast-encoded biotin carboxyl carrier protein
          that may be highly derived from the last common
          ancestral sequence. Scoring below the noise cutoff are
          biotin carboxyl carrier domains of other enzymes such
          as pyruvate carboxylase.The gene name is accB or fabE
          [Fatty acid and phospholipid metabolism, Biosynthesis].
          Length = 156

 Score = 28.3 bits (63), Expect = 6.2
 Identities = 12/51 (23%), Positives = 16/51 (31%), Gaps = 10/51 (19%)

Query: 26 VQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPP----------PMMGT 66
           ++A      P P   P       P P +  P P            PM+GT
Sbjct: 40 KKSAVQQAAAPVPAQVPAAPSAQAPAPAVCAPAPAKADKKGHFVRSPMVGT 90


>gnl|CDD|236063 PRK07586, PRK07586, hypothetical protein; Validated.
          Length = 514

 Score = 29.0 bits (66), Expect = 6.6
 Identities = 8/19 (42%), Positives = 9/19 (47%)

Query: 26  VQAAPPPPPPPPPLPPPPP 44
           + A P  PP   P  PP P
Sbjct: 315 LGAKPAAPPLAAPARPPLP 333


>gnl|CDD|227869 COG5582, COG5582, Uncharacterized conserved protein [Function
           unknown].
          Length = 182

 Score = 28.2 bits (63), Expect = 6.6
 Identities = 26/146 (17%), Positives = 55/146 (37%), Gaps = 14/146 (9%)

Query: 108 EVPINNLEEFTELFSRQVTAQRP----VTRKRQQKSSKVQNVARLIDSKRSQNVGILAQS 163
           EV +   ++F   F  Q   +R     V        + ++NV   ++        ++  +
Sbjct: 3   EVSVQEKKDFIRWFLNQYQLKRREAVWVLNYLMSHDALLENV-HFVEEAEGCPRTLIIST 61

Query: 164 LHIEFSEIESAIFNMDASVVSLEALQQIYDVRATDEEINLIRAHLASNSDLQLDKPEQFL 223
             +     E        ++ +++  +  +D+RA   E   I+ H A  S     K  Q+L
Sbjct: 62  HGVH----EVPFQFYKYNIKTMDPEKAFHDIRANPNEAIYIQLHFAGAS-----KNPQYL 112

Query: 224 ADLADIPHFSERIACFMFEADFSDSI 249
           A L   P   E ++      + +D++
Sbjct: 113 AVLESNPQLPENLSEQTKLKELADAL 138


>gnl|CDD|217769 pfam03867, FTZ, Fushi tarazu (FTZ), N-terminal region.  This region
           contains the important motif (LXXLL) necessary for the
           interaction of FTZ with the nuclear receptor FTZ-F1. FTZ
           is thought to represents a category of LXXLL
           motif-dependent co-activators for nuclear receptors.
          Length = 264

 Score = 28.6 bits (63), Expect = 6.6
 Identities = 9/22 (40%), Positives = 10/22 (45%)

Query: 49  TPPPMMGTPPPPPPMMGTPPPP 70
           TPPP   TP     + G   PP
Sbjct: 204 TPPPAATTPVDGIGIEGISTPP 225


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 29.0 bits (66), Expect = 6.6
 Identities = 21/174 (12%), Positives = 45/174 (25%), Gaps = 9/174 (5%)

Query: 5   DKRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMM 64
           ++  K   K+E      +     AA                                P  
Sbjct: 41  EEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAE 100

Query: 65  GTPPPPPPMMGTTPHVLGPLYWTRLIVSPEIPSDTTPLWKELEEVPINNLEEFTELFSRQ 124
                P       P               +      P  K+ +       +       R+
Sbjct: 101 DEAARPAEAAARRPKAKKA---------AKKKKGPKPKKKKPKRKAARGGKRGKGGKGRR 151

Query: 125 VTAQRPVTRKRQQKSSKVQNVARLIDSKRSQNVGILAQSLHIEFSEIESAIFNM 178
               R   RK+++K    + + R +    +  V  LA+ + ++ +E+   +F +
Sbjct: 152 RRRGRRRRRKKKKKQKPTEKIPREVVIPETITVAELAEKMAVKAAEVIKKLFKL 205


>gnl|CDD|219056 pfam06485, DUF1092, Protein of unknown function (DUF1092).  This
           family consists of several hypothetical proteins of
           unknown function all from photosynthetic organisms
           including plants and cyanobacteria.
          Length = 270

 Score = 28.4 bits (64), Expect = 6.7
 Identities = 8/22 (36%), Positives = 8/22 (36%)

Query: 22  SLPNVQAAPPPPPPPPPLPPPP 43
             P   A  PPP      PP P
Sbjct: 110 QEPGYMALAPPPVALDKPPPQP 131


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
          consists of various plant development proteins which
          are homologues of floricaula (FLO) and Leafy (LFY)
          proteins which are floral meristem identity proteins.
          Mutations in the sequences of these proteins affect
          flower and leaf development.
          Length = 382

 Score = 28.8 bits (65), Expect = 7.0
 Identities = 8/31 (25%), Positives = 8/31 (25%)

Query: 27 QAAPPPPPPPPPLPPPPPMMGTTPPPMMGTP 57
               PPPPP  LPP                
Sbjct: 15 PRPAVPPPPPRLLPPAAVPPPPLAAAAAARK 45


>gnl|CDD|235562 PRK05687, fliH, flagellar assembly protein H; Validated.
          Length = 246

 Score = 28.4 bits (64), Expect = 7.5
 Identities = 11/33 (33%), Positives = 11/33 (33%)

Query: 29 APPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPP 61
           PPP PPPP    PPP              P  
Sbjct: 25 DPPPEPPPPEPAAPPPEPPEPEEVPEEEAAPLT 57


>gnl|CDD|235931 PRK07092, PRK07092, benzoylformate decarboxylase; Reviewed.
          Length = 530

 Score = 28.8 bits (65), Expect = 7.7
 Identities = 19/74 (25%), Positives = 23/74 (31%), Gaps = 14/74 (18%)

Query: 34  PPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSP 93
            PP   P PP              PPP P  G P     ++      L  L     IV  
Sbjct: 335 LPPSARPAPPAR----------PMPPPAPAPGEPLSVAFVL----QTLAALRPADAIVVE 380

Query: 94  EIPSDTTPLWKELE 107
           E PS    + + L 
Sbjct: 381 EAPSTRPAMQEHLP 394



 Score = 28.4 bits (64), Expect = 9.4
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMM 46
           P+ + APP  P PPP P P   +
Sbjct: 337 PSARPAPPARPMPPPAPAPGEPL 359


>gnl|CDD|220840 pfam10667, DUF2486, Protein of unknown function (DUF2486).  This
           family is made up of members from various Burkholderia
           spp. The function is unknown.
          Length = 245

 Score = 28.3 bits (63), Expect = 7.7
 Identities = 12/52 (23%), Positives = 13/52 (25%), Gaps = 3/52 (5%)

Query: 23  LPNVQAAPPPPPPP---PPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
           LP+  AA  P  PP               TPP      PP            
Sbjct: 116 LPSALAADDPQAPPAGATAADAGDAAPDATPPAAGDASPPAAAQAAASAAAA 167


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
          domain family is found in eukaryotes, and is typically
          between 125 and 136 amino acids in length.
          Length = 133

 Score = 27.5 bits (61), Expect = 7.7
 Identities = 9/17 (52%), Positives = 10/17 (58%)

Query: 28 AAPPPPPPPPPLPPPPP 44
          +AP PP  P P P  PP
Sbjct: 15 SAPAPPAGPGPGPNAPP 31


>gnl|CDD|219805 pfam08347, CTNNB1_binding, N-terminal CTNNB1 binding.  This region
           tends to appear at the N-terminus of proteins also
           containing DNA-binding HMG (high mobility group) boxes
           (pfam00505) and appears to bind the armadillo repeat of
           CTNNB1 (beta-catenin), forming a stable complex.
           Signaling by Wnt through TCF/LCF is involved in
           developmental patterning, induction of neural tissues,
           cell fate decisions and stem cell differentiation.
           Isoforms of HMG T-cell factors lacking the N-terminal
           CTNNB1-binding domain cannot fulfill their role as
           transcriptional activators in T-cell differentiation.
          Length = 200

 Score = 27.9 bits (62), Expect = 7.8
 Identities = 18/71 (25%), Positives = 21/71 (29%), Gaps = 10/71 (14%)

Query: 22  SLPNVQAAPPPPPPP----------PPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
                      P  P          P  PPP       P   +  PP PP +    P  P
Sbjct: 129 VPVVQPPHHVHPLTPLITYSNEHFSPGTPPPHLPYDVDPKTGIPRPPHPPDISPFYPLSP 188

Query: 72  PMMGTTPHVLG 82
             +G  PH LG
Sbjct: 189 GGVGQIPHPLG 199



 Score = 27.9 bits (62), Expect = 8.1
 Identities = 14/58 (24%), Positives = 17/58 (29%), Gaps = 9/58 (15%)

Query: 24  PNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMM---------GTPPPPPP 72
           P +    P  P     P PP               P  P++         GTPPP  P
Sbjct: 105 PFLMLPNPYLPNGSLSPLPPSSNKVPVVQPPHHVHPLTPLITYSNEHFSPGTPPPHLP 162



 Score = 27.9 bits (62), Expect = 8.6
 Identities = 21/78 (26%), Positives = 29/78 (37%), Gaps = 16/78 (20%)

Query: 23  LPNVQAAPPPPP--------PPPPLPPPPPMM--------GTTPPPMMGTPPPPPPMMGT 66
           LPN   +P PP         PP  + P  P++          TPPP +     P   +  
Sbjct: 114 LPNGSLSPLPPSSNKVPVVQPPHHVHPLTPLITYSNEHFSPGTPPPHLPYDVDPKTGIPR 173

Query: 67  PPPPPPMMGTTPHVLGPL 84
           PP PP +    P   G +
Sbjct: 174 PPHPPDISPFYPLSPGGV 191


>gnl|CDD|191251 pfam05283, MGC-24, Multi-glycosylated core protein 24 (MGC-24).
           This family consists of several MGC-24 (or Cd164
           antigen) proteins from eukaryotic organisms.
           MGC-24/CD164 is a sialomucin expressed in many normal
           and cancerous tissues. In humans, soluble and
           transmembrane forms of MGC-24 are produced by
           alternative splicing.
          Length = 187

 Score = 28.0 bits (62), Expect = 7.9
 Identities = 10/44 (22%), Positives = 10/44 (22%)

Query: 28  AAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPP 71
            A   P P      P      T      T  P      TP   P
Sbjct: 110 VATTTPVPTNSTAKPTITPSPTTSHHHVTSEPKTNTTVTPTSQP 153


>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
          Length = 171

 Score = 27.8 bits (62), Expect = 8.0
 Identities = 18/106 (16%), Positives = 30/106 (28%), Gaps = 8/106 (7%)

Query: 1   MEYMDKRAKRRKKKETYSNIFSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPPMMGTPPPP 60
           ME  +++AK+ ++        + P   + P          P   +  TTP          
Sbjct: 39  MEQANEKAKKLEQSSD-----AKPAAASLPRLAEGNFRTMPVQTVTATTPSGSRPAVTAT 93

Query: 61  PPMMGTPPPPPPMMGTTPHVLGPLYWTRLIVSPE---IPSDTTPLW 103
           P      P P          + P+     +  P        T  LW
Sbjct: 94  PEQKLLAPRPLFTAAREVKTVVPVSSVTPVTPPRPLRTGEQTARLW 139


>gnl|CDD|177445 PHA02662, PHA02662, ORF131 putative membrane protein;
          Provisional.
          Length = 226

 Score = 28.1 bits (62), Expect = 8.2
 Identities = 9/14 (64%), Positives = 9/14 (64%)

Query: 30 PPPPPPPPPLPPPP 43
          PP  PP PP PP P
Sbjct: 2  PPRTPPTPPRPPEP 15


>gnl|CDD|111097 pfam02166, Androgen_recep, Androgen receptor. 
          Length = 423

 Score = 28.6 bits (63), Expect = 8.4
 Identities = 10/17 (58%), Positives = 12/17 (70%)

Query: 21  FSLPNVQAAPPPPPPPP 37
           ++ P   A PPPPPPPP
Sbjct: 339 YNFPLALAGPPPPPPPP 355


>gnl|CDD|218017 pfam04305, DUF455, Protein of unknown function (DUF455). 
          Length = 250

 Score = 28.3 bits (64), Expect = 8.7
 Identities = 14/51 (27%), Positives = 23/51 (45%), Gaps = 8/51 (15%)

Query: 165 HIEFSEIESAIFNMDASVVSLEALQQIYD--VRATDEEI---NLIRAHLAS 210
           HIE + I+ A+   DA        ++ YD  ++  D+E     L+R  L  
Sbjct: 74  HIELNAIDLAL---DAVARFRGLPREFYDDWLKVADDEAKHFRLLRERLEE 121


>gnl|CDD|222296 pfam13660, DUF4147, Domain of unknown function (DUF4147).  This
           domain is frequently found at the N-terminus of proteins
           carrying the glycerate kinase-like domain MOFRL,
           pfam05161.
          Length = 227

 Score = 28.2 bits (64), Expect = 8.8
 Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 4/40 (10%)

Query: 173 SAIFNMDASVVSLEALQQIYD--VR--ATDEEINLIRAHL 208
           SA+    A  ++LE  Q +    +   A  +EIN +R HL
Sbjct: 121 SALLPAPAEGITLEDKQAVTRALLASGADIQEINAVRKHL 160


>gnl|CDD|216111 pfam00772, DnaB, DnaB-like helicase N terminal domain.  The
           hexameric helicase DnaB unwinds the DNA duplex at the
           Escherichia coli chromosome replication fork. Although
           the mechanism by which DnaB both couples ATP hydrolysis
           to translocation along DNA and denatures the duplex is
           unknown, a change in the quaternary structure of the
           protein involving dimerisation of the N-terminal domain
           has been observed and may occur during the enzymatic
           cycle. This N-terminal domain is required both for
           interaction with other proteins in the primosome and for
           DnaB helicase activity.
          Length = 103

 Score = 26.8 bits (60), Expect = 8.8
 Identities = 9/51 (17%), Positives = 18/51 (35%), Gaps = 6/51 (11%)

Query: 184 SLEALQQIYDV----RATDEEINLI--RAHLASNSDLQLDKPEQFLADLAD 228
              A + I++         + I+ +     L     L+      +LA+LA 
Sbjct: 36  YDPAHRLIFEAILELYEKGKPIDPVTVAEELERKGKLEEVGGIAYLAELAA 86


>gnl|CDD|227361 COG5028, COG5028, Vesicle coat complex COPII, subunit SEC24/subunit
           SFB2/subunit SFB3 [Intracellular trafficking and
           secretion].
          Length = 861

 Score = 28.6 bits (64), Expect = 8.9
 Identities = 15/66 (22%), Positives = 18/66 (27%), Gaps = 5/66 (7%)

Query: 40  PPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVL---GPLYWTRLIVSPEI- 95
            P P +M            P    M     P P     P  L    P   + L + P   
Sbjct: 73  NPAPSVMSPAFQSQQKFSSPYGGSMADGTAPKPTNPLVPVDLFEDQPPPISDLFLPPPPI 132

Query: 96  -PSDTT 100
            P  TT
Sbjct: 133 VPPLTT 138


>gnl|CDD|223037 PHA03301, PHA03301, envelope glycoprotein L; Provisional.
          Length = 226

 Score = 27.9 bits (62), Expect = 9.0
 Identities = 11/32 (34%), Positives = 14/32 (43%), Gaps = 1/32 (3%)

Query: 21  FSLPNVQAAPPPPPPPPPLPPPPPMMGTTPPP 52
            S P+ +A+P P     P P   P    TP P
Sbjct: 180 LSAPDDEASPQPKSLATPPPVAAP-SRRTPRP 210


>gnl|CDD|223689 COG0616, SppA, Periplasmic serine proteases (ClpP class)
           [Posttranslational modification, protein turnover,
           chaperones / Intracellular trafficking and secretion].
          Length = 317

 Score = 28.1 bits (63), Expect = 9.2
 Identities = 31/143 (21%), Positives = 48/143 (33%), Gaps = 30/143 (20%)

Query: 239 FMFEADFSDSIALIDSKLNNIKSVCEFLMKSEHLKKVIAIILSLGNFMNGGNRLRGQADG 298
           ++       +  L    L   +      +  +   KVIA+I   G  + GG  LR     
Sbjct: 24  WLLSLILLLAPTLKAVGLLAKRLGKLEALLRKRGSKVIAVIHVEGAIVAGGGPLRFIG-- 81

Query: 299 FGLEILPKLKDVRSKDNSVTLLHFIVRTYLR-NSENPLNESLPVPEP-GDVDRAASVVFD 356
            G +I   L+  R+  +   ++       LR NS            P G V   AS   +
Sbjct: 82  -GDDIEEILRAARADPSVKAVV-------LRINS------------PGGSV--VAS---E 116

Query: 357 DIHSQLSTLAKKLDAVTISMNKV 379
            I   L  L  K   V +S+   
Sbjct: 117 LIARALKRLRAK-KPVVVSVGGY 138


>gnl|CDD|226708 COG4257, Vgb, Streptogramin lyase [Defense mechanisms].
          Length = 353

 Score = 28.3 bits (63), Expect = 9.3
 Identities = 11/54 (20%), Positives = 16/54 (29%), Gaps = 3/54 (5%)

Query: 30 PPPPPPPPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPPPPPMMGTTPHVLGP 83
          P              ++G    P  G+ P P       P P    G+ P  + P
Sbjct: 19 PASSSAAANAVGANALIGEFATPAAGSSPAPDGSSAEFPVPN---GSAPFDVAP 69


>gnl|CDD|180315 PRK05933, PRK05933, type III secretion system protein; Validated.
          Length = 372

 Score = 28.3 bits (63), Expect = 9.6
 Identities = 10/24 (41%), Positives = 11/24 (45%)

Query: 20  IFSLPNVQAAPPPPPPPPPLPPPP 43
           I S PN+Q   PP  P    P  P
Sbjct: 280 ITSYPNLQPEDPPEEPSEGGPAAP 303


>gnl|CDD|226293 COG3770, MepA, Murein endopeptidase [Cell envelope biogenesis,
           outer membrane].
          Length = 284

 Score = 28.2 bits (63), Expect = 9.6
 Identities = 14/51 (27%), Positives = 14/51 (27%), Gaps = 12/51 (23%)

Query: 30  PPPPPPPPPLPPPPPMMGT--------TPPPMMGTPPPPPPMMGTPPPPPP 72
           P            PP  G          PPP    P    P    PPP PP
Sbjct: 222 PAGSLECEDQALVPPGDGCGKELASWFEPPP----PGTTKPEPKAPPPLPP 268


>gnl|CDD|185156 PRK15244, PRK15244, virulence protein SpvB; Provisional.
          Length = 591

 Score = 28.5 bits (63), Expect = 9.8
 Identities = 11/16 (68%), Positives = 11/16 (68%)

Query: 50  PPPMMGTPPPPPPMMG 65
           P   M  PPPPPPMMG
Sbjct: 361 PVNNMMPPPPPPPMMG 376



 Score = 28.5 bits (63), Expect = 9.8
 Identities = 11/16 (68%), Positives = 11/16 (68%)

Query: 60  PPPMMGTPPPPPPMMG 75
           P   M  PPPPPPMMG
Sbjct: 361 PVNNMMPPPPPPPMMG 376


>gnl|CDD|223044 PHA03325, PHA03325, nuclear-egress-membrane-like protein;
           Provisional.
          Length = 418

 Score = 28.3 bits (63), Expect = 10.0
 Identities = 7/34 (20%), Positives = 10/34 (29%)

Query: 36  PPPLPPPPPMMGTTPPPMMGTPPPPPPMMGTPPP 69
            PPL     +     P +   P PP   +     
Sbjct: 385 SPPLDLTTSLRHMPSPSVTSAPEPPSIPLTYLSD 418


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,076,429
Number of extensions: 2415394
Number of successful extensions: 14082
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8932
Number of HSP's successfully gapped: 993
Length of query: 454
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 354
Effective length of database: 6,502,202
Effective search space: 2301779508
Effective search space used: 2301779508
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.4 bits)