RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15122
         (927 letters)



>gnl|CDD|216919 pfam02181, FH2, Formin Homology 2 Domain. 
          Length = 372

 Score =  203 bits (518), Expect = 7e-58
 Identities = 77/201 (38%), Positives = 124/201 (61%), Gaps = 2/201 (0%)

Query: 591 PEMYTDCHKNIINGKQACEEVKQSKKLAKILELILLMGNYMNSGSRNGGAFGFEINFLTK 650
            E   +   ++   + A EE+++S+K  K+LELIL +GNYMNSG+R G A GF+++ L K
Sbjct: 173 EEEVEELKPSLETLEAASEELRESRKFKKLLELILALGNYMNSGTRRGNAKGFKLSSLLK 232

Query: 651 LSSTKDIENKTTLLHYLVDTIEQKFPECLKFGDELLHVDRAARVSTDVIQNSIRQMENNI 710
           LS TK  +NKTTLLHYLV  I +K P+ L F  EL HV++AA+V  + ++  ++++E  +
Sbjct: 233 LSDTKSTDNKTTLLHYLVKIIREKLPDLLDFSSELSHVEKAAKVDLEQLEKDVKELEKGL 292

Query: 711 KNLETDIQNCKQAPVANENDKFLEIMEPFAKEVRQKITLLSNMSKNMMTLYGDLAEFYTF 770
           K LE +++        + +DKF+E M+ F +E  +K+  L ++ K  M L+ +L E++  
Sbjct: 293 KKLERELELSALDE--HPDDKFVEKMKEFLEEAEEKLDKLESLLKEAMELFKELTEYFGE 350

Query: 771 DKNIYTLEEFFTDIKTFKDSF 791
           D    + EEFF  ++ F   F
Sbjct: 351 DPKETSPEEFFKILRDFLRMF 371


>gnl|CDD|219000 pfam06367, Drf_FH3, Diaphanous FH3 Domain.  This region is found in
           the Formin-like and and diaphanous proteins.
          Length = 197

 Score =  189 bits (482), Expect = 3e-55
 Identities = 88/223 (39%), Positives = 129/223 (57%), Gaps = 28/223 (12%)

Query: 267 DGHDKVIKAITMSGEL-KGKERFQPVVQGLMVKGN-NEALRTACLQLINAIVATPDDLEF 324
            GH+KV++A     E+ + + RF+ +V  L    N N   + A +Q INA+V +P+DL F
Sbjct: 1   GGHEKVLEATLNFKEVCRERGRFRSLVGALDSSENDNVEYKVATMQFINALVNSPEDLNF 60

Query: 325 RLHLRNEIMRVGLYDLLDALEKDASEDVSVQLKVFIEHKEEDYYEFIQRFDNVRMEIDDV 384
           RLHLR+E  R+GL  +LD L +  +E +  QL++F E++EED  E ++RFD+V +++DD 
Sbjct: 61  RLHLRSEFTRLGLDRILDKLRELENEKLDDQLQIFEENREEDVEELLERFDDVNVDLDDP 120

Query: 385 NDCFETVRNMVMDSACEPYLLSILQHLLFIRDDQNMVMDSACEPYLLSILQHLLFIRDDQ 444
            + FE ++                          N V D+  EP+ LSILQHLL IRDD+
Sbjct: 121 VELFELLK--------------------------NKVKDTEAEPHFLSILQHLLLIRDDE 154

Query: 445 YVRLAYYKLLEECVSQIVLHRGGCDPDFRSSRRFQLDVQPLVE 487
                Y+KLLEE VSQIVLHR   DPDF   +   +D+  L++
Sbjct: 155 EELPQYWKLLEELVSQIVLHRTKPDPDFDERKNLNIDINRLLD 197


>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain.  FH proteins control
           rearrangements of the actin cytoskeleton, especially in
           the context of cytokinesis and cell polarisation.
           Members of this family have been found to interact with
           Rho-GTPases, profilin and other actin-assoziated
           proteins. These interactions are mediated by the
           proline-rich FH1 domain, usually located in front of FH2
           (but not listed in SMART). Despite this cytosolic
           function, vertebrate formins have been assigned
           functions within the nucleus. A set of Formin-Binding
           Proteins (FBPs) has been shown to bind FH1 with their WW
           domain.
          Length = 392

 Score =  177 bits (452), Expect = 9e-49
 Identities = 88/264 (33%), Positives = 127/264 (48%), Gaps = 45/264 (17%)

Query: 591 PEMYTDCHKNIINGKQACEEVKQSKKLAKILELILLMGNYMNSGSRNGGAFGFEINFLTK 650
            E   D    I   + ACEE+++SKK  K+LELIL +GNYMN GSR G A+GF+++ L K
Sbjct: 173 EEEVEDLKPQIEKVEAACEELRESKKFRKLLELILAIGNYMNGGSRRGQAYGFKLSSLLK 232

Query: 651 LSSTKDIENKTTLLHYLVDTIEQKFPECLKFGDELLHVDRAARVSTDVIQNSIRQMENNI 710
           LS  K  +NKTTLLH+LV  I +K+                                  +
Sbjct: 233 LSDVKSADNKTTLLHFLVKIIRKKY----------------------------------L 258

Query: 711 KNLETDIQNCKQAPVANENDKFLEIMEPFAKEVRQKITLLSNMSKNMMTLYGDLAEFYTF 770
             L             N +DKF+E+M+PF K  ++K   L     ++ T +  L E+Y  
Sbjct: 259 GGLS---------DPENLDDKFIEVMKPFLKAAKEKYDKLQKDLSDLKTRFEKLVEYYGE 309

Query: 771 DKNIYTLEEFFTDIKTFKDSFYQAWQENIKLREAEEKSIRVREAREKAENEK-KDKAARK 829
           D    + EEFF D   F   F +A +ENIK  E EE+  + +  +E  E E+   +   +
Sbjct: 310 DPKDTSPEEFFKDFNEFLKEFSKAAEENIKKEEEEEERRK-KLVKETTEYEQSSSRQKER 368

Query: 830 KALIDMTTDQTQQGVMDSLLEALQ 853
              +D   ++   GV+DSLLE L 
Sbjct: 369 NPSMDFEVERDFLGVLDSLLEELG 392


>gnl|CDD|219001 pfam06371, Drf_GBD, Diaphanous GTPase-binding Domain.  This domain
           is bound to by GTP-attached Rho proteins, leading to
           activation of the Drf protein.
          Length = 187

 Score =  140 bits (354), Expect = 6e-38
 Identities = 55/195 (28%), Positives = 102/195 (52%), Gaps = 14/195 (7%)

Query: 75  LEKLNPEQLNQKFEDMLNDMNLSDEKKEPLRRQPLANKKKMLLMHYKGT------VTSYE 128
           L+K + +++ + FE+++ +MNL +EK+ P+R +P+  K ++++ +            S  
Sbjct: 1   LQKPDDDEILKLFEELMEEMNLPEEKRRPMRAKPIEKKWQLVVQYLLTNFQGESLSKSLS 60

Query: 129 NKSKFDKPIEYIQYLSQPELSVNKMYSCIESLRIALTNHPLSWVNEFVLQDNKNFRKYPI 188
             ++   P  Y++ L    +       C+ESLR+AL   P+SWV  F+  +  +     I
Sbjct: 61  KSNETGSPEYYVKKLKDGSIDQK----CLESLRVALRTQPVSWVKRFIGAEGLS-ALLNI 115

Query: 189 AFLYPRFPSRSRNDSRYDRVQYECVRCIRAIMNNTVGLKQMFGQKEALTIVARSLDPNKP 248
                R  S S  D      +YE ++C++A+MNN  G+  + G  E + ++ARSLD  +P
Sbjct: 116 LSKIVRKKSESDEDLD---REYEILKCLKALMNNKFGIDHVLGHPEVILLLARSLDSPRP 172

Query: 249 TVMLEAVKVLAAVCL 263
                A+++L A+CL
Sbjct: 173 KTRKLALELLTALCL 187


>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1.  This region is found
           in some of the Diaphanous related formins (Drfs). It
           consists of low complexity repeats of around 12
           residues.
          Length = 160

 Score = 61.5 bits (149), Expect = 5e-11
 Identities = 31/49 (63%), Positives = 32/49 (65%), Gaps = 8/49 (16%)

Query: 551 AAGIPTGP-----KGGPPPPPPPPGGMG-PPPPPMPGMPG--PPPPPMP 591
           AAGIP  P      G PPPPPP PGG   PPPPP+PG PG  PPPPP P
Sbjct: 81  AAGIPPPPPLPGGAGIPPPPPPLPGGAAVPPPPPLPGGPGVPPPPPPFP 129



 Score = 60.3 bits (146), Expect = 2e-10
 Identities = 29/51 (56%), Positives = 29/51 (56%), Gaps = 8/51 (15%)

Query: 551 AAGIPTGPKGGP-----PPPPPPPGGMG--PPPPPMPGMPG-PPPPPMPEM 593
            AGIP  P   P     PPPPP PGG G  PPPPP PG PG PPPPP    
Sbjct: 93  GAGIPPPPPPLPGGAAVPPPPPLPGGPGVPPPPPPFPGAPGIPPPPPGMGS 143



 Score = 58.8 bits (142), Expect = 5e-10
 Identities = 25/44 (56%), Positives = 26/44 (59%), Gaps = 7/44 (15%)

Query: 554 IPTGPKGGPPPPPPPPGGMG-PPPPPMPGMPGPPP-----PPMP 591
           +P GP G PPPPPP PG  G PPPPP  G P PPP     P  P
Sbjct: 115 LPGGP-GVPPPPPPFPGAPGIPPPPPGMGSPPPPPFGFGVPAAP 157



 Score = 58.0 bits (140), Expect = 9e-10
 Identities = 27/48 (56%), Positives = 29/48 (60%), Gaps = 7/48 (14%)

Query: 551 AAGIPTGP----KGGPPPPPPPPGGMG--PPPPPMPGMPG-PPPPPMP 591
           A  IP  P      G PPPPP PGG G  PPPPP+PG    PPPPP+P
Sbjct: 69  ATAIPPPPPLPGAAGIPPPPPLPGGAGIPPPPPPLPGGAAVPPPPPLP 116



 Score = 56.9 bits (137), Expect = 2e-09
 Identities = 26/46 (56%), Positives = 28/46 (60%), Gaps = 6/46 (13%)

Query: 552 AGIPTGPK----GGPPPPPPPPGGMG-PPPPPMPGMPG-PPPPPMP 591
           A IP  P      G PPPPP PG    PPPPP+PG  G PPPPP+P
Sbjct: 46  AAIPPPPPLPGVAGIPPPPPLPGATAIPPPPPLPGAAGIPPPPPLP 91



 Score = 56.5 bits (136), Expect = 4e-09
 Identities = 27/48 (56%), Positives = 29/48 (60%), Gaps = 7/48 (14%)

Query: 551 AAGIPTGP----KGGPPPPPPPPGGMG-PPPPPMPGMPG--PPPPPMP 591
            AGIP  P        PPPPP PG  G PPPPP+PG  G  PPPPP+P
Sbjct: 57  VAGIPPPPPLPGATAIPPPPPLPGAAGIPPPPPLPGGAGIPPPPPPLP 104



 Score = 56.1 bits (135), Expect = 4e-09
 Identities = 23/40 (57%), Positives = 25/40 (62%), Gaps = 2/40 (5%)

Query: 555 PTGPKGGPPPPPPPPGGMG-PPPPPMPGMPG-PPPPPMPE 592
           P       PPPPP PGG G PPPPP+PG    PPPPP+P 
Sbjct: 17  PLPGGVCIPPPPPLPGGTGIPPPPPLPGGAAIPPPPPLPG 56



 Score = 55.7 bits (134), Expect = 6e-09
 Identities = 25/46 (54%), Positives = 27/46 (58%), Gaps = 6/46 (13%)

Query: 552 AGIPTGPK----GGPPPPPPPPGGMG-PPPPPMPGMPG-PPPPPMP 591
            GIP  P        PPPPP PG  G PPPPP+PG    PPPPP+P
Sbjct: 34  TGIPPPPPLPGGAAIPPPPPLPGVAGIPPPPPLPGATAIPPPPPLP 79



 Score = 55.3 bits (133), Expect = 8e-09
 Identities = 27/46 (58%), Positives = 30/46 (65%), Gaps = 6/46 (13%)

Query: 552 AGIPTGPK--GGP--PPPPPPPGGMG-PPPPPMPGMPG-PPPPPMP 591
             IP  P   GG   PPPPP PGG   PPPPP+PG+ G PPPPP+P
Sbjct: 22  VCIPPPPPLPGGTGIPPPPPLPGGAAIPPPPPLPGVAGIPPPPPLP 67



 Score = 52.6 bits (126), Expect = 8e-08
 Identities = 24/40 (60%), Positives = 26/40 (65%), Gaps = 3/40 (7%)

Query: 555 PTGPKGG--PPPPPPPPGGMGPPPPPMPGMPG-PPPPPMP 591
           P  P G   PPPPP P G   PPPPP+PG  G PPPPP+P
Sbjct: 4   PPLPGGVGIPPPPPLPGGVCIPPPPPLPGGTGIPPPPPLP 43



 Score = 51.5 bits (123), Expect = 2e-07
 Identities = 22/31 (70%), Positives = 25/31 (80%), Gaps = 2/31 (6%)

Query: 563 PPPPPPPGGMG-PPPPPMPGMPG-PPPPPMP 591
           PPPPP PGG+G PPPPP+PG    PPPPP+P
Sbjct: 1   PPPPPLPGGVGIPPPPPLPGGVCIPPPPPLP 31



 Score = 42.2 bits (99), Expect = 2e-04
 Identities = 15/30 (50%), Positives = 16/30 (53%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMP 580
           A GIP  P G   PPPPP G   P  P +P
Sbjct: 131 APGIPPPPPGMGSPPPPPFGFGVPAAPVLP 160


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This family
           contains bacterial fibronectin-attachment proteins
           (FAP). Family members are rich in alanine and proline,
           are approximately 300 long, and seem to be restricted to
           mycobacteria. These proteins contain a
           fibronectin-binding motif that allows mycobacteria to
           bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 52.2 bits (125), Expect = 5e-07
 Identities = 15/37 (40%), Positives = 15/37 (40%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
            T      P  PPPP     PP P P  P   PPP P
Sbjct: 49  STAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPP 85



 Score = 49.1 bits (117), Expect = 5e-06
 Identities = 14/34 (41%), Positives = 14/34 (41%)

Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           P   PPPP        P  PP P  P  PP P P
Sbjct: 41  PAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQP 74



 Score = 46.4 bits (110), Expect = 3e-05
 Identities = 14/37 (37%), Positives = 14/37 (37%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
                    PPPPPP    P P P      PPPPP  
Sbjct: 51  AAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPAD 87



 Score = 46.0 bits (109), Expect = 5e-05
 Identities = 20/45 (44%), Positives = 21/45 (46%), Gaps = 4/45 (8%)

Query: 551 AAGIPTGPKGGPPP---PPPPPGGMGPPPPPMPGMP-GPPPPPMP 591
           AA  P  P   PPP   P P P      PPP P  P  PPPPP+ 
Sbjct: 53  AAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVD 97



 Score = 45.7 bits (108), Expect = 7e-05
 Identities = 18/46 (39%), Positives = 19/46 (41%), Gaps = 1/46 (2%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPP-PPPMPGMPGPPPPPMPE 592
                  P  P+   P   PPP    P  PPP P  P  PPPP PE
Sbjct: 62  PPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPAPE 107



 Score = 43.7 bits (103), Expect = 3e-04
 Identities = 15/42 (35%), Positives = 15/42 (35%)

Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
            A   P  P   P      P    PPPPP P  P  P P  P
Sbjct: 36  TANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDP 77



 Score = 43.3 bits (102), Expect = 4e-04
 Identities = 14/44 (31%), Positives = 14/44 (31%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           A   A     P   P  PP P        PP P      PPP P
Sbjct: 52  AAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPP 95



 Score = 31.0 bits (70), Expect = 2.5
 Identities = 13/34 (38%), Positives = 13/34 (38%), Gaps = 2/34 (5%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPG 581
                  P  P   PPPP  P     PPP P PG
Sbjct: 78  NAAPPPPPADPNAPPPPPVDPNA--PPPPAPEPG 109



 Score = 30.2 bits (68), Expect = 4.1
 Identities = 12/30 (40%), Positives = 12/30 (40%), Gaps = 4/30 (13%)

Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPP 587
           P    P  PPPP    P  P  P  P P P
Sbjct: 83  PPPADPNAPPPP----PVDPNAPPPPAPEP 108


>gnl|CDD|189968 pfam01391, Collagen, Collagen triple helix repeat (20 copies).
           Members of this family belong to the collagen
           superfamily. Collagens are generally extracellular
           structural proteins involved in formation of connective
           tissue structure. The alignment contains 20 copies of
           the G-X-Y repeat that forms a triple helix. The first
           position of the repeat is glycine, the second and third
           positions can be any residue but are frequently proline
           and hydroxyproline. Collagens are post translationally
           modified by proline hydroxylase to form the
           hydroxyproline residues. Defective hydroxylation is the
           cause of scurvy. Some members of the collagen
           superfamily are not involved in connective tissue
           structure but share the same triple helical structure.
          Length = 60

 Score = 45.6 bits (109), Expect = 2e-06
 Identities = 25/38 (65%), Positives = 25/38 (65%), Gaps = 3/38 (7%)

Query: 555 PTGPKG--GPPPPPPPPGGMGPPPPP-MPGMPGPPPPP 589
           P GP G  GPP PP PPG  GPP PP  PG PGPP PP
Sbjct: 2   PPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP 39



 Score = 45.2 bits (108), Expect = 2e-06
 Identities = 25/38 (65%), Positives = 25/38 (65%), Gaps = 3/38 (7%)

Query: 555 PTGPKG--GPPPPPPPPGGMGPP-PPPMPGMPGPPPPP 589
           P GP G  GPP PP PPG  GPP PP  PG PGPP PP
Sbjct: 11  PPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP 48



 Score = 45.2 bits (108), Expect = 3e-06
 Identities = 24/41 (58%), Positives = 24/41 (58%), Gaps = 5/41 (12%)

Query: 551 AAGIPTGPKG--GPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
             G P GP G  GPP PP PPG  GPP P  PG PG P PP
Sbjct: 23  PPGPP-GPPGPPGPPGPPGPPGPPGPPGP--PGAPGAPGPP 60



 Score = 41.7 bits (99), Expect = 4e-05
 Identities = 24/39 (61%), Positives = 24/39 (61%), Gaps = 5/39 (12%)

Query: 557 GPKGGPPPPPPPPGGMGPP----PPPMPGMPGPPPPPMP 591
           GP G PP PP PPG  GPP    PP  PG PGPP PP P
Sbjct: 1   GPPG-PPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGP 38



 Score = 40.9 bits (97), Expect = 8e-05
 Identities = 25/38 (65%), Positives = 25/38 (65%), Gaps = 3/38 (7%)

Query: 555 PTGPKG--GPPPPPPPPGGMGPPPPP-MPGMPGPPPPP 589
           P GP G  GPP PP PPG  GPP PP  PG PGPP PP
Sbjct: 5   PPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP 42


>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729).  This
           family of proteins is found in viruses. Proteins in this
           family are typically between 145 and 1707 amino acids in
           length. The family is found in association with
           pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
           There is a single completely conserved residue L that
           may be functionally important.
          Length = 115

 Score = 44.3 bits (105), Expect = 2e-05
 Identities = 20/47 (42%), Positives = 21/47 (44%), Gaps = 7/47 (14%)

Query: 552 AGIPTGPKGGPPPP-------PPPPGGMGPPPPPMPGMPGPPPPPMP 591
            G P  P    PPP       PPP     PPP P P +PGP  PP P
Sbjct: 43  VGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSP 89



 Score = 40.4 bits (95), Expect = 5e-04
 Identities = 14/44 (31%), Positives = 16/44 (36%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
                 +P   +   PPP P P   GP  PP P  P  P    P
Sbjct: 57  VSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPLAPPAPARKPP 100



 Score = 37.0 bits (86), Expect = 0.008
 Identities = 16/40 (40%), Positives = 16/40 (40%), Gaps = 5/40 (12%)

Query: 555 PTGPKGGPPPPPPPPGGMGP-----PPPPMPGMPGPPPPP 589
                  P P PP PG  GP     PP P    P PPP P
Sbjct: 67  SEPAAPPPDPEPPVPGPAGPPSPLAPPAPARKPPLPPPRP 106



 Score = 35.0 bits (81), Expect = 0.038
 Identities = 15/38 (39%), Positives = 15/38 (39%), Gaps = 4/38 (10%)

Query: 558 PKGGPPPPP----PPPGGMGPPPPPMPGMPGPPPPPMP 591
           P  G P PP    PPP       PP      PPP P P
Sbjct: 41  PPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEP 78



 Score = 34.3 bits (79), Expect = 0.058
 Identities = 11/30 (36%), Positives = 14/30 (46%)

Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           PPP   P   +   PPP+  +   PPP  P
Sbjct: 40  PPPVGDPRPPVVDTPPPVSAVWVLPPPSEP 69



 Score = 33.9 bits (78), Expect = 0.093
 Identities = 14/47 (29%), Positives = 17/47 (36%), Gaps = 5/47 (10%)

Query: 551 AAGI---PTGPKGGPPPPPPPPGGMGPPPP--PMPGMPGPPPPPMPE 592
            +G     + P+   P  PPP G   PP    P P       PP  E
Sbjct: 22  TSGFSSCFSPPESAHPDDPPPVGDPRPPVVDTPPPVSAVWVLPPPSE 68


>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
           modification].
          Length = 2365

 Score = 46.5 bits (110), Expect = 7e-05
 Identities = 19/31 (61%), Positives = 19/31 (61%), Gaps = 5/31 (16%)

Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
           P G PPPPPPPP G  PP  P P    PPPP
Sbjct: 5   PPGNPPPPPPPP-GFEPPSQPPP----PPPP 30



 Score = 38.0 bits (88), Expect = 0.026
 Identities = 16/42 (38%), Positives = 17/42 (40%), Gaps = 4/42 (9%)

Query: 560 GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHKNI 601
             PP  PPPP    PPPP       PPPPP P +        
Sbjct: 3   SLPPGNPPPP----PPPPGFEPPSQPPPPPPPGVNVKKRSRK 40


>gnl|CDD|222828 PHA01732, PHA01732, proline-rich protein.
          Length = 94

 Score = 40.9 bits (95), Expect = 2e-04
 Identities = 17/36 (47%), Positives = 17/36 (47%), Gaps = 3/36 (8%)

Query: 554 IPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
           I   PK   PP P PP    P PPP P  P P P P
Sbjct: 3   IFRAPKPPEPPAPLPPA---PVPPPPPAPPAPVPEP 35



 Score = 39.0 bits (90), Expect = 0.001
 Identities = 15/32 (46%), Positives = 15/32 (46%), Gaps = 4/32 (12%)

Query: 560 GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
              P PP PP     P PP P  P PP PP P
Sbjct: 4   FRAPKPPEPP----APLPPAPVPPPPPAPPAP 31



 Score = 31.2 bits (70), Expect = 0.52
 Identities = 14/38 (36%), Positives = 16/38 (42%), Gaps = 4/38 (10%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           P  P    PP P PP    PPP P   +P P   P+  
Sbjct: 9   PPEPPAPLPPAPVPP----PPPAPPAPVPEPTVKPVNA 42


>gnl|CDD|218146 pfam04554, Extensin_2, Extensin-like region. 
          Length = 57

 Score = 39.8 bits (93), Expect = 2e-04
 Identities = 15/41 (36%), Positives = 16/41 (39%), Gaps = 6/41 (14%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP------PPP 589
           P       PPPP        PPPP+   P PP      PPP
Sbjct: 16  PPPYYYKSPPPPVKSPVYKSPPPPVYKSPPPPKYVYKSPPP 56



 Score = 39.0 bits (91), Expect = 4e-04
 Identities = 15/34 (44%), Positives = 15/34 (44%), Gaps = 7/34 (20%)

Query: 562 PPPPP-------PPPGGMGPPPPPMPGMPGPPPP 588
           PPPP        PPP     PPPP      PPPP
Sbjct: 24  PPPPVKSPVYKSPPPPVYKSPPPPKYVYKSPPPP 57



 Score = 38.6 bits (90), Expect = 5e-04
 Identities = 15/33 (45%), Positives = 16/33 (48%), Gaps = 5/33 (15%)

Query: 562 PPP-----PPPPPGGMGPPPPPMPGMPGPPPPP 589
           PPP     PPPP     PPPP    +   PPPP
Sbjct: 7   PPPVKQYSPPPPYYYKSPPPPVKSPVYKSPPPP 39



 Score = 38.2 bits (89), Expect = 7e-04
 Identities = 19/45 (42%), Positives = 20/45 (44%), Gaps = 11/45 (24%)

Query: 562 PPPP-----PPPPGGMGPPPPPMPGM--PGPPPP----PMPEMYT 595
           PPPP     PPPP     PPPP+       PPPP    P P  Y 
Sbjct: 6   PPPPVKQYSPPPPYYYKSPPPPVKSPVYKSPPPPVYKSPPPPKYV 50



 Score = 37.1 bits (86), Expect = 0.002
 Identities = 12/37 (32%), Positives = 13/37 (35%), Gaps = 4/37 (10%)

Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMY 594
                PPPP         PPP      PPPP    +Y
Sbjct: 1   YHYKSPPPPVKQY----SPPPPYYYKSPPPPVKSPVY 33


>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein.  The WWbp domain is
           characterized by several short PY and PT-like motifs of
           the PPPPY form. These appear to bind directly to the WW
           domains of WWP1 and WWP2 and other such diverse proteins
           as dystrophin and YAP (Yes-associated protein). This is
           the WW-domain binding protein WWbp via PY and PY_like
           motifs. The presence of a phosphotyrosine residue in the
           pWBP-1 peptide abolishes WW domain binding which
           suggests a potential regulatory role for tyrosine
           phosphorylation in modulating WW domain-ligand
           interactions. Given the likelihood that WWP1 and WWP2
           function as E3 ubiquitin-protein ligases, it is possible
           that initial substrate-specific recognition occurs via
           WW domain-substrate protein interaction followed by
           ubiquitin transfer and subsequent proteolysis. This
           domain lies just downstream of the GRAM (pfam02893) in
           many members.
          Length = 111

 Score = 41.2 bits (97), Expect = 2e-04
 Identities = 13/50 (26%), Positives = 15/50 (30%), Gaps = 1/50 (2%)

Query: 546 VKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYT 595
                 +G    P       P P  G    PPP    P   P P P +Y 
Sbjct: 38  QPVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAP-PGVYP 86



 Score = 40.5 bits (95), Expect = 4e-04
 Identities = 20/50 (40%), Positives = 21/50 (42%), Gaps = 6/50 (12%)

Query: 549 QVAAGIPTGPK--GGPPPPPPPPGGMGPPPPPM----PGMPGPPPPPMPE 592
             A G    P   G PPP  P P G+ P PPP        P  PPPP P 
Sbjct: 59  LPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMADPQEPPPPYPG 108



 Score = 33.5 bits (77), Expect = 0.12
 Identities = 12/55 (21%), Positives = 18/55 (32%), Gaps = 1/55 (1%)

Query: 541 NRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYT 595
             ++ +  +     P   + G  PPP     +  P P       PPP   P  Y 
Sbjct: 25  QAMERLAERAQRAQPVSRESGYYPPPGAYVHL-EPLPAYGQYAAPPPYGPPPPYY 78


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 44.0 bits (104), Expect = 3e-04
 Identities = 20/86 (23%), Positives = 24/86 (27%), Gaps = 3/86 (3%)

Query: 510 EAIMLRQEAEAKLVQAQK---TLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPP 566
           E  +L+    A      +    LE L  G P   +         A   P      P PP 
Sbjct: 352 EMALLKMATLAPGAPVSELLDRLEALERGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPA 411

Query: 567 PPPGGMGPPPPPMPGMPGPPPPPMPE 592
            P       P P P  P    PP   
Sbjct: 412 APARPAAARPAPAPAPPAAAAPPARS 437


>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN).  This family
           consists of several eukaryotic survival motor neuron
           (SMN) proteins. The Survival of Motor Neurons (SMN)
           protein, the product of the spinal muscular
           atrophy-determining gene, is part of a large
           macromolecular complex (SMN complex) that functions in
           the assembly of spliceosomal small nuclear
           ribonucleoproteins (snRNPs). The SMN complex functions
           as a specificity factor essential for the efficient
           assembly of Sm proteins on U snRNAs and likely protects
           cells from illicit, and potentially deleterious,
           non-specific binding of Sm proteins to RNAs.
          Length = 264

 Score = 43.1 bits (101), Expect = 4e-04
 Identities = 22/45 (48%), Positives = 22/45 (48%), Gaps = 16/45 (35%)

Query: 562 PPPPPPPPGGMG---------PP-----PPPMPGMPG--PPPPPM 590
           PP PPPPP G G         PP     PPP P  P   PPPPPM
Sbjct: 173 PPGPPPPPPGFGRHGEKPSGWPPFLSGWPPPFPLGPPMIPPPPPM 217



 Score = 33.8 bits (77), Expect = 0.33
 Identities = 16/29 (55%), Positives = 16/29 (55%), Gaps = 6/29 (20%)

Query: 557 GPKGGPP-----PPPPPPGG-MGPPPPPM 579
            P G PP     PPP P G  M PPPPPM
Sbjct: 189 KPSGWPPFLSGWPPPFPLGPPMIPPPPPM 217



 Score = 29.2 bits (65), Expect = 9.5
 Identities = 12/20 (60%), Positives = 12/20 (60%)

Query: 572 MGPPPPPMPGMPGPPPPPMP 591
           MGPP P  P  P  PPPP P
Sbjct: 162 MGPPSPWNPRFPPGPPPPPP 181


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
           domain family is found in eukaryotes, and is typically
           between 125 and 136 amino acids in length.
          Length = 133

 Score = 41.0 bits (96), Expect = 4e-04
 Identities = 12/33 (36%), Positives = 14/33 (42%)

Query: 549 QVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPG 581
               G+ +G     PP  P PG   PP P  PG
Sbjct: 5   DGTGGVSSGSSAPAPPAGPGPGPNAPPAPAAPG 37



 Score = 31.3 bits (71), Expect = 0.94
 Identities = 12/36 (33%), Positives = 12/36 (33%), Gaps = 1/36 (2%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP 586
             G  TG        P PP G   P P  P  P  P
Sbjct: 2   TGGDGTGGVSSGSSAPAPPAG-PGPGPNAPPAPAAP 36



 Score = 30.5 bits (69), Expect = 1.5
 Identities = 9/37 (24%), Positives = 9/37 (24%), Gaps = 2/37 (5%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
            TG  G             P  P     P  PP P  
Sbjct: 1   STGGDGTGGVSSGSSAPAPPAGPGPG--PNAPPAPAA 35


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 43.6 bits (103), Expect = 5e-04
 Identities = 16/46 (34%), Positives = 16/46 (34%), Gaps = 4/46 (8%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPP----PMPGMPGPPPPPMPE 592
             G P G  G PPPP  P      P        P  P  P PP  E
Sbjct: 700 PGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGAE 745



 Score = 37.9 bits (88), Expect = 0.030
 Identities = 12/30 (40%), Positives = 12/30 (40%)

Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
            P P  PPGG G  PPP        P P  
Sbjct: 697 APRPGGPPGGGGGLPPPPDLPAAAGPAPCG 726



 Score = 37.5 bits (87), Expect = 0.039
 Identities = 17/65 (26%), Positives = 20/65 (30%), Gaps = 8/65 (12%)

Query: 536 RPVEKNRLDEVK-AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP-------P 587
           RP    R D V+     A +       P  PP   GG+ PPP         P        
Sbjct: 673 RPSRVARGDPVRPTAHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIAS 732

Query: 588 PPMPE 592
           P  P 
Sbjct: 733 PTAPP 737



 Score = 32.1 bits (73), Expect = 1.8
 Identities = 11/43 (25%), Positives = 11/43 (25%)

Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
                P  P    P P        P  PP P  PG       E
Sbjct: 709 GLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGAEQADGAE 751


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 43.8 bits (103), Expect = 6e-04
 Identities = 17/37 (45%), Positives = 17/37 (45%)

Query: 555  PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
            P      PP P  PP     P PP P  P PPPPP P
Sbjct: 2901 PPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937



 Score = 43.4 bits (102), Expect = 8e-04
 Identities = 26/94 (27%), Positives = 29/94 (30%), Gaps = 25/94 (26%)

Query: 528 TLEDLSSGR--------PVEKNR-LDEVKAQVAAGI-------PTGPKGGPPPPPPPPGG 571
           +LEDLS+GR        P  K R         A G        P  P     P P P   
Sbjct: 364 SLEDLSAGRHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPV 423

Query: 572 M--------GPPPPPMPGMP-GPPPPPMPEMYTD 596
                     P P   PG   GP PPP  +    
Sbjct: 424 PASAPPPPATPLPSAEPGSDDGPAPPPERQPPAP 457



 Score = 43.0 bits (101), Expect = 9e-04
 Identities = 16/38 (42%), Positives = 17/38 (44%)

Query: 555  PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            P  P    P P PPP     PPPP P  P PP  P  +
Sbjct: 2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948



 Score = 41.8 bits (98), Expect = 0.002
 Identities = 19/39 (48%), Positives = 21/39 (53%), Gaps = 1/39 (2%)

Query: 555  PTGPKGGPPPPPPPPGGMGPPPPPMPG-MPGPPPPPMPE 592
            P  P+  P P  PPP    P PPP P   P PPPPP P+
Sbjct: 2902 PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ 2940



 Score = 41.5 bits (97), Expect = 0.003
 Identities = 19/45 (42%), Positives = 21/45 (46%), Gaps = 1/45 (2%)

Query: 548  AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            A+ A    T     PP  P  P     PPPP P  P PPPPP P+
Sbjct: 2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQP-QPQPPPPPQPQ 2930



 Score = 40.3 bits (94), Expect = 0.006
 Identities = 15/40 (37%), Positives = 15/40 (37%), Gaps = 1/40 (2%)

Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
           V A  P  P   P P   P    GP PPP    P P   P
Sbjct: 423 VPASAPPPPAT-PLPSAEPGSDDGPAPPPERQPPAPATEP 461



 Score = 39.5 bits (92), Expect = 0.009
 Identities = 15/43 (34%), Positives = 15/43 (34%), Gaps = 1/43 (2%)

Query: 551  AAGIPTGPKGGPPPP-PPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            A     G   GP PP P PP    P PPP    P    P    
Sbjct: 2600 APVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHP 2642



 Score = 38.8 bits (90), Expect = 0.018
 Identities = 21/62 (33%), Positives = 24/62 (38%), Gaps = 12/62 (19%)

Query: 536  RPVE-----KNRLDEVKAQVAAG-IPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
            RP E     + R  +   Q A    P   +G P  P PP      P PP    P  PPPP
Sbjct: 2576 RPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPS-----PLPPDTHAP-DPPPP 2629

Query: 590  MP 591
             P
Sbjct: 2630 SP 2631



 Score = 38.8 bits (90), Expect = 0.020
 Identities = 13/45 (28%), Positives = 16/45 (35%)

Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMY 594
             A  P   +G    P    G  GPPPPP    P     P   ++
Sbjct: 254 APAPPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVW 298



 Score = 38.0 bits (88), Expect = 0.027
 Identities = 20/48 (41%), Positives = 22/48 (45%), Gaps = 3/48 (6%)

Query: 545  EVKAQVAAGIPTGPKGGPPPPP-PPPGGMGPPPPPMPGMPGPPPPPMP 591
            +  A V A     P    P  P PPP    P  PP P  PGPPPP +P
Sbjct: 2806 DPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP--PGPPPPSLP 2851



 Score = 38.0 bits (88), Expect = 0.029
 Identities = 14/47 (29%), Positives = 15/47 (31%)

Query: 551  AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDC 597
                   P    PP   P G + PP    P  P PPP P P      
Sbjct: 2807 PPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG 2853



 Score = 37.6 bits (87), Expect = 0.036
 Identities = 18/44 (40%), Positives = 19/44 (43%), Gaps = 2/44 (4%)

Query: 548  AQVAAGIPTGP--KGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
            A     +P GP   GGP  P  PP   GPP P  P  P   PP 
Sbjct: 2738 APAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781



 Score = 37.2 bits (86), Expect = 0.054
 Identities = 15/32 (46%), Positives = 15/32 (46%), Gaps = 5/32 (15%)

Query: 560  GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
            G PPPP PP       PP  P    PPP P P
Sbjct: 2549 GDPPPPLPP-----AAPPAAPDRSVPPPRPAP 2575



 Score = 37.2 bits (86), Expect = 0.058
 Identities = 13/30 (43%), Positives = 14/30 (46%), Gaps = 4/30 (13%)

Query: 563  PPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            PPPPPP     P P P   +   P PP P 
Sbjct: 2701 PPPPPPT----PEPAPHALVSATPLPPGPA 2726



 Score = 36.5 bits (84), Expect = 0.088
 Identities = 12/27 (44%), Positives = 13/27 (48%), Gaps = 2/27 (7%)

Query: 562  PPPPPPPPGGMGPPPPPMPGMPGPPPP 588
            PPPPPP P     P   +   P PP P
Sbjct: 2701 PPPPPPTPEP--APHALVSATPLPPGP 2725



 Score = 36.1 bits (83), Expect = 0.12
 Identities = 14/42 (33%), Positives = 14/42 (33%)

Query: 551  AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            A   P  P    P PPPP        P     P  PPP  P 
Sbjct: 2612 APPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPR 2653



 Score = 35.7 bits (82), Expect = 0.14
 Identities = 13/34 (38%), Positives = 13/34 (38%)

Query: 555  PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
               P   P PPPPPP    PP  P     G   P
Sbjct: 2922 QPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEP 2955



 Score = 35.7 bits (82), Expect = 0.16
 Identities = 26/96 (27%), Positives = 39/96 (40%), Gaps = 3/96 (3%)

Query: 485 LVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLD 544
           L E L E+S+   D      +    E ++ R +A A   + ++     +  R V +  LD
Sbjct: 665 LSEWLLERSRAGPDTLFAPATPTHPEPLLQRVQALAGFARGEEIRAA-AEDREV-RGALD 722

Query: 545 EVKAQV-AAGIPTGPKGGPPPPPPPPGGMGPPPPPM 579
            +   V A     GP    P P  P G   P PPP+
Sbjct: 723 ALARGVDAVARRAGPLTVAPAPAAPGGQGAPRPPPL 758



 Score = 35.3 bits (81), Expect = 0.18
 Identities = 18/54 (33%), Positives = 21/54 (38%)

Query: 537  PVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
            P   +  D   A +A      P   P  P PPP    P  PP P  P PP  P+
Sbjct: 2799 PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPL 2852



 Score = 35.3 bits (81), Expect = 0.19
 Identities = 26/108 (24%), Positives = 31/108 (28%), Gaps = 41/108 (37%)

Query: 525  AQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPP--PPPPGGMGP-------- 574
                L +L  G PV +   +      A   P    GGPP P  PP P  + P        
Sbjct: 2466 LSLLLGELFPGAPVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPV 2525

Query: 575  -------------------------PPPPMPGMPGPP------PPPMP 591
                                     PPPP+P    P       PPP P
Sbjct: 2526 GEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRP 2573



 Score = 35.3 bits (81), Expect = 0.19
 Identities = 12/38 (31%), Positives = 13/38 (34%)

Query: 555  PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            P      PP P PPP     P PP+     P     P 
Sbjct: 2919 PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956



 Score = 35.3 bits (81), Expect = 0.22
 Identities = 15/46 (32%), Positives = 16/46 (34%), Gaps = 5/46 (10%)

Query: 551  AAGIPTGPKGGPPPPPPPPGGMGPPPPP----MPGMPGPPP-PPMP 591
            A  +P GP       P  P    PP  P     PG P  P  PP  
Sbjct: 2718 ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT 2763



 Score = 34.5 bits (79), Expect = 0.37
 Identities = 18/64 (28%), Positives = 21/64 (32%), Gaps = 13/64 (20%)

Query: 546 VKAQVAAGIPTGPKGG-PPPPPPPPGGMGPPPPPMPG------------MPGPPPPPMPE 592
           V  + A   P   +G   PPPPP          P  G            +P PP PP P 
Sbjct: 260 VVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPA 319

Query: 593 MYTD 596
              D
Sbjct: 320 PAGD 323



 Score = 34.1 bits (78), Expect = 0.48
 Identities = 10/34 (29%), Positives = 11/34 (32%)

Query: 558  PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
            P    P P P       P PP P       P +P
Sbjct: 2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALP 2736



 Score = 33.8 bits (77), Expect = 0.55
 Identities = 10/39 (25%), Positives = 13/39 (33%), Gaps = 1/39 (2%)

Query: 555  PTGP-KGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            P G  +  PP   P      P  PP+  +  P      E
Sbjct: 2858 PGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTE 2896



 Score = 33.8 bits (77), Expect = 0.58
 Identities = 12/37 (32%), Positives = 12/37 (32%)

Query: 555  PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
            P  P    P PPPPP     PP      P     P  
Sbjct: 2921 PQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957



 Score = 33.8 bits (77), Expect = 0.59
 Identities = 15/48 (31%), Positives = 15/48 (31%), Gaps = 6/48 (12%)

Query: 548  AQVAAGIPTGPKGGPPPPPPP-----PGGMGPPP-PPMPGMPGPPPPP 589
               A          P PP  P     PGG   P  PP    P  P PP
Sbjct: 2725 PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPP 2772



 Score = 33.8 bits (77), Expect = 0.66
 Identities = 12/43 (27%), Positives = 13/43 (30%), Gaps = 8/43 (18%)

Query: 555  PTGPKGGPPPPPPPPGGMGPP--------PPPMPGMPGPPPPP 589
            P  P   P P         PP         P +P  P PP  P
Sbjct: 2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVP 2745



 Score = 33.4 bits (76), Expect = 0.67
 Identities = 15/65 (23%), Positives = 20/65 (30%), Gaps = 8/65 (12%)

Query: 530  EDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGP-----PPPPMPGMPG 584
             D  +   V + R    +    A   + P   P      P  +G       PPP P  P 
Sbjct: 2653 RDDPAPGRVSRPR--RARRLGRAAQASSPPQRPRRRAARPT-VGSLTSLADPPPPPPTPE 2709

Query: 585  PPPPP 589
            P P  
Sbjct: 2710 PAPHA 2714



 Score = 33.4 bits (76), Expect = 0.84
 Identities = 14/40 (35%), Positives = 14/40 (35%), Gaps = 2/40 (5%)

Query: 555  PTGPKGGPP--PPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            P  P   PP    P PP     P    P    PP  P PE
Sbjct: 2611 PAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPE 2650



 Score = 33.0 bits (75), Expect = 1.1
 Identities = 11/52 (21%), Positives = 13/52 (25%), Gaps = 11/52 (21%)

Query: 548  AQVAAGIPTGPK-------GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            ++  A  P  P          P           PP  P      PP P  P 
Sbjct: 2868 SRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPER----PPQPQAPP 2915



 Score = 32.6 bits (74), Expect = 1.3
 Identities = 23/68 (33%), Positives = 23/68 (33%), Gaps = 6/68 (8%)

Query: 523  VQAQKTLEDLSSGRPVEKNRLD-----EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPP 577
              A   L D   G PV    L      E  A   AG P  P   P  PP  P    PPP 
Sbjct: 2514 RLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPP-PPLPPAAPPAAPDRSVPPPR 2572

Query: 578  PMPGMPGP 585
            P P    P
Sbjct: 2573 PAPRPSEP 2580



 Score = 32.6 bits (74), Expect = 1.4
 Identities = 9/31 (29%), Positives = 10/31 (32%)

Query: 555  PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGP 585
            P  P   P PP  P         P   +P P
Sbjct: 2932 PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQP 2962



 Score = 32.2 bits (73), Expect = 1.7
 Identities = 17/47 (36%), Positives = 19/47 (40%), Gaps = 1/47 (2%)

Query: 546 VKAQVAAGIPTGPKGGPPPPPPPPG-GMGPPPPPMPGMPGPPPPPMP 591
           V+ +V    P       P PPP  G G    P    G  GPPPPP  
Sbjct: 238 VERRVVISHPLRGDIAAPAPPPVVGEGADRAPETARGATGPPPPPEA 284



 Score = 32.2 bits (73), Expect = 1.9
 Identities = 12/45 (26%), Positives = 13/45 (28%), Gaps = 4/45 (8%)

Query: 551  AAGIPTGPKGGPPPPPPPPGGMGPPPP----PMPGMPGPPPPPMP 591
            A   P  P  GPP     P            P P  P  PP  + 
Sbjct: 2768 APAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVL 2812



 Score = 31.4 bits (71), Expect = 3.2
 Identities = 13/41 (31%), Positives = 14/41 (34%), Gaps = 3/41 (7%)

Query: 551  AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
                P GP   P  PP   G   P PP  P     PP  + 
Sbjct: 2747 GPATPGGPAR-PARPPTTAGPPAPAPPAAP--AAGPPRRLT 2784



 Score = 31.4 bits (71), Expect = 3.4
 Identities = 17/47 (36%), Positives = 19/47 (40%), Gaps = 7/47 (14%)

Query: 551  AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPG--MPGPP----PPPMP 591
            A  +P      P  PPPPP G  PP  P+ G   PG      PP   
Sbjct: 2825 AGPLPPPTSAQPTAPPPPP-GPPPPSLPLGGSVAPGGDVRRRPPSRS 2870



 Score = 31.1 bits (70), Expect = 4.2
 Identities = 14/43 (32%), Positives = 15/43 (34%), Gaps = 6/43 (13%)

Query: 555  PTGPKGGPPPPPPPPGGMGP------PPPPMPGMPGPPPPPMP 591
            P  P G PPP  P  G + P       PP       P  P  P
Sbjct: 2839 PPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARP 2881



 Score = 30.3 bits (68), Expect = 6.2
 Identities = 10/57 (17%), Positives = 12/57 (21%), Gaps = 2/57 (3%)

Query: 537  PVEKNRLDEVKAQVAAGIPTGPKG--GPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
            P    R            P  P      PP       +         +P P  P  P
Sbjct: 2751 PGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADP 2807



 Score = 29.9 bits (67), Expect = 7.7
 Identities = 14/51 (27%), Positives = 18/51 (35%), Gaps = 7/51 (13%)

Query: 548  AQVAAGIPTGPKGGPPPPPPPPGGM-------GPPPPPMPGMPGPPPPPMP 591
            ++ +   P  P   P     P   +       GP PPP    P  PPPP  
Sbjct: 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG 2844



 Score = 29.9 bits (67), Expect = 8.3
 Identities = 14/49 (28%), Positives = 16/49 (32%), Gaps = 11/49 (22%)

Query: 555  PTGPK--------GGPPPP--PPPPGG-MGPPPPPMPGMPGPPPPPMPE 592
            P+ P           PP    P  P    G P  P P  P PP    P+
Sbjct: 2577 PSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625



 Score = 29.9 bits (67), Expect = 8.7
 Identities = 16/46 (34%), Positives = 16/46 (34%), Gaps = 5/46 (10%)

Query: 551  AAGIPTGPKGGPPP-PPPPPGG----MGPPPPPMPGMPGPPPPPMP 591
            AA     P     P PP  P G     GP  P  P     PP P P
Sbjct: 2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAP 2771



 Score = 29.9 bits (67), Expect = 9.5
 Identities = 10/41 (24%), Positives = 14/41 (34%)

Query: 548  AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
            ++    +P+      PP          PP   P  P PPP 
Sbjct: 2792 SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPT 2832


>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
          Length = 540

 Score = 43.0 bits (101), Expect = 6e-04
 Identities = 16/60 (26%), Positives = 23/60 (38%), Gaps = 10/60 (16%)

Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPP----------GGMGPPPPPMPGMPGPPPPPMPEM 593
           +   A+    +  GP+  P PP   P          G    P PP+   P P P P+ E+
Sbjct: 148 NSTAARAVVFLVVGPRPVPTPPGGTPPPDDDEGDEAGAPATPAPPLHPPPAPHPHPIAEV 207


>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR.  This region is found
           to the N-terminus of the pfam00170 transcription factor
           domain. It is between 150 and 200 amino acids in length.
           The N-terminal half is rather rich in proline residues
           and has been termed the PRD (proline rich domain),
           whereas the C-terminal half is more polar and has been
           called the MFMR (multifunctional mosaic region). It has
           been suggested that this family is composed of three
           sub-families called A, B and C, classified according to
           motif composition. It has been suggested that some of
           these motifs may be involved in mediating
           protein-protein interactions. The MFMR region contains a
           nuclear localisation signal in bZIP opaque and GBF-2.
           The MFMR also contains a transregulatory activity in
           TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention
           signals.
          Length = 189

 Score = 41.4 bits (97), Expect = 7e-04
 Identities = 14/32 (43%), Positives = 15/32 (46%)

Query: 560 GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           G  PPPP     +   P P P M GP  P MP
Sbjct: 39  GPRPPPPYFNSSVASSPQPHPYMWGPQQPMMP 70



 Score = 35.6 bits (82), Expect = 0.068
 Identities = 18/46 (39%), Positives = 19/46 (41%), Gaps = 10/46 (21%)

Query: 557 GPKGGPPP--------PPPPPGGMGPPPPPMPGMPGPPPPPMPEMY 594
           GP+  PP         P P P   GP  P MP  P   PPP   MY
Sbjct: 39  GPRPPPPYFNSSVASSPQPHPYMWGPQQPMMP--PYGTPPPYAAMY 82



 Score = 32.1 bits (73), Expect = 0.76
 Identities = 14/58 (24%), Positives = 20/58 (34%), Gaps = 7/58 (12%)

Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPP--PG-----GMGPPPPPMPGMPGPPPPPMPEMY 594
           +E K   ++   +  +  P P   P         G  PPPP         P P P M+
Sbjct: 5   EEGKPSKSSPKTSVQEDTPTPTVYPDWSAMQAYYGPRPPPPYFNSSVASSPQPHPYMW 62



 Score = 32.1 bits (73), Expect = 0.90
 Identities = 26/96 (27%), Positives = 34/96 (35%), Gaps = 15/96 (15%)

Query: 558 PKGGPPPPPP--PPGGMG--PPPPPMPGMPGPPPPPMPE------MYTDCHKNIINGKQA 607
           P G PPP     PPGG+   P  PP      P   P  E      +  +      + K  
Sbjct: 71  PYGTPPPYAAMYPPGGVYAHPSMPPGSHPFSPYAMPSAEVPGSTPLSMETDAKSSDNKDK 130

Query: 608 CEEVKQSKKLAKILELILLMGNYMN----SGSRNGG 639
              +K+SK     L L +   N  +      S NGG
Sbjct: 131 G-SIKKSKGSDGSLGLAMSGKNGESGKASGSSANGG 165


>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
           role in the biomineralisation of teeth.  They seem to
           regulate formation of crystallites during the secretory
           stage of tooth enamel development and are thought to
           play a major role in the structural organisation and
           mineralisation of developing enamel. The extracellular
           matrix of the developing enamel comprises two major
           classes of protein: the hydrophobic amelogenins and the
           acidic enamelins. Circular dichroism studies of porcine
           amelogenin have shown that the protein consists of 3
           discrete folding units: the N-terminal region appears to
           contain beta-strand structures, while the C-terminal
           region displays characteristics of a random coil
           conformation. Subsequent studies on the bovine protein
           have indicated the amelogenin structure to contain a
           repetitive beta-turn segment and a "beta-spiral" between
           Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
           rich region. The beta-spiral offers a probable site for
           interactions with Ca2+ ions. Muatations in the human
           amelogenin gene (AMGX) cause X-linked hypoplastic
           amelogenesis imperfecta, a disease characterised by
           defective enamel. A 9bp deletion in exon 2 of AMGX
           results in the loss of codons for Ile5, Leu6, Phe7 and
           Ala8, and replacement by a new threonine codon,
           disrupting the 16-residue (Met1-Ala16) amelogenin signal
           peptide.
          Length = 165

 Score = 40.9 bits (96), Expect = 8e-04
 Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 2/40 (5%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMY 594
           P  P+   PP P  P  M P PP  P  P PP PP+P M+
Sbjct: 99  PFQPQPLQPPQPQQP--MQPQPPVHPIPPLPPQPPLPPMF 136



 Score = 39.8 bits (93), Expect = 0.002
 Identities = 14/31 (45%), Positives = 15/31 (48%), Gaps = 2/31 (6%)

Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           PP  P PP    PP PPM  M   P PP+  
Sbjct: 118 PPVHPIPPLPPQPPLPPMFPMQ--PLPPLLP 146



 Score = 39.0 bits (91), Expect = 0.004
 Identities = 16/40 (40%), Positives = 20/40 (50%), Gaps = 5/40 (12%)

Query: 554 IPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
           +   P   P PP PP     PP PPM  M  P PP +P++
Sbjct: 114 MQPQPPVHPIPPLPPQ----PPLPPMFPMQ-PLPPLLPDL 148



 Score = 38.6 bits (90), Expect = 0.004
 Identities = 15/39 (38%), Positives = 15/39 (38%), Gaps = 4/39 (10%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
           P  P     P  P P    P  P  P  P PP PPM  M
Sbjct: 104 PLQPPQPQQPMQPQP----PVHPIPPLPPQPPLPPMFPM 138



 Score = 38.2 bits (89), Expect = 0.007
 Identities = 15/45 (33%), Positives = 18/45 (40%), Gaps = 1/45 (2%)

Query: 555 PTGPKGGPPPPPPPPGGMGPP-PPPMPGMPGPPPPPMPEMYTDCH 598
              P+   P  P PP    PP PP  P  P  P  P+P +  D  
Sbjct: 105 LQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPPLLPDLP 149



 Score = 32.5 bits (74), Expect = 0.51
 Identities = 12/29 (41%), Positives = 14/29 (48%)

Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
           P PP PP   M P PP +P +P    P  
Sbjct: 128 PQPPLPPMFPMQPLPPLLPDLPLEAWPAT 156



 Score = 32.5 bits (74), Expect = 0.56
 Identities = 12/32 (37%), Positives = 15/32 (46%)

Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
           P   P  P  + PP P  P  P PP  P+P +
Sbjct: 95  PAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPL 126



 Score = 31.3 bits (71), Expect = 1.4
 Identities = 14/38 (36%), Positives = 16/38 (42%), Gaps = 1/38 (2%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPM-PGMPGPPPPPMP 591
           P  P+    P  P P     P  PM P  P  P PP+P
Sbjct: 90  PNLPQPAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLP 127



 Score = 29.4 bits (66), Expect = 4.8
 Identities = 11/31 (35%), Positives = 13/31 (41%)

Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
           P P   P    P  PP P  P  P PP+  +
Sbjct: 93  PQPAQQPFQPQPLQPPQPQQPMQPQPPVHPI 123


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 43.2 bits (102), Expect = 8e-04
 Identities = 16/43 (37%), Positives = 16/43 (37%), Gaps = 4/43 (9%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPP----MPGMPGPPPPPMPEM 593
           P      PPPP PPP    P P P    M    G P PP    
Sbjct: 109 PGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAAS 151



 Score = 41.7 bits (98), Expect = 0.002
 Identities = 14/39 (35%), Positives = 14/39 (35%), Gaps = 3/39 (7%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
            A         PP P  P     PPP P P  P P P P
Sbjct: 97  PASPAREGSPTPPGPSSPDP---PPPTPPPASPPPSPAP 132



 Score = 40.5 bits (95), Expect = 0.005
 Identities = 15/47 (31%), Positives = 17/47 (36%), Gaps = 1/47 (2%)

Query: 552 AGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCH 598
           +  P GP   P PPPP P    PPP P P +     P          
Sbjct: 105 SPTPPGP-SSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAA 150



 Score = 37.5 bits (87), Expect = 0.037
 Identities = 15/38 (39%), Positives = 15/38 (39%), Gaps = 3/38 (7%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           PT P   PPP P P            G PGPPP   P 
Sbjct: 119 PTPPPASPPPSPAPD---LSEMLRPVGSPGPPPAASPP 153



 Score = 36.7 bits (85), Expect = 0.066
 Identities = 17/62 (27%), Positives = 19/62 (30%), Gaps = 12/62 (19%)

Query: 535 GRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP-----P 589
           GR   KN     +       P  P+  P    PP      PP   P     P P     P
Sbjct: 863 GRARGKNGRRRPR-------PPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGP 915

Query: 590 MP 591
           MP
Sbjct: 916 MP 917



 Score = 35.5 bits (82), Expect = 0.17
 Identities = 14/61 (22%), Positives = 17/61 (27%)

Query: 531 DLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
             +   P   +   + K   A G   G  G   P PP P        P       PP   
Sbjct: 841 AAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGA 900

Query: 591 P 591
           P
Sbjct: 901 P 901



 Score = 34.8 bits (80), Expect = 0.30
 Identities = 13/39 (33%), Positives = 14/39 (35%), Gaps = 2/39 (5%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMP--GPPP 587
           AA  P      PP   P P     P   +  MP  GP P
Sbjct: 885 AAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDP 923



 Score = 34.8 bits (80), Expect = 0.31
 Identities = 10/41 (24%), Positives = 11/41 (26%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
            A +  GP     P P  P     P  P         P  P
Sbjct: 338 GAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSP 378



 Score = 34.4 bits (79), Expect = 0.37
 Identities = 11/50 (22%), Positives = 14/50 (28%), Gaps = 1/50 (2%)

Query: 541 NRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPP-PPMPGMPGPPPPP 589
             L     + +       + GP      P    P P P  PG    P  P
Sbjct: 263 ITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSP 312



 Score = 33.2 bits (76), Expect = 0.76
 Identities = 11/52 (21%), Positives = 13/52 (25%)

Query: 542 RLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
           +     A   A    G +   PP P    G   PP      P    P     
Sbjct: 855 KSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPR 906



 Score = 33.2 bits (76), Expect = 0.83
 Identities = 11/41 (26%), Positives = 12/41 (29%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
           A +    P      P  PP  P    PP    P  P    P
Sbjct: 172 AALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSP 212



 Score = 33.2 bits (76), Expect = 0.88
 Identities = 11/41 (26%), Positives = 11/41 (26%), Gaps = 4/41 (9%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMP----GPPPPPMP 591
                    PPP  P     P PP    P       P P P
Sbjct: 184 RAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAP 224



 Score = 32.8 bits (75), Expect = 1.0
 Identities = 10/57 (17%), Positives = 15/57 (26%), Gaps = 1/57 (1%)

Query: 536 RPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPP-PPPMPGMPGPPPPPMP 591
            P+ +     +  ++            P P           P P P  PG  P P  
Sbjct: 255 CPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSS 311



 Score = 32.8 bits (75), Expect = 1.0
 Identities = 9/59 (15%), Positives = 15/59 (25%), Gaps = 1/59 (1%)

Query: 532 LSSGRPVEKNRLDEVKAQVAAGIPTGPK-GGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
             S      +     ++  ++   +     G    P P     P P   P    P  P 
Sbjct: 308 APSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPR 366



 Score = 32.5 bits (74), Expect = 1.3
 Identities = 15/55 (27%), Positives = 15/55 (27%), Gaps = 15/55 (27%)

Query: 552 AGIPTGPKGGPPPPPPP---------------PGGMGPPPPPMPGMPGPPPPPMP 591
              PTGP  GP    P                P        P P  P  P PP P
Sbjct: 65  FEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPP 119



 Score = 30.9 bits (70), Expect = 3.8
 Identities = 14/57 (24%), Positives = 17/57 (29%), Gaps = 9/57 (15%)

Query: 545 EVKAQVAAGIPTGPKGGPP---------PPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
              A VA+   +  +   P          P  PP    P  PP    P PP    P 
Sbjct: 157 ASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPI 213


>gnl|CDD|218108 pfam04487, CITED, CITED.  CITED, CBP/p300-interacting
           transactivator with ED-rich tail, are characterized by a
           conserved 32-amino acid sequence at the C-terminus.
           CITED proteins do not bind DNA directly and are thought
           to function as transcriptional co-activators.
          Length = 206

 Score = 41.0 bits (96), Expect = 0.001
 Identities = 15/49 (30%), Positives = 16/49 (32%), Gaps = 6/49 (12%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGM------GPPPPPMPGMPGPPPPPMPEM 593
           A+G P     GP     P   M       P   P P M  P P  M  M
Sbjct: 55  ASGRPRSAMSGPMGGGHPHQSMPAYMMFNPSSKPQPFMLVPGPQLMASM 103



 Score = 38.3 bits (89), Expect = 0.009
 Identities = 14/38 (36%), Positives = 14/38 (36%), Gaps = 1/38 (2%)

Query: 553 GIPTGPKGGPPPPPPPPGGMGP-PPPPMPGMPGPPPPP 589
           G    P G P    P     G   PP M  MP P  PP
Sbjct: 114 GHAGAPAGHPGGGGPQQFRPGAGQPPGMQHMPAPALPP 151



 Score = 30.2 bits (68), Expect = 4.1
 Identities = 14/50 (28%), Positives = 16/50 (32%), Gaps = 1/50 (2%)

Query: 566 PPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHK-NIINGKQACEEVKQS 614
             P G  G   P         PP M  M       N+I+     EEV  S
Sbjct: 117 GAPAGHPGGGGPQQFRPGAGQPPGMQHMPAPALPPNVIDTDLIDEEVLMS 166



 Score = 29.1 bits (65), Expect = 8.4
 Identities = 12/43 (27%), Positives = 12/43 (27%), Gaps = 4/43 (9%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPP----PMPGMPGPPPPPMPEM 593
             G      P     G MG   P    P   M  P   P P M
Sbjct: 50  GGGMDASGRPRSAMSGPMGGGHPHQSMPAYMMFNPSSKPQPFM 92


>gnl|CDD|218350 pfam04959, ARS2, Arsenite-resistance protein 2.  Arsenite is a
           carcinogenic compound which can act as a co-mutagen by
           inhibiting DNA repair. Arsenite-resistance protein 2 is
           thought to play a role in arsenite resistance.
          Length = 211

 Score = 40.6 bits (95), Expect = 0.002
 Identities = 23/66 (34%), Positives = 30/66 (45%), Gaps = 16/66 (24%)

Query: 542 RLDEVKAQVA--------AGIPTGPKGGPPPPP-PPPGGMGPPPP------PMPGMP-GP 585
           +++EV+ +V         A  P  P+  P  PP P PGG+ P  P      P   MP G 
Sbjct: 104 KVEEVRKEVEFFNNYLTDAKRPALPEIKPLQPPKPDPGGLAPGLPGYPPQTPQALMPYGQ 163

Query: 586 PPPPMP 591
           P PPM 
Sbjct: 164 PRPPMM 169



 Score = 36.7 bits (85), Expect = 0.032
 Identities = 17/41 (41%), Positives = 18/41 (43%), Gaps = 2/41 (4%)

Query: 553 GIPTGPKGGPPPPPPPPGGMGPPPPPMPGMP--GPPPPPMP 591
           G+  G  G PP  P      G P PPM G    GPP PP  
Sbjct: 142 GLAPGLPGYPPQTPQALMPYGQPRPPMMGYGRGGPPFPPNQ 182


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 41.8 bits (98), Expect = 0.002
 Identities = 20/56 (35%), Positives = 22/56 (39%), Gaps = 4/56 (7%)

Query: 543 LDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMP--GPPPPPMPEMYTD 596
           LD+V       + T P G     PPP  G GPP PP    P     PP  P    D
Sbjct: 891 LDQVFGPGTGRVATAPGGDAASAPPP--GAGPPAPPQAVPPPRTTQPPAAPPRGPD 944



 Score = 33.3 bits (76), Expect = 0.85
 Identities = 14/46 (30%), Positives = 17/46 (36%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
              A+  P G     PP   PP     PP   P  P  PP  + E+
Sbjct: 908 GDAASAPPPGAGPPAPPQAVPPPRTTQPPAAPPRGPDVPPAAVAEL 953


>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2).  DAZ
           associated protein 2 has a highly conserved sequence
           throughout evolution including a conserved polyproline
           region and several SH2/SH3 binding sites. It occurs as a
           single copy gene with a four-exon organisation and is
           located on chromosome 12. It encodes a ubiquitously
           expressed protein and binds to DAZ and DAZL1 through DAZ
           repeats.
          Length = 136

 Score = 39.1 bits (91), Expect = 0.002
 Identities = 11/37 (29%), Positives = 13/37 (35%), Gaps = 5/37 (13%)

Query: 562 PPPPPP-----PPGGMGPPPPPMPGMPGPPPPPMPEM 593
           P  PP       P  + PP   MP    P P P   +
Sbjct: 2   PDAPPAYSELYQPSYVVPPQAQMPQASAPYPGPSMYL 38



 Score = 33.7 bits (77), Expect = 0.14
 Identities = 17/51 (33%), Positives = 18/51 (35%), Gaps = 5/51 (9%)

Query: 546 VKAQVAAGIPTGPKGGPPPPPPPPGGMGPPP----PPMPGMP-GPPPPPMP 591
           V  Q      + P  GP    P    M   P    PPM   P G PPP  P
Sbjct: 18  VPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYP 68



 Score = 32.9 bits (75), Expect = 0.27
 Identities = 8/36 (22%), Positives = 10/36 (27%)

Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
           P    PP    P    P P P   +P      +   
Sbjct: 14  PSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQ 49



 Score = 31.7 bits (72), Expect = 0.64
 Identities = 10/38 (26%), Positives = 10/38 (26%), Gaps = 5/38 (13%)

Query: 562 PPPPPPPPGGMGPPPPPM-----PGMPGPPPPPMPEMY 594
           PP    P      P P M       M   P    P M 
Sbjct: 19  PPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMA 56


>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
           terminal.  This domain family is found in eukaryotes,
           and is typically between 177 and 207 amino acids in
           length. The family is found in association with
           pfam00778, pfam02377, pfam00610, pfam00595. The segment
           polarity gene dishevelled (dsh) is required for pattern
           formation of the embryonic segments. It is involved in
           the determination of body organisation through the
           Wingless pathway (analogous to the Wnt-1 pathway).
          Length = 202

 Score = 39.6 bits (92), Expect = 0.003
 Identities = 20/54 (37%), Positives = 24/54 (44%), Gaps = 5/54 (9%)

Query: 534 SGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPP 587
                 ++ L    +  + G P     G PPP  PP  M  PPPP PG PG PP
Sbjct: 131 PSEGSVRSSLSHPSSHSSYGAP-----GVPPPYNPPMLMMKPPPPSPGPPGAPP 179



 Score = 37.3 bits (86), Expect = 0.017
 Identities = 14/40 (35%), Positives = 17/40 (42%), Gaps = 1/40 (2%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP-PPPMPEM 593
            +    G P  PPP         P P  PGPP  PP+ E+
Sbjct: 144 SSHSSYGAPGVPPPYNPPMLMMKPPPPSPGPPGAPPVREL 183


>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
           (DUF2360).  This is the conserved 140 amino acid region
           of a family of proteins conserved from nematodes to
           humans. One C. elegans member is annotated as a
           Daf-16-dependent longevity protein 1 but this could not
           be confirmed. The function is unknown.
          Length = 147

 Score = 38.5 bits (90), Expect = 0.004
 Identities = 27/94 (28%), Positives = 32/94 (34%), Gaps = 19/94 (20%)

Query: 506 AKLEEAIMLRQEAEAKLV------QAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPK 559
           +KL EA    Q+ E  L        +   LED++                          
Sbjct: 21  SKLVEAERSLQQLETTLSLLEAKLASIPGLEDVTVQTTP------------PPPASAITN 68

Query: 560 GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
           GGPPPPPP       PPPP      PP  P PE 
Sbjct: 69  GGPPPPPPARAEAASPPPPEAPAE-PPAEPEPEA 101



 Score = 33.5 bits (77), Expect = 0.18
 Identities = 24/79 (30%), Positives = 32/79 (40%), Gaps = 6/79 (7%)

Query: 517 EAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPP 576
             E+KLV+A+++L+ L +        L E K     G+        PPPP      G PP
Sbjct: 18  VCESKLVEAERSLQQLET-----TLSLLEAKLASIPGLEDVTVQTTPPPPASAITNGGPP 72

Query: 577 PPMPGMPGPPPPPMPEMYT 595
           PP P       PP P    
Sbjct: 73  PPPPARAEAASPP-PPEAP 90


>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 584

 Score = 39.9 bits (93), Expect = 0.006
 Identities = 11/51 (21%), Positives = 18/51 (35%), Gaps = 3/51 (5%)

Query: 542 RLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           R++ ++ ++   IP       P   P P       P     P P P P+  
Sbjct: 375 RVERIETRLDMSIPANLLHNAPQAAPAP---SAAAPEPKHQPAPEPRPVLA 422



 Score = 31.8 bits (72), Expect = 2.0
 Identities = 11/40 (27%), Positives = 12/40 (30%)

Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMP 583
               A  AA  P+     P   P P       P P  G P
Sbjct: 391 LLHNAPQAAPAPSAAAPEPKHQPAPEPRPVLAPTPASGEP 430



 Score = 30.2 bits (68), Expect = 5.7
 Identities = 11/33 (33%), Positives = 11/33 (33%), Gaps = 4/33 (12%)

Query: 560 GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           G P     P GG    PP  P      PPP   
Sbjct: 511 GKPAAAASPAGGGANAPPAKP----VKPPPSCL 539


>gnl|CDD|216368 pfam01213, CAP_N, Adenylate cyclase associated (CAP) N terminal. 
          Length = 313

 Score = 39.0 bits (91), Expect = 0.007
 Identities = 12/30 (40%), Positives = 14/30 (46%), Gaps = 4/30 (13%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMP 580
            + +P      P  PPPPP    PPPPP  
Sbjct: 219 KSALPAVSSSAPSAPPPPP----PPPPPSV 244



 Score = 35.2 bits (81), Expect = 0.15
 Identities = 12/37 (32%), Positives = 14/37 (37%), Gaps = 4/37 (10%)

Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMP 580
               A+ A    +      PPPPPPP    PPP    
Sbjct: 214 PVAAAKSALPAVSSSAPSAPPPPPPP----PPPSVPT 246



 Score = 30.9 bits (70), Expect = 3.0
 Identities = 13/39 (33%), Positives = 14/39 (35%), Gaps = 6/39 (15%)

Query: 555 PTGPKGGPPP--PPPPPGGMGPPPPPMPGMPGPPPPPMP 591
             GP        P         PPPP    P PPPP +P
Sbjct: 211 KKGPVAAAKSALPAVSSSAPSAPPPP----PPPPPPSVP 245


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 39.3 bits (92), Expect = 0.011
 Identities = 16/77 (20%), Positives = 18/77 (23%), Gaps = 5/77 (6%)

Query: 518 AEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPP-- 575
           AEA     +KT     +  P          A   A  P        P  PP      P  
Sbjct: 370 AEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAP--AAAASAPAAPPAAAPPAPVA 427

Query: 576 -PPPMPGMPGPPPPPMP 591
            P        P   P  
Sbjct: 428 APAAAAPAAAPAAAPAA 444



 Score = 33.5 bits (77), Expect = 0.61
 Identities = 14/57 (24%), Positives = 15/57 (26%)

Query: 537 PVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
           P EK      +A   A  P       P P   P      P   P    P P   P  
Sbjct: 375 PAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAA 431



 Score = 31.2 bits (71), Expect = 2.8
 Identities = 10/58 (17%), Positives = 11/58 (18%), Gaps = 2/58 (3%)

Query: 537 PVEKNRLDEVKAQVAAGIPTGPKGGPPPP--PPPPGGMGPPPPPMPGMPGPPPPPMPE 592
                    V    A      P+    P    P P      P P          P  E
Sbjct: 436 AAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEE 493



 Score = 30.8 bits (70), Expect = 3.7
 Identities = 10/81 (12%), Positives = 15/81 (18%), Gaps = 8/81 (9%)

Query: 505 SAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPP 564
            A +         A      A   L      +   +     V+          P+     
Sbjct: 423 PAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVA--------PEPAVAS 474

Query: 565 PPPPPGGMGPPPPPMPGMPGP 585
             P P          P   G 
Sbjct: 475 AAPAPAAAPAAARLTPTEEGD 495


>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator.  In
           eukaryotes, this family of proteins induces
           mitochondrial fission.
          Length = 248

 Score = 38.2 bits (89), Expect = 0.014
 Identities = 34/120 (28%), Positives = 49/120 (40%), Gaps = 20/120 (16%)

Query: 478 FQLDVQPLVEHLA-EKSKTEEDRRVED-----LSAKLEEAIMLRQEAEAKLVQAQKTLED 531
            +L  QP V +L  E+ + +  +   +     +SA  +E   LR +  AK+V AQ   E 
Sbjct: 92  MKLSRQPSVPNLRQEEPRLQSLKPPNEEALQKISALEDELAFLRAQI-AKIVAAQ---EQ 147

Query: 532 LSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
            +S      +  + V +      P        PP   P    PPPPP P    PPPPP  
Sbjct: 148 SNSTTSDLLSSDESVPSSSTTSFP------ISPPTEEPVLEVPPPPPPP----PPPPPPS 197


>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
           Provisional.
          Length = 475

 Score = 38.3 bits (89), Expect = 0.018
 Identities = 25/105 (23%), Positives = 38/105 (36%), Gaps = 17/105 (16%)

Query: 489 LAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKA 548
           +  K + ++ RR+E L         L ++ +A            ++G PV +        
Sbjct: 94  VLNKQRGDDQRRIEKLGQDNAA---LAEQVKALGANPV-----TATGEPVPQM------P 139

Query: 549 QVAAGIPTGPKGGPPPPPPPPGGMGPPPPPM---PGMPGPPPPPM 590
               G    P+ G  P   PP G    PPP    PG    PPP +
Sbjct: 140 ASPPGPEGEPQPGNTPVSFPPQGSVAVPPPTAFYPGNGVTPPPQV 184



 Score = 36.7 bits (85), Expect = 0.061
 Identities = 19/92 (20%), Positives = 36/92 (39%), Gaps = 17/92 (18%)

Query: 517 EAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAA-----------GIPTGPKGGPPPP 565
           + + +  + ++ L+ L+  R  ++ R++++    AA            +    +  P  P
Sbjct: 80  QMQKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALAEQVKALGANPVTATGEPVPQMP 139

Query: 566 PPPPGGMGPPPPPMPGMPGPP------PPPMP 591
             PPG  G P P    +  PP      PPP  
Sbjct: 140 ASPPGPEGEPQPGNTPVSFPPQGSVAVPPPTA 171


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 38.0 bits (88), Expect = 0.018
 Identities = 11/41 (26%), Positives = 11/41 (26%), Gaps = 7/41 (17%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPM-------PGMPGPPPP 588
           PT      PP    P        P         G P PP P
Sbjct: 222 PTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTP 262



 Score = 37.6 bits (87), Expect = 0.030
 Identities = 8/42 (19%), Positives = 9/42 (21%), Gaps = 3/42 (7%)

Query: 553 GIPTGPKGGPPPP---PPPPGGMGPPPPPMPGMPGPPPPPMP 591
             P         P   P P     PP   +P        P  
Sbjct: 206 ATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQA 247



 Score = 33.0 bits (75), Expect = 0.78
 Identities = 9/35 (25%), Positives = 9/35 (25%), Gaps = 4/35 (11%)

Query: 562 PPPPPPPPGGMGPP----PPPMPGMPGPPPPPMPE 592
           P  P P P     P     P     P     P P 
Sbjct: 205 PATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPS 239



 Score = 30.3 bits (68), Expect = 5.6
 Identities = 11/47 (23%), Positives = 12/47 (25%), Gaps = 5/47 (10%)

Query: 551 AAGIPTGPKGGPPPP-----PPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            +     P      P     P   G   PP P     P     P PE
Sbjct: 231 PSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAPE 277


>gnl|CDD|220260 pfam09483, HpaP, Type III secretion protein (HpaP).  This entry
           represents proteins encoded by genes which are always
           found in type III secretion operons, although their
           function in the processes of secretion and virulence is
           unclear. Hpa stands for Hrp-associated gene, where Hrp
           stands for hypersensitivity response and virulence. see
           also PMID:18584024.
          Length = 185

 Score = 37.1 bits (86), Expect = 0.020
 Identities = 13/35 (37%), Positives = 13/35 (37%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
             GP G P PP P        P      P PP PP
Sbjct: 25  RAGPPGTPAPPGPAEDAHPEFPERPRDAPAPPAPP 59



 Score = 35.1 bits (81), Expect = 0.078
 Identities = 14/34 (41%), Positives = 14/34 (41%), Gaps = 2/34 (5%)

Query: 561 GPPPPPPPPGGMGPPPPPMPGMPG--PPPPPMPE 592
           GPP  P PPG      P  P  P   P PP  P 
Sbjct: 27  GPPGTPAPPGPAEDAHPEFPERPRDAPAPPAPPR 60



 Score = 29.3 bits (66), Expect = 5.6
 Identities = 8/35 (22%), Positives = 8/35 (22%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
           P        P  P      P PP        P P 
Sbjct: 37  PAEDAHPEFPERPRDAPAPPAPPRATDGDRDPQPL 71


>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 38.0 bits (88), Expect = 0.023
 Identities = 46/211 (21%), Positives = 78/211 (36%), Gaps = 51/211 (24%)

Query: 411 LLFIRDDQ-NMVMDSACEPYLLSILQHLLFIRDDQYVRLAYY-KLLEECVSQIVLHRGGC 468
           L +I D+Q   V D+  E   + +L+   F+ D+ Y    +  KL+E   + +VL+    
Sbjct: 254 LNYIDDEQFFDVTDAIAENDAVKMLEVARFVIDNGYDEQDFLEKLIEHFRNFLVLY---- 309

Query: 469 DPDFRSSRRFQLDVQPLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKT 528
             + RS+R            L E+ ++ ++R  +  +  L  A +++      L++ Q  
Sbjct: 310 --NLRSTR------------LIERPESVKERYQQS-AGGLTPAAVMQMTD--FLMKTQGE 352

Query: 529 LE---------DLSSGRPVEKNRLD-EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPP 578
           L+         +L+  R +E  R D  V    A       K   P  P P    GP  P 
Sbjct: 353 LKFQFEYQFRFELALLRLIELVRNDGGVAPSPAGSPDVKKKAPEPDLPQPDRHPGPAKPE 412

Query: 579 MPGM------------------PGPPPPPMP 591
            PG                   P     P+P
Sbjct: 413 APGARPAELPSPASAPTPEQQPPVARSAPLP 443


>gnl|CDD|177646 PHA03418, PHA03418, hypothetical E4 protein; Provisional.
          Length = 230

 Score = 37.0 bits (85), Expect = 0.023
 Identities = 20/67 (29%), Positives = 24/67 (35%), Gaps = 16/67 (23%)

Query: 558 PKGGPPPPPPPPGGMGPPPP-------PMPGM---------PGPPPPPMPEMYTDCHKNI 601
             G P PPP   GG G  PP       P  G          P  PP P P+ +     + 
Sbjct: 133 LHGDPDPPPGATGGQGEEPPEGGEESQPPLGEGEGAVEGHPPPLPPAPEPKPHNGDATHG 192

Query: 602 INGKQAC 608
           + G  AC
Sbjct: 193 LLGTVAC 199



 Score = 32.8 bits (74), Expect = 0.60
 Identities = 15/41 (36%), Positives = 16/41 (39%), Gaps = 11/41 (26%)

Query: 556 TGPKGGPPPPPPPPGGMG-----PPPPPMPGMPGPPPPPMP 591
             P+GG    PP   G G     PPP P      P P P P
Sbjct: 150 EPPEGGEESQPPLGEGEGAVEGHPPPLP------PAPEPKP 184



 Score = 31.6 bits (71), Expect = 1.6
 Identities = 25/93 (26%), Positives = 32/93 (34%), Gaps = 25/93 (26%)

Query: 524 QAQKTLEDLSSGRPVEKNRLDEVKAQVAAG--------IPTGPKGGPPPPP--------- 566
           QA    +     RP ++++ DE                +   P  G P PP         
Sbjct: 90  QAPLAADADDDPRPGKRSKADEHGPAPGRAALAPFKLDLDQDPLHGDPDPPPGATGGQGE 149

Query: 567 -PPPGGMGPPPPP------MPGMPGPPPPPMPE 592
            PP GG    PP       + G P PP PP PE
Sbjct: 150 EPPEGGEESQPPLGEGEGAVEGHP-PPLPPAPE 181


>gnl|CDD|220915 pfam10961, DUF2763, Protein of unknown function (DUF2763).  This
           eukaryotic family of proteins has no known function.
          Length = 91

 Score = 34.7 bits (80), Expect = 0.032
 Identities = 17/50 (34%), Positives = 19/50 (38%), Gaps = 9/50 (18%)

Query: 538 VEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPP------GGMGPPPPPMPG 581
           V K      K   +     G +GGP PP          GG GP  PPM G
Sbjct: 44  VTKRYR---KNSSSGKRWDGGRGGPGPPGGGRRMGRIGGGGGPSRPPMAG 90


>gnl|CDD|144451 pfam00859, CTF_NFI, CTF/NF-I family transcription modulation
           region. 
          Length = 295

 Score = 37.0 bits (85), Expect = 0.035
 Identities = 19/41 (46%), Positives = 20/41 (48%), Gaps = 1/41 (2%)

Query: 552 AGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           AG P G   G  P    P  M  PPPP P M  P P PMP+
Sbjct: 205 AGQPNGSGQGKVPNHFLPTPMLAPPPP-PPMARPVPLPMPD 244



 Score = 32.0 bits (72), Expect = 1.5
 Identities = 13/44 (29%), Positives = 15/44 (34%), Gaps = 18/44 (40%)

Query: 555 PTGPKGGPP------------------PPPPPPGGMGPPPPPMP 580
           P+  + G P                   PPPPP    P P PMP
Sbjct: 200 PSSQQAGQPNGSGQGKVPNHFLPTPMLAPPPPPPMARPVPLPMP 243



 Score = 30.0 bits (67), Expect = 6.1
 Identities = 12/36 (33%), Positives = 13/36 (36%), Gaps = 3/36 (8%)

Query: 558 PKGGPPPPPPPPGGMGPPP---PPMPGMPGPPPPPM 590
                    P   G G  P    P P +  PPPPPM
Sbjct: 199 DPSSQQAGQPNGSGQGKVPNHFLPTPMLAPPPPPPM 234


>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824).  This is
           a repeating domain found in fungal proteins. It is
           proline-rich, and the function is not known.
          Length = 135

 Score = 35.6 bits (82), Expect = 0.035
 Identities = 18/47 (38%), Positives = 20/47 (42%), Gaps = 7/47 (14%)

Query: 555 PTGPKGGPPPPPPPPGGMGPP--PPPMPGMPGP-----PPPPMPEMY 594
           P  P   P PPP P  G  P   PPP   +P P     PP P  + Y
Sbjct: 78  PPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQNYPYPPGPGQDPY 124



 Score = 29.5 bits (66), Expect = 4.5
 Identities = 23/82 (28%), Positives = 27/82 (32%), Gaps = 19/82 (23%)

Query: 536 RPVEKNRLDEVKAQVAAGIPTGPKGGPPPPP-------------PPPGGMGPPPPPM--- 579
           R  E+ R D      +   P  P   PP PP             PPP G  P PPP    
Sbjct: 34  RERERYRHDHDAYSDSYEEPYDPTPYPPSPPVSDPRYYPNSNYFPPPPGSTPVPPPGPQP 93

Query: 580 ---PGMPGPPPPPMPEMYTDCH 598
              P    PPP  +P      +
Sbjct: 94  GYNPADYPPPPGAVPPPQNYPY 115


>gnl|CDD|222579 pfam14179, YppG, YppG-like protein.  The YppG-like protein family
           includes the B. subtilis YppG protein, which is
           functionally uncharacterized. This family of proteins is
           found in bacteria. Proteins in this family are typically
           between 115 and 181 amino acids in length. There are two
           completely conserved residues (F and G) that may be
           functionally important.
          Length = 110

 Score = 34.7 bits (80), Expect = 0.037
 Identities = 10/30 (33%), Positives = 10/30 (33%)

Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           PPPP  PP        P    P P   P  
Sbjct: 30  PPPPYSPPQQQQGHFMPPQPQPYPKQSPQQ 59



 Score = 32.0 bits (73), Expect = 0.37
 Identities = 12/44 (27%), Positives = 13/44 (29%), Gaps = 6/44 (13%)

Query: 555 PTGPKGGPPPP---PPPPGGMGPPP---PPMPGMPGPPPPPMPE 592
              P+     P    P    M PPP   P        PP P P 
Sbjct: 9   QYPPQNQQQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPPQPQPY 52



 Score = 28.9 bits (65), Expect = 3.6
 Identities = 9/37 (24%), Positives = 11/37 (29%), Gaps = 3/37 (8%)

Query: 554 IPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
            P   +G   PP P P    P   P    P      +
Sbjct: 36  PPQQQQGHFMPPQPQPY---PKQSPQQQQPPQFSSFL 69


>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
           carboxylase.
          Length = 274

 Score = 36.7 bits (85), Expect = 0.038
 Identities = 15/30 (50%), Positives = 15/30 (50%)

Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           P PPPP P  M  PPPP    P  PP   P
Sbjct: 142 PQPPPPAPVVMMQPPPPHAMPPASPPAAQP 171



 Score = 34.4 bits (79), Expect = 0.19
 Identities = 14/44 (31%), Positives = 14/44 (31%), Gaps = 4/44 (9%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           A V    P  P   PP  PP        P   P    PP P  P
Sbjct: 148 APVVMMQPPPPHAMPPASPPAAQPAPSAPASSP----PPTPASP 187



 Score = 34.0 bits (78), Expect = 0.25
 Identities = 11/36 (30%), Positives = 12/36 (33%)

Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCH 598
           PP  PP     P  P     P P  PP  +     H
Sbjct: 162 PPASPPAAQPAPSAPASSPPPTPASPPPAKAPKSSH 197



 Score = 30.6 bits (69), Expect = 3.8
 Identities = 11/34 (32%), Positives = 11/34 (32%)

Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           P     PP  PP     P  P    P  P  P P
Sbjct: 156 PPPHAMPPASPPAAQPAPSAPASSPPPTPASPPP 189



 Score = 30.2 bits (68), Expect = 4.4
 Identities = 12/38 (31%), Positives = 12/38 (31%), Gaps = 1/38 (2%)

Query: 555 PTGPKGGPPPPPP-PPGGMGPPPPPMPGMPGPPPPPMP 591
           P      P  PP   P    P   P P    PPP   P
Sbjct: 156 PPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAP 193



 Score = 29.8 bits (67), Expect = 5.4
 Identities = 15/51 (29%), Positives = 15/51 (29%), Gaps = 16/51 (31%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPP------------PMPGM----PGPPPPP 589
              P    P   PPP    PPP             PM G     P P  PP
Sbjct: 168 AAQPAPSAPASSPPPTPASPPPAKAPKSSHPPLKSPMAGTFYRSPAPGEPP 218


>gnl|CDD|236511 PRK09429, mepA, penicillin-insensitive murein endopeptidase;
           Reviewed.
          Length = 275

 Score = 36.8 bits (86), Expect = 0.039
 Identities = 18/50 (36%), Positives = 19/50 (38%), Gaps = 15/50 (30%)

Query: 558 PKGGP---PPPPPPPG-GMG-----------PPPPPMPGMPGPPPPPMPE 592
           P G P      PPPPG G G            PP P       PPPP+P 
Sbjct: 217 PAGSPECEDQAPPPPGDGCGAELASWSDAPFEPPKPTTKPKPKPPPPLPP 266


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 37.4 bits (87), Expect = 0.042
 Identities = 16/48 (33%), Positives = 20/48 (41%), Gaps = 1/48 (2%)

Query: 543 LDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
           L+EV+AQ+       P+   PP    P GM P     P    P  PP 
Sbjct: 156 LEEVEAQLQ-QRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPG 202



 Score = 34.0 bits (78), Expect = 0.46
 Identities = 15/46 (32%), Positives = 17/46 (36%), Gaps = 3/46 (6%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
           +Q  A  P  P     P  PPP      P     MP PPP P  + 
Sbjct: 224 SQAPAQPPLPP---QLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQ 266



 Score = 33.6 bits (77), Expect = 0.55
 Identities = 11/50 (22%), Positives = 13/50 (26%)

Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHKNIINGKQA 607
           P+G PP     P    P  PP    P    P   +             Q 
Sbjct: 181 PQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQP 230



 Score = 32.4 bits (74), Expect = 1.2
 Identities = 10/37 (27%), Positives = 13/37 (35%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
              P+   P    P     P  PP+P      PPP+ 
Sbjct: 208 QGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQ 244



 Score = 32.4 bits (74), Expect = 1.2
 Identities = 19/84 (22%), Positives = 23/84 (27%), Gaps = 3/84 (3%)

Query: 508 LEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPP 567
           LEE     Q+ +      Q   + L  G P  +        Q     P G    P   P 
Sbjct: 156 LEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAF---PQQGPPEQPPGYPQPPQGHPE 212

Query: 568 PPGGMGPPPPPMPGMPGPPPPPMP 591
                   P P      PP PP  
Sbjct: 213 QVQPQQFLPAPSQAPAQPPLPPQL 236



 Score = 32.0 bits (73), Expect = 1.8
 Identities = 16/61 (26%), Positives = 19/61 (31%), Gaps = 13/61 (21%)

Query: 553 GIPTGPKGGPPPPP------------PPPGGMGPPPPPMP-GMPGPPPPPMPEMYTDCHK 599
           G+       PP PP            PPP     P P +P G   P PPP         +
Sbjct: 250 GLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQ 309

Query: 600 N 600
            
Sbjct: 310 Q 310



 Score = 30.1 bits (68), Expect = 6.6
 Identities = 12/41 (29%), Positives = 12/41 (29%), Gaps = 5/41 (12%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP-----PPPM 590
              P   P  P        PPP P      PP     PPP 
Sbjct: 239 QPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQ 279


>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
           synthetase/uroporphyrin-III C-methyltransferase;
           Reviewed.
          Length = 656

 Score = 37.0 bits (86), Expect = 0.047
 Identities = 8/42 (19%), Positives = 9/42 (21%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
              A      P             +   P      P PPP P
Sbjct: 267 DAAAQPATAAPAPSRMTDTNDSKSVTSQPAAAAAAPAPPPNP 308


>gnl|CDD|237802 PRK14723, flhF, flagellar biosynthesis regulator FlhF; Provisional.
          Length = 767

 Score = 37.1 bits (86), Expect = 0.048
 Identities = 13/47 (27%), Positives = 14/47 (29%), Gaps = 1/47 (2%)

Query: 543 LDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
           LDE    VAA         P P P       P    +   P   P P
Sbjct: 44  LDEDLGAVAASAQAYAPPAPAPLPAALVAPAPAAASIA-APAAVPAP 89



 Score = 30.2 bits (68), Expect = 6.0
 Identities = 10/42 (23%), Positives = 14/42 (33%), Gaps = 1/42 (2%)

Query: 551 AAGIPTGPKGGPPP-PPPPPGGMGPPPPPMPGMPGPPPPPMP 591
              +    +   PP P P P  +  P P    +  P   P P
Sbjct: 48  LGAVAASAQAYAPPAPAPLPAALVAPAPAAASIAAPAAVPAP 89


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 37.3 bits (87), Expect = 0.049
 Identities = 12/51 (23%), Positives = 13/51 (25%), Gaps = 2/51 (3%)

Query: 545 EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPM--PGMPGPPPPPMPEM 593
              A   +     P   P P    P     P P       P P P P P  
Sbjct: 393 APAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPS 443



 Score = 36.9 bits (86), Expect = 0.057
 Identities = 9/45 (20%), Positives = 9/45 (20%)

Query: 547 KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
                A          P P PP        P  P  P  P     
Sbjct: 464 AQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADD 508



 Score = 36.9 bits (86), Expect = 0.061
 Identities = 15/78 (19%), Positives = 16/78 (20%), Gaps = 1/78 (1%)

Query: 515 RQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGP 574
              A  +   A        S                AA     P   P   P P     P
Sbjct: 424 APAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQ-PAPAPAAAPEPTAAPAP 482

Query: 575 PPPPMPGMPGPPPPPMPE 592
            PP  P     P  P   
Sbjct: 483 APPAAPAPAAAPAAPAAP 500



 Score = 36.1 bits (84), Expect = 0.089
 Identities = 10/45 (22%), Positives = 11/45 (24%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
              AA  P+     P   P P                P P P P 
Sbjct: 392 GAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPA 436



 Score = 36.1 bits (84), Expect = 0.11
 Identities = 14/37 (37%), Positives = 14/37 (37%)

Query: 560 GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTD 596
            G P  PPPP    P   P    P  PP    EM  D
Sbjct: 752 AGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAED 788



 Score = 35.7 bits (83), Expect = 0.12
 Identities = 16/48 (33%), Positives = 17/48 (35%), Gaps = 4/48 (8%)

Query: 549 QVAAGIPTGPKGGPPPPPPPPGGMGPPP----PPMPGMPGPPPPPMPE 592
           QV A +   P       PP P   GPP     P  P  P  P  P P 
Sbjct: 583 QVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPA 630



 Score = 35.7 bits (83), Expect = 0.14
 Identities = 11/42 (26%), Positives = 12/42 (28%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
           A+     P  P   P P  P       P  P P     PP  
Sbjct: 671 AKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAG 712



 Score = 35.3 bits (82), Expect = 0.17
 Identities = 12/40 (30%), Positives = 13/40 (32%)

Query: 553 GIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
             P G    PPPPP P     P   P P  P        +
Sbjct: 749 PDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAED 788



 Score = 35.3 bits (82), Expect = 0.18
 Identities = 5/42 (11%), Positives = 6/42 (14%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
                            P     P          P P   P+
Sbjct: 389 GGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQ 430



 Score = 34.6 bits (80), Expect = 0.28
 Identities = 8/51 (15%), Positives = 12/51 (23%), Gaps = 5/51 (9%)

Query: 542 RLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           RL+    ++   +      G P    P       P   P      P     
Sbjct: 377 RLE----RLERRLGVAGGAGAPAAAAPS-AAAAAPAAAPAPAAAAPAAAAA 422



 Score = 34.2 bits (79), Expect = 0.34
 Identities = 13/38 (34%), Positives = 13/38 (34%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           P  P      P  PP    P P   P    PP PP  E
Sbjct: 745 PDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEE 782



 Score = 33.4 bits (77), Expect = 0.58
 Identities = 13/45 (28%), Positives = 14/45 (31%), Gaps = 2/45 (4%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYT 595
              +P      P  PP P G    PPPP    P   P   P    
Sbjct: 736 DDPVP--LPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSP 778



 Score = 33.4 bits (77), Expect = 0.74
 Identities = 11/44 (25%), Positives = 12/44 (27%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
              +A  P      P PP P         P  P  P  P P   
Sbjct: 726 QGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAA 769



 Score = 33.0 bits (76), Expect = 0.79
 Identities = 13/46 (28%), Positives = 13/46 (28%), Gaps = 1/46 (2%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
           A   A  P      P     P     P P P P  P P PP     
Sbjct: 403 AAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPA-PAPAPPSPAGN 447



 Score = 33.0 bits (76), Expect = 0.96
 Identities = 10/62 (16%), Positives = 12/62 (19%)

Query: 530 EDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
           E  ++  P         K         G  G P           PP P       P    
Sbjct: 638 EASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAA 697

Query: 590 MP 591
             
Sbjct: 698 PA 699



 Score = 33.0 bits (76), Expect = 0.98
 Identities = 11/49 (22%), Positives = 11/49 (22%)

Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           D      A      P   PP P P          P    P P   P   
Sbjct: 664 DGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAG 712



 Score = 32.3 bits (74), Expect = 1.6
 Identities = 11/53 (20%), Positives = 12/53 (22%), Gaps = 4/53 (7%)

Query: 545 EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPP----PPPMPGMPGPPPPPMPEM 593
                  +     P GG P PPP       P             P P P    
Sbjct: 436 APAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAP 488



 Score = 31.9 bits (73), Expect = 1.8
 Identities = 12/45 (26%), Positives = 12/45 (26%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           A  AA  P          P P     P P P P    P P     
Sbjct: 405 APAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAP 449



 Score = 31.9 bits (73), Expect = 2.1
 Identities = 13/38 (34%), Positives = 13/38 (34%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           P  P   P P   P     PP P     P   PPP P 
Sbjct: 742 PPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPP 779



 Score = 31.9 bits (73), Expect = 2.2
 Identities = 10/38 (26%), Positives = 10/38 (26%), Gaps = 1/38 (2%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           P  P     P  P P G     P        P    PE
Sbjct: 615 PAAPAAPAAPAAPAPAG-AAAAPAEASAAPAPGVAAPE 651



 Score = 31.5 bits (72), Expect = 2.3
 Identities = 12/51 (23%), Positives = 13/51 (25%), Gaps = 4/51 (7%)

Query: 547 KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMP----GMPGPPPPPMPEM 593
               A G P+ P    P   P P     P P           P P   P  
Sbjct: 446 GNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAA 496



 Score = 31.5 bits (72), Expect = 2.5
 Identities = 11/47 (23%), Positives = 11/47 (23%), Gaps = 5/47 (10%)

Query: 551 AAGIPTGPKGGPPPPPP-----PPGGMGPPPPPMPGMPGPPPPPMPE 592
            A    GP       PP     P     P  P  P   G    P   
Sbjct: 593 GAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEA 639



 Score = 31.5 bits (72), Expect = 2.8
 Identities = 14/50 (28%), Positives = 15/50 (30%)

Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
               A  A      P+  P P P P         P  G P PPP   P  
Sbjct: 415 AAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSA 464



 Score = 31.1 bits (71), Expect = 3.5
 Identities = 10/46 (21%), Positives = 11/46 (23%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
           A  AA  P      P  P         P P      G    P  + 
Sbjct: 676 AAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQP 721



 Score = 31.1 bits (71), Expect = 3.6
 Identities = 12/43 (27%), Positives = 13/43 (30%)

Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            A      P   P P P PP   G  P      P P   P  +
Sbjct: 423 PAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQ 465



 Score = 30.7 bits (70), Expect = 4.0
 Identities = 12/45 (26%), Positives = 12/45 (26%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           A  A      P     P P P     P PP   G       P P 
Sbjct: 413 AAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPP 457



 Score = 30.7 bits (70), Expect = 4.1
 Identities = 10/44 (22%), Positives = 10/44 (22%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           A   A  P  P       P  P       PP      P   P  
Sbjct: 680 APPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQ 723



 Score = 30.7 bits (70), Expect = 4.2
 Identities = 14/74 (18%), Positives = 15/74 (20%), Gaps = 1/74 (1%)

Query: 519 EAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPP 578
                  +      SSG P E  R     A  A   P    G    P        P    
Sbjct: 591 APGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPA-GAAAAPAEASAAPAPGVAA 649

Query: 579 MPGMPGPPPPPMPE 592
               P     P   
Sbjct: 650 PEHHPKHVAVPDAS 663



 Score = 30.3 bits (69), Expect = 5.5
 Identities = 15/55 (27%), Positives = 16/55 (29%), Gaps = 3/55 (5%)

Query: 545 EVKAQVAAGIPTGPKGGP---PPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTD 596
              AQ A      P  G    P   PP    G   P        P PP P+   D
Sbjct: 696 AAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPD 750



 Score = 30.3 bits (69), Expect = 5.6
 Identities = 11/43 (25%), Positives = 11/43 (25%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
              A   P  P G  P  P P     PP          PP   
Sbjct: 683 PAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAA 725



 Score = 30.3 bits (69), Expect = 6.4
 Identities = 9/42 (21%), Positives = 10/42 (23%)

Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
            + G    P       P  P     P  P       P  P P
Sbjct: 662 ASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAP 703



 Score = 29.6 bits (67), Expect = 9.1
 Identities = 12/51 (23%), Positives = 14/51 (27%), Gaps = 1/51 (1%)

Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGP-PPPPMPGMPGPPPPPMPEM 593
           D+      A     P   P P   P     P PP     M     P M + 
Sbjct: 746 DDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDE 796


>gnl|CDD|215618 PLN03184, PLN03184, chloroplast Hsp70; Provisional.
          Length = 673

 Score = 37.1 bits (86), Expect = 0.051
 Identities = 24/124 (19%), Positives = 44/124 (35%), Gaps = 32/124 (25%)

Query: 493 SKTEEDRRVEDLS------AKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVE-----KN 541
            K E +R V++         +  +A+  + +A++ + Q +K L++L    P +     + 
Sbjct: 540 PKDEVERMVQEAEKFAKEDKEKRDAVDTKNQADSVVYQTEKQLKELGDKVPADVKEKVEA 599

Query: 542 RLDEVKAQVAAGIPTGPK---------------------GGPPPPPPPPGGMGPPPPPMP 580
           +L E+K  +A+G     K                     G     P P G  G       
Sbjct: 600 KLKELKDAIASGSTQKMKDAMAALNQEVMQIGQSLYNQPGAGGAGPAPGGEAGSSSSSSS 659

Query: 581 GMPG 584
           G  G
Sbjct: 660 GGDG 663


>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
           carrier protein subunit; Validated.
          Length = 153

 Score = 35.2 bits (81), Expect = 0.052
 Identities = 16/51 (31%), Positives = 18/51 (35%), Gaps = 4/51 (7%)

Query: 545 EVKAQVAAGIPTGPKGGP---PPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           EV+A+   GI           P P P P     P P  P  P P P    E
Sbjct: 35  EVEAK-GLGIDLSAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPASAGE 84



 Score = 29.4 bits (66), Expect = 5.3
 Identities = 14/46 (30%), Positives = 15/46 (32%), Gaps = 1/46 (2%)

Query: 546 VKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           V+ QV    P  P    P  P P     P P P          PMP
Sbjct: 48  VQEQVPTPAPA-PAPAVPSAPTPVAPAAPAPAPASAGENVVTAPMP 92


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 37.0 bits (85), Expect = 0.053
 Identities = 14/51 (27%), Positives = 16/51 (31%), Gaps = 8/51 (15%)

Query: 555 PTGPKGGPPPPPPP-----PGG---MGPPPPPMPGMPGPPPPPMPEMYTDC 597
            +G   GPP  P        GG   +GPPP      P  PP          
Sbjct: 440 HSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAPPRASSGSQPPG 490



 Score = 34.3 bits (78), Expect = 0.36
 Identities = 13/40 (32%), Positives = 15/40 (37%), Gaps = 2/40 (5%)

Query: 554 IPTGPKGGPPPPPPPPGGMGPPP--PPMPGMPGPPPPPMP 591
           +P G    P  PPP P     PP   P+   P P P    
Sbjct: 183 VPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPS 222



 Score = 32.7 bits (74), Expect = 1.1
 Identities = 12/52 (23%), Positives = 14/52 (26%), Gaps = 12/52 (23%)

Query: 554 IPTGPKGGPPPPPPPPGGMGPPPPPMPGM------------PGPPPPPMPEM 593
              G     PPP  P      PP    G                P PP+P +
Sbjct: 458 TSGGLPAIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPI 509



 Score = 32.0 bits (72), Expect = 2.1
 Identities = 17/54 (31%), Positives = 25/54 (46%), Gaps = 1/54 (1%)

Query: 547 KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHKN 600
           +AQ  +  P       P  PP    + PP P MP +  PP  P+P++    HK+
Sbjct: 312 QAQPHSHTPPSQSALQPQQPPREQPL-PPAPSMPHIKPPPTTPIPQLPNQSHKH 364



 Score = 30.8 bits (69), Expect = 3.8
 Identities = 17/47 (36%), Positives = 19/47 (40%), Gaps = 11/47 (23%)

Query: 555 PTGPKGGPPPPPP-----PPGGMGPP----PPPMPGMPG--PPPPPM 590
           P+ P   PPP  P           PP    P P P MP   PPPP +
Sbjct: 341 PSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPSNLPPPPAL 387



 Score = 30.8 bits (69), Expect = 4.3
 Identities = 16/69 (23%), Positives = 21/69 (30%), Gaps = 10/69 (14%)

Query: 535 GRPVEKNRLDEVKAQVAAGIPTGPKGGPP--PPPPPP-----GGMGPPPPPMPGMPGPPP 587
           G P+      + +      + + P   P   P P PP          P PP P    P  
Sbjct: 207 GSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQS 266

Query: 588 P---PMPEM 593
               P P M
Sbjct: 267 SHHGPGPPM 275



 Score = 30.0 bits (67), Expect = 6.7
 Identities = 15/45 (33%), Positives = 15/45 (33%), Gaps = 4/45 (8%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPP----PPMPGMPGPPPPPMP 591
           A G P       P  PP       PP    P   G  GP PP  P
Sbjct: 464 AIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPP 508


>gnl|CDD|177553 PHA03185, PHA03185, UL14 tegument protein; Provisional.
          Length = 214

 Score = 35.8 bits (82), Expect = 0.057
 Identities = 18/43 (41%), Positives = 19/43 (44%), Gaps = 4/43 (9%)

Query: 549 QVAAGIPTGPKGGP---PPPPPPPGGMGPPPPPMPGMPGPPPP 588
            V  G  +GP   P   P     PGG G P PP P  PGP  P
Sbjct: 159 AVRPGDLSGPGSRPTSGPVAGGLPGG-GAPAPPTPPRPGPTDP 200



 Score = 30.0 bits (67), Expect = 4.9
 Identities = 37/141 (26%), Positives = 51/141 (36%), Gaps = 23/141 (16%)

Query: 471 DFRSSRRFQLDVQPLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLE 530
           DF  + R  LD       L E+    EDR + D   +LEEA      A A L+       
Sbjct: 89  DFLIAHRRYLD-----PALGERLDEAEDR-LADQEEQLEEA-----AANACLLSGDGDPA 137

Query: 531 D--LSSGRP-----VEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMP 583
           D  +S          +      V+    +G  + P  GP     P GG   PP      P
Sbjct: 138 DGWMSPEDSDLLIMWQLGSAPAVRPGDLSGPGSRPTSGPVAGGLPGGGAPAPPT-----P 192

Query: 584 GPPPPPMPEMYTDCHKNIING 604
             P P  P   +DC ++  +G
Sbjct: 193 PRPGPTDPASESDCGRDSRDG 213


>gnl|CDD|221040 pfam11235, Med25_SD1, Mediator complex subunit 25 synapsin 1.  The
           overall function of the full-length Med25 is efficiently
           to coordinate the transcriptional activation of RAR/RXR
           (retinoic acid receptor/retinoic X receptor) in higher
           eukaryotic cells. Human Med25 consists of several
           domains with different binding properties, the
           N-terminal, VWA, domain, this SD1 - synapsin 1 - domain
           from residues 229-381, a PTOV(B) or ACID domain from
           395-545, an SD2 domain from residues 564-645 and a
           C-terminal NR box-containing domain (646-650) from
           646-747. This The function of the SD domains is unclear.
          Length = 168

 Score = 35.2 bits (80), Expect = 0.059
 Identities = 21/75 (28%), Positives = 25/75 (33%), Gaps = 8/75 (10%)

Query: 523 VQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGM 582
             AQK +E   + +   KNR   +     A I   P    P P  PPG  G P P     
Sbjct: 59  DAAQKAVEAAKNQKQGLKNRFSPITPLQQAPIVGPPFSQAPAPVLPPGPPGAPKPSPASQ 118

Query: 583 --------PGPPPPP 589
                   PG    P
Sbjct: 119 LSLVTTVSPGSGLAP 133



 Score = 30.2 bits (67), Expect = 3.2
 Identities = 16/73 (21%), Positives = 23/73 (31%), Gaps = 12/73 (16%)

Query: 532 LSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPP---------PPPMPGM 582
           L +     +  ++  K Q         +  P  P      +GPP         PP  PG 
Sbjct: 54  LHAAHDAAQKAVEAAKNQKQG---LKNRFSPITPLQQAPIVGPPFSQAPAPVLPPGPPGA 110

Query: 583 PGPPPPPMPEMYT 595
           P P P     + T
Sbjct: 111 PKPSPASQLSLVT 123


>gnl|CDD|223010 PHA03215, PHA03215, nuclear protein UL24; Provisional.
          Length = 262

 Score = 36.0 bits (83), Expect = 0.063
 Identities = 19/78 (24%), Positives = 28/78 (35%), Gaps = 8/78 (10%)

Query: 513 MLRQEAEAKLVQAQKTLEDLSS-GRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGG 571
           ++ Q+    +    +TL+ LS+   P+E             G   G +       P  G 
Sbjct: 165 LVPQKITGDVTATVRTLQGLSTYAVPLEP-------RTTRRGRRRGARRPRSRRRPRGGA 217

Query: 572 MGPPPPPMPGMPGPPPPP 589
            GPP P  P  P  PP  
Sbjct: 218 AGPPVPAEPAAPERPPAD 235


>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl carrier
           protein subunit; Validated.
          Length = 130

 Score = 34.8 bits (80), Expect = 0.063
 Identities = 14/52 (26%), Positives = 17/52 (32%), Gaps = 3/52 (5%)

Query: 543 LDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP---PPPMP 591
           ++E+ A   A  P  P   P P P           P P         P PMP
Sbjct: 18  MEEIGAPAQAAAPAQPASTPVPVPTEASPQVEAQAPQPAAAAGADAMPSPMP 69


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 36.6 bits (84), Expect = 0.064
 Identities = 21/73 (28%), Positives = 23/73 (31%), Gaps = 2/73 (2%)

Query: 519 EAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPP 578
            A   +AQ+        RP             A G    P   P    PP    G   PP
Sbjct: 708 AAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPP 767

Query: 579 M--PGMPGPPPPP 589
              PG P P PPP
Sbjct: 768 AAAPGAPTPQPPP 780



 Score = 33.9 bits (77), Expect = 0.51
 Identities = 19/53 (35%), Positives = 19/53 (35%), Gaps = 12/53 (22%)

Query: 549 QVAAGIPTGPKGGPPPPPPPPGGMG-----PPP--PPMP-----GMPGPPPPP 589
             A G    P   P    PP    G     PPP  PP P     G P P PPP
Sbjct: 748 AAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPP 800



 Score = 32.7 bits (74), Expect = 1.2
 Identities = 16/45 (35%), Positives = 20/45 (44%), Gaps = 3/45 (6%)

Query: 558 PKGGPPPPPPP---PGGMGPPPPPMPGMPGPPPPPMPEMYTDCHK 599
           P+G P P PPP   P  M   P   PG  GP    + ++ T   K
Sbjct: 790 PRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVK 834



 Score = 32.0 bits (72), Expect = 2.0
 Identities = 11/39 (28%), Positives = 11/39 (28%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
            A  P  P   PP     P        P    PG   PP
Sbjct: 699 RAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPP 737



 Score = 31.2 bits (70), Expect = 3.2
 Identities = 13/41 (31%), Positives = 13/41 (31%), Gaps = 1/41 (2%)

Query: 549 QVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
           Q A G    P   P P  PP    G    P     G   PP
Sbjct: 688 QWAPGTMQPPPRAPTPMRPPAAPPGRAQRPA-AATGRARPP 727


>gnl|CDD|148844 pfam07469, DUF1518, Domain of unknown function (DUF1518).  This
           domain, which is usually found tandemly repeated, is
           found various receptor co-activating proteins.
          Length = 56

 Score = 32.5 bits (74), Expect = 0.068
 Identities = 16/45 (35%), Positives = 17/45 (37%), Gaps = 1/45 (2%)

Query: 550 VAAGIPTGPKGGPPPPPPPPG-GMGPPPPPMPGMPGPPPPPMPEM 593
              G P  P+  P   P PP  GMG  P P    P  P  PM   
Sbjct: 4   GQMGNPRMPQAPPQQFPYPPNYGMGQQPDPAFTSPFSPQSPMMSP 48



 Score = 29.8 bits (67), Expect = 0.61
 Identities = 10/35 (28%), Positives = 10/35 (28%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
           P     G  P P       P  P M    GP   P
Sbjct: 22  PPNYGMGQQPDPAFTSPFSPQSPMMSPRMGPSQSP 56


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 36.5 bits (84), Expect = 0.077
 Identities = 21/71 (29%), Positives = 28/71 (39%), Gaps = 5/71 (7%)

Query: 524 QAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPP---PPPPPPGGMGPPPPPMP 580
             Q     +SS  PV + + ++   Q     P+ P  G P   PP    GGM P PP + 
Sbjct: 403 MQQGQPGMMSSPSPVPQVQTNQSMPQ--PPQPSVPSPGGPGSQPPQSVSGGMIPSPPALM 460

Query: 581 GMPGPPPPPMP 591
             P P     P
Sbjct: 461 PSPSPQMSQSP 471



 Score = 30.4 bits (68), Expect = 5.6
 Identities = 12/48 (25%), Positives = 13/48 (27%), Gaps = 4/48 (8%)

Query: 548 AQVA-AGIPTGPKGGPPPP---PPPPGGMGPPPPPMPGMPGPPPPPMP 591
            QV     P   +         P     M       PG P  PPP  P
Sbjct: 271 MQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPMGPPPQRP 318


>gnl|CDD|219914 pfam08577, PI31_Prot_C, PI31 proteasome regulator.  PI31 is a
           cellular regulator of proteasome formation and of
           proteasome-mediated antigen processing.
          Length = 68

 Score = 32.7 bits (75), Expect = 0.080
 Identities = 15/50 (30%), Positives = 16/50 (32%), Gaps = 10/50 (20%)

Query: 552 AGIPTGPKGGP---PPPPPPPGGM-------GPPPPPMPGMPGPPPPPMP 591
            G+       P   P      GGM       G  PP   G  GPPP   P
Sbjct: 10  PGLGPPDPFDPLPAPLGGNGQGGMIFDPNRPGFGPPRGGGGDGPPPGVPP 59


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score = 36.0 bits (83), Expect = 0.084
 Identities = 8/45 (17%), Positives = 10/45 (22%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            Q A   P+      PPPP                      P  +
Sbjct: 15  EQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKD 59



 Score = 32.6 bits (74), Expect = 1.2
 Identities = 9/48 (18%), Positives = 9/48 (18%)

Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
            E  A         P   PP     P      P         P    P
Sbjct: 14  AEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKP 61



 Score = 30.3 bits (68), Expect = 5.8
 Identities = 10/57 (17%), Positives = 11/57 (19%), Gaps = 4/57 (7%)

Query: 539 EKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPP----PMPGMPGPPPPPMP 591
           E  +        AA     P      P           P    P    P     P P
Sbjct: 13  EAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKP 69



 Score = 30.3 bits (68), Expect = 6.2
 Identities = 11/35 (31%), Positives = 11/35 (31%)

Query: 557 GPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           G    P P PP P     PPPP            P
Sbjct: 12  GEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAP 46


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 36.4 bits (84), Expect = 0.089
 Identities = 18/111 (16%), Positives = 24/111 (21%), Gaps = 1/111 (0%)

Query: 481 DVQPLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEK 540
             QP     A  +            A    A    +   A   +     E L++ R    
Sbjct: 382 VAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASA 441

Query: 541 NRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
                  A   A     P     P    P  +       P    P   P P
Sbjct: 442 RGPGGAPAPAPAPAAA-PAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAP 491



 Score = 34.5 bits (79), Expect = 0.32
 Identities = 12/50 (24%), Positives = 13/50 (26%), Gaps = 4/50 (8%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP----PPPMPEM 593
           A  AA           P P        P    P     P    PPP  E+
Sbjct: 453 APAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEEL 502



 Score = 32.9 bits (75), Expect = 1.0
 Identities = 7/55 (12%), Positives = 8/55 (14%)

Query: 536 RPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
                       A  A          PP  P            +   P    P  
Sbjct: 376 TAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAP 430



 Score = 32.5 bits (74), Expect = 1.3
 Identities = 17/73 (23%), Positives = 20/73 (27%)

Query: 515 RQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGP 574
           R  A   L  A++       G P          A  A     GP+         P    P
Sbjct: 426 RSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAP 485

Query: 575 PPPPMPGMPGPPP 587
              P P    PPP
Sbjct: 486 AAAPAPADDDPPP 498



 Score = 30.6 bits (69), Expect = 4.5
 Identities = 10/39 (25%), Positives = 10/39 (25%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
             GP G P P P P         P    P P        
Sbjct: 441 ARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAA 479



 Score = 30.2 bits (68), Expect = 6.1
 Identities = 9/36 (25%), Positives = 9/36 (25%)

Query: 556 TGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
             P   P P    P    P P   P  P   P    
Sbjct: 379 AAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAA 414


>gnl|CDD|223029 PHA03264, PHA03264, envelope glycoprotein D; Provisional.
          Length = 416

 Score = 35.8 bits (82), Expect = 0.100
 Identities = 13/48 (27%), Positives = 13/48 (27%), Gaps = 16/48 (33%)

Query: 555 PTGPKGGPP-PPPPPPGGMGPP---------------PPPMPGMPGPP 586
           P G  G     P P P    P                PPP P  P  P
Sbjct: 310 PAGRDGAAGGEPKPGPPRPAPDADRPEGWPSLEAITFPPPTPATPAVP 357



 Score = 35.0 bits (80), Expect = 0.19
 Identities = 12/54 (22%), Positives = 16/54 (29%), Gaps = 1/54 (1%)

Query: 535 GRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
           G    + + +    +  A        G  P P    G      P PG P P P 
Sbjct: 279 GDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDG-AAGGEPKPGPPRPAPD 331



 Score = 34.6 bits (79), Expect = 0.25
 Identities = 16/72 (22%), Positives = 22/72 (30%), Gaps = 12/72 (16%)

Query: 530 EDLSSGRPVEKNRLDEVKAQVAAGIPTGPKG----GPPP--PPPPPGGMGPPP------P 577
           +D    +P      D    +   G   GP+     G     P P P    P        P
Sbjct: 280 DDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGGEPKPGPPRPAPDADRPEGWP 339

Query: 578 PMPGMPGPPPPP 589
            +  +  PPP P
Sbjct: 340 SLEAITFPPPTP 351



 Score = 30.4 bits (68), Expect = 4.9
 Identities = 13/30 (43%), Positives = 13/30 (43%)

Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           PP      G  PPP P  G P PP    PE
Sbjct: 255 PPYFEESKGYEPPPAPSGGSPAPPGDDRPE 284



 Score = 30.4 bits (68), Expect = 5.7
 Identities = 12/43 (27%), Positives = 13/43 (30%), Gaps = 8/43 (18%)

Query: 554 IPTGPKGGPPPPP--------PPPGGMGPPPPPMPGMPGPPPP 588
            P  P GG P PP        P PG +    P          P
Sbjct: 266 PPPAPSGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGP 308


>gnl|CDD|219805 pfam08347, CTNNB1_binding, N-terminal CTNNB1 binding.  This region
           tends to appear at the N-terminus of proteins also
           containing DNA-binding HMG (high mobility group) boxes
           (pfam00505) and appears to bind the armadillo repeat of
           CTNNB1 (beta-catenin), forming a stable complex.
           Signaling by Wnt through TCF/LCF is involved in
           developmental patterning, induction of neural tissues,
           cell fate decisions and stem cell differentiation.
           Isoforms of HMG T-cell factors lacking the N-terminal
           CTNNB1-binding domain cannot fulfill their role as
           transcriptional activators in T-cell differentiation.
          Length = 200

 Score = 34.9 bits (80), Expect = 0.11
 Identities = 24/121 (19%), Positives = 37/121 (30%), Gaps = 16/121 (13%)

Query: 484 PLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRL 543
           P  E   +KS+     ++ED+S K ++    +    +           L +G        
Sbjct: 70  PRYESYRDKSR----EKLEDVSHKHQDMGYYKGPPYSGYPFLMLPNPYLPNGSLSPLPPS 125

Query: 544 DEVKAQVAAGIPTGPKGGPP---------PPPPPPGGMGPPPPPMPGMPGPPPPP-MPEM 593
                 V    P       P          P  PP  +     P  G+P PP PP +   
Sbjct: 126 SNKVPVVQP--PHHVHPLTPLITYSNEHFSPGTPPPHLPYDVDPKTGIPRPPHPPDISPF 183

Query: 594 Y 594
           Y
Sbjct: 184 Y 184


>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
           (DUF2076).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function. The domain,
           however, is found in various periplasmic ligand-binding
           sensor proteins.
          Length = 234

 Score = 35.0 bits (81), Expect = 0.11
 Identities = 21/98 (21%), Positives = 32/98 (32%), Gaps = 10/98 (10%)

Query: 499 RRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDL---SSGRPVEKNR--LDEV--KAQVA 551
           RR  D    L + I++++ A   L QA   +E+L   +     + +   L  +       
Sbjct: 37  RRQPDAPYYLVQTILVQEAA---LKQANARIEELEAQAQHPQSQSSGGFLSGMFGGGAPR 93

Query: 552 AGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
                     P PP  P  G G P     G       P
Sbjct: 94  PPPAAPAVQPPAPPARPGWGSGGPSQQGAGQQPGYAQP 131


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 35.8 bits (82), Expect = 0.11
 Identities = 16/42 (38%), Positives = 19/42 (45%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           ++  P  P     P P  P    PP PP  G+P  P P MPE
Sbjct: 417 SSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPE 458



 Score = 32.0 bits (72), Expect = 1.8
 Identities = 18/42 (42%), Positives = 18/42 (42%), Gaps = 3/42 (7%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
           A  AA  P  P    PP PP  G    PP P P MP   P P
Sbjct: 425 APSAAPSPRVPWDDAPPAPPRSGI---PPRPAPRMPEASPVP 463


>gnl|CDD|226654 COG4191, COG4191, Signal transduction histidine kinase regulating
           C4-dicarboxylate transport system [Signal transduction
           mechanisms].
          Length = 603

 Score = 35.8 bits (83), Expect = 0.12
 Identities = 22/87 (25%), Positives = 38/87 (43%), Gaps = 12/87 (13%)

Query: 471 DFRSSRRFQLDVQPLVEHLAEKSKTEEDRRVEDLS---AKLEEAIMLRQEAEAKLVQAQK 527
            +   RR        ++    + +   + R  DL+   A+L+  I  R++AEA L +AQ 
Sbjct: 314 LWLRRRRRARLRLAELQEARAELERRVEERTADLTRANARLQAEIAEREQAEAALRRAQD 373

Query: 528 TLEDLSSGRPVEKNRLDEVKAQVAAGI 554
            L        V+  +L     Q++AGI
Sbjct: 374 EL--------VQAGKL-AALGQMSAGI 391


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 35.7 bits (82), Expect = 0.12
 Identities = 15/39 (38%), Positives = 18/39 (46%), Gaps = 1/39 (2%)

Query: 555 PTGPKGGPPPPPPPPGGM-GPPPPPMPGMPGPPPPPMPE 592
           P  P   P  P  PP  +  PP PP+     PPP P P+
Sbjct: 223 PFLPGPSPAQPSAPPASIPAPPIPPVIQYVAPPPVPPPQ 261



 Score = 31.8 bits (72), Expect = 2.0
 Identities = 16/70 (22%), Positives = 22/70 (31%), Gaps = 1/70 (1%)

Query: 522 LVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPG 581
           L     +L   +S  P  +         +    P  P   P   P PP          P 
Sbjct: 197 LPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPAPPIPPVIQYVAPPP 256

Query: 582 MPGPPPPPMP 591
           +P PP P +P
Sbjct: 257 VP-PPQPIIP 265


>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 598

 Score = 35.6 bits (83), Expect = 0.12
 Identities = 12/47 (25%), Positives = 15/47 (31%)

Query: 543 LDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
            DE   ++  G P+   GG  PP        P     PG     P  
Sbjct: 380 PDEALRRLQEGPPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAA 426



 Score = 35.3 bits (82), Expect = 0.19
 Identities = 13/49 (26%), Positives = 14/49 (28%), Gaps = 1/49 (2%)

Query: 542 RLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
           RL E       G   GP GG   P  P     P         G P   +
Sbjct: 386 RLQEGPPSPGGG-GGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAAL 433



 Score = 34.5 bits (80), Expect = 0.33
 Identities = 14/51 (27%), Positives = 15/51 (29%), Gaps = 10/51 (19%)

Query: 551 AAGIPT----------GPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           AA +PT          GP         PPGG G P  P            P
Sbjct: 374 AADLPTPDEALRRLQEGPPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAP 424



 Score = 32.6 bits (75), Expect = 1.2
 Identities = 10/42 (23%), Positives = 12/42 (28%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
             G P G  G P  P            P  G P      +P+
Sbjct: 397 GGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPD 438


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 35.6 bits (82), Expect = 0.13
 Identities = 12/28 (42%), Positives = 12/28 (42%), Gaps = 4/28 (14%)

Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
            P PPPPP    PPP P      P  P 
Sbjct: 270 EPQPPPPP----PPPEPPEPEEEPDEPD 293



 Score = 35.2 bits (81), Expect = 0.19
 Identities = 12/38 (31%), Positives = 13/38 (34%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           P  P   PPP PP P      P       G     +PE
Sbjct: 271 PQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPE 308



 Score = 34.8 bits (80), Expect = 0.23
 Identities = 13/31 (41%), Positives = 14/31 (45%), Gaps = 4/31 (12%)

Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            P P PPP    PPPPP P  P   P    +
Sbjct: 268 EPEPQPPP----PPPPPEPPEPEEEPDEPDQ 294



 Score = 34.4 bits (79), Expect = 0.28
 Identities = 15/46 (32%), Positives = 17/46 (36%), Gaps = 4/46 (8%)

Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHKNIIN 603
           P+  P PPPPPP    PP PP P      P        +    I  
Sbjct: 267 PEPEPQPPPPPP----PPEPPEPEEEPDEPDQTDPDDGEETDQIPE 308


>gnl|CDD|219133 pfam06682, DUF1183, Protein of unknown function (DUF1183).  This
           family consists of several eukaryotic proteins of around
           360 residues in length. The function of this family is
           unknown.
          Length = 317

 Score = 34.7 bits (80), Expect = 0.18
 Identities = 13/37 (35%), Positives = 13/37 (35%)

Query: 553 GIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
           G   G  GG   P PPP G     PP  G    P   
Sbjct: 200 GGGGGGGGGGSGPGPPPPGFKSSFPPPYGPGAGPSSG 236



 Score = 33.9 bits (78), Expect = 0.30
 Identities = 15/36 (41%), Positives = 15/36 (41%), Gaps = 4/36 (11%)

Query: 557 GPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           G  GG  P PPPPG     PPP     GP   P   
Sbjct: 205 GGGGGSGPGPPPPGFKSSFPPP----YGPGAGPSSG 236



 Score = 32.0 bits (73), Expect = 1.4
 Identities = 14/41 (34%), Positives = 14/41 (34%), Gaps = 1/41 (2%)

Query: 552 AGIPTGPKGGPPPPPPPPGGMGPPPP-PMPGMPGPPPPPMP 591
           AG   G  GG         G GPPPP      P P  P   
Sbjct: 192 AGYGGGGGGGGGGGGGGGSGPGPPPPGFKSSFPPPYGPGAG 232


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 35.2 bits (81), Expect = 0.18
 Identities = 13/41 (31%), Positives = 14/41 (34%)

Query: 552 AGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           +  P        PP  P      PPP  P    PP P  PE
Sbjct: 386 STRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPE 426



 Score = 33.2 bits (76), Expect = 0.64
 Identities = 26/111 (23%), Positives = 41/111 (36%), Gaps = 14/111 (12%)

Query: 494 KTEEDRRVEDLSAKLEEAIMLRQEAE----AKLVQAQKTLEDL-------SSGR-PVEKN 541
            +  DR + DL+A  +E   L++ ++      L +  K    L       S G+ P+E  
Sbjct: 297 NSGADRSLLDLTA--DEKAALQKVSQIANLEALTKWVKAFSQLDFQLRTTSYGQLPLELA 354

Query: 542 RLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            ++ +   V A  P  P    P P  P       P        PP  P+ E
Sbjct: 355 VIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRE 405



 Score = 33.2 bits (76), Expect = 0.74
 Identities = 13/48 (27%), Positives = 13/48 (27%)

Query: 546 VKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
            K   AA  P  P   P   P        PP         PPP  P  
Sbjct: 369 AKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP 416



 Score = 32.5 bits (74), Expect = 1.1
 Identities = 10/35 (28%), Positives = 10/35 (28%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
           P  P      PPP P     PP P      P    
Sbjct: 399 PKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTR 433



 Score = 32.5 bits (74), Expect = 1.3
 Identities = 10/46 (21%), Positives = 14/46 (30%)

Query: 546 VKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
             A+  A  P+  +  P P   P        PP   +     PP  
Sbjct: 367 QPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPV 412



 Score = 30.9 bits (70), Expect = 3.2
 Identities = 12/53 (22%), Positives = 12/53 (22%)

Query: 537 PVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
           P E  R       V       P    P   P       P    P    P PP 
Sbjct: 399 PKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPPK 451



 Score = 30.5 bits (69), Expect = 4.5
 Identities = 18/63 (28%), Positives = 19/63 (30%), Gaps = 17/63 (26%)

Query: 547 KAQVAAGIPTGP---KGGPPPPPPPPGGMGPPPPPMPGMP--------------GPPPPP 589
           KA  AA IP      +   PPP PP     P P      P                PP P
Sbjct: 390 KAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAP 449

Query: 590 MPE 592
             E
Sbjct: 450 PKE 452


>gnl|CDD|221827 pfam12881, NUT_N, NUT protein N terminus.  This family includes the
           NUT protein. The gene encoding for NUT protein (Nuclear
           Testis protein) is found fused to BRD3 or BRD4 genes, in
           some aggressive types of carcinoma, due to chromosomal
           translocations. Proteins of the BRD family contain two
           bromodomains that bind transcriptionally active
           chromatin through associations with acetylated histones
           H3 and H4. Such proteins are crucial for the regulation
           of cell cycle progression. On the other hand, little is
           known about NUT protein. NUT is known to have a Nuclear
           Export Sequence (NES) as well as a Nuclear Localization
           Signal (NLS), both located towards the C-terminal end of
           the protein. A fused NUT-GFP protein showed either
           cytoplasmic or nuclear localization, suggesting that it
           is subject to nuclear/cytoplasmic shuttling. Consistent
           with this possibility, treatment with leptomycin B an
           inhibitor of CRM1-dependent nuclear export resulted in
           re-distribution of NUT-GFP to the nucleus. Inspection of
           NUT revealed a C-terminal sequence similar to known
           nuclear export sequences (NES) which are often regulated
           by phosphorylation.
          Length = 328

 Score = 34.9 bits (80), Expect = 0.19
 Identities = 12/32 (37%), Positives = 12/32 (37%), Gaps = 1/32 (3%)

Query: 556 TGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPP 587
           T   G P PPPPPP     P         P P
Sbjct: 148 TWILGLPLPPPPPPVAQLVPIVS-LENAWPGP 178



 Score = 29.2 bits (65), Expect = 9.8
 Identities = 11/30 (36%), Positives = 13/30 (43%), Gaps = 1/30 (3%)

Query: 562 PPPPPPPPGGM-GPPPPPMPGMPGPPPPPM 590
           PP P PP      PPPP +     P  P +
Sbjct: 25  PPAPQPPDQPFWEPPPPLVTAGFSPGNPLV 54


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 35.2 bits (81), Expect = 0.19
 Identities = 15/60 (25%), Positives = 15/60 (25%), Gaps = 12/60 (20%)

Query: 548 AQVAAGIPTGPKGGPPPPPP-----PPGGMGPPPPPMPGMPG-------PPPPPMPEMYT 595
            Q   G P G       P P     P    G  P      P          PP  P MY 
Sbjct: 406 QQQFNGQPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQPVMYP 465


>gnl|CDD|221518 pfam12301, CD99L2, CD99 antigen like protein 2.  This family of
           proteins is found in eukaryotes. Proteins in this family
           are typically between 165 and 237 amino acids in length.
           CD99L2 and CD99 are involved in trans-endothelial
           migration of neutrophils in vitro and in the recruitment
           of neutrophils into inflamed peritoneum.
          Length = 154

 Score = 33.6 bits (77), Expect = 0.19
 Identities = 10/44 (22%), Positives = 10/44 (22%), Gaps = 3/44 (6%)

Query: 552 AGIPTGPKGGPPP---PPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
              P  P  G           GG      P P     P P  P 
Sbjct: 19  PATPKKPSKGDDDFDLEDALDGGNDGGAGPKPKPRPQPDPGNPG 62


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 33.3 bits (76), Expect = 0.19
 Identities = 12/37 (32%), Positives = 14/37 (37%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           P  P   P    P P    PPP   P +P  P   +P
Sbjct: 87  PPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVP 123



 Score = 33.3 bits (76), Expect = 0.20
 Identities = 21/66 (31%), Positives = 27/66 (40%), Gaps = 1/66 (1%)

Query: 524 QAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMP 583
           +A+K L  L+   P E       ++  A  +PT P    P  PP P    P P   P   
Sbjct: 74  EARKILAPLTPPAPPEPVTPPTAQSP-APAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPS 132

Query: 584 GPPPPP 589
            PP PP
Sbjct: 133 DPPQPP 138



 Score = 31.7 bits (72), Expect = 0.67
 Identities = 13/41 (31%), Positives = 13/41 (31%), Gaps = 3/41 (7%)

Query: 555 PTGPKGGPPPPP---PPPGGMGPPPPPMPGMPGPPPPPMPE 592
           P  P     P P    PP    P  PP P    P P   P 
Sbjct: 90  PVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPP 130



 Score = 31.0 bits (70), Expect = 1.2
 Identities = 12/29 (41%), Positives = 13/29 (44%)

Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           PP PP P        P P +P PPP   P
Sbjct: 84  PPAPPEPVTPPTAQSPAPAVPTPPPTSTP 112



 Score = 29.8 bits (67), Expect = 3.3
 Identities = 13/38 (34%), Positives = 14/38 (36%), Gaps = 1/38 (2%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           P  P+   PP    P    P PPP    P  PP P   
Sbjct: 85  PAPPEPVTPPTAQSPAPAVPTPPP-TSTPAVPPAPAAA 121


>gnl|CDD|115029 pfam06345, Drf_DAD, DRF Autoregulatory Domain.  This motif is found
           in Diaphanous-related formins. It binds the N-terminal
           GTPase-binding domain; this link is broken when
           GTP-bound Rho binds to the GBD and activates the
           protein. The addition of DAD to mammalian cells induces
           actin filament formation, stabilises microtubules, and
           activates serum-response mediated transcription.
          Length = 15

 Score = 30.4 bits (68), Expect = 0.21
 Identities = 12/13 (92%), Positives = 13/13 (100%)

Query: 843 GVMDSLLEALQTG 855
           GVMDSLLEALQ+G
Sbjct: 1   GVMDSLLEALQSG 13


>gnl|CDD|221759 pfam12764, Gly-rich_Ago1, Glycine-rich region of argonaut.  This
           domain is often found at the very N-terminal of
           argonaut-like proteins.
          Length = 102

 Score = 32.6 bits (74), Expect = 0.22
 Identities = 14/44 (31%), Positives = 17/44 (38%), Gaps = 9/44 (20%)

Query: 560 GGPP---------PPPPPPGGMGPPPPPMPGMPGPPPPPMPEMY 594
           GGPP         PP     G G          GPP P +PE++
Sbjct: 1   GGPPEYQGRGRGGPPQQGGRGGGGGGRGGGSTGGPPRPSVPELH 44



 Score = 30.7 bits (69), Expect = 0.79
 Identities = 12/47 (25%), Positives = 13/47 (27%), Gaps = 10/47 (21%)

Query: 553 GIPTGPKGGPPPPPPP----------PGGMGPPPPPMPGMPGPPPPP 589
           G   G  GGPP P  P             +   P P        PP 
Sbjct: 26  GRGGGSTGGPPRPSVPELHQATQAPYQAAVSTQPAPSEASSSSQPPE 72


>gnl|CDD|216868 pfam02084, Bindin, Bindin. 
          Length = 239

 Score = 34.1 bits (78), Expect = 0.22
 Identities = 13/38 (34%), Positives = 13/38 (34%), Gaps = 1/38 (2%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGM-GPPPPPMPGMPGPPP 587
           A     GP GG   P    G   GP      G  GPP 
Sbjct: 32  ANQGMGGPVGGGGGPGAGGGAPGGPVGGGGGGSGGPPG 69



 Score = 32.9 bits (75), Expect = 0.50
 Identities = 10/40 (25%), Positives = 10/40 (25%)

Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
            A G P            P GG G P        GP    
Sbjct: 21  PAPGQPAQQGYANQGMGGPVGGGGGPGAGGGAPGGPVGGG 60



 Score = 32.5 bits (74), Expect = 0.85
 Identities = 12/41 (29%), Positives = 12/41 (29%), Gaps = 5/41 (12%)

Query: 553 GIPTGPKGGPPPPP-----PPPGGMGPPPPPMPGMPGPPPP 588
           G P G  GGP         P  GG G    P  G       
Sbjct: 37  GGPVGGGGGPGAGGGAPGGPVGGGGGGSGGPPGGGEVAGEA 77


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 34.3 bits (78), Expect = 0.23
 Identities = 35/134 (26%), Positives = 47/134 (35%), Gaps = 34/134 (25%)

Query: 491 EKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQV 550
           EK   EE  R E ++ +L+  I  + E +A  +   + +      +P   N  +  + QV
Sbjct: 99  EKRLLEEKERQEQMAEQLQR-ISEQNEQQAIEIDPTEEVNTQEPTQPAGVNVANNPQPQV 157

Query: 551 AAGIPTGPK----------GGPPPP-----------PPPPGGMGPPP-----PPMPGM-- 582
                 GP           G P  P           PP   GM P        PMPGM  
Sbjct: 158 Q--PQFGPNPQQRINPQRFGFPMQPNMGMRPGFNQMPPHMPGMPPNQMRPGFNPMPGMPP 215

Query: 583 -PGPPPPP--MPEM 593
            PG    P  MP M
Sbjct: 216 RPGFNQNPNMMPNM 229



 Score = 31.6 bits (71), Expect = 1.9
 Identities = 14/42 (33%), Positives = 16/42 (38%), Gaps = 2/42 (4%)

Query: 555 PTGPKGGPPPPP--PPPGGMGPPPPPMPGMPGPPPPPMPEMY 594
           P   + G  P P  PP  G    P  MP M  P   P P  +
Sbjct: 200 PNQMRPGFNPMPGMPPRPGFNQNPNMMPNMNRPGFRPQPGGF 241



 Score = 31.2 bits (70), Expect = 2.2
 Identities = 14/34 (41%), Positives = 14/34 (41%), Gaps = 3/34 (8%)

Query: 553 GIPTGPKGGPPPPPPPPGGMGPPP---PPMPGMP 583
           G P GP     P   P  GM PPP    P  G P
Sbjct: 245 GTPMGPNMQQRPGFNPNQGMNPPPHMAGPRAGFP 278



 Score = 29.3 bits (65), Expect = 8.7
 Identities = 13/41 (31%), Positives = 13/41 (31%), Gaps = 2/41 (4%)

Query: 553 GIPTGPKGGPPPPPPPPGGMGPPP--PPMPGMPGPPPPPMP 591
           G    P G   P  P    M   P   P  GM  PP    P
Sbjct: 233 GFRPQPGGFNHPGTPMGPNMQQRPGFNPNQGMNPPPHMAGP 273


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 35.1 bits (81), Expect = 0.23
 Identities = 23/78 (29%), Positives = 37/78 (47%), Gaps = 6/78 (7%)

Query: 486 VEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGR-----PVEK 540
            +   ++     DRR+E L  +LEE     +E E KL + +K  +   SG+      VEK
Sbjct: 468 DKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEK 527

Query: 541 NRLDEV-KAQVAAGIPTG 557
             L+ + +A+   GI  G
Sbjct: 528 LTLEAIEEAEEEYGIKEG 545


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 35.0 bits (81), Expect = 0.23
 Identities = 19/66 (28%), Positives = 34/66 (51%), Gaps = 8/66 (12%)

Query: 493 SKTEEDRRVEDLSA------KLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEV 546
           S+ E +R V++  A      K +E I  R  A++   QA+KTL++     P E+   +++
Sbjct: 499 SEEEIERMVKEAEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPAEEK--EKI 556

Query: 547 KAQVAA 552
           +  VA 
Sbjct: 557 EKAVAE 562


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 34.8 bits (80), Expect = 0.25
 Identities = 15/51 (29%), Positives = 16/51 (31%), Gaps = 7/51 (13%)

Query: 549 QVAAGIPTGPKGGPPPPPPP-------PGGMGPPPPPMPGMPGPPPPPMPE 592
            +   I   P  G P PPPP       P    P   P  G  GP     P 
Sbjct: 81  PIRIAITVDPSAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPP 131


>gnl|CDD|219753 pfam08226, DUF1720, Domain of unknown function (DUF1720).  This
           domain is found in different combinations with cortical
           patch components EF hand, SH3 and ENTH and is therefore
           likely to be involved in cytoskeletal processes. This
           family contains many hypothetical proteins.
          Length = 73

 Score = 31.3 bits (71), Expect = 0.29
 Identities = 9/44 (20%), Positives = 12/44 (27%), Gaps = 5/44 (11%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPP-----PPMPGMPGPPPPPMPEM 593
            TG +      P   G + P P     P   G         P+ 
Sbjct: 5   QTGYQPPQQQQPQQQGPLQPQPTGFMQPQPTGFGQQQQGLQPQQ 48


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 34.4 bits (80), Expect = 0.31
 Identities = 24/82 (29%), Positives = 41/82 (50%), Gaps = 10/82 (12%)

Query: 472 FRSSRRFQLDVQPLVEHLAEKSKTEEDRRVEDLSAKLE----EAIMLRQEAEAKLVQAQK 527
           F  ++R  L    ++E  A+K   E+  ++ +L A LE    E     +EAEA L +A+K
Sbjct: 491 FEIAKRLGLPEN-IIEE-AKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEK 548

Query: 528 TLEDLSSGRPVEKNRLDEVKAQ 549
             E+L      +K +L E + +
Sbjct: 549 LKEELEE----KKEKLQEEEDK 566


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
           procyclic acidic repetitive protein (PARP) like
           sequences. The procyclic acidic repetitive protein
           (parp) genes of Trypanosoma brucei encode a small family
           of abundant surface proteins whose expression is
           restricted to the procyclic form of the parasite. They
           are found at two unlinked loci, parpA and parpB;
           transcription of both loci is developmentally regulated.
          Length = 145

 Score = 32.6 bits (73), Expect = 0.37
 Identities = 9/49 (18%), Positives = 10/49 (20%), Gaps = 1/49 (2%)

Query: 545 EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPG-MPGPPPPPMPE 592
           E   +V A    G      P               P         P PE
Sbjct: 45  EKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPE 93



 Score = 32.6 bits (73), Expect = 0.38
 Identities = 5/35 (14%), Positives = 6/35 (17%), Gaps = 1/35 (2%)

Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            +  P                 P  P       PE
Sbjct: 68  EEPEPEEEGEEEPEPEEEGEEEPE-PEETGEEEPE 101



 Score = 30.7 bits (68), Expect = 1.9
 Identities = 11/42 (26%), Positives = 12/42 (28%), Gaps = 4/42 (9%)

Query: 555 PTGP-KGGPPPPPPP---PGGMGPPPPPMPGMPGPPPPPMPE 592
             G  +  P         P   G   P     P P P P PE
Sbjct: 74  EEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPE 115



 Score = 29.2 bits (64), Expect = 6.3
 Identities = 12/38 (31%), Positives = 13/38 (34%), Gaps = 1/38 (2%)

Query: 556 TGPKGGPPPPP-PPPGGMGPPPPPMPGMPGPPPPPMPE 592
             P+      P P   G   P P     P P P P PE
Sbjct: 80  PEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPE 117


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 33.5 bits (77), Expect = 0.40
 Identities = 14/65 (21%), Positives = 17/65 (26%)

Query: 527 KTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP 586
           +  ED         N      A  A+  P+ P    P  P  P     P  P      P 
Sbjct: 156 EEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPA 215

Query: 587 PPPMP 591
           P    
Sbjct: 216 PSSFQ 220



 Score = 32.3 bits (74), Expect = 0.96
 Identities = 14/41 (34%), Positives = 17/41 (41%)

Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
             + +P  P       PPP       P P PG   PPPPP+
Sbjct: 208 SDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPV 248



 Score = 32.3 bits (74), Expect = 1.1
 Identities = 13/42 (30%), Positives = 14/42 (33%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
               P+ P     PP P       PPP       P PPP P 
Sbjct: 200 PPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPA 241



 Score = 32.0 bits (73), Expect = 1.4
 Identities = 10/33 (30%), Positives = 10/33 (30%), Gaps = 2/33 (6%)

Query: 562 PPPPPPPPGGMGPP--PPPMPGMPGPPPPPMPE 592
            PP P       PP  P        PP P  P 
Sbjct: 212 LPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPP 244



 Score = 31.6 bits (72), Expect = 2.0
 Identities = 13/73 (17%), Positives = 19/73 (26%)

Query: 517 EAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPP 576
           E +     A    ++   G   +           ++         P  P  P     PP 
Sbjct: 156 EEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPA 215

Query: 577 PPMPGMPGPPPPP 589
           P       PPP P
Sbjct: 216 PSSFQSDTPPPSP 228



 Score = 30.0 bits (68), Expect = 5.3
 Identities = 17/87 (19%), Positives = 25/87 (28%), Gaps = 10/87 (11%)

Query: 520 AKLVQAQKTLEDLSSGRPVEKNRLDEV--KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPP 577
           A++ +A K  ED + G P+++   D            P          P  P    P  P
Sbjct: 136 ARIHKALKEGEDPNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVP 195

Query: 578 PMPG--------MPGPPPPPMPEMYTD 596
             P              PP      +D
Sbjct: 196 SFPSPPEDPSSPSDSSLPPAPSSFQSD 222



 Score = 29.7 bits (67), Expect = 6.9
 Identities = 8/35 (22%), Positives = 8/35 (22%)

Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTD 596
               PP P       PP        P P P     
Sbjct: 209 DSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAP 243



 Score = 29.3 bits (66), Expect = 9.5
 Identities = 10/37 (27%), Positives = 12/37 (32%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           P+   G P  P PP     P    +P  P       P
Sbjct: 188 PSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTP 224


>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
          Length = 1000

 Score = 34.3 bits (78), Expect = 0.41
 Identities = 14/38 (36%), Positives = 14/38 (36%), Gaps = 4/38 (10%)

Query: 558 PKGGPPPPP--PPPGGMGPPPP--PMPGMPGPPPPPMP 591
           P   PPP    P  G   P     P PG   P PP  P
Sbjct: 736 PDQAPPPSHQAPYSGHEEPQAQQAPYPGYWEPRPPQAP 773


>gnl|CDD|235309 PRK04596, minC, septum formation inhibitor; Reviewed.
          Length = 248

 Score = 33.4 bits (76), Expect = 0.41
 Identities = 13/23 (56%), Positives = 13/23 (56%)

Query: 562 PPPPPPPPGGMGPPPPPMPGMPG 584
           PPPPPPPP    P PP     PG
Sbjct: 120 PPPPPPPPARAEPAPPVARPAPG 142



 Score = 32.6 bits (74), Expect = 0.66
 Identities = 13/28 (46%), Positives = 13/28 (46%), Gaps = 5/28 (17%)

Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
           PPPPPPP     PP    P  P   P P
Sbjct: 119 PPPPPPP-----PPARAEPAPPVARPAP 141



 Score = 30.3 bits (68), Expect = 3.8
 Identities = 12/27 (44%), Positives = 12/27 (44%), Gaps = 4/27 (14%)

Query: 565 PPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           PPPPP    PPPP       P   P P
Sbjct: 119 PPPPP----PPPPARAEPAPPVARPAP 141


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 34.1 bits (79), Expect = 0.43
 Identities = 11/43 (25%), Positives = 11/43 (25%)

Query: 547 KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
            A  A       K    P P PP    P  PP P         
Sbjct: 53  AAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAA 95



 Score = 32.9 bits (76), Expect = 0.90
 Identities = 10/46 (21%), Positives = 10/46 (21%)

Query: 547 KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
               AA  P  P    P  PP P             P       P 
Sbjct: 64  AKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPA 109



 Score = 32.6 bits (75), Expect = 1.3
 Identities = 17/71 (23%), Positives = 19/71 (26%), Gaps = 5/71 (7%)

Query: 526 QKTLEDLSSGRP-----VEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMP 580
           Q+ L D +S  P               A  AA          P   P       P P  P
Sbjct: 16  QQYLADPNSVDPSWREFFADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPP 75

Query: 581 GMPGPPPPPMP 591
               P  PP P
Sbjct: 76  AAAAPAAPPKP 86



 Score = 32.2 bits (74), Expect = 1.7
 Identities = 11/61 (18%), Positives = 12/61 (19%), Gaps = 1/61 (1%)

Query: 529 LEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
             D   G            A  A+     P    P  P P       P   P  P     
Sbjct: 33  FADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPK-PAAAAA 91

Query: 589 P 589
            
Sbjct: 92  A 92



 Score = 31.0 bits (71), Expect = 4.0
 Identities = 11/41 (26%), Positives = 11/41 (26%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           AA         P P PP       PP P         P  P
Sbjct: 59  AAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAP 99



 Score = 30.6 bits (70), Expect = 4.5
 Identities = 8/44 (18%), Positives = 8/44 (18%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           A  AA             P  P    P        P P      
Sbjct: 49  AAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA 92



 Score = 30.6 bits (70), Expect = 5.1
 Identities = 8/45 (17%), Positives = 9/45 (20%)

Query: 547 KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
            A+  A     P     P  PP           P  P        
Sbjct: 63  AAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAA 107


>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region.  The
           defective chorion-1 gene (dec-1) in Drosophila encodes
           follicle cell proteins necessary for proper eggshell
           assembly. Multiple products of the dec-1 gene are formed
           by alternative RNA splicing and proteolytic processing.
           Cleavage products include S80 (80 kDa) which is
           incorporated into the eggshell, and further proteolysis
           of S80 gives S60 (60 kDa).
          Length = 407

 Score = 33.6 bits (76), Expect = 0.45
 Identities = 11/36 (30%), Positives = 13/36 (36%), Gaps = 1/36 (2%)

Query: 551 AAGIPT-GPKGGPPPPPPPPGGMGPPPPPMPGMPGP 585
           AA +P   P     PP  P      P  P+P    P
Sbjct: 106 AAPVPAPAPAPAAAPPAAPAPAADTPAAPIPDAVQP 141



 Score = 33.2 bits (75), Expect = 0.72
 Identities = 9/30 (30%), Positives = 11/30 (36%)

Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           P P P P     PP         P  P+P+
Sbjct: 108 PVPAPAPAPAAAPPAAPAPAADTPAAPIPD 137


>gnl|CDD|227934 COG5647, COG5647, Cullin, a subunit of E3 ubiquitin ligase
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 773

 Score = 34.0 bits (78), Expect = 0.46
 Identities = 12/49 (24%), Positives = 20/49 (40%), Gaps = 3/49 (6%)

Query: 7   LEEQENMLEKLNPE---QLNQKFEDMLNDMNLSDEKKEPLRRQPLANKK 52
            + +  M+  L      +   K E M  D++LS E  E  +  P +  K
Sbjct: 482 AQAELKMISMLKKVCGQEFTSKLEGMFRDISLSSEFTEAFQHSPQSYNK 530



 Score = 33.2 bits (76), Expect = 0.79
 Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 1/54 (1%)

Query: 61  GTVTSYEN-KKQENMLEKLNPEQLNQKFEDMLNDMNLSDEKKEPLRRQPLANKK 113
           G   S +   K  +ML+K+  ++   K E M  D++LS E  E  +  P +  K
Sbjct: 477 GRSASAQAELKMISMLKKVCGQEFTSKLEGMFRDISLSSEFTEAFQHSPQSYNK 530


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 33.8 bits (78), Expect = 0.47
 Identities = 13/39 (33%), Positives = 16/39 (41%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
           P  P+  PPPPPPPP     PP             + E+
Sbjct: 262 PQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEI 300



 Score = 31.9 bits (73), Expect = 1.8
 Identities = 12/28 (42%), Positives = 12/28 (42%)

Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
           P PP PP     PPPPP      PP   
Sbjct: 259 PAPPQPPEEEPPPPPPPPEDDDDPPEDE 286



 Score = 31.5 bits (72), Expect = 2.1
 Identities = 14/30 (46%), Positives = 14/30 (46%)

Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           P PP PP    PPPPP P     PP    E
Sbjct: 259 PAPPQPPEEEPPPPPPPPEDDDDPPEDEEE 288



 Score = 31.1 bits (71), Expect = 3.0
 Identities = 14/43 (32%), Positives = 16/43 (37%), Gaps = 5/43 (11%)

Query: 546 VKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
           V A  A  +P  P+     PPPPP     PPP     P     
Sbjct: 250 VLAPRATRLPAPPQPPEEEPPPPP-----PPPEDDDDPPEDEE 287



 Score = 30.0 bits (68), Expect = 6.9
 Identities = 16/51 (31%), Positives = 22/51 (43%), Gaps = 6/51 (11%)

Query: 542 RLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            ++E    +AA +   P+    P PP P    PPPPP      PPP    +
Sbjct: 237 AVEEEDLALAARLVLAPRATRLPAPPQPPEEEPPPPP------PPPEDDDD 281


>gnl|CDD|223006 PHA03179, PHA03179, UL43 envelope protein; Provisional.
          Length = 387

 Score = 33.7 bits (77), Expect = 0.47
 Identities = 9/33 (27%), Positives = 9/33 (27%), Gaps = 5/33 (15%)

Query: 557 GPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
               G      PP          P  P P PPP
Sbjct: 202 VIGAGESLSRRPPED-----AERPTAPDPEPPP 229


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 33.4 bits (76), Expect = 0.49
 Identities = 14/41 (34%), Positives = 14/41 (34%), Gaps = 4/41 (9%)

Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGP----PPPPMPEMY 594
                PP  P P  M   PP     P      PP P P MY
Sbjct: 187 QPQSYPPNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSPYMY 227



 Score = 33.0 bits (75), Expect = 0.73
 Identities = 13/39 (33%), Positives = 14/39 (35%), Gaps = 2/39 (5%)

Query: 555 PTGPKGGPPPPP--PPPGGMGPPPPPMPGMPGPPPPPMP 591
               +  P   P  PPP    PPP   P    P PPP  
Sbjct: 109 SVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQV 147



 Score = 30.7 bits (69), Expect = 4.2
 Identities = 15/45 (33%), Positives = 16/45 (35%), Gaps = 2/45 (4%)

Query: 555 PTGPKGGPPPPPPP--PGGMGPPPPPMPGMPGPPPPPMPEMYTDC 597
           P+    GPP P P    G  G P    P    PPP    E Y   
Sbjct: 212 PSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPPPSQGQEGYGYS 256



 Score = 30.3 bits (68), Expect = 5.4
 Identities = 11/41 (26%), Positives = 12/41 (29%), Gaps = 4/41 (9%)

Query: 555 PTGPKGGPPPPPPPPGGMGPP----PPPMPGMPGPPPPPMP 591
           P   +     P PPP          PP  P     PPP   
Sbjct: 131 PPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQ 171


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 33.2 bits (76), Expect = 0.51
 Identities = 10/36 (27%), Positives = 11/36 (30%)

Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
               P  P  PP    P P   P     P  P P+ 
Sbjct: 66  QPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKP 101



 Score = 29.0 bits (65), Expect = 9.4
 Identities = 18/59 (30%), Positives = 22/59 (37%), Gaps = 1/59 (1%)

Query: 534 SGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
               V    +      +AA +   P   P P P PP    P PP  P  P  P PP P+
Sbjct: 28  QEDFVGIELVPLAVFLLAAKVLEAPTEEPQPEPEPPE-EQPKPPTEPETPPEPTPPKPK 85


>gnl|CDD|227003 COG4656, RnfC, Predicted NADH:ubiquinone oxidoreductase, subunit
           RnfC [Energy production and conversion].
          Length = 529

 Score = 33.5 bits (77), Expect = 0.51
 Identities = 21/107 (19%), Positives = 42/107 (39%), Gaps = 1/107 (0%)

Query: 784 IKTFKDSFYQAWQENIKLREAEEKSIRVREAREKAENEKKDKAARKKALIDMTTDQTQQG 843
           ++ F+    +   +  +L++AEE   R      + E EK ++  R K             
Sbjct: 423 VQYFRQEKAEILAQRQELKKAEEAKTRFEARTARLEREKAERIPRMKKAAVKRLSFVTAA 482

Query: 844 VMDSLLEALQTGRPKKTGSSIKSVGCPSHS-ALQTGSAFTREQRRKR 889
           ++D++  AL+  + KK  +   S    +      +G    RE R  +
Sbjct: 483 LIDAIAAALERVKEKKAYAKQPSAALGAPILPSNSGLVAAREARAYQ 529


>gnl|CDD|179538 PRK03100, PRK03100, sec-independent translocase; Provisional.
          Length = 136

 Score = 31.9 bits (73), Expect = 0.51
 Identities = 9/30 (30%), Positives = 9/30 (30%)

Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPP 587
             G P P  P   G  PP          PP
Sbjct: 101 GNGRPTPSSPDAAGPAPPDATPLAPGERPP 130



 Score = 30.4 bits (69), Expect = 1.8
 Identities = 10/34 (29%), Positives = 10/34 (29%), Gaps = 4/34 (11%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
             G      P    P     PP   P  PG  PP
Sbjct: 101 GNGRPTPSSPDAAGP----APPDATPLAPGERPP 130


>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
           (LMP2) protein.  This family consists of several
           Gammaherpesvirus latent membrane protein (LMP2)
           proteins. Epstein-Barr virus is a human Gammaherpesvirus
           that infects and establishes latency in B lymphocytes in
           vivo. The latent membrane protein 2 (LMP2) gene is
           expressed in latently infected B cells and encodes two
           protein isoforms, LMP2A and LMP2B, that are identical
           except for an additional N-terminal 119 aa cytoplasmic
           domain which is present in the LMP2A isoform. LMP2A is
           thought to play a key role in either the establishment
           or the maintenance of latency and/or the reactivation of
           productive infection from the latent state. The
           significance of LMP2B and its role in pathogenesis
           remain unclear.
          Length = 489

 Score = 33.7 bits (77), Expect = 0.51
 Identities = 14/42 (33%), Positives = 15/42 (35%), Gaps = 5/42 (11%)

Query: 553 GIPTGPKGGPPPPPP-----PPGGMGPPPPPMPGMPGPPPPP 589
           G P G +G   P  P          GPP P     P   PPP
Sbjct: 18  GGPDGDEGDSNPYYPSSFGSSWDRPGPPVPEDYDAPSHRPPP 59



 Score = 32.5 bits (74), Expect = 1.0
 Identities = 14/48 (29%), Positives = 14/48 (29%), Gaps = 11/48 (22%)

Query: 555 PTGPKGGPPPPPPPPGGMG-----PP-----PPPMPGMPGPPPPPMPE 592
           P G  GGP     P G  G      P         PG P P     P 
Sbjct: 8   PLGA-GGPRSHGGPDGDEGDSNPYYPSSFGSSWDRPGPPVPEDYDAPS 54



 Score = 30.6 bits (69), Expect = 4.7
 Identities = 9/31 (29%), Positives = 9/31 (29%), Gaps = 3/31 (9%)

Query: 561 GPPPPPPPPGGMGPPPP---PMPGMPGPPPP 588
           GPP P         PPP         G   P
Sbjct: 43  GPPVPEDYDAPSHRPPPYGGSNGDRHGGYQP 73


>gnl|CDD|182059 PRK09752, PRK09752, adhesin; Provisional.
          Length = 1250

 Score = 33.9 bits (77), Expect = 0.52
 Identities = 18/43 (41%), Positives = 22/43 (51%), Gaps = 5/43 (11%)

Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHKNIINGK 605
           PP PP P    P P P P  P P P P PE  T  ++ ++N K
Sbjct: 920 PPSPPDP---DPTPDPDP-TPDPDPTPDPEP-TPAYQPVLNAK 957


>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
           component of pyruvate dehydrogenase complex.
          Length = 539

 Score = 33.7 bits (77), Expect = 0.52
 Identities = 12/46 (26%), Positives = 13/46 (28%)

Query: 547 KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           K   +A          PPPP       P   P P    P  PP   
Sbjct: 201 KPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSSG 246


>gnl|CDD|237782 PRK14666, uvrC, excinuclease ABC subunit C; Provisional.
          Length = 694

 Score = 33.7 bits (77), Expect = 0.52
 Identities = 13/55 (23%), Positives = 14/55 (25%), Gaps = 3/55 (5%)

Query: 537 PVEKNRL--DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
           P  + R   D     V       P   P  P  P G   P  P     P     P
Sbjct: 317 PDTEGREGDDLAPTAVCTDAGLLP-DTPLLPDAPEGSSDPVVPVAAATPVDASLP 370


>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein.  Tymoviruses
           are single stranded RNA viruses. This family includes a
           protein of unknown function that has been named based on
           its molecular weight. Tymoviruses such as the ononis
           yellow mosaic tymovirus encode only three proteins. Of
           these two are overlapping this protein overlaps a larger
           ORF that is thought to be the polymerase.
          Length = 458

 Score = 33.5 bits (77), Expect = 0.55
 Identities = 14/36 (38%), Positives = 15/36 (41%), Gaps = 5/36 (13%)

Query: 562 PPPPPPP----PGGMGPPP-PPMPGMPGPPPPPMPE 592
           P P P P    PG +GP P  P    P PP P    
Sbjct: 227 PLPSPQPVRHSPGILGPSPLHPHTTRPSPPRPAFSR 262



 Score = 30.8 bits (70), Expect = 3.5
 Identities = 14/40 (35%), Positives = 15/40 (37%), Gaps = 5/40 (12%)

Query: 557 GPKGGPPPPPPP--PG---GMGPPPPPMPGMPGPPPPPMP 591
            PKG    P     P     + PPPPP P  P   P   P
Sbjct: 44  HPKGTSSFPQLLRHPSLRLRLHPPPPPCPQNPRDFPSLQP 83


>gnl|CDD|144655 pfam01141, Gag_p12, Gag polyprotein, inner coat protein p12.  The
           retroviral p12 is a virion structural protein. p12 is
           proline rich. The function carried out by p12 in
           assembly and replication is unknown. p12 is associated
           with pathogenicity of the virus.
          Length = 85

 Score = 31.0 bits (70), Expect = 0.58
 Identities = 10/30 (33%), Positives = 10/30 (33%)

Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPP 587
           P  GP P PP   G           P P P
Sbjct: 34  PYRGPGPSPPAGEGDEEEAATTSEAPDPSP 63


>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
           dihydrolipoamide succinyltransferase.  This model
           represents an Actinobacterial clade of E2 enzyme, a
           component of the 2-oxoglutarate dehydrogenase complex
           involved in the TCA cycle. These proteins have multiple
           domains including the catalytic domain (pfam00198), one
           or two biotin domains (pfam00364) and an E3-component
           binding domain (pfam02817).
          Length = 579

 Score = 33.4 bits (76), Expect = 0.59
 Identities = 13/59 (22%), Positives = 16/59 (27%), Gaps = 4/59 (6%)

Query: 538 VEKNRLDEVKAQVA----AGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
             ++   EV   +A    A            P P   G  P P P    P   P P   
Sbjct: 185 APEDDTVEVGTVLAIIGDANAAPAEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAP 243



 Score = 31.5 bits (71), Expect = 2.1
 Identities = 12/52 (23%), Positives = 15/52 (28%)

Query: 545 EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTD 596
           E +A   +   + P   P    P      P P P P     P    P    D
Sbjct: 212 EEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAPAAAAPVSSGD 263



 Score = 30.8 bits (69), Expect = 4.7
 Identities = 13/54 (24%), Positives = 16/54 (29%), Gaps = 4/54 (7%)

Query: 539 EKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           + N      A+  A  P+     P P P        P PP P     P P    
Sbjct: 202 DANAAPAEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAP----APAPAKTA 251


>gnl|CDD|216325 pfam01140, Gag_MA, Matrix protein (MA), p15.  The matrix protein,
           p15, is encoded by the gag gene. MA is involved in
           pathogenicity.
          Length = 129

 Score = 31.7 bits (72), Expect = 0.62
 Identities = 15/38 (39%), Positives = 17/38 (44%), Gaps = 5/38 (13%)

Query: 562 PPP--PP--PPPGGMGPPPPPMPGMPGPPPPPMP-EMY 594
           PPP   P   PP  + P   P P  P P  PP P  +Y
Sbjct: 92  PPPWVKPFVDPPKVLLPSSTPKPVSPSPSAPPRPSSLY 129


>gnl|CDD|233927 TIGR02557, HpaP, type III secretion protein HpaP.  This family of
           genes is always found in type III secretion operons,
           althought its function in the processes of secretion and
           virulence is unclear. Hpa stands for Hrp-associated
           gene, where Hrp stands for hypersensitivity response and
           virulence.
          Length = 201

 Score = 32.6 bits (74), Expect = 0.64
 Identities = 15/55 (27%), Positives = 16/55 (29%), Gaps = 1/55 (1%)

Query: 536 RPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPG-GMGPPPPPMPGMPGPPPPP 589
            P    R     AQ+           P P PPPP     P P        PPP  
Sbjct: 14  DPARPARRRTPLAQLRRRDALAYAPPPRPEPPPPCDEDRPEPRADTRASDPPPEA 68



 Score = 31.8 bits (72), Expect = 1.0
 Identities = 12/42 (28%), Positives = 12/42 (28%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTD 596
                  PP P PPP      P P        PPP      D
Sbjct: 32  DALAYAPPPRPEPPPPCDEDRPEPRADTRASDPPPEAPTDAD 73


>gnl|CDD|151322 pfam10873, DUF2668, Protein of unknown function (DUF2668).  Members
           in this family of proteins are annotated as Cysteine and
           tyrosine-rich protein 1, however currently no function
           is known.
          Length = 154

 Score = 32.1 bits (73), Expect = 0.68
 Identities = 18/64 (28%), Positives = 20/64 (31%), Gaps = 17/64 (26%)

Query: 538 VEKNRLDEVKAQVAAGIPTGPKGGPPPP-------------PPPPGGMGPPPPPMPGMPG 584
           ++ NR   V    AA I        PPP             PPP      P P       
Sbjct: 88  MKNNRGTRVGVIRAAHINAISYPMAPPPYTYDHEMEYPTDLPPPYS----PAPQASAQRS 143

Query: 585 PPPP 588
           PPPP
Sbjct: 144 PPPP 147



 Score = 31.0 bits (70), Expect = 1.3
 Identities = 11/24 (45%), Positives = 11/24 (45%), Gaps = 2/24 (8%)

Query: 562 PPP--PPPPPGGMGPPPPPMPGMP 583
           PPP  P P       PPPP PG  
Sbjct: 129 PPPYSPAPQASAQRSPPPPYPGNS 152


>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin.  Amelogenins play a role in
           biomineralisation. They seem to regulate the formation
           of crystallites during the secretory stage of tooth
           enamel development. thought to play a major role in the
           structural organisation and mineralisation of developing
           enamel. They are found in the extracellular matrix.
           Mutations in X-chromosomal amelogenin can cause
           Amelogenesis imperfecta.
          Length = 174

 Score = 32.2 bits (73), Expect = 0.69
 Identities = 13/41 (31%), Positives = 16/41 (39%)

Query: 556 TGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTD 596
             P+   P    PP     P  P PG P  P  P+P +  D
Sbjct: 116 QNPQPQQPTHTQPPVQPQQPADPQPGQPMFPMQPLPPLVPD 156


>gnl|CDD|220403 pfam09788, Tmemb_55A, Transmembrane protein 55A.  Members of this
           family catalyze the hydrolysis of the 4-position
           phosphate of phosphatidylinositol 4,5-bisphosphate, in
           the reaction: 1-phosphatidyl-myo-inositol
           4,5-bisphosphate + H(2)O =
           1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate.
          Length = 258

 Score = 32.9 bits (75), Expect = 0.71
 Identities = 11/37 (29%), Positives = 14/37 (37%)

Query: 550 VAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPP 586
              G+  G +G  PP  PP      P P +     PP
Sbjct: 17  GGTGVSPGDEGAGPPSKPPYDQNFAPFPVLLSELPPP 53


>gnl|CDD|223024 PHA03252, PHA03252, DNA packaging tegument protein UL25;
           Provisional.
          Length = 589

 Score = 33.2 bits (76), Expect = 0.73
 Identities = 30/129 (23%), Positives = 43/129 (33%), Gaps = 22/129 (17%)

Query: 478 FQLDVQPLVEHLAEKSK--TEED-RRVEDLSA-----KLEEAIM-------LRQEAEAKL 522
           +QL + P+ E          EED RR+ D SA       E+ +        L +E +   
Sbjct: 8   YQLPIVPIYEGHVRNILRCPEEDLRRLRDDSALRLRRYREDLLRDRLLRRRLGEELDDLQ 67

Query: 523 VQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGM 582
            + Q   EDL S       R+ E +A +      G  GG           G     +   
Sbjct: 68  KRLQTECEDLRS-------RVSEAEALLLHDASGGEGGGATNGGEVNVDGGADRTWLAQS 120

Query: 583 PGPPPPPMP 591
           P  P    P
Sbjct: 121 PERPADGGP 129


>gnl|CDD|220596 pfam10138, Tellurium_res, Tellurium resistance protein.  Members of
           this family confer resistance to the metalloid element
           tellurium and its salts.
          Length = 98

 Score = 30.8 bits (70), Expect = 0.73
 Identities = 11/26 (42%), Positives = 11/26 (42%), Gaps = 3/26 (11%)

Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPP 587
           P  P PPP    P PP     P  PP
Sbjct: 1   PAAPVPPP---APAPPAPAPPPAAPP 23


>gnl|CDD|223054 PHA03359, PHA03359, UL17 tegument protein; Provisional.
          Length = 686

 Score = 33.1 bits (76), Expect = 0.75
 Identities = 32/159 (20%), Positives = 50/159 (31%), Gaps = 49/159 (30%)

Query: 454 LEECVSQIVLHRGGCDPDFRSSRRFQLDVQPLV------EHLAEKSKTEEDRRVEDLSAK 507
           LE C+ +    R   DP   +      + + L          A+    E+DR        
Sbjct: 196 LEACLGE--DGRSNGDPVLSADAYIDPE-EQLTAPGGFDTPAADARIEEDDR-------- 244

Query: 508 LEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKG------- 560
               I+        L++  +  E ++S  PV      EV   VA+G+  G +        
Sbjct: 245 ---EIL-------SLIR--RAAEVVASRHPVRAAAPREVGGAVASGLRQGARALGAAGAA 292

Query: 561 GPPPPPP----------PPGG---MGPPPPPMPGMPGPP 586
           G                PPG    +G   P +P   GP 
Sbjct: 293 GAASGAADADAVLAGLEPPGKGRFVGSGNPRVPDPEGPV 331


>gnl|CDD|223044 PHA03325, PHA03325, nuclear-egress-membrane-like protein;
           Provisional.
          Length = 418

 Score = 32.9 bits (75), Expect = 0.76
 Identities = 21/100 (21%), Positives = 33/100 (33%), Gaps = 7/100 (7%)

Query: 492 KSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVA 551
           + K  E  + E   A+  EA    Q A +    ++ +    SS +  +        +  A
Sbjct: 320 RVKHPEAGKEEPDGARNAEAKEPAQPATST--SSKGS----SSAQNKDSGSTGPGSSLAA 373

Query: 552 AGIPT-GPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
           A         G PP          P P +   P PP  P+
Sbjct: 374 ASSFLEDDDFGSPPLDLTTSLRHMPSPSVTSAPEPPSIPL 413


>gnl|CDD|238560 cd01140, FatB, Siderophore binding protein FatB.  These proteins
           have been shown to function as ABC-type initial
           receptors in the siderophore-mediated iron uptake in
           some eubacterial species.  They belong to the TroA
           superfamily of periplasmic metal binding proteins that
           share a distinct fold and ligand binding mechanism. A
           typical TroA protein is comprised of two globular
           subdomains connected by a single helix and can bind
           their ligands in the cleft between these domains.
          Length = 270

 Score = 32.6 bits (75), Expect = 0.77
 Identities = 21/78 (26%), Positives = 34/78 (43%), Gaps = 1/78 (1%)

Query: 762 GDLAEFYTFDKNIYTLEEFFTDIKTFKDSFYQAWQENIKLREAEEKSIRVREAREKAENE 821
           G LAE Y   K I    +   D+K + +S  Q  +   K+   EE++  +    + +  E
Sbjct: 80  GRLAEKYDELKKIAPTIDLGADLKNYLESVKQNIETLGKIFGKEEEAKELVAEIDASIAE 139

Query: 822 KKDKAARK-KALIDMTTD 838
            K  A  K KAL+ +   
Sbjct: 140 AKSAAKGKKKALVVLVNG 157


>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
          Length = 318

 Score = 32.9 bits (75), Expect = 0.79
 Identities = 12/37 (32%), Positives = 12/37 (32%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
                  PP  P PP    P  P       PP PP P
Sbjct: 4   ARSRSPSPPRRPSPPRPTPPRSPDASPEETPPSPPGP 40



 Score = 30.2 bits (68), Expect = 4.5
 Identities = 12/49 (24%), Positives = 16/49 (32%), Gaps = 8/49 (16%)

Query: 532 LSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMP 580
            ++G    + R          G P G       PP PP G+   P   P
Sbjct: 48  RAAGPAAPRRRPR--------GCPAGVTFSSSAPPRPPLGLDDAPAATP 88



 Score = 29.1 bits (65), Expect = 9.8
 Identities = 14/38 (36%), Positives = 14/38 (36%), Gaps = 6/38 (15%)

Query: 558 PKGGPPPPPPPPGGMGPP------PPPMPGMPGPPPPP 589
           P     P PP P     P       PP P  PG  PPP
Sbjct: 9   PSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPP 46


>gnl|CDD|215605 PLN03153, PLN03153, hypothetical protein; Provisional.
          Length = 537

 Score = 33.0 bits (75), Expect = 0.88
 Identities = 9/35 (25%), Positives = 11/35 (31%), Gaps = 4/35 (11%)

Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           P   P PPPP      P  P    +       + E
Sbjct: 79  PIVTPLPPPPSS----PSLPSSLLLDHFRNRSLSE 109


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
           subunit 62; Provisional.
          Length = 576

 Score = 33.0 bits (75), Expect = 0.89
 Identities = 28/98 (28%), Positives = 33/98 (33%), Gaps = 23/98 (23%)

Query: 507 KLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGI--PTGPKGGPPP 564
           K+ E I    E  A L   ++ L  + S R          K   AA    P   K   P 
Sbjct: 301 KVVEVI---AETTAPLTPMEELLAKIPSQRV-------PPKESDAADGPKPVPTKPVTPE 350

Query: 565 PPPPPGGMGPPPP----PMPGMP-------GPPPPPMP 591
            P PP    PP P    P P  P        PP  P+P
Sbjct: 351 APSPPIEEEPPQPKAVVPRPLSPYTAYEDLKPPTSPIP 388


>gnl|CDD|233508 TIGR01649, hnRNP-L_PTB, hnRNP-L/PTB/hephaestus splicing factor
           family.  Included in this family of heterogeneous
           ribonucleoproteins are PTB (polypyrimidine tract binding
           protein ) and hnRNP-L. These proteins contain four RNA
           recognition motifs (rrm: pfam00067).
          Length = 481

 Score = 32.9 bits (75), Expect = 0.96
 Identities = 15/83 (18%), Positives = 18/83 (21%), Gaps = 17/83 (20%)

Query: 528 TLEDLSSGRPVEKNR--------LDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPM 579
           T  DL   R    ++        L                GGP  P      MGPP  P 
Sbjct: 189 TNPDLPGRRDPGLDQTHRQRQPALLGQHPSSYGHDGYSSHGGPLAPLAGGDRMGPPHGPP 248

Query: 580 ---------PGMPGPPPPPMPEM 593
                      +        P  
Sbjct: 249 SRYRPAYEAAPLAPAISSYGPAG 271


>gnl|CDD|234398 TIGR03921, T7SS_mycosin, type VII secretion-associated serine
           protease mycosin.  Members of this family are
           subtilisin-related serine proteases, found strictly in
           the Actinobacteria and associated with type VII
           secretion operons. The designation mycosin is used for
           members from Mycobacterium [Protein fate, Protein and
           peptide secretion and trafficking, Protein fate, Protein
           modification and repair].
          Length = 350

 Score = 32.3 bits (74), Expect = 1.1
 Identities = 10/29 (34%), Positives = 13/29 (44%)

Query: 564 PPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            PP     + P P P   +  P PPP P+
Sbjct: 286 LPPEDGRPLRPAPAPARPVAAPAPPPPPD 314



 Score = 32.3 bits (74), Expect = 1.2
 Identities = 17/42 (40%), Positives = 19/42 (45%), Gaps = 9/42 (21%)

Query: 550 VAA--GIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
           VAA  G    P+ G P  P P      P  P+   P PPPPP
Sbjct: 279 VAALTGEL-PPEDGRPLRPAP-----APARPVAA-PAPPPPP 313


>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
           Activating Proteins.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. Slit-Robo GTPase Activating Proteins
           (srGAPs) are Rho GAPs that interact with Robo1, the
           transmembrane receptor of Slit proteins. Slit proteins
           are secreted proteins that control axon guidance and the
           migration of neurons and leukocytes. Vertebrates contain
           three isoforms of srGAPs, all of which are expressed
           during embryonic and early development in the nervous
           system but with different localization and timing.
           srGAPs contain an N-terminal F-BAR domain, a Rho GAP
           domain, and a C-terminal SH3 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 241

 Score = 31.9 bits (73), Expect = 1.1
 Identities = 26/124 (20%), Positives = 52/124 (41%), Gaps = 18/124 (14%)

Query: 742 EVRQKITLLSNMSKNMMTLYGDLAE-------------FYTFDKNIYTLEEFFTDIKTFK 788
           E R   TL    S N++   G ++E                 D+ +  L E  T +KT+ 
Sbjct: 82  ESRDHSTLSDIYSNNLVQRLGQMSEDLQRISKKCREIGSQLHDELLRVLNELQTAMKTYH 141

Query: 789 DSFYQAWQENIKLREAEEKSIRVREAREKAENEKKDKAARKKALIDMTTDQTQQGVMDSL 848
               ++     KL+EAE++     E +E++  +K +++   K  I+   ++ Q    ++ 
Sbjct: 142 TYHAESKSAERKLKEAEKQ----EEKQEQSPEKKLERSRSSK-KIEKEVEKRQAKYSEAK 196

Query: 849 LEAL 852
           L+  
Sbjct: 197 LKCT 200


>gnl|CDD|202620 pfam03411, Peptidase_M74, Penicillin-insensitive murein
           endopeptidase. 
          Length = 240

 Score = 32.2 bits (73), Expect = 1.1
 Identities = 12/45 (26%), Positives = 14/45 (31%), Gaps = 8/45 (17%)

Query: 555 PTGPKGGPPPPPPPPGGMGP-------PPPPMPGMPGP-PPPPMP 591
               +    P  P   G G        PP P    P    PPP+P
Sbjct: 184 ADSLECEDQPLVPSGDGCGAELQSWFEPPKPGTTKPEKKTPPPLP 228


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
           family of proteins are found in large numbers in the
           Trichomonas vaginalis proteome. The function of this
           protein is unknown.
          Length = 422

 Score = 32.3 bits (73), Expect = 1.2
 Identities = 9/42 (21%), Positives = 9/42 (21%)

Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHK 599
           P    P   P       P  P        P   P      HK
Sbjct: 216 PTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHK 257



 Score = 30.4 bits (68), Expect = 4.5
 Identities = 10/44 (22%), Positives = 10/44 (22%)

Query: 549 QVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           Q  A  PT       P   P      P            PP  E
Sbjct: 210 QNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTE 253


>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
          Length = 653

 Score = 32.5 bits (74), Expect = 1.3
 Identities = 27/138 (19%), Positives = 50/138 (36%), Gaps = 29/138 (21%)

Query: 476 RRFQLDVQPLVEHLAEKSKTEEDRRVEDLSAK-------------LEEAIMLRQEAEAKL 522
           R  + D+  +V   AEK K E++   E + AK             L++  +  + +++  
Sbjct: 511 RLSKADIDRMVNE-AEKYKAEDEANRERVEAKNGLENYCYSMKNTLQDEKVKGKLSDSDK 569

Query: 523 VQAQKTLED----LSSGRPVEKN----RLDEVKA-------QVAAGIPTGPKGGPPPPPP 567
              +K +++    L   +  EK     +  EV++       ++      G  GG P   P
Sbjct: 570 ATIEKAIDEALEWLEKNQLAEKEEFEHKQKEVESVCNPIMTKMYQAAGGGMPGGMPGGMP 629

Query: 568 PPGGMGPPPPPMPGMPGP 585
                G  P       GP
Sbjct: 630 GGMPGGAGPAGAGASSGP 647


>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
          Length = 200

 Score = 31.5 bits (71), Expect = 1.3
 Identities = 20/43 (46%), Positives = 21/43 (48%), Gaps = 5/43 (11%)

Query: 551 AAGIPTGPKGGPPPP---PPPPGGMGP-PPPPMPGMPGPPPPP 589
           A G   G +   PPP   PPPPGG G     P PG PGP P  
Sbjct: 118 AKGEGEGHEPEDPPPEDTPPPPGGEGEVEGGPSPG-PGPGPLD 159



 Score = 29.1 bits (65), Expect = 7.3
 Identities = 13/37 (35%), Positives = 14/37 (37%), Gaps = 3/37 (8%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPP---MPGMPGPPPP 588
           PT P     PPP PP    PP         +   PPP
Sbjct: 54  PTTPHPSSQPPPCPPSPGHPPQTNDTHEKDLALQPPP 90



 Score = 28.8 bits (64), Expect = 9.8
 Identities = 12/28 (42%), Positives = 12/28 (42%)

Query: 561 GPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
           G P  PP        PPP P  PG PP 
Sbjct: 48  GYPFCPPTTPHPSSQPPPCPPSPGHPPQ 75


>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
           partitioning].
          Length = 324

 Score = 32.1 bits (73), Expect = 1.3
 Identities = 23/111 (20%), Positives = 36/111 (32%), Gaps = 4/111 (3%)

Query: 483 QPLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNR 542
           Q   EH A +   +   + E  SA+++  +    +        Q T   L   +P E+ R
Sbjct: 74  QFTQEHEAARQSPQHQYQPEYASAQIKIPVPQPPQISDPPAHPQPTQPALDQEQPPEEAR 133

Query: 543 LDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
              +     A  P       P P      + P  P     P     P PE+
Sbjct: 134 QPVL--PQEAPAPQPVHSAAPQPAVQT--VQPAVPEQQVQPEEVVEPAPEV 180


>gnl|CDD|233154 TIGR00859, ENaC, sodium channel transporter.  The Epithelial Na+
           Channel (ENaC) Family (TC 1.A.06)The ENaC family
           consists of sodium channels from animals and has no
           recognizable homologues in other eukaryotes or bacteria.
           The vertebrate ENaC proteins from epithelial cells
           cluster tightly together on the phylogenetic tree:
           voltage-insensitive ENaC homologues are also found in
           the brain. Eleven sequenced C. elegans proteins,
           including the degenerins, are distantly related to the
           vertebrate proteins as well as to each other. At least
           some ofthese proteins form part of a mechano-transducing
           complex for touch sensitivity. Other members of the ENaC
           family, the acid-sensing ion channels, ASIC1-3,are homo-
           or hetero-oligomeric neuronal H+-gated channels that
           mediate pain sensation in response to tissue acidosis.
           The homologous Helix aspersa(FMRF-amide)-activated Na+
           channel is the first peptide neurotransmitter-gated
           ionotropic receptor to be sequenced.Mammalian ENaC is
           important for the maintenance of Na+ balance and the
           regulation of blood pressure. Three homologous ENaC
           subunits, a, b and g, havebeen shown to assemble to form
           the highly Na+-selective channel.This model is designed
           from the vertebrate members of the ENaC family
           [Transport and binding proteins, Cations and iron
           carrying compounds].
          Length = 595

 Score = 32.4 bits (74), Expect = 1.4
 Identities = 11/34 (32%), Positives = 13/34 (38%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
            T P      PP  P  +  P      +PG PPP
Sbjct: 557 DTPPSLQLDDPPTFPSALPLPHASGLSLPGTPPP 590


>gnl|CDD|225499 COG2948, VirB10, Type IV secretory pathway, VirB10 components
           [Intracellular trafficking and secretion].
          Length = 360

 Score = 31.7 bits (72), Expect = 1.6
 Identities = 11/38 (28%), Positives = 15/38 (39%)

Query: 554 IPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           I    +        PP  +    PP+P +P  PP P P
Sbjct: 44  IALQGEKKRINNTQPPSNVERGTPPLPPLPDDPPLPPP 81



 Score = 31.7 bits (72), Expect = 1.7
 Identities = 14/57 (24%), Positives = 21/57 (36%), Gaps = 2/57 (3%)

Query: 536 RPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
             ++  +      Q  + +  G    PP P  PP  + PP P   G P  P   + E
Sbjct: 44  IALQGEKKRINNTQPPSNVERGTPPLPPLPDDPP--LPPPLPVDLGAPVLPDQQVEE 98


>gnl|CDD|227547 COG5222, COG5222, Uncharacterized conserved protein, contains RING
           Zn-finger [General function prediction only].
          Length = 427

 Score = 32.0 bits (72), Expect = 1.7
 Identities = 16/40 (40%), Positives = 16/40 (40%), Gaps = 2/40 (5%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPM 590
           A   P        P P  P   G PP PM  MP P  PPM
Sbjct: 375 ATAEPAFKSAMAIPMPSMPHVQGFPPFPM--MPLPQMPPM 412


>gnl|CDD|237011 PRK11892, PRK11892, pyruvate dehydrogenase subunit beta;
           Provisional.
          Length = 464

 Score = 31.8 bits (73), Expect = 1.7
 Identities = 10/49 (20%), Positives = 11/49 (22%), Gaps = 2/49 (4%)

Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           D   A  AA         P            P P  P  P       P+
Sbjct: 86  DAGAAPAAA--AEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAADPD 132



 Score = 31.4 bits (72), Expect = 2.4
 Identities = 7/48 (14%), Positives = 10/48 (20%)

Query: 545 EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           E  +   A      +     P            P P  P  P   +  
Sbjct: 82  ESASDAGAAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAA 129


>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
          Length = 209

 Score = 31.3 bits (71), Expect = 1.7
 Identities = 23/100 (23%), Positives = 37/100 (37%), Gaps = 19/100 (19%)

Query: 503 DLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRL---------DEVKAQV-AA 552
           +   + +EA  LR   EA+L   ++T+ D  S  P    R          +E+K+ +  A
Sbjct: 45  EFRRQFDEA--LR---EAELDDVRQTISDARSLNPRTSLRQAMNPLRQAGNEIKSDLQKA 99

Query: 553 GIPTGPKGGPPPPPP----PPGGMGPPPPPMPGMPGPPPP 588
              + P     P        P G+   PP +P     P  
Sbjct: 100 TSMSEPATENKPAEVTTPVEPMGLPETPPAVPVPAPAPAV 139


>gnl|CDD|146151 pfam03363, Herpes_LP, Herpesvirus leader protein. 
          Length = 177

 Score = 31.0 bits (70), Expect = 1.9
 Identities = 18/46 (39%), Positives = 25/46 (54%), Gaps = 4/46 (8%)

Query: 536 RPVEKNRLDEVKAQVAAGIPTGPKG----GPPPPPPPPGGMGPPPP 577
           R V +  L E + +V +G P+GP+G     P P  P P G+GP  P
Sbjct: 45  RRVRRRVLLEEEEEVVSGPPSGPRGDPSEAPGPSRPGPPGLGPEGP 90


>gnl|CDD|223041 PHA03321, PHA03321, tegument protein VP11/12; Provisional.
          Length = 694

 Score = 31.9 bits (72), Expect = 2.0
 Identities = 12/28 (42%), Positives = 12/28 (42%), Gaps = 1/28 (3%)

Query: 560 GGPPPPPPPPGGMGPPPPPMPGMPGPPP 587
             PP  P P     PPPPP    PG  P
Sbjct: 428 RQPPGAPAPRRDNDPPPPPRA-RPGSTP 454


>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator.  Shisa is a
           transcription factor-type molecule that physically
           interacts with immature forms of the Wnt receptor
           Frizzled and the FGF receptor within the endoplasmic
           reticulum to inhibit their post-translational maturation
           and trafficking to the cell surface.
          Length = 177

 Score = 30.9 bits (70), Expect = 2.0
 Identities = 14/30 (46%), Positives = 14/30 (46%), Gaps = 5/30 (16%)

Query: 564 PPPPPPGGMGPP-----PPPMPGMPGPPPP 588
           P PP PG   PP     PPP    P  PPP
Sbjct: 147 PMPPQPGMPAPPYSLQYPPPGLLQPQGPPP 176


>gnl|CDD|216534 pfam01499, Herpes_UL25, Herpesvirus UL25 family.  The herpesvirus
           UL25 gene product is a virion component involved in
           virus penetration and capsid assembly. The product of
           the UL25 gene is required for packaging but not cleavage
           of replicated viral DNA. This family includes a number
           of herpesvirus proteins: EHV-1 36, EBV BVRF1, HCMV UL77,
           ILTV ORF2, and VZV gene 34.
          Length = 538

 Score = 31.6 bits (72), Expect = 2.0
 Identities = 21/116 (18%), Positives = 39/116 (33%), Gaps = 14/116 (12%)

Query: 470 PDFRSSRRFQLDVQPLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTL 529
            D  S    +          A + +T  ++     +    +   L+ E +      Q   
Sbjct: 21  CDEPSLFETRPR-------FALRLRTRAEQLRLAKA----KKRALKAELDNLEQLHQTRS 69

Query: 530 EDLSSG-RPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPG 584
            ++ +  RP+E N+L+EV A     + T  +   P     P        P  G+ G
Sbjct: 70  AEIDADLRPIE-NQLEEV-ADPLDLLETAARAAEPADAAQPDATAGEAAPGEGVGG 123


>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA.  Members
           of this protein share two defining regions. One is a
           histidine/serine-rich cluster, typically
           H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
           the context of a pair of radical SAM proteins, HxsB and
           HxsC, and a fourth protein HxsD. The system is predicted
           to perform peptide modifications, likely in the
           His-Xaa-Ser region, to produce some uncharacterized
           natural product.
          Length = 186

 Score = 31.0 bits (70), Expect = 2.1
 Identities = 19/81 (23%), Positives = 31/81 (38%), Gaps = 17/81 (20%)

Query: 860 TGSSIKSVGCPSHSALQTGSAFTREQRRKRQN-----------------DRPMGAERRAQ 902
            GSSI+S+   +    Q+ +     ++RK                    D  MG   R  
Sbjct: 99  PGSSIQSLPSTTGVRPQSSAENANSEKRKLLVIRVQTALVVFGYYGGIIDGIMGEATRVA 158

Query: 903 LNRSRSRNGIVITRELSNEVL 923
           LNR +  NG+ +T  +  + L
Sbjct: 159 LNRYQMDNGLPVTGTIDTQTL 179


>gnl|CDD|221459 pfam12200, DUF3597, Domain of unknown function (DUF3597).  This
           family of proteins is found in bacteria, eukaryotes and
           viruses. Proteins in this family are typically between
           126 and 281 amino acids in length. The function of this
           domain is unknown. The structure of this domain has been
           found to contain five helices with a long flexible loop
           between helices one and two.
          Length = 124

 Score = 30.0 bits (68), Expect = 2.1
 Identities = 6/31 (19%), Positives = 7/31 (22%)

Query: 559 KGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
           +     P P       P       P   PP 
Sbjct: 12  EAAAAAPAPAAAPATAPAAAAAAAPAATPPA 42



 Score = 29.6 bits (67), Expect = 3.4
 Identities = 5/27 (18%), Positives = 5/27 (18%)

Query: 562 PPPPPPPPGGMGPPPPPMPGMPGPPPP 588
            P P   P               PP  
Sbjct: 17  APAPAAAPATAPAAAAAAAPAATPPAA 43


>gnl|CDD|221172 pfam11701, UNC45-central, Myosin-binding striated muscle assembly
           central.  The UNC-45 or small muscle protein 1 of
           C.elegans is expressed in two forms from different
           genomic positions in mammals, as a general tissue
           protein UNC-45a and a specific form Unc-45b expressed
           only in striated and skeletal muscle. All members carry
           up to three amino-terminal tetratricopeptide repeat
           (TPR) domains towards their N-terminal, a UCS domain at
           the C-terminal that contains a number of Arm repeats
           pfam00514 and this central region of approximately 400
           residues. Both the general form and the muscle form of
           UNC-45 function in myotube formation through cell
           fusion. Myofibril formation requires both GC and SM
           UNC-45, consistent with the fact that the cytoskeleton
           is necessary for the development and maintenance of
           organised myofibrils. The S. pombe Rng3p, is crucial for
           cell shape, normal actin cytoskeleton, and contractile
           ring assembly, and is essential for assembly of the
           myosin II-containing progenitors of the contractile
           ring. Widespread defects in the cytoskeleton are found
           in null mutants of all three fungal proteins. Mammalian
           Unc45 is found to act as a specific chaperone during the
           folding of myosin and the assembly of striated muscle by
           forming a stable complex with the general chaperone
           Hsp90. The exact function of this central region is not
           known.
          Length = 155

 Score = 30.3 bits (69), Expect = 2.2
 Identities = 17/63 (26%), Positives = 28/63 (44%), Gaps = 10/63 (15%)

Query: 252 LEAVKVLAAVCLIPPDGHDKVIKAITMSGELKGKERFQPVVQGLMVKGNNEALRTACLQL 311
           + AV  L A+   PPD    V   + +S      E F   +  L+ +     L+ A L+L
Sbjct: 61  INAVSALTALFPGPPD----VGATLFLS------EGFLESLLPLVARSERRKLQLAALEL 110

Query: 312 INA 314
           ++A
Sbjct: 111 LSA 113


>gnl|CDD|177618 PHA03381, PHA03381, tegument protein VP22; Provisional.
          Length = 290

 Score = 31.1 bits (70), Expect = 2.2
 Identities = 18/85 (21%), Positives = 24/85 (28%), Gaps = 10/85 (11%)

Query: 515 RQEAEAKLVQAQKTLED--LSSGRPVEKNR-LDEVKAQVAAGIPTGPKGGPPPPPPPPGG 571
           R +AE +     +   D    +G   E  R  D   ++     P     GP      P G
Sbjct: 54  RSQAERRFHHYDEARADYPYYTGSSSEDERPADPRPSRRPHAQPEASGPGPARGARGPAG 113

Query: 572 MG-------PPPPPMPGMPGPPPPP 589
                     P P  P  P     P
Sbjct: 114 SRGRGRRAESPSPRDPPNPKGASAP 138


>gnl|CDD|214545 smart00182, CULLIN, Cullin. 
          Length = 143

 Score = 30.4 bits (69), Expect = 2.3
 Identities = 16/52 (30%), Positives = 23/52 (44%), Gaps = 7/52 (13%)

Query: 8  EEQENMLEKLNPE---QLNQKFEDMLNDMNLS----DEKKEPLRRQPLANKK 52
          + +ENM+ KL  E   +   K E M  D++LS       K+ L   P A   
Sbjct: 29 DAEENMITKLKQECGYEFTSKLERMFRDISLSKDLNQSFKDMLENNPSAKPI 80



 Score = 29.6 bits (67), Expect = 3.9
 Identities = 16/51 (31%), Positives = 22/51 (43%), Gaps = 7/51 (13%)

Query: 70  KQENMLEKLNPE---QLNQKFEDMLNDMNLS----DEKKEPLRRQPLANKK 113
            +ENM+ KL  E   +   K E M  D++LS       K+ L   P A   
Sbjct: 30  AEENMITKLKQECGYEFTSKLERMFRDISLSKDLNQSFKDMLENNPSAKPI 80


>gnl|CDD|226293 COG3770, MepA, Murein endopeptidase [Cell envelope biogenesis,
           outer membrane].
          Length = 284

 Score = 31.3 bits (71), Expect = 2.4
 Identities = 17/46 (36%), Positives = 18/46 (39%), Gaps = 9/46 (19%)

Query: 555 PTGPKGGPPPPPPPPG-GMGP-------PPPPMPGMPGPP-PPPMP 591
           P G          PPG G G        PPPP    P P  PPP+P
Sbjct: 222 PAGSLECEDQALVPPGDGCGKELASWFEPPPPGTTKPEPKAPPPLP 267


>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
          Length = 280

 Score = 31.0 bits (69), Expect = 2.4
 Identities = 16/42 (38%), Positives = 17/42 (40%), Gaps = 4/42 (9%)

Query: 552 AGIPTGPKGGPPPPPPPPGGMGPPP----PPMPGMPGPPPPP 589
           A  PT P     P P  P    PP     PP P +P P P P
Sbjct: 119 ATAPTCPPPAVCPAPARPAPACPPSTRQCPPAPPLPTPKPAP 160



 Score = 30.2 bits (67), Expect = 4.2
 Identities = 17/47 (36%), Positives = 18/47 (38%), Gaps = 11/47 (23%)

Query: 556 TGPKGGPPPPP------PPPGGMGPPPPPMPGMPG-----PPPPPMP 591
           T P   P  PP      PPP     P  P P  P      PP PP+P
Sbjct: 108 TCPAPAPACPPATAPTCPPPAVCPAPARPAPACPPSTRQCPPAPPLP 154


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 31.4 bits (72), Expect = 2.4
 Identities = 13/57 (22%), Positives = 14/57 (24%)

Query: 536 RPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            P       EV  Q AA   +      P     P      PPP    P   P     
Sbjct: 360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLP 416


>gnl|CDD|219056 pfam06485, DUF1092, Protein of unknown function (DUF1092).  This
           family consists of several hypothetical proteins of
           unknown function all from photosynthetic organisms
           including plants and cyanobacteria.
          Length = 270

 Score = 31.1 bits (71), Expect = 2.5
 Identities = 13/27 (48%), Positives = 15/27 (55%), Gaps = 1/27 (3%)

Query: 566 PPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           P  PG M   PPP+     PPP P+PE
Sbjct: 109 PQEPGYMALAPPPVALDK-PPPQPLPE 134


>gnl|CDD|220871 pfam10759, DUF2587, Protein of unknown function (DUF2587).  This is
           a bacterial family of proteins with no known function.
          Length = 168

 Score = 30.5 bits (69), Expect = 2.5
 Identities = 10/27 (37%), Positives = 11/27 (40%)

Query: 568 PPGGMGPPPPPMPGMPGPPPPPMPEMY 594
            P G+G  PP  P      PPP    Y
Sbjct: 141 LPPGVGIAPPGQPQGARGGPPPGTGQY 167


>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 504

 Score = 31.3 bits (71), Expect = 2.5
 Identities = 13/47 (27%), Positives = 17/47 (36%), Gaps = 1/47 (2%)

Query: 545 EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
           EV++  +A        G P P   P     PP P P      PP + 
Sbjct: 373 EVRSLRSAPTAAATAAGAPLPDFDPRP-RGPPAPEPARSAEAPPLVA 418



 Score = 30.2 bits (68), Expect = 6.4
 Identities = 15/73 (20%), Positives = 17/73 (23%), Gaps = 3/73 (4%)

Query: 524 QAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMP 583
            A    +       +EK       A  AA    G    P P   P     P P P     
Sbjct: 356 PAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAGA---PLPDFDPRPRGPPAPEPARSAE 412

Query: 584 GPPPPPMPEMYTD 596
            PP          
Sbjct: 413 APPLVAPAAAPAG 425


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 31.3 bits (71), Expect = 2.6
 Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 2/40 (5%)

Query: 793 QAWQENIKLREAEEKSIRVREAREKAENEKKDKA-ARKKA 831
           QA +E    + A E   +  EA++KAE E K KA A+ KA
Sbjct: 151 QAEEEAKA-KAAAEAKKKAAEAKKKAEAEAKAKAEAKAKA 189


>gnl|CDD|219837 pfam08430, Fork_head_N, Forkhead N-terminal region.  The region
           described in this family is found towards the N-terminus
           of various eukaryotic fork head/HNF-3-related
           transcription factors (which contain the pfam00250
           domain). These proteins play key roles in embryogenesis,
           maintenance of differentiated cell states, and
           tumorigenesis.
          Length = 137

 Score = 29.9 bits (67), Expect = 2.6
 Identities = 14/48 (29%), Positives = 14/48 (29%), Gaps = 4/48 (8%)

Query: 551 AAGIPTGPKGGPPPPPP--PPGGMGP--PPPPMPGMPGPPPPPMPEMY 594
            AG P G  G         P G MG   P P   G    P   M    
Sbjct: 69  GAGSPMGMMGMSSMGTSLSPSGTMGAMGPMPAGSGGSLSPNMSMSRAS 116



 Score = 29.6 bits (66), Expect = 4.3
 Identities = 6/43 (13%), Positives = 7/43 (16%)

Query: 551 AAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEM 593
            +               P    G     M GM          M
Sbjct: 32  TSQGSANSSMNMSGYAGPGAMNGMSSSSMNGMSPGYGGAGSPM 74


>gnl|CDD|99790 cd06193, siderophore_interacting, Siderophore interacting proteins
           share the domain structure of the ferredoxin reductase
           like family. Siderophores are produced in various
           bacteria (and some plants) to extract iron from hosts.
           Binding constants are high, so iron can be pilfered from
           transferrin and lactoferrin for bacterial uptake,
           contributing to pathogen virulence. Ferredoxin reductase
           (FNR), an FAD and NAD(P) binding protein, was intially
           identified as a chloroplast reductase activity,
           catalyzing the electron transfer from reduced
           iron-sulfur protein ferredoxin to NADP+ as the final
           step in the electron transport mechanism of photosystem
           I. FNR transfers electrons from reduced ferredoxin to
           FAD (forming FADH2 via a semiquinone intermediate) and
           then transfers a hydride ion to convert NADP+ to NADPH.
           FNR has since been shown to utilize a variety of
           electron acceptors and donors and has a variety of
           physiological functions including nitrogen assimilation,
           dinitrogen fixation, steroid hydroxylation, fatty acid
           metabolism, oxygenase activity, and methane assimilation
           in a variety of organisms. FNR has an NAD(P)-binding
           sub-domain of the alpha/beta class and a discrete
           (usually N-terminal) flavin sub-domain which vary in
           orientation with respect  to the NAD(P) binding domain.
           The N-terminal moeity may contain a flavin prosthetic
           group (as in flavoenzymes) or use flavin as a substrate.
           Because flavins such as FAD can exist in oxidized,
           semiquinone (one-electron reduced), or fully reduced
           hydroquinone forms, FNR can interact with one and two
           electron carriers. FNR has a strong preference for
           NADP(H) vs NAD(H).
          Length = 235

 Score = 30.7 bits (70), Expect = 2.6
 Identities = 12/31 (38%), Positives = 13/31 (41%), Gaps = 2/31 (6%)

Query: 567 PPPGGMGPPPPPMPGMPGPPPPPMPEM--YT 595
           P PG   P  P +     PP  P P M  YT
Sbjct: 38  PDPGQAPPVLPVLGRRRWPPEEPRPVMRTYT 68


>gnl|CDD|188647 cd05396, An_peroxidase_like, Animal heme peroxidases and related
           proteins.  A diverse family of enzymes, which includes
           prostaglandin G/H synthase, thyroid peroxidase,
           myeloperoxidase, linoleate diol synthase,
           lactoperoxidase, peroxinectin, peroxidasin, and others.
           Despite its name, this family is not restricted to
           metazoans: members are found in fungi, plants, and
           bacteria as well.
          Length = 370

 Score = 31.2 bits (71), Expect = 2.7
 Identities = 14/60 (23%), Positives = 20/60 (33%), Gaps = 4/60 (6%)

Query: 755 KNMMTLYGDLAEFYTFDKNIYTLEEFFTDIKTFKDSFYQA----WQENIKLREAEEKSIR 810
           K        +  F   D   Y L EFFT    F  S          EN + +E  +  ++
Sbjct: 138 KFTDPRDDLVLLFPDPDVVPYVLSEFFTAAYRFGHSLVPEGVDRIDENGQPKEIPDVPLK 197


>gnl|CDD|214009 cd12810, Esterase_713_like-3, Uncharacterized enzymes similar to
           novel bacterial esterase that cleaves esters on
           halogenated cyclic compounds.  This family contains
           uncharacterized proteins similar to a novel bacterial
           esterase (Alcaligenes esterase 713) with the alpha/beta
           hydrolase fold but does not contain the GXSXXG
           pentapeptide around the active site serine residue as
           commonly seen in other enzymes of this class. Esterase
           713 shows negligible sequence homology to other esterase
           and lipase enzymes. It is active as a dimer and cleaves
           esters on halogenated cyclic compounds though its
           natural substrate is unknown.
          Length = 328

 Score = 31.0 bits (71), Expect = 2.8
 Identities = 24/77 (31%), Positives = 25/77 (32%), Gaps = 28/77 (36%)

Query: 544 DEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHKNIIN 603
           D VKA VA      P GGP PP         PPPP P   GP  P    +          
Sbjct: 197 DNVKAIVA----YEPGGGPFPPG------EEPPPPPPSAFGPLGPVGVPLE--------- 237

Query: 604 GKQACEEVKQSKKLAKI 620
                       KL KI
Sbjct: 238 ---------DFLKLTKI 245


>gnl|CDD|177663 PLN00026, PLN00026, aquaporin  NIP; Provisional.
          Length = 298

 Score = 31.0 bits (70), Expect = 2.9
 Identities = 7/23 (30%), Positives = 8/23 (34%)

Query: 568 PPGGMGPPPPPMPGMPGPPPPPM 590
                 P  P  P  PG P  P+
Sbjct: 1   MDNEEVPSAPSTPATPGTPGGPL 23


>gnl|CDD|216648 pfam01690, PLRV_ORF5, Potato leaf roll virus readthrough protein.
           This family consists mainly of the potato leaf roll
           virus readthrough protein. This is generated via a
           readthrough of open reading frame 3 a coat protein
           allowing transcription of open reading frame 5 to give
           an extended coat protein with a large c-terminal
           addition or read through domain. The readthrough protein
           is thought to play a role in the circulative aphid
           transmission of potato leaf roll virus. Also in the
           family is open reading frame 6 from beet western yellows
           virus and potato leaf roll virus both luteovirus and an
           unknown protein from cucurbit aphid-borne yellows virus
           a closterovirus.
          Length = 460

 Score = 31.2 bits (71), Expect = 3.1
 Identities = 14/24 (58%), Positives = 14/24 (58%), Gaps = 3/24 (12%)

Query: 557 GPKGGPPPPPPPPGGMGPPPPPMP 580
           GPK GP P PPPP    P P P P
Sbjct: 6   GPKPGPTPTPPPP---APTPEPTP 26



 Score = 30.5 bits (69), Expect = 4.8
 Identities = 14/33 (42%), Positives = 14/33 (42%), Gaps = 7/33 (21%)

Query: 560 GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            G P P P P    P PPP    P P P P P 
Sbjct: 2   DGEPGPKPGP---TPTPPP----PAPTPEPTPA 27


>gnl|CDD|233515 TIGR01659, sex-lethal, sex-lethal family splicing factor.  This
           model describes the sex-lethal family of splicing
           factors found in Dipteran insects. The sex-lethal
           phenotype, however, may be limited to the Melanogasters
           and closely related species. In Drosophila the protein
           acts as an inhibitor of splicing. This subfamily is most
           closely related to the ELAV/HUD subfamily of splicing
           factors (TIGR01661).
          Length = 346

 Score = 30.8 bits (69), Expect = 3.2
 Identities = 8/21 (38%), Positives = 8/21 (38%)

Query: 553 GIPTGPKGGPPPPPPPPGGMG 573
           G      G    PP PP  MG
Sbjct: 298 GNMGMAGGSGMNPPNPPAFMG 318


>gnl|CDD|180801 PRK07033, PRK07033, hypothetical protein; Provisional.
          Length = 427

 Score = 30.8 bits (70), Expect = 3.4
 Identities = 11/34 (32%), Positives = 11/34 (32%)

Query: 558 PKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMP 591
             G     PP PG   P   P  G P  P P   
Sbjct: 8   SAGSGGFVPPNPGDRTPAAAPAAGAPFQPRPGRG 41



 Score = 30.4 bits (69), Expect = 5.0
 Identities = 8/36 (22%), Positives = 8/36 (22%)

Query: 554 IPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
            P     G   PP P        P       P P  
Sbjct: 5   DPFSAGSGGFVPPNPGDRTPAAAPAAGAPFQPRPGR 40



 Score = 30.4 bits (69), Expect = 5.0
 Identities = 8/32 (25%), Positives = 8/32 (25%), Gaps = 4/32 (12%)

Query: 564 PPPPPPGGMGPPPP----PMPGMPGPPPPPMP 591
           P     GG  PP P    P        P    
Sbjct: 6   PFSAGSGGFVPPNPGDRTPAAAPAAGAPFQPR 37



 Score = 29.2 bits (66), Expect = 9.7
 Identities = 8/36 (22%), Positives = 8/36 (22%), Gaps = 4/36 (11%)

Query: 565 PPPPPGGMGPPPPP----MPGMPGPPPPPMPEMYTD 596
           P     G   PP P        P    P  P     
Sbjct: 6   PFSAGSGGFVPPNPGDRTPAAAPAAGAPFQPRPGRG 41


>gnl|CDD|227444 COG5113, UFD2, Ubiquitin fusion degradation protein 2
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 929

 Score = 31.1 bits (70), Expect = 3.4
 Identities = 29/154 (18%), Positives = 53/154 (34%), Gaps = 18/154 (11%)

Query: 402 PYLLSILQHLLFIRDDQNMVMDSACEPYLLSILQH-LLFIRDDQYVRLAYYKLLEECVSQ 460
           P L   L + L +  D+ M M+S     +  I +   +F R      +A+Y  +E     
Sbjct: 549 PMLNRKLDYYLSLGRDE-MRMES--RSIIHDIFKEGKVFSRWLLPALMAFYIEIESTGQS 605

Query: 461 IVLHRGGCDPDFRSSR-------RFQLDVQPLVEHLAEKSKTEEDRRVEDLSAKLEEAIM 513
              +      D  + R        F+       E L+    T     V+  +  L +   
Sbjct: 606 TQFY------DKFNIRFIICMMKDFEYKQPSYSEGLSSIKDTNLPFFVKFDAKMLNDLTR 659

Query: 514 LRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVK 547
           L  EA  +LV+    ++ L +      N  + + 
Sbjct: 660 LLDEALKELVEEHN-IQSLLADAISNSNISERIG 692


>gnl|CDD|206165 pfam13995, YebF, YebF-like protein.  The YebF-like protein family
           appears to be a group of colicin immunity proteins. As
           well as YebF the family includes cmi, the colicin M
           immunity protein. This domain family is found in
           bacteria, and is approximately 80 amino acids in length.
           The alignment contains two conserved cysteine residues
           that form a disulphide bond in the solved structure.
          Length = 89

 Score = 28.8 bits (65), Expect = 3.5
 Identities = 20/58 (34%), Positives = 27/58 (46%), Gaps = 10/58 (17%)

Query: 674 KFPECLKFGDELLHVDRAARVSTDVIQNSIRQMENNIKNLETDIQNCKQAPVANENDK 731
           KFP C    D L +   AA+V  D +QN I + +++ K L T        PVA  N  
Sbjct: 4   KFPSC----DGLTYEQVAAQVKRDFLQNRIPRWDDDKKLLGTAK------PVAWVNVN 51


>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
          Length = 935

 Score = 31.2 bits (70), Expect = 3.5
 Identities = 14/57 (24%), Positives = 17/57 (29%), Gaps = 17/57 (29%)

Query: 553 GIPTGPKGGPPPPPPPPGGMGPP-----------------PPPMPGMPGPPPPPMPE 592
           G+    +    P P P G +  P                 P PMP  P P P    E
Sbjct: 486 GVVQDGRPACAPVPAPAGPIVRPWEASLSQVPGVAFAPVMPQPMPVEPVPVPTVALE 542


>gnl|CDD|217044 pfam02451, Nodulin, Nodulin.  Nodulin is a plant protein of unknown
           function. It is induced during nodulation in legume
           roots after rhizobium infection.
          Length = 177

 Score = 30.0 bits (67), Expect = 3.6
 Identities = 26/116 (22%), Positives = 38/116 (32%), Gaps = 7/116 (6%)

Query: 475 SRRFQLDVQPLVEHLAEKSKTEEDRRVEDLSAKLEE---AIMLRQEAEAKLVQAQKTLED 531
           S +F+  V     H+AE   T +     +          A  L    E  L   + +L D
Sbjct: 6   SPKFKKFVTHCTSHVAETCSTTDPSHGSEALHNQGPLGLAFCLFDSMEKCLADHKASLVD 65

Query: 532 LSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPP 587
           LS   P   ++LD  ++     I   P    PP  P    +    P  P  P    
Sbjct: 66  LSLFFPRRSSKLDPHQSL-PGSIQLPP---IPPKSPFEPSISRVIPWPPIQPRDQG 117


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 31.2 bits (71), Expect = 3.6
 Identities = 29/213 (13%), Positives = 74/213 (34%), Gaps = 4/213 (1%)

Query: 633 SGSRNGGAFGFEINFLTKLSSTKDIENKTTLLHYLVDTIEQKFPECLKFGDELLHVDRAA 692
            G   GG+     + L +    +++E K   L   +  +E+   E  K  +EL       
Sbjct: 658 GGVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQL 717

Query: 693 RVSTDVIQNSIRQMENNIKNLETDIQNCKQ--APVANENDKFLEIMEPFAKEVRQKITLL 750
           R   + +   I  +  ++  LE +++  ++  A ++ E  +    +E   + + +    L
Sbjct: 718 RKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEEL 777

Query: 751 SNMSKNMMTLYGDLAEFYT-FDKNIYTLEEFFTDIKTFKDSFYQAWQENIKL-REAEEKS 808
           +     +  L   + +           L+E   ++    +      +    L R      
Sbjct: 778 AEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATE 837

Query: 809 IRVREAREKAENEKKDKAARKKALIDMTTDQTQ 841
            R+ +  E+ E   +D  +    + ++     +
Sbjct: 838 RRLEDLEEQIEELSEDIESLAAEIEELEELIEE 870



 Score = 30.4 bits (69), Expect = 5.1
 Identities = 13/50 (26%), Positives = 27/50 (54%)

Query: 501 VEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQV 550
           +E L A+LEE     +E E++L + ++ LE L S     + ++  +  ++
Sbjct: 353 LESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEI 402



 Score = 29.6 bits (67), Expect = 9.1
 Identities = 18/60 (30%), Positives = 31/60 (51%), Gaps = 4/60 (6%)

Query: 490 AEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQ 549
           AE+   E    +++L  KLEE  +   E E ++ + QK L  L++    E +RL++ K  
Sbjct: 251 AEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALAN----EISRLEQQKQI 306


>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
          Length = 226

 Score = 30.4 bits (69), Expect = 3.6
 Identities = 15/46 (32%), Positives = 18/46 (39%), Gaps = 1/46 (2%)

Query: 548 AQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPM-PGMPGPPPPPMPE 592
             V AG    P   P    PP   + P P P+ P  P P   P P+
Sbjct: 73  EAVRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPKPKPVEKPKPK 118



 Score = 30.0 bits (68), Expect = 4.5
 Identities = 16/48 (33%), Positives = 18/48 (37%), Gaps = 1/48 (2%)

Query: 545 EVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            V+A  AA     P    PP  P      P  PP P  P   P P P+
Sbjct: 74  AVRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPKP-KPVEKPKPKPK 120


>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI.  This minor capsid
           protein may act as a link between the external capsid
           and the internal DNA-protein core. The C-terminal 11
           residues may function as a protease cofactor leading to
           enzyme activation.
          Length = 238

 Score = 30.5 bits (69), Expect = 3.7
 Identities = 8/34 (23%), Positives = 10/34 (29%), Gaps = 4/34 (11%)

Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTD 596
           P  P PP         +   P P   P+  M   
Sbjct: 143 PAAPEPP----SYEETIKPGPAPVEEPVDSMAIA 172


>gnl|CDD|152747 pfam12312, NeA_P2, Nepovirus subgroup A polyprotein.  This family
           of proteins is found in viruses. Proteins in this family
           are typically between 259 and 1110 amino acids in
           length. The family is found in association with
           pfam03688, pfam03689, pfam03391. This family is one of
           the polyproteins expressed by Nepoviruses in subgroup A.
          Length = 175

 Score = 29.9 bits (67), Expect = 3.9
 Identities = 12/19 (63%), Positives = 12/19 (63%), Gaps = 2/19 (10%)

Query: 559 KGGP--PPPPPPPGGMGPP 575
            GGP  PPPPPPP    PP
Sbjct: 113 PGGPCLPPPPPPPPIQKPP 131


>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP).  This family consists of
           several PV-1 (PLVAP) proteins which seem to be specific
           to mammals. PV-1 is a novel protein component of the
           endothelial fenestral and stomatal diaphragms. The
           function of this family is unknown.
          Length = 442

 Score = 30.8 bits (69), Expect = 3.9
 Identities = 29/111 (26%), Positives = 43/111 (38%), Gaps = 3/111 (2%)

Query: 476 RRFQLDVQPLVEHLAE---KSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDL 532
           +R +L+ +   +   E   K++ E   R   L A+      L  E +A L   +  L   
Sbjct: 305 QRQKLEAEQAAQAAQEAKAKAEKEAQAREAKLQAECARQTQLALEEKAALRAERDNLAKE 364

Query: 533 SSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMP 583
              +  E  +L    A   + + T  K    P  PP   MGPPP P P  P
Sbjct: 365 LEAKKREAEQLRMELAIRISALDTCIKAKSLPAMPPSRPMGPPPNPPPIDP 415


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 30.8 bits (69), Expect = 4.1
 Identities = 22/69 (31%), Positives = 25/69 (36%), Gaps = 4/69 (5%)

Query: 523 VQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGM 582
             AQ+     S   P     LD  K+      P  PK  PPPP  P     P  P +   
Sbjct: 601 RSAQRPTRPKSPKLP---ELLDIPKSPKRPESPKSPKR-PPPPQRPSSPERPEGPKIIKS 656

Query: 583 PGPPPPPMP 591
           P PP  P P
Sbjct: 657 PKPPKSPKP 665


>gnl|CDD|99780 cd06183, cyt_b5_reduct_like, Cytochrome b5 reductase catalyzes the
           reduction of 2 molecules of cytochrome b5 using NADH as
           an electron donor. Like ferredoxin reductases, these
           proteins have an N-terminal FAD binding subdomain and a
           C-terminal NADH binding subdomain, separated by a cleft,
           which accepts FAD. The NADH-binding moiety interacts
           with part of the FAD and resembles a Rossmann fold.
           However, NAD is bound differently than in canonical
           Rossmann fold proteins. Nitrate reductases,
           flavoproteins similar to pyridine nucleotide cytochrome
           reductases, catalyze the reduction of nitrate to
           nitrite. The enzyme can be divided into three functional
           fragments that bind the cofactors molybdopterin,
           heme-iron, and FAD/NADH.
          Length = 234

 Score = 30.2 bits (69), Expect = 4.1
 Identities = 17/59 (28%), Positives = 26/59 (44%), Gaps = 10/59 (16%)

Query: 306 TACLQLINAIVATPDDLEFRLHL------RNEIMRVGLYDLLDALEKDASEDVSVQLKV 358
           T  LQLI AI+  P+D   ++ L        +I+   L + LD L K   +   V   +
Sbjct: 117 TPMLQLIRAILKDPEDKT-KISLLYANRTEEDIL---LREELDELAKKHPDRFKVHYVL 171


>gnl|CDD|184753 PRK14581, hmsF, outer membrane N-deacetylase; Provisional.
          Length = 672

 Score = 30.7 bits (69), Expect = 4.1
 Identities = 26/107 (24%), Positives = 43/107 (40%), Gaps = 17/107 (15%)

Query: 736 MEPFAKEVRQKITLLSNMSKNMMTLYGDLAEFYTFDKNIYTLEEFFTDIKTFKDSFYQAW 795
           + PF  EVRQ+I          + +Y D+A     D  IY  +   +D       F  A 
Sbjct: 435 LSPFNPEVRQRI----------IDIYRDMAYSAPIDGIIYHDDAVMSD-------FEDAS 477

Query: 796 QENIKLREAEEKSIRVREAREKAENEKKDKAARKKALIDMTTDQTQQ 842
            + I+  E       +   R+  E  ++    + K LID T + T++
Sbjct: 478 PDAIRAYEKAGFPGSITTIRQDPEMMQRWTRYKSKYLIDFTNELTRE 524


>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 226

 Score = 30.2 bits (68), Expect = 4.1
 Identities = 15/48 (31%), Positives = 16/48 (33%), Gaps = 1/48 (2%)

Query: 546 VKAQVAAGIPTGPK-GGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
           V  Q  A  P      G P     P  + PP  P  G     P P PE
Sbjct: 89  VDPQPVAQPPVESTPAGVPVAAQTPKPVKPPKQPPAGAVPAKPTPKPE 136


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 30.7 bits (70), Expect = 4.2
 Identities = 21/67 (31%), Positives = 24/67 (35%), Gaps = 7/67 (10%)

Query: 555 PTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYTDCHKNIINGKQACEEV--K 612
            T P   PPPPPP         P        PPPP+PE            K +  E   K
Sbjct: 545 ATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPE-----EPTPSPTKDSSPEEIDK 599

Query: 613 QSKKLAK 619
            +K LA 
Sbjct: 600 AAKNLAD 606



 Score = 29.5 bits (67), Expect = 8.9
 Identities = 15/42 (35%), Positives = 16/42 (38%)

Query: 547 KAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPP 588
            +  A   P   K  PPP P PP        P P  P PPP 
Sbjct: 517 ASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPT 558


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 30.6 bits (69), Expect = 4.3
 Identities = 27/133 (20%), Positives = 41/133 (30%), Gaps = 32/133 (24%)

Query: 477 RFQLDVQPLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGR 536
           +F++D++P       K K E            E+       A+ KL+ + K L D     
Sbjct: 12  KFKIDMEPDETVKELKEKIEA-----------EQGKDAYPVAQQKLIYSGKILSDDK--- 57

Query: 537 PVEKNRLDE-------------VKAQVAAGIPT---GPKGGPPPPPPPPGGMGPPPPP-- 578
            V++ ++ E                +VA    T    P   P PP  P  GM   P    
Sbjct: 58  TVKEYKIKEKDFVVVMVSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAV 117

Query: 579 MPGMPGPPPPPMP 591
               P        
Sbjct: 118 EEKSPSEESATAT 130


>gnl|CDD|139164 PRK12700, flgH, flagellar basal body L-ring protein; Reviewed.
          Length = 230

 Score = 30.0 bits (67), Expect = 4.3
 Identities = 10/16 (62%), Positives = 10/16 (62%)

Query: 556 TGPKGGPPPPPPPPGG 571
           TGP   PPPPPP P  
Sbjct: 32  TGPLTAPPPPPPQPSA 47


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 30.5 bits (69), Expect = 4.3
 Identities = 19/89 (21%), Positives = 30/89 (33%), Gaps = 15/89 (16%)

Query: 803 EAEEKSIRVREAREKAENEKKDKAAR-------------KKALIDMTTDQTQQGVMDSLL 849
           EAE K     EA++KA  E K  AA+                  +          +D L 
Sbjct: 204 EAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVDDLF 263

Query: 850 EALQTGR--PKKTGSSIKSVGCPSHSALQ 876
             L +G+  PK  G +  +    + +   
Sbjct: 264 GGLDSGKNAPKTGGGAKGNGAQGAGAGNG 292


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 30.7 bits (70), Expect = 4.4
 Identities = 22/104 (21%), Positives = 40/104 (38%), Gaps = 3/104 (2%)

Query: 738 PFAKEVRQKITLLSNMSKNMMTLYGDLAEFYTFDKNIYTLEEFFTDIKTFKDSFYQAWQE 797
              K  R+ +  LS + K+   L   LA F    +      EF T  K       +  ++
Sbjct: 362 SKRKVRREVLPFLSIIFKHNPELAARLAAFLELTEE---EIEFLTGSKKATKKIKKIVEK 418

Query: 798 NIKLREAEEKSIRVREAREKAENEKKDKAARKKALIDMTTDQTQ 841
             K RE E+K  + +    K + E++++   KK       ++  
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462


>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
          Length = 627

 Score = 30.5 bits (70), Expect = 4.6
 Identities = 18/66 (27%), Positives = 35/66 (53%), Gaps = 8/66 (12%)

Query: 493 SKTEEDRRVEDLSA------KLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEV 546
           S  E +R V+D  A      K +E +  R +A++ + Q +KTL++L  G  V  +  +++
Sbjct: 501 SDEEIERMVKDAEANAEEDKKRKELVEARNQADSLIYQTEKTLKEL--GDKVPADEKEKI 558

Query: 547 KAQVAA 552
           +A +  
Sbjct: 559 EAAIKE 564


>gnl|CDD|236795 PRK10920, PRK10920, putative uroporphyrinogen III
           C-methyltransferase; Provisional.
          Length = 390

 Score = 30.4 bits (69), Expect = 4.6
 Identities = 16/65 (24%), Positives = 27/65 (41%)

Query: 488 HLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVK 547
             A+      D     L+A  +     +QE E  L Q  K L+  +  +     +LDE++
Sbjct: 60  QQAQNQTATNDALANQLTALQKAQESQKQELEGILKQQAKALDQANRQQAALAKQLDELQ 119

Query: 548 AQVAA 552
            +VA 
Sbjct: 120 QKVAT 124


>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 594

 Score = 30.4 bits (69), Expect = 4.6
 Identities = 36/184 (19%), Positives = 59/184 (32%), Gaps = 36/184 (19%)

Query: 638 GGAFGFEINFLTKLSSTKDIENKTTLLHYLVDTIEQKFPECLKFGDELLHVDRAARVSTD 697
           GGA    I   T     K+      +   L+D  + K    L             ++   
Sbjct: 432 GGA---RIKG-TIELPFKE------VCEKLLDKDKNKPFIKLILLSNNSQEKAVLKL--- 478

Query: 698 VIQNSIRQMENNIKNLETDIQNCKQAPVANE---------NDKFLEIMEPFAKEVRQKI- 747
                I++  + IK  ETD    ++     +         + K   I E F +     + 
Sbjct: 479 --LQKIKKNNDFIKEFETDALKLQEILEKVDSKSEKLEKISAKIDNIKELFDESKMSILQ 536

Query: 748 -TLLSNMSKNMMTLYGDLAEFYTFDKNIYTLEEFFTDIKTFKDSFYQAWQENI-KLREAE 805
             L   +    +     +A  Y    NI   E+F    K +  + +  W E I    EA+
Sbjct: 537 EILQPALHHIELN----IARIYKL--NIKDKEDFL--NKLYIKA-HLEWLEIIYGKLEAQ 587

Query: 806 EKSI 809
            KSI
Sbjct: 588 IKSI 591


>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin. 
          Length = 237

 Score = 29.9 bits (68), Expect = 4.7
 Identities = 20/71 (28%), Positives = 33/71 (46%), Gaps = 16/71 (22%)

Query: 473 RSSRRFQLDVQPLVEHLAEKSKTEEDR--------------RVEDLSAKLEEAIMLRQEA 518
           RS  R    ++ L E  AEK+  E +R              ++E L A+L+EA  + +EA
Sbjct: 54  RSEERLATALEKLEE--AEKAADESERGRKVLENRSLKDDEKMEQLEAQLKEAKEIAEEA 111

Query: 519 EAKLVQAQKTL 529
           + K  +  + L
Sbjct: 112 DRKYEEVARKL 122


>gnl|CDD|236522 PRK09450, cyaA, adenylate cyclase; Provisional.
          Length = 830

 Score = 30.7 bits (70), Expect = 4.8
 Identities = 17/80 (21%), Positives = 30/80 (37%), Gaps = 6/80 (7%)

Query: 659 NKTTLLHYLVDTIEQKFPECLKFGDELLHVDRAARVSTDVI---QNSIRQMENNIKNLET 715
           N+   LH+     EQ   + LK     +H D A   S +V    Q+    +   ++ L +
Sbjct: 605 NEVRTLHF---EGEQALLDALKTILGKMHQDAAPPPSVEVFCYSQHLRGLIRTRVQQLVS 661

Query: 716 DIQNCKQAPVANENDKFLEI 735
           +    +      E  K L +
Sbjct: 662 ECIELRLGSTRQEPGKALRV 681


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 30.5 bits (69), Expect = 5.0
 Identities = 31/201 (15%), Positives = 71/201 (35%), Gaps = 22/201 (10%)

Query: 644 EINFLTKLSSTKDIENKTTLLHYLVDTIEQKFPECLKFGDELLHVDRAARVSTDVIQNSI 703
           +++ L K    +       L   L + +E          +EL  + +   +  +  +  +
Sbjct: 171 KLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEEL 230

Query: 704 RQMENNIKNLETDIQNCKQ------APVANENDKFLEIMEPFAKEVRQKITLLS---NMS 754
            Q    ++    +++  K+      A +       LE ++   +E+R+   LL       
Sbjct: 231 EQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKI 290

Query: 755 KNMMTLYGDLAEFYTFDKNIYTLEEFFTDIKTFKDSFYQAWQENIKLREAEEKSIRVREA 814
           + +  L  ++ E            E    +    +   +      KL+  EE+  ++ E 
Sbjct: 291 ERLEELEREIEELEE-------ELEGLRALLEELEELLE------KLKSLEERLEKLEEK 337

Query: 815 REKAENEKKDKAARKKALIDM 835
            EK E+E ++ A  K  L  +
Sbjct: 338 LEKLESELEELAEEKNELAKL 358


>gnl|CDD|219944 pfam08633, Rox3, Rox3 mediator complex subunit.  The mediator
           complex is part of the RNA polymerase II holoenzyme.
           Rox3 is a subunit of the mediator complex.
          Length = 210

 Score = 29.8 bits (67), Expect = 5.1
 Identities = 12/68 (17%), Positives = 16/68 (23%), Gaps = 7/68 (10%)

Query: 522 LVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPP-------PPGGMGP 574
           L+        ++ G     +    V  Q     P G         P          G G 
Sbjct: 97  LLSELSRARTMARGTVPNMDWCSSVLGQFERSYPNGFLNSAGFAVPDLAFDLDGTQGNGR 156

Query: 575 PPPPMPGM 582
           P    PG 
Sbjct: 157 PKGGNPGA 164


>gnl|CDD|203446 pfam06428, Sec2p, GDP/GTP exchange factor Sec2p.  In Saccharomyces
           cerevisiae, Sec2p is a GDP/GTP exchange factor for
           Sec4p, which is required for vesicular transport at the
           post-Golgi stage of yeast secretion.
          Length = 90

 Score = 28.1 bits (63), Expect = 5.8
 Identities = 15/51 (29%), Positives = 26/51 (50%), Gaps = 8/51 (15%)

Query: 489 LAEKSKTEEDRRVEDLSAKL--------EEAIMLRQEAEAKLVQAQKTLED 531
            AEK K + ++ VEDL+A L         +A   R+  E K  + ++ L++
Sbjct: 12  RAEKEKKKLEQEVEDLTASLFDEANKMVADARREREAVEIKNEKLEEQLKE 62


>gnl|CDD|237110 PRK12472, PRK12472, hypothetical protein; Provisional.
          Length = 508

 Score = 30.2 bits (68), Expect = 5.9
 Identities = 15/63 (23%), Positives = 27/63 (42%), Gaps = 2/63 (3%)

Query: 490 AEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQ 549
           A+ +     R    L A L +    +  A+A+L +A K L   ++     K R +E + +
Sbjct: 209 AKTAAAAAAREAAPLKASLRKLERAKARADAELKRADKAL--AAAKTDEAKARAEERQQK 266

Query: 550 VAA 552
            A 
Sbjct: 267 AAQ 269


>gnl|CDD|152115 pfam11679, DUF3275, Protein of unknown function (DUF3275).  This
           family of proteins with unknown function appear to be
           restricted to Proteobacteria.
          Length = 211

 Score = 29.4 bits (66), Expect = 6.0
 Identities = 13/41 (31%), Positives = 18/41 (43%), Gaps = 1/41 (2%)

Query: 530 EDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPG 570
           ++     P E + LDE  A  AA + + P   P P  P P 
Sbjct: 88  DEPRRTEPQEPDPLDESPAS-AAPVASAPAPAPSPQSPKPA 127


>gnl|CDD|236692 PRK10431, PRK10431, N-acetylmuramoyl-l-alanine amidase II;
           Provisional.
          Length = 445

 Score = 30.2 bits (68), Expect = 6.1
 Identities = 19/72 (26%), Positives = 27/72 (37%), Gaps = 10/72 (13%)

Query: 520 AKLVQAQKTLEDLSSGRPVEKNRLDEVKAQVAAG---IPTGPKGGPPPPPPPP--GGMGP 574
            K  Q  + + DL+     E  + + VK Q  +    + T     PPPPPPPP       
Sbjct: 90  PKDAQTLRLVVDLT-----ENGKTEAVKRQNGSNYTVVFTINADVPPPPPPPPVVAKRVE 144

Query: 575 PPPPMPGMPGPP 586
            P  +      P
Sbjct: 145 TPAVVAPRVSEP 156


>gnl|CDD|216175 pfam00888, Cullin, Cullin family. 
          Length = 603

 Score = 30.3 bits (68), Expect = 6.2
 Identities = 12/31 (38%), Positives = 16/31 (51%), Gaps = 3/31 (9%)

Query: 11  ENMLEKL---NPEQLNQKFEDMLNDMNLSDE 38
           + M+EKL      Q   K E M  D++LS E
Sbjct: 413 KKMIEKLKQECGYQFTSKLERMFKDISLSKE 443



 Score = 30.3 bits (68), Expect = 6.2
 Identities = 12/31 (38%), Positives = 16/31 (51%), Gaps = 3/31 (9%)

Query: 72  ENMLEKL---NPEQLNQKFEDMLNDMNLSDE 99
           + M+EKL      Q   K E M  D++LS E
Sbjct: 413 KKMIEKLKQECGYQFTSKLERMFKDISLSKE 443


>gnl|CDD|236098 PRK07789, PRK07789, acetolactate synthase 1 catalytic subunit;
           Validated.
          Length = 612

 Score = 29.9 bits (68), Expect = 6.8
 Identities = 10/32 (31%), Positives = 10/32 (31%)

Query: 564 PPPPPPGGMGPPPPPMPGMPGPPPPPMPEMYT 595
           P PP P       PP       P    PE  T
Sbjct: 1   PSPPTPAAAASAAPPPAAPAARPRIVAPERMT 32


>gnl|CDD|183198 PRK11560, PRK11560, phosphoethanolamine transferase; Provisional.
          Length = 558

 Score = 30.0 bits (68), Expect = 6.9
 Identities = 19/57 (33%), Positives = 32/57 (56%), Gaps = 8/57 (14%)

Query: 91  LNDMNLSDEKKEPLRRQPLANKKKMLLMHYKGTVTSYENKSKFDKPIEYIQYLSQPE 147
           ++DM L DE K+ L R P  + K ++++H KG+  +Y  +     P  + +Y  QPE
Sbjct: 369 VDDMLLVDEMKQSLGRNP--DGKHLIILHTKGSHYNYTQRY----PRSFARY--QPE 417


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 30.0 bits (68), Expect = 7.1
 Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 5/78 (6%)

Query: 479 QLDVQPLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSSG--- 535
            LD+Q L       ++TE   R++     L+ A+  +++AE +LVQA   LE+       
Sbjct: 587 SLDLQRLDVPDYAANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAE 646

Query: 536 --RPVEKNRLDEVKAQVA 551
               +++ RLD  + Q  
Sbjct: 647 ARTALKQARLDLQRLQNE 664


>gnl|CDD|234730 PRK00339, minC, septum formation inhibitor; Reviewed.
          Length = 249

 Score = 29.4 bits (66), Expect = 7.1
 Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 2/51 (3%)

Query: 538 VEKNRLDEVKAQVAAGIPTGPKGGPPPPP--PPPGGMGPPPPPMPGMPGPP 586
           +  +R++++ A +AA +P  P  G    P  P P    P P   P  P  P
Sbjct: 88  IRASRIEDIAAAIAADLPVLPPSGARERPLEPSPAEAEPEPAKAPEKPPEP 138


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 28.9 bits (65), Expect = 7.2
 Identities = 19/71 (26%), Positives = 30/71 (42%), Gaps = 3/71 (4%)

Query: 67  ENKKQENMLEKLNPEQLNQKFEDMLNDMNLSDEK---KEPLRRQPLANKKKMLLMHYKGT 123
           E ++ E M EK   E  +++F+    +    DE+   K+  +RQ    KKK      KG 
Sbjct: 48  EYERLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGN 107

Query: 124 VTSYENKSKFD 134
               +  SK  
Sbjct: 108 KKEEKEGSKSS 118


>gnl|CDD|129911 TIGR00831, a_cpa1, Na+/H+ antiporter, bacterial form.  The
           Monovalent Cation:Proton Antiporter-1 (CPA1) Family (TC
           2.A.36) The CPA1 family is a large family of proteins
           derived from Gram-positive and Gram-negative bacteria,
           blue green bacteria, yeast, plants and animals.
           Transporters from eukaryotes have been functionally
           characterized, and all of these catalyze Na+:H+
           exchange. Their primary physiological functions may be
           in (1) cytoplasmic pH regulation, extruding the H+
           generated during metabolism, and (2) salt tolerance (in
           plants), due to Na+ uptake into vacuoles. This model is
           specific for the bacterial members of this family
           [Transport and binding proteins, Cations and iron
           carrying compounds].
          Length = 525

 Score = 29.9 bits (67), Expect = 7.2
 Identities = 14/52 (26%), Positives = 23/52 (44%), Gaps = 1/52 (1%)

Query: 500 RVEDLSAKLEEAIMLR-QEAEAKLVQAQKTLEDLSSGRPVEKNRLDEVKAQV 550
            V + S +  E I+ R   A +      K +E L    PV +  L E+ A++
Sbjct: 408 FVSEHSERELEEIIARYIAARSAKFALMKAVEQLRIVEPVARELLPELDARI 459


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score = 30.0 bits (68), Expect = 7.4
 Identities = 15/45 (33%), Positives = 24/45 (53%)

Query: 490 AEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTLEDLSS 534
           AEK K++    V+DL A+L++    +  AE K  Q +  L +L  
Sbjct: 150 AEKEKSQLQAEVDDLLAQLDQIAKAKLNAEKKAKQLESQLSELQV 194


>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
           family.  The proteins represented by this model contain
           three RNA recognition motifs (rrm: pfam00076) and have
           been characterized as poly-pyrimidine tract binding
           proteins associated with RNA splicing factors. In the
           case of PUF60 (GP|6176532), in complex with p54, and in
           the presence of U2AF, facilitates association of U2
           snRNP with pre-mRNA.
          Length = 612

 Score = 30.0 bits (67), Expect = 7.8
 Identities = 12/40 (30%), Positives = 13/40 (32%)

Query: 546 VKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGP 585
             A V  G    P   PPP    P  + PP    P    P
Sbjct: 365 APAVVKPGPMEIPTPVPPPGLAIPSLVAPPGLVAPTEINP 404


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 29.3 bits (66), Expect = 7.8
 Identities = 12/30 (40%), Positives = 14/30 (46%), Gaps = 5/30 (16%)

Query: 563 PPPPPPPGGMGPPPPPMPGMPGPPPPPMPE 592
            PP        PPP P+   P P P P+PE
Sbjct: 59  EPPQAVQ----PPPEPVV-EPEPEPEPIPE 83


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 30.0 bits (67), Expect = 7.8
 Identities = 28/186 (15%), Positives = 50/186 (26%), Gaps = 36/186 (19%)

Query: 470 PDFRSSRRFQLDVQPLVEHLAEKSKTEEDRRVEDLSAKLEEAIMLRQEAEAKLVQAQKTL 529
                S  ++   + L + L    K             +E+ ++  ++  + +   +   
Sbjct: 286 GSSDESPEWKTFYEALADQLNNLYKLLRTIYKHKDETVIEQYLIEGRKLFSTINGLKAHN 345

Query: 530 EDLSSGRPVEKNRLDEVKAQVAAGIPTGPKGGP------------PPPPPPPGGMGPPPP 577
           E L +      +R+    A+VA         GP            P   P    M   PP
Sbjct: 346 EILKTASLTAPSRVLAAAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPP 405

Query: 578 ------------------------PMPGMPGPPPPPMPEMYTDCHKNIINGKQACEEVKQ 613
                                   P P  P PP P  P +      N++      E   +
Sbjct: 406 VPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPTNPYVMPISMANMVYPGHPQEHGHE 465

Query: 614 SKKLAK 619
            K+   
Sbjct: 466 RKRKRG 471


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 29.3 bits (66), Expect = 7.8
 Identities = 11/45 (24%), Positives = 19/45 (42%)

Query: 797 ENIKLREAEEKSIRVREAREKAENEKKDKAARKKALIDMTTDQTQ 841
           E IK   AEEK     E   + E  ++++      L++ + D   
Sbjct: 152 EKIKKERAEEKEREEEEKAAEEEKAREEEILTGNPLLNTSGDFKV 196


>gnl|CDD|171770 PRK12851, groEL, chaperonin GroEL; Reviewed.
          Length = 541

 Score = 29.7 bits (67), Expect = 8.2
 Identities = 10/16 (62%), Positives = 10/16 (62%)

Query: 558 PKGGPPPPPPPPGGMG 573
           PK  P PP PP GGM 
Sbjct: 525 PKKEPAPPAPPGGGMD 540


>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
           (DUF1750).  This is a fungal domain of unknown function.
          Length = 669

 Score = 29.6 bits (66), Expect = 8.3
 Identities = 11/32 (34%), Positives = 12/32 (37%), Gaps = 2/32 (6%)

Query: 558 PKGGPPPPPPPPGGMGP--PPPPMPGMPGPPP 587
            +    PP P P  M    P  P PG    PP
Sbjct: 210 GQSMYQPPGPYPNAMVGRQPFYPQPGAVAGPP 241


>gnl|CDD|222366 pfam13764, E3_UbLigase_R4, E3 ubiquitin-protein ligase UBR4.  This
           is a family of E## ubiquitin ligase enzymes.
          Length = 794

 Score = 30.0 bits (68), Expect = 8.3
 Identities = 12/45 (26%), Positives = 21/45 (46%), Gaps = 2/45 (4%)

Query: 801 LREAEEKSIRVREAREKAENEKKDKA--ARKKALIDMTTDQTQQG 843
           L E E  + ++ E R++   EK+  A   R+K L  +    +  G
Sbjct: 400 LAEKEGVAKKIDEVRDETRAEKRRLAMAMREKQLQALGMRTSSGG 444


>gnl|CDD|225950 COG3416, COG3416, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 233

 Score = 29.1 bits (65), Expect = 8.5
 Identities = 29/122 (23%), Positives = 42/122 (34%), Gaps = 20/122 (16%)

Query: 485 LVEHLAEKSKTEEDRRVEDLSAK-----------LEEAIMLRQEAEAKLVQAQKTLED-- 531
           L   L +    E D + E L A+           L + ++++++A  K     K LE   
Sbjct: 12  LFHRLKKAEANERDPQAEALIAEAVAKQPDAAYYLAQRVLIQEQALKKASTQIKELEKRI 71

Query: 532 --LSSGRPVEKNRLDEVKAQVAAGIPTGPKGGPPPPPPPPGGMGPPPPPMPGMPGPPPPP 589
             L +G     + L       A    T  +  PP   PPP     PP       GP   P
Sbjct: 72  AILQAGEAGSGSFLSN-----AFKWGTPQEPAPPANAPPPKEPAAPPSWRSSPAGPTTQP 126

Query: 590 MP 591
            P
Sbjct: 127 SP 128


>gnl|CDD|184062 PRK13455, PRK13455, F0F1 ATP synthase subunit B; Provisional.
          Length = 184

 Score = 29.0 bits (65), Expect = 8.6
 Identities = 14/38 (36%), Positives = 21/38 (55%), Gaps = 9/38 (23%)

Query: 498 DRRVEDLSAKLEEAIMLRQEAE---------AKLVQAQ 526
           D+R E + ++LEEA  LR+EA+          + VQ Q
Sbjct: 57  DKRAEGIRSELEEARALREEAQTLLASYERKQREVQEQ 94


>gnl|CDD|147567 pfam05454, DAG1, Dystroglycan (Dystrophin-associated glycoprotein
           1).  Dystroglycan is one of the dystrophin-associated
           glycoproteins, which is encoded by a 5.5 kb transcript
           in human. The protein product is cleaved into two
           non-covalently associated subunits, [alpha] (N-terminal)
           and [beta] (C-terminal). In skeletal muscle the
           dystroglycan complex works as a transmembrane linkage
           between the extracellular matrix and the cytoskeleton.
           [alpha]-dystroglycan is extracellular and binds to
           merosin ([alpha]-2 laminin) in the basement membrane,
           while [beta]-dystroglycan is a transmembrane protein and
           binds to dystrophin, which is a large rod-like
           cytoskeletal protein, absent in Duchenne muscular
           dystrophy patients. Dystrophin binds to intracellular
           actin cables. In this way, the dystroglycan complex,
           which links the extracellular matrix to the
           intracellular actin cables, is thought to provide
           structural integrity in muscle tissues. The dystroglycan
           complex is also known to serve as an agrin receptor in
           muscle, where it may regulate agrin-induced
           acetylcholine receptor clustering at the neuromuscular
           junction. There is also evidence which suggests the
           function of dystroglycan as a part of the signal
           transduction pathway because it is shown that Grb2, a
           mediator of the Ras-related signal pathway, can interact
           with the cytoplasmic domain of dystroglycan. In general,
           aberrant expression of dystrophin-associated protein
           complex underlies the pathogenesis of Duchenne muscular
           dystrophy, Becker muscular dystrophy and severe
           childhood autosomal recessive muscular dystrophy.
           Interestingly, no genetic disease has been described for
           either [alpha]- or [beta]-dystroglycan. Dystroglycan is
           widely distributed in non-muscle tissues as well as in
           muscle tissues. During epithelial morphogenesis of
           kidney, the dystroglycan complex is shown to act as a
           receptor for the basement membrane. Dystroglycan
           expression in mouse brain and neural retina has also
           been reported. However, the physiological role of
           dystroglycan in non-muscle tissues has remained unclear.
          Length = 290

 Score = 29.6 bits (66), Expect = 8.6
 Identities = 16/38 (42%), Positives = 16/38 (42%), Gaps = 3/38 (7%)

Query: 555 PTGPKGGPPPPPPPPG-GMGPPPPPMPGMPGPPP--PP 589
           P  P   PPPP   P  G G  P  M     PPP  PP
Sbjct: 253 PNAPPYQPPPPFTTPMEGKGSRPKNMTPYRSPPPYVPP 290


>gnl|CDD|221441 pfam12145, Med12-LCEWAV, Eukaryotic Mediator 12 subunit domain.
           This domain is found in eukaryotes, and is typically
           between 325 and 354 amino acids in length. The function
           of this particular region of the Mediator subunit Med12
           is not known, but there is a conserved sequence motif:
           LCEWAV, from which the name derives.
          Length = 471

 Score = 29.4 bits (66), Expect = 9.5
 Identities = 11/34 (32%), Positives = 13/34 (38%), Gaps = 2/34 (5%)

Query: 543 LDEVKAQVAAGIPT--GPKGGPPPPPPPPGGMGP 574
           L + KA  + GI T  G  G       PP    P
Sbjct: 25  LSDTKAHTSHGIKTQDGTNGPTTSSGTPPTSSSP 58


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.134    0.380 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 47,715,060
Number of extensions: 4834122
Number of successful extensions: 13587
Number of sequences better than 10.0: 1
Number of HSP's gapped: 11184
Number of HSP's successfully gapped: 708
Length of query: 927
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 821
Effective length of database: 6,236,078
Effective search space: 5119820038
Effective search space used: 5119820038
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 64 (28.6 bits)