RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17777
         (1375 letters)



>gnl|CDD|128421 smart00111, C4, C-terminal tandem repeated domain in type 4
            procollagens.  Duplicated domain in C-terminus of type 4
            collagens. Mutations in alpha-5 collagen IV are
            associated with X-linked Alport syndrome.
          Length = 114

 Score =  187 bits (476), Expect = 2e-55
 Identities = 68/116 (58%), Positives = 81/116 (69%), Gaps = 3/116 (2%)

Query: 1260 ANVIAVHSQSVVIPECPVGWNSLWIGYSFVMHTAAGGDGGGQSLASPGSCLEEFRATPFI 1319
              VIAVHSQ+  +P+CP GW  LW GYSF+MHT  G  G GQ L SPGSCLE FR  PFI
Sbjct: 1    GFVIAVHSQTTNVPQCPAGWVELWTGYSFLMHTGNGE-GHGQDLGSPGSCLERFRTMPFI 59

Query: 1320 ECNGEHGSCHYFANKL-SFWLATIDPQDQWSRPQQQTLKSGNLRQRISRCQVCIKR 1374
            ECNG  G C+Y +    SFWL+TI+P DQ++ P+  T K+G+LR  ISRCQVC K 
Sbjct: 60   ECNG-RGVCNYASRNDYSFWLSTIEPSDQFTAPRPMTPKAGDLRPYISRCQVCEKP 114



 Score =  178 bits (454), Expect = 2e-52
 Identities = 65/111 (58%), Positives = 78/111 (70%), Gaps = 3/111 (2%)

Query: 908  ANVIAVHSQSVVIPECPVGWNSLWIGYSFVMHTAAGGDGGGQSLASPGSCLEEFRATPFI 967
              VIAVHSQ+  +P+CP GW  LW GYSF+MHT  G  G GQ L SPGSCLE FR  PFI
Sbjct: 1    GFVIAVHSQTTNVPQCPAGWVELWTGYSFLMHTGNGE-GHGQDLGSPGSCLERFRTMPFI 59

Query: 968  ECNGEHGSCHYFANKL-SFWLATIDPQDQWSRPQQQTLKSGNLRQRISRCQ 1017
            ECNG  G C+Y +    SFWL+TI+P DQ++ P+  T K+G+LR  ISRCQ
Sbjct: 60   ECNG-RGVCNYASRNDYSFWLSTIEPSDQFTAPRPMTPKAGDLRPYISRCQ 109



 Score =  161 bits (410), Expect = 2e-46
 Identities = 57/120 (47%), Positives = 70/120 (58%), Gaps = 13/120 (10%)

Query: 1147 GILMVKHSQTAEVPSCTDDKTNTQFTKLWEGYSLLYVEGNEQAHNQDLGFAGSCVRKFST 1206
            G ++  HSQT  VP C        + +LW GYS L   GN + H QDLG  GSC+ +F T
Sbjct: 1    GFVIAVHSQTTNVPQCPAG-----WVELWTGYSFLMHTGNGEGHGQDLGSPGSCLERFRT 55

Query: 1207 MPFLFCDPNNVCNYASRNDRSYWLSTEE-------PMPMMPVESTQIQKFISRCVVCEVP 1259
            MPF+ C+   VCNYASRND S+WLST E       P PM P     ++ +ISRC VCE P
Sbjct: 56   MPFIECNGRGVCNYASRNDYSFWLSTIEPSDQFTAPRPMTPKAG-DLRPYISRCQVCEKP 114



 Score = 55.5 bits (134), Expect = 3e-09
 Identities = 21/44 (47%), Positives = 25/44 (56%), Gaps = 8/44 (18%)

Query: 871 RNDRSYWLSTEE-------PMPMMPVESTQIQKFISRCVVCEVP 907
           RND S+WLST E       P PM P     ++ +ISRC VCE P
Sbjct: 72  RNDYSFWLSTIEPSDQFTAPRPMTPKAG-DLRPYISRCQVCEKP 114


>gnl|CDD|144854 pfam01413, C4, C-terminal tandem repeated domain in type 4
            procollagen.  Duplicated domain in C-terminus of type 4
            collagens. Mutations in alpha-5 collagen IV are
            associated with X-linked Alport syndrome.
          Length = 110

 Score =  183 bits (467), Expect = 4e-54
 Identities = 59/115 (51%), Positives = 73/115 (63%), Gaps = 7/115 (6%)

Query: 1147 GILMVKHSQTAEVPSCTDDKTNTQFTKLWEGYSLLYVEGNEQAHNQDLGFAGSCVRKFST 1206
            G L+  HSQT ++P C        ++ LW GYS L   GN + H QDLG  GSC+ +F T
Sbjct: 1    GFLIAVHSQTTQIPQCPAG-----WSSLWTGYSFLMHTGNAEGHGQDLGSPGSCLERFRT 55

Query: 1207 MPFLFCDPNNVCNYASRNDRSYWLST--EEPMPMMPVESTQIQKFISRCVVCEVP 1259
            MPF+ C+ N VCNYASRND S+WLST    PMPM P    +I+ +ISRC VCE P
Sbjct: 56   MPFIECNGNGVCNYASRNDYSFWLSTIEPMPMPMTPKAGREIRPYISRCQVCEAP 110



 Score =  179 bits (456), Expect = 1e-52
 Identities = 65/117 (55%), Positives = 78/117 (66%), Gaps = 9/117 (7%)

Query: 1260 ANVIAVHSQSVVIPECPVGWNSLWIGYSFVMHTAAGGDGGGQSLASPGSCLEEFRATPFI 1319
              +IAVHSQ+  IP+CP GW+SLW GYSF+MHT    +G GQ L SPGSCLE FR  PFI
Sbjct: 1    GFLIAVHSQTTQIPQCPAGWSSLWTGYSFLMHT-GNAEGHGQDLGSPGSCLERFRTMPFI 59

Query: 1320 ECNGEHGSCHYFA-NKLSFWLATIDPQDQWSRPQQQTLKSG-NLRQRISRCQVCIKR 1374
            ECNG +G C+Y + N  SFWL+TI+P      P   T K+G  +R  ISRCQVC   
Sbjct: 60   ECNG-NGVCNYASRNDYSFWLSTIEPM-----PMPMTPKAGREIRPYISRCQVCEAP 110



 Score =  171 bits (435), Expect = 8e-50
 Identities = 63/112 (56%), Positives = 76/112 (67%), Gaps = 9/112 (8%)

Query: 908  ANVIAVHSQSVVIPECPVGWNSLWIGYSFVMHTAAGGDGGGQSLASPGSCLEEFRATPFI 967
              +IAVHSQ+  IP+CP GW+SLW GYSF+MHT    +G GQ L SPGSCLE FR  PFI
Sbjct: 1    GFLIAVHSQTTQIPQCPAGWSSLWTGYSFLMHT-GNAEGHGQDLGSPGSCLERFRTMPFI 59

Query: 968  ECNGEHGSCHYFA-NKLSFWLATIDPQDQWSRPQQQTLKSG-NLRQRISRCQ 1017
            ECNG +G C+Y + N  SFWL+TI+P      P   T K+G  +R  ISRCQ
Sbjct: 60   ECNG-NGVCNYASRNDYSFWLSTIEPM-----PMPMTPKAGREIRPYISRCQ 105



 Score = 68.5 bits (168), Expect = 8e-14
 Identities = 22/39 (56%), Positives = 26/39 (66%), Gaps = 2/39 (5%)

Query: 871 RNDRSYWLST--EEPMPMMPVESTQIQKFISRCVVCEVP 907
           RND S+WLST    PMPM P    +I+ +ISRC VCE P
Sbjct: 72  RNDYSFWLSTIEPMPMPMTPKAGREIRPYISRCQVCEAP 110


>gnl|CDD|189968 pfam01391, Collagen, Collagen triple helix repeat (20 copies).
            Members of this family belong to the collagen
            superfamily. Collagens are generally extracellular
            structural proteins involved in formation of connective
            tissue structure. The alignment contains 20 copies of the
            G-X-Y repeat that forms a triple helix. The first
            position of the repeat is glycine, the second and third
            positions can be any residue but are frequently proline
            and hydroxyproline. Collagens are post translationally
            modified by proline hydroxylase to form the
            hydroxyproline residues. Defective hydroxylation is the
            cause of scurvy. Some members of the collagen superfamily
            are not involved in connective tissue structure but share
            the same triple helical structure.
          Length = 60

 Score = 42.1 bits (100), Expect = 5e-05
 Identities = 34/58 (58%), Positives = 34/58 (58%)

Query: 1060 GKPGQPGYRGPPGPQGLPGLAGIPGARGDKGDWGPQGLDGPVGPPGLRGEPGPPGRPG 1117
            G PG PG  GPPGP G PG  G PG  G  G  GP G  GP GPPG  G PG PG PG
Sbjct: 1    GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58



 Score = 42.1 bits (100), Expect = 5e-05
 Identities = 33/58 (56%), Positives = 33/58 (56%)

Query: 678 GLPGQKGDPGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQRGEMGLPG 735
           G PG  G PGP GPPGP G  G PG  G  G  G PG PG PG PG PG  G  G PG
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58



 Score = 40.6 bits (96), Expect = 1e-04
 Identities = 35/60 (58%), Positives = 35/60 (58%)

Query: 49  GKPGLEGRNGPPGERGPPGIPGFPGEKGDLGAPGPRGFPGSVGPGGPAGDIGPPGVPGLP 108
           G PG  G  GPPG  GPPG PG PG  G  G PGP G PG  GP GP G  G PG PG P
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 40.6 bits (96), Expect = 1e-04
 Identities = 35/58 (60%), Positives = 35/58 (60%)

Query: 52  GLEGRNGPPGERGPPGIPGFPGEKGDLGAPGPRGFPGSVGPGGPAGDIGPPGVPGLPG 109
           G  G  GPPG  GPPG PG PG  G  G PGP G PG  GP GP G  GPPG PG PG
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58



 Score = 40.2 bits (95), Expect = 2e-04
 Identities = 34/58 (58%), Positives = 34/58 (58%)

Query: 1063 GQPGYRGPPGPQGLPGLAGIPGARGDKGDWGPQGLDGPVGPPGLRGEPGPPGRPGLDG 1120
            G PG  GPPGP G PG  G PG  G  G  GP G  GP GPPG  G PGPPG PG  G
Sbjct: 1    GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58



 Score = 40.2 bits (95), Expect = 2e-04
 Identities = 34/60 (56%), Positives = 34/60 (56%)

Query: 1057 GREGKPGQPGYRGPPGPQGLPGLAGIPGARGDKGDWGPQGLDGPVGPPGLRGEPGPPGRP 1116
            G  G PG PG  GPPGP G PG  G PG  G  G  GP G  GP GPPG  G PG PG P
Sbjct: 1    GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 39.8 bits (94), Expect = 3e-04
 Identities = 33/60 (55%), Positives = 33/60 (55%)

Query: 687 GPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQRGEMGLPGIEGEKGNPGLP 746
           GP GPPGP G  G PG  G  G  G PG PG PG PG PG  G  G PG  G  G PG P
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 39.0 bits (92), Expect = 6e-04
 Identities = 32/58 (55%), Positives = 32/58 (55%)

Query: 538 GSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMG 595
           G  G PGPPG  GP G PG  G  G PGP GP G PGP G  G  GP G  G PG  G
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58



 Score = 38.6 bits (91), Expect = 0.001
 Identities = 31/58 (53%), Positives = 31/58 (53%)

Query: 520 GQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVG 577
           G PG  G  G PG  G  G  G PGPPG  GP G PG  G  G PGP GP G PG  G
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58



 Score = 38.2 bits (90), Expect = 0.001
 Identities = 32/58 (55%), Positives = 32/58 (55%)

Query: 541 GEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTG 598
           G PGPPG  GP G PG  G  G PGP GP G PGP G  G  GP G  G PG  G  G
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58



 Score = 38.2 bits (90), Expect = 0.001
 Identities = 31/60 (51%), Positives = 31/60 (51%)

Query: 532 GERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEP 591
           G  G  G  G PGPPG  GP G PG  G  G PGP GP G PGP G  G  G  G  G P
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 38.2 bits (90), Expect = 0.001
 Identities = 33/60 (55%), Positives = 33/60 (55%)

Query: 675 GLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQRGEMGLP 734
           G  G PG  G PGP GPPGP G  G PG  G  G  G PG PG PG PG PG  G  G P
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 37.9 bits (89), Expect = 0.001
 Identities = 34/60 (56%), Positives = 34/60 (56%)

Query: 669 GIPGLPGLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQR 728
           G PG PG  G PG  G PGP GPPGP G  G PG  G  G  G PG PG PG PG PG  
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 37.9 bits (89), Expect = 0.002
 Identities = 33/59 (55%), Positives = 33/59 (55%)

Query: 535 GSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGP 593
           G  G  G PGPPG  GP G PG  G  G PGP GP G PGP G  G  GP G  G PGP
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59



 Score = 37.9 bits (89), Expect = 0.002
 Identities = 35/60 (58%), Positives = 35/60 (58%)

Query: 1069 GPPGPQGLPGLAGIPGARGDKGDWGPQGLDGPVGPPGLRGEPGPPGRPGLDGLQGNTGPP 1128
            GPPGP G PG  G PG  G  G  GP G  GP GPPG  G PGPPG PG  G  G  GPP
Sbjct: 1    GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 37.5 bits (88), Expect = 0.002
 Identities = 34/58 (58%), Positives = 34/58 (58%)

Query: 141 GPPGLIGKPGPPGKAGLPGVEGERGPPGDPGIQGPQGKDGIPGIPGKDGLPGYPGPKG 198
           GPPG  G PGPPG  G PG  G  GPPG PG  GP G  G PG PG  G PG PG  G
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58



 Score = 37.5 bits (88), Expect = 0.002
 Identities = 33/60 (55%), Positives = 33/60 (55%)

Query: 663 GDIGEQGIPGLPGLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLP 722
           G  G  G PG PG  G PG  G PGP GPPGP G  G PG  G  G  G PG PG PG P
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 37.5 bits (88), Expect = 0.002
 Identities = 30/58 (51%), Positives = 30/58 (51%)

Query: 1021 GYPGLAGPAGEKGFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLPG 1078
            G PG  GP G  G  G PG PG PG  G  G  G  G  G PG PG  GPPG  G PG
Sbjct: 1    GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58



 Score = 37.1 bits (87), Expect = 0.002
 Identities = 32/59 (54%), Positives = 32/59 (54%)

Query: 526 GYKGEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGP 584
           G  G PG  G  G  G PGPPG  GP G PG  G  G PGP GP G PGP G  G  GP
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59



 Score = 37.1 bits (87), Expect = 0.003
 Identities = 28/56 (50%), Positives = 28/56 (50%)

Query: 514 GVKGEKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGP 569
           G  G  G PG  G  G PG  G  G  G PGPPG  GP G PG  G  G PG  GP
Sbjct: 4   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59



 Score = 37.1 bits (87), Expect = 0.003
 Identities = 29/60 (48%), Positives = 29/60 (48%)

Query: 514 GVKGEKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNP 573
           G  G  G PG  G  G PG  G  G  G PGPPG  GP G PG  G  G PG  G  G P
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 37.1 bits (87), Expect = 0.003
 Identities = 31/60 (51%), Positives = 31/60 (51%)

Query: 553 GVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRGLPGYP 612
           G PG  G  G PGP GP G PGP G  G  GP G  G PGP G  G  G  G  G PG P
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 37.1 bits (87), Expect = 0.003
 Identities = 32/59 (54%), Positives = 32/59 (54%)

Query: 648 GKQGAPGEPGLNGLPGDIGEQGIPGLPGLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGE 706
           G  G PG PG  G PG  G  G PG PG  G PG  G PGP GPPGP G  G PG  G 
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59



 Score = 36.7 bits (86), Expect = 0.003
 Identities = 34/60 (56%), Positives = 34/60 (56%)

Query: 135 GEKGDYGPPGLIGKPGPPGKAGLPGVEGERGPPGDPGIQGPQGKDGIPGIPGKDGLPGYP 194
           G  G  GPPG  G PGPPG  G PG  G  GPPG PG  GP G  G PG PG  G PG P
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 36.7 bits (86), Expect = 0.004
 Identities = 32/55 (58%), Positives = 32/55 (58%)

Query: 147 GKPGPPGKAGLPGVEGERGPPGDPGIQGPQGKDGIPGIPGKDGLPGYPGPKGDQG 201
           G PGPPG  G PG  G  GPPG PG  GP G  G PG PG  G PG PGP G  G
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPG 55



 Score = 36.7 bits (86), Expect = 0.004
 Identities = 31/59 (52%), Positives = 31/59 (52%)

Query: 529 GEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGE 587
           G PG  G  G  G PGPPG  GP G PG  G  G PGP GP G PGP G  G  G  G 
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59



 Score = 36.3 bits (85), Expect = 0.005
 Identities = 30/56 (53%), Positives = 30/56 (53%)

Query: 42 KGDKGEPGKPGLEGRNGPPGERGPPGIPGFPGEKGDLGAPGPRGFPGSVGPGGPAG 97
           G  G PG PG  G  GPPG  GPPG PG PG  G  G PGP G PG  G  G  G
Sbjct: 3  PGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58



 Score = 35.9 bits (84), Expect = 0.006
 Identities = 30/58 (51%), Positives = 30/58 (51%)

Query: 123 GKNGRDGAPGLAGEKGDYGPPGLIGKPGPPGKAGLPGVEGERGPPGDPGIQGPQGKDG 180
           G  G  G PG  G  G  GPPG  G PGPPG  G PG  G  GPPG PG  G  G  G
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58



 Score = 35.9 bits (84), Expect = 0.007
 Identities = 30/58 (51%), Positives = 30/58 (51%)

Query: 559 GSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRGLPGYPSPKG 616
           G  G PGP GP G PGP G  G  GP G  G PGP G  G  G  GP G PG P   G
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58



 Score = 35.9 bits (84), Expect = 0.007
 Identities = 33/59 (55%), Positives = 33/59 (55%)

Query: 672 GLPGLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQRGE 730
           G PG  G PG  G PGP GPPGP G  G PG  G  G  G PG PG PG PG PG  G 
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59



 Score = 35.5 bits (83), Expect = 0.008
 Identities = 32/60 (53%), Positives = 32/60 (53%)

Query: 654 GEPGLNGLPGDIGEQGIPGLPGLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGEKGNAGLP 713
           G PG  G PG  G  G PG PG  G PG  G PGP GPPGP G  G PG  G  G  G P
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 35.5 bits (83), Expect = 0.009
 Identities = 29/58 (50%), Positives = 29/58 (50%)

Query: 113 KGEKGLPGLHGKNGRDGAPGLAGEKGDYGPPGLIGKPGPPGKAGLPGVEGERGPPGDP 170
            G  G PG  G  G  G PG  G  G  GPPG  G PGPPG  G PG  G  G PG P
Sbjct: 3   PGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 35.5 bits (83), Expect = 0.009
 Identities = 34/60 (56%), Positives = 34/60 (56%)

Query: 761 GEKGDKGDLGPFGYPGPPGLPGPMGPPGLDGLPGEIGFPGLAGPVGPPGLQGLKGDKGLP 820
           G  G  G  GP G PGPPG PGP GPPG  G PG  G PG  GP GPPG  G  G  G P
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 35.5 bits (83), Expect = 0.011
 Identities = 28/57 (49%), Positives = 28/57 (49%)

Query: 1021 GYPGLAGPAGEKGFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLP 1077
            G PG  GP G  G  G PG PG PG  G  G  G  G  G PG PG  G PG  G P
Sbjct: 4    GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 35.2 bits (82), Expect = 0.012
 Identities = 30/60 (50%), Positives = 30/60 (50%)

Query: 636 GEKGEKGLIGPFGKQGAPGEPGLNGLPGDIGEQGIPGLPGLQGLPGQKGDPGPLGPPGPR 695
           G  G  G  GP G  G PG PG  G PG  G  G PG PG  G PG  G PG  G PGP 
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 35.2 bits (82), Expect = 0.015
 Identities = 31/58 (53%), Positives = 31/58 (53%)

Query: 1033 GFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLPGLAGIPGARGDKG 1090
            G  G PG PG PG  G  G  G  G  G PG PG  GPPGP G PG  G PGA G  G
Sbjct: 1    GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58



 Score = 34.8 bits (81), Expect = 0.019
 Identities = 32/53 (60%), Positives = 32/53 (60%)

Query: 770 GPFGYPGPPGLPGPMGPPGLDGLPGEIGFPGLAGPVGPPGLQGLKGDKGLPGP 822
           GP G PGPPG PGP GPPG  G PG  G PG  GP GPPG  G  G  G PG 
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGA 53



 Score = 34.4 bits (80), Expect = 0.024
 Identities = 30/58 (51%), Positives = 30/58 (51%)

Query: 1024 GLAGPAGEKGFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLPGLAG 1081
            G  GP G  G  G PG PG PG  G  G  G  G  G PG PG  GPPGP G PG  G
Sbjct: 1    GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58



 Score = 34.4 bits (80), Expect = 0.029
 Identities = 33/56 (58%), Positives = 33/56 (58%)

Query: 767 GDLGPFGYPGPPGLPGPMGPPGLDGLPGEIGFPGLAGPVGPPGLQGLKGDKGLPGP 822
           G  GP G PGPPG PGP GPPG  G PG  G PG  GP GPPG  G  G  G PG 
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGA 56



 Score = 33.6 bits (78), Expect = 0.041
 Identities = 29/50 (58%), Positives = 30/50 (60%)

Query: 43 GDKGEPGKPGLEGRNGPPGERGPPGIPGFPGEKGDLGAPGPRGFPGSVGP 92
          G  G PG PG  G  GPPG  GPPG PG PG  G  G PGP G PG+ GP
Sbjct: 10 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59



 Score = 33.2 bits (77), Expect = 0.070
 Identities = 31/56 (55%), Positives = 31/56 (55%)

Query: 625 LPGPPGPPGYPGEKGEKGLIGPFGKQGAPGEPGLNGLPGDIGEQGIPGLPGLQGLP 680
            PGPPGPPG PG  G  G  GP G  G PG PG  G PG  G  G PG PG  G P
Sbjct: 5   PPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 32.5 bits (75), Expect = 0.13
 Identities = 28/48 (58%), Positives = 28/48 (58%)

Query: 1090 GDWGPQGLDGPVGPPGLRGEPGPPGRPGLDGLQGNTGPPGENGLPGAP 1137
            G  GP G  GP GPPG  G PGPPG PG  G  G  GPPG  G PG P
Sbjct: 1    GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP 48



 Score = 32.1 bits (74), Expect = 0.14
 Identities = 27/51 (52%), Positives = 27/51 (52%)

Query: 705 GEKGNAGLPGQPGQPGLPGFPGQRGEMGLPGIEGEKGNPGLPGKIGAPGIP 755
           G  G  G PG PG PG PG PG  G  G PG  G  G PG PG  G PG P
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP 51



 Score = 31.3 bits (72), Expect = 0.32
 Identities = 34/62 (54%), Positives = 34/62 (54%), Gaps = 2/62 (3%)

Query: 601 GREGPRGLPGYPSPKGDKGEPGISLPGPPGPPGYPGEKGEKGLIGPFGKQGAPGEPGLNG 660
           G  GP G PG P P G  G PG   PGPPGPPG PG  G  G  GP G  G PG PG  G
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGP--PGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58

Query: 661 LP 662
            P
Sbjct: 59  PP 60



 Score = 31.3 bits (72), Expect = 0.36
 Identities = 33/60 (55%), Positives = 33/60 (55%), Gaps = 1/60 (1%)

Query: 598 GDKGREGPRGLPGYPSPKGDKGEPGIS-LPGPPGPPGYPGEKGEKGLIGPFGKQGAPGEP 656
           G  G  GP G PG P P G  G PG    PGPPGPPG PG  G  G  GP G  GAPG P
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 30.9 bits (71), Expect = 0.38
 Identities = 27/60 (45%), Positives = 27/60 (45%)

Query: 351 GEPGQDGAKGDSGGRCIGCLPGARGEKGDRGKDGLPGIPGPPGAPGMRGRDGDAGRDGPP 410
           G PG  G  G  G       PG  G  G  G  G PG PGPPG PG  G  G  G  GPP
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 30.9 bits (71), Expect = 0.47
 Identities = 34/62 (54%), Positives = 34/62 (54%), Gaps = 2/62 (3%)

Query: 732 GLPGIEGEKGNPGLPGKIGAPGIPAIGMPGEKGDKGDLGPFGYPGPPGLPGPMGPPGLDG 791
           G PG  G  G PG PG  G PG P  G PG  G  G  GP G PGPPG PGP G PG  G
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPP--GPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58

Query: 792 LP 793
            P
Sbjct: 59  PP 60



 Score = 30.5 bits (70), Expect = 0.52
 Identities = 31/68 (45%), Positives = 31/68 (45%), Gaps = 10/68 (14%)

Query: 3  GLDGLKGELGPPGYPGSIGFPGVKGDKGERGPASPVINVKGDKGEPGKPGLEGRNGPPGE 62
          G  G  G  GPPG PG  G PG  G  G  GP           G PG PG  G  GPPG 
Sbjct: 1  GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP----------GPPGPPGPPGPPGPPGP 50

Query: 63 RGPPGIPG 70
           G PG PG
Sbjct: 51 PGAPGAPG 58



 Score = 30.1 bits (69), Expect = 0.78
 Identities = 31/61 (50%), Positives = 31/61 (50%), Gaps = 3/61 (4%)

Query: 183 GIPGKDGLPGYPGPKGDQGLSITGPPGERGPDGLPAGEKGFRGSPGLPGTPGLKGEQGDK 242
           G PG  G PG PGP G  G    GPPG  GP G P G  G  G PG PG PG  G  G  
Sbjct: 1   GPPGPPGPPGPPGPPGPPGP--PGPPGPPGPPG-PPGPPGPPGPPGPPGPPGPPGAPGAP 57

Query: 243 G 243
           G
Sbjct: 58  G 58



 Score = 29.8 bits (68), Expect = 1.3
 Identities = 33/73 (45%), Positives = 33/73 (45%), Gaps = 13/73 (17%)

Query: 384 GLPGIPGPPGAPGMRGRDGDAGRDGPPGKDAVWTSDNLIKGERGPPGLKGDIGRDGPMGP 443
           G PG PGPPG PG  G  G  G  GPPG               GPPG  G  G  GP GP
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPG-------------PPGPPGPPGPPGPPGPPGP 47

Query: 444 PGPKGVIGLTGDR 456
           PGP G  G  G  
Sbjct: 48  PGPPGAPGAPGPP 60



 Score = 29.0 bits (66), Expect = 1.9
 Identities = 28/54 (51%), Positives = 28/54 (51%)

Query: 702 GRDGEKGNAGLPGQPGQPGLPGFPGQRGEMGLPGIEGEKGNPGLPGKIGAPGIP 755
           G  G  G  G PG PG PG PG PG  G  G PG  G  G PG PG  G PG P
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAP 54



 Score = 29.0 bits (66), Expect = 2.1
 Identities = 36/72 (50%), Positives = 37/72 (51%), Gaps = 12/72 (16%)

Query: 162 GERGPPGDPGIQGPQGKDGIPGIPGKDGLPGYPGPKGDQGLSITGPPGERGPDGLPAGEK 221
           G  GPPG PG  GP G  G PG PG  G PG PGP G        PPG  GP G P    
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPG--------PPGPPGPPGPP---- 48

Query: 222 GFRGSPGLPGTP 233
           G  G+PG PG P
Sbjct: 49  GPPGAPGAPGPP 60



 Score = 28.6 bits (65), Expect = 2.8
 Identities = 27/63 (42%), Positives = 27/63 (42%), Gaps = 7/63 (11%)

Query: 371 PGARGEKGDRGKDGLPGIPGPPGAPGMRGRDGDAGRDGPPGKDAVWTSDNLIKGERGPPG 430
           PG  G  G  G  G PG PGPPG PG  G  G  G  GPPG            G  G PG
Sbjct: 3   PGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGP-------PGPPGAPG 55

Query: 431 LKG 433
             G
Sbjct: 56  APG 58



 Score = 27.5 bits (62), Expect = 6.6
 Identities = 28/60 (46%), Positives = 29/60 (48%), Gaps = 2/60 (3%)

Query: 189 GLPGYPGPKGDQGLS-ITGPPGERGPDGLPAGEKGFRGSPGLPGTPGLKGEQGDKGSEAQ 247
           G PG PGP G  G     GPPG  GP G P G  G  G PG PG PG  G  G  G+   
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPG-PPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 46.1 bits (110), Expect = 1e-04
 Identities = 34/227 (14%), Positives = 41/227 (18%), Gaps = 19/227 (8%)

Query: 589 GEPGPMGFTGDKGREGPRGLPGYPSPKGDKGEPGISLPGPPGPPGYPGEKGEKGLIGPFG 648
             PG  G  G               P           P  P P G      E       G
Sbjct: 590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAA---PAAPAPAGAAAAPAEASAAPAPG 646

Query: 649 KQGAPGEPGLNGLPGDIGEQGIPGLPGLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGEKG 708
                  P    +P      G  G P   G       P    P  P    G         
Sbjct: 647 VAAPEHHPKHVAVPD--ASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPA-- 702

Query: 709 NAGLPGQPGQPGLPGFPGQRGEMGLPGIEGEKGNPGLPGKIGAPGIPAIGMPGEKGDKGD 768
               P     P          +           +P          +P    P +  D   
Sbjct: 703 ----PAPAATPPAGQADDPAAQPPQAAQGASAPSPA-----ADDPVPLPPEPDDPPDPAG 753

Query: 769 LGPFGYPGP---PGLPGPMGPPGLDGLPGEIGFPGLAGPVGPPGLQG 812
                 P P   P       PP       E      A  +     + 
Sbjct: 754 APAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRD 800



 Score = 42.3 bits (100), Expect = 0.002
 Identities = 42/220 (19%), Positives = 48/220 (21%), Gaps = 12/220 (5%)

Query: 626 PGPPGPPGYPGEKGEKGLIGPFGKQGAPGEPGLNGLPGDIGEQGIPGLPGLQGLPGQKGD 685
            GP                GP  +   P  P     P      G    P         G 
Sbjct: 588 VGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGV 647

Query: 686 PGPLGPPGPRGF-DGTPGRDGEKGNAGLPGQPGQPGLPGFPGQRGEMGLPGIEGEKGNPG 744
             P   P      D + G DG    AG       P  P         G    +       
Sbjct: 648 AAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAA 707

Query: 745 LPGKIGAPGIPAIGMPGEKGDKGDLGPFGYPGPPGLPGPMGPPGLDGLPGEIGFPGLAGP 804
            P    A   PA   P          P      P  P P  PP   G P +   P    P
Sbjct: 708 TPPAGQADD-PAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAP 766

Query: 805 VGPPGLQGLKGDKGLPGPYASEVLQGPKENRDPRDGMDPG 844
              P             P  S   +  +   D    MD  
Sbjct: 767 AAAPA----------AAPPPSPPSEEEEMAEDDAPSMDDE 796



 Score = 41.1 bits (97), Expect = 0.005
 Identities = 32/193 (16%), Positives = 34/193 (17%), Gaps = 4/193 (2%)

Query: 541 GEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDK 600
             PG  G  GP         +    P  P     P              E       G  
Sbjct: 590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAA-AAPAEASAAPAPGVA 648

Query: 601 GREGPRGLPGYPSP--KGDKGEPGISLPGPPGPPGYPGEKGEKGLIGPFGKQGAPGEPGL 658
             E        P     GD          P  PP  P         G    Q AP     
Sbjct: 649 APEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAAT 708

Query: 659 NGLPGDIGEQGIPGLPGLQGLPGQK-GDPGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPG 717
                       P              D     PP P       G   +      P    
Sbjct: 709 PPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAA 768

Query: 718 QPGLPGFPGQRGE 730
            P     P    E
Sbjct: 769 APAAAPPPSPPSE 781



 Score = 37.3 bits (87), Expect = 0.072
 Identities = 28/179 (15%), Positives = 34/179 (18%), Gaps = 11/179 (6%)

Query: 518 EKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVG 577
           E  +P        P      G+   P         GV                       
Sbjct: 611 EAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWP 670

Query: 578 RR-GDTGPMGEKGEPGPMGFTGDKGREGPRGLPGYPSPKGDKGEPGISLPGPPGPPGYPG 636
            + G   P      P P       G    +     P+P      P      P   P    
Sbjct: 671 AKAGGAAPAAPPPAPAPAAPAAPAGAAPAQ-----PAPAPAATPPAGQADDPAAQPPQAA 725

Query: 637 EKGEKGLIGPFGKQGAPGEPGLNGLPGDIGEQGIPGLPGLQGLPGQKGDPGPLGPPGPR 695
           +               P EP     P     Q  P        P     P    PP P 
Sbjct: 726 QGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPA-----PAPAAAPAAAPPPSPP 779



 Score = 36.9 bits (86), Expect = 0.091
 Identities = 34/172 (19%), Positives = 40/172 (23%), Gaps = 7/172 (4%)

Query: 684 GDPGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQRGEMGLPGIEGEKGNP 743
             PG  G  GP      P   G    A  P  P  P  P  P   G    P        P
Sbjct: 590 PAPGAAGGEGPPA----PASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAP 645

Query: 744 GLPGKIGAPGIPAIGMPGEKGDKG--DLGPFGYPGPPGLPGPMGPPGLDGLPGEIGFPGL 801
           G+      P   A+    + GD      G      PP  P P  P    G       P  
Sbjct: 646 GVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAP 705

Query: 802 -AGPVGPPGLQGLKGDKGLPGPYASEVLQGPKENRDPRDGMDPGVYPDLWDP 852
            A P                   ++           P +  DP         
Sbjct: 706 AATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQ 757



 Score = 35.3 bits (82), Expect = 0.26
 Identities = 39/234 (16%), Positives = 47/234 (20%), Gaps = 21/234 (8%)

Query: 23  PGVKGDKGERGPASPVINVKGDKGEPGKPGLEGRNGPPGERGPPGIPGFPGEKGDLGAPG 82
           P      GE  PA        +   P  P        P   G    P         G   
Sbjct: 590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAA 649

Query: 83  PRGFPGSVGPGGPAGDIGPPGVPGLPGITIKGEKGLPGLHGKNGRDGAPGLAGEKGDYGP 142
           P   P  V     +                                 AP  A      G 
Sbjct: 650 PEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPP-------------PAPAPAAPAAPAGA 696

Query: 143 PGLIGKPGPPGKAGLPGVEGERG-PPGDPGIQGPQGKDGIPGIPGKDGLPGYPGPKGDQG 201
                 P P         +     PP                +P    LP  P    D  
Sbjct: 697 APAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP----LPPEPDDPPDPA 752

Query: 202 LSITGPPGERGPDGLPAGEKGFRGSPGLPGTPGLKGEQGDKGSEAQLERRDSEE 255
            +   PP    P    A       SP        + +      E    RRD+EE
Sbjct: 753 GAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDED---RRDAEE 803



 Score = 35.3 bits (82), Expect = 0.30
 Identities = 24/137 (17%), Positives = 31/137 (22%), Gaps = 11/137 (8%)

Query: 517 GEKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGVPGFM-------GSKGEPGPRGP 569
                P   G    P E  +  + G   P        VP             G   P  P
Sbjct: 622 AAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAP 681

Query: 570 QGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRGLPGYP----SPKGDKGEPGISL 625
              P P       G    +  P P         + P   P       S      +  + L
Sbjct: 682 PPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPL 741

Query: 626 PGPPGPPGYPGEKGEKG 642
           P  P  P  P     + 
Sbjct: 742 PPEPDDPPDPAGAPAQP 758



 Score = 33.8 bits (78), Expect = 0.71
 Identities = 28/158 (17%), Positives = 33/158 (20%), Gaps = 10/158 (6%)

Query: 479 GWPGIDGTPGQDGIPGLKGEKGHGIKGEPGINGTHGVKGEKGQPGGIGYKGEPGERGSTG 538
             P      G    P     +             H  K         G  G P + G   
Sbjct: 622 AAPAAPAPAGAAAAPA----EASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAA 677

Query: 539 SKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTG 598
               P  P    P    G   ++  P P       G         P   +G   P     
Sbjct: 678 PAAPPPAPAPAAPAAPAGAAPAQPAPAPA-ATPPAGQADDPAAQPPQAAQGASAPSPAAD 736

Query: 599 DKGREGPRGLPGYPSPKGDKGEPGISLPGPPGPPGYPG 636
           D        LP  P    D        P PP P     
Sbjct: 737 DPVP-----LPPEPDDPPDPAGAPAQPPPPPAPAPAAA 769



 Score = 32.3 bits (74), Expect = 2.2
 Identities = 19/126 (15%), Positives = 22/126 (17%), Gaps = 5/126 (3%)

Query: 1023 PGLAGPAGEKGFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLPGLAGI 1082
              +A P    G  G P   G                    G    +  P P   P     
Sbjct: 655  KHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQA 714

Query: 1083 PGARGDK-----GDWGPQGLDGPVGPPGLRGEPGPPGRPGLDGLQGNTGPPGENGLPGAP 1137
                        G   P        P     +  P              P        AP
Sbjct: 715  DDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAP 774

Query: 1138 CESTPD 1143
              S P 
Sbjct: 775  PPSPPS 780



 Score = 31.9 bits (73), Expect = 2.8
 Identities = 38/224 (16%), Positives = 47/224 (20%), Gaps = 10/224 (4%)

Query: 55  GRNGPPGERGPPGIPGFPGEKGDLGAPGPRGFPGSVGPGGPAGDIGPPGVPGLPGITIKG 114
           G          P  P   G   +   P     P +     PA   G    P         
Sbjct: 589 GPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPA---GAAAAPAEASAAPAP 645

Query: 115 EKGLPGLHGKNGRDGAPGLAGEKGDYGPPGLIGKPGPPGKAGLPGVEGERGPPGDPGIQG 174
               P  H K+            G  G P   G   P      P       P G    Q 
Sbjct: 646 GVAAPEHHPKHVAVPDASD----GGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQP 701

Query: 175 -PQGKDGIPGIPGKDGLPGYPGPKGDQGLSITGPPGERGPDGLPAGEKGFRGSPGLPGTP 233
            P      P     D     P                      P       G+P  P  P
Sbjct: 702 APAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPP 761

Query: 234 GLKGEQGDKGSEAQLERRDSEEETKDQKDERENQDSQDIEVPRE 277
                     +         EEE  +  D+  + D +D     E
Sbjct: 762 PAPAPAAAPAAAPPPSPPSEEEEMAE--DDAPSMDDEDRRDAEE 803



 Score = 30.7 bits (70), Expect = 6.1
 Identities = 16/103 (15%), Positives = 18/103 (17%), Gaps = 3/103 (2%)

Query: 1026 AGPAGEKGFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLPGLAGIPGA 1085
            A PA      G+   P         G    E        P                 P A
Sbjct: 622  AAPAAPAP-AGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAA 680

Query: 1086 RGDKGDWGPQGLDGPVGPPGLRGEPGPPGRPGLDGLQGNTGPP 1128
                          P G    +  P P   P           P
Sbjct: 681  PPPAPAPAAP--AAPAGAAPAQPAPAPAATPPAGQADDPAAQP 721


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 43.1 bits (101), Expect = 0.001
 Identities = 38/162 (23%), Positives = 48/162 (29%), Gaps = 17/162 (10%)

Query: 551 PRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRGLPG 610
           PR +P         G   P G+       G    +   G  GP G     G       P 
Sbjct: 363 PRLMPVESLRPSGGGASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPA 422

Query: 611 YPSPKGDKGEPGISLPGPPGPPGYPGEKGEKGLIGPFGKQGAPGEPGLNGLPGDIGEQGI 670
            P+P      P   +P    PP  P          P      P    + G P  +     
Sbjct: 423 TPAP---SAAPSPRVPWDDAPPAPPRSGI-----PPRPAPRMPEASPVPGAPDSVASAS- 473

Query: 671 PGLPGLQGLPGQKGDPGPLGPPGPRGFDG----TPGRDGEKG 708
              P   G P    D     P GPR +DG      GR+G+ G
Sbjct: 474 -DAPPTLGDP---SDTAEHTPSGPRTWDGFLEFCQGRNGQGG 511



 Score = 38.1 bits (88), Expect = 0.035
 Identities = 28/124 (22%), Positives = 38/124 (30%), Gaps = 9/124 (7%)

Query: 147 GKPGPPGKAGLPG-VEGERGPPGDPGIQGPQGKDGIPGIPGKDGLPGYPGPKGDQGLSIT 205
           G    P  +   G   G       PG QGPQG     G+      P  P P       + 
Sbjct: 376 GGASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRV- 434

Query: 206 GPPGERGPDGLPAGEKGFRGSPGLPGTPGLKGEQGDKGSEAQLERRDSEEETKDQKDERE 265
             P +  P   P      R +P +P    + G      S +     D+     D  D  E
Sbjct: 435 --PWDDAPPAPPRSGIPPRPAPRMPEASPVPGAPDSVASAS-----DAPPTLGDPSDTAE 487

Query: 266 NQDS 269
           +  S
Sbjct: 488 HTPS 491



 Score = 33.1 bits (75), Expect = 1.3
 Identities = 30/121 (24%), Positives = 35/121 (28%), Gaps = 8/121 (6%)

Query: 1024 GLAGPAGEKGFRG-SPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLPGLAGI 1082
            G A         G + G   T    G QG +G+    G          P P   P     
Sbjct: 376  GGASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPS---- 431

Query: 1083 PGARGDKGDWGPQGLDGPVGPPGLRGEPGP-PGRPGLDGLQGNTGPPGENGLPGAPCEST 1141
            P    D     P     P  P     E  P PG P  D +   +  P   G P    E T
Sbjct: 432  PRVPWDDAPPAPPRSGIPPRPAPRMPEASPVPGAP--DSVASASDAPPTLGDPSDTAEHT 489

Query: 1142 P 1142
            P
Sbjct: 490  P 490



 Score = 30.0 bits (67), Expect = 9.5
 Identities = 26/123 (21%), Positives = 34/123 (27%), Gaps = 12/123 (9%)

Query: 517 GEKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGV---PGFMGSKGEPGPRGPQGNP 573
           G    P G   +G      ++G       PG  GP+G     G   S   P    P   P
Sbjct: 376 GGASAPSGSAAEG-----PASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAP 430

Query: 574 GPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRGLPGYPSPKGDK----GEPGISLPGPP 629
            P     D  P   +    P             G P   +   D     G+P  +    P
Sbjct: 431 SPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVPGAPDSVASASDAPPTLGDPSDTAEHTP 490

Query: 630 GPP 632
             P
Sbjct: 491 SGP 493


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 41.1 bits (96), Expect = 0.004
 Identities = 36/172 (20%), Positives = 48/172 (27%), Gaps = 4/172 (2%)

Query: 559 GSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRGLPGYPSPKGDK 618
           G K E G  GP G+           P G   E        +     P     +  P    
Sbjct: 82  GEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPP 141

Query: 619 GEPGISLPGPPGPPGYPGEKGEKGLIGPFGKQGA-PGEPGLNGLPGDIGEQGIPGLPGLQ 677
             PG   P PP        +     + P  +      EP  +    D         P   
Sbjct: 142 SHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSS 201

Query: 678 GLPGQKGD-PGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQR 728
             P    D PG    P P+       +   +     P +P + G P FPG R
Sbjct: 202 PPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDE-PTEPEREG-PPFPGHR 251



 Score = 40.7 bits (95), Expect = 0.004
 Identities = 30/146 (20%), Positives = 40/146 (27%), Gaps = 3/146 (2%)

Query: 497 GEKGHGIKGEPGINGTHGVKGEKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGVPG 556
           GEK    +G P  +G+  V      P G   +   G      S   P  P  + P   P 
Sbjct: 82  GEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSP- 140

Query: 557 FMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRGLPGYPSPKG 616
              S   P    P  +  P   +  +  +    E  P           P       S   
Sbjct: 141 --PSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETP 198

Query: 617 DKGEPGISLPGPPGPPGYPGEKGEKG 642
               P  S P  PG P  P  +    
Sbjct: 199 TSSPPPQSPPDEPGEPQSPTPQQAPS 224



 Score = 38.4 bits (89), Expect = 0.022
 Identities = 32/173 (18%), Positives = 40/173 (23%), Gaps = 9/173 (5%)

Query: 84  RGFPGSVGPGGPAGDIGPPGVPGLPGITIKGEKGLPGLHGKNGRDGAPGLAGEKGDYGPP 143
            G     G GGP+G          P  +   E+   GL  +N    +P          PP
Sbjct: 81  HGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSP--ESPASHSPPP 138

Query: 144 GLIGKPGPPGKAGLPGVEGERGPPGDPGIQGPQGKDGIPGIPGKDGLPGYPGPKGDQGLS 203
                PG P +   P                P  +D           P    P   Q  +
Sbjct: 139 SPPSHPG-PHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSET 197

Query: 204 ITGPPGERGPDGLPAGEKGFRGSPGLPGTPGLKGEQGDKGSEAQLERRDSEEE 256
               P    P   P  E G   SP     P                     E 
Sbjct: 198 ----PTSSPPPQSPPDEPGEPQSPTPQQAPS--PNTQQAVEHEDEPTEPEREG 244



 Score = 36.9 bits (85), Expect = 0.068
 Identities = 34/174 (19%), Positives = 43/174 (24%), Gaps = 14/174 (8%)

Query: 49  GKPGLEGRNGPPGERGPPGIPGFPGEKGDLGAPGPRGFP---GSVGPGGPAGDIGPPGVP 105
           G+    G+ GP G          P   G          P       P  PA    PP  P
Sbjct: 82  GEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPP 141

Query: 106 GLPGITIKGEKGLPGLHGKNGRDGAPGLAGEKGDYGPPGLIGKPGPPGKAGLPGVEGERG 165
             PG     E   P  H  +             +  P        P      P  +    
Sbjct: 142 SHPG---PHEPAPPESHNPSPNQQPSSFLQPSHEDSP------EEPEPPTSEPEPDSPGP 192

Query: 166 PPGDPGIQGPQGKDGI--PGIPGKDGLPGYPGPKGDQGLSITGPPGERGPDGLP 217
           P  +     P  +     PG P        P P   Q +     P E   +G P
Sbjct: 193 PQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEPEREGPP 246



 Score = 36.9 bits (85), Expect = 0.071
 Identities = 29/162 (17%), Positives = 40/162 (24%), Gaps = 3/162 (1%)

Query: 473 SRHGDKGWPGIDGTPGQDGIPGLKGEKGHGIKGEPGINGTHGVKGEKGQPGGIGYKGEPG 532
           SRHG+K   G  G  G                 E   +G          P        P 
Sbjct: 79  SRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPP 138

Query: 533 ERGSTGSKGEPGPPGRNGP---RGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKG 589
              S     EP PP  + P   +    F+    E  P  P+             P  E  
Sbjct: 139 SPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETP 198

Query: 590 EPGPMGFTGDKGREGPRGLPGYPSPKGDKGEPGISLPGPPGP 631
              P   +       P+      +P  +  +       P  P
Sbjct: 199 TSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEP 240



 Score = 35.7 bits (82), Expect = 0.18
 Identities = 35/158 (22%), Positives = 41/158 (25%), Gaps = 14/158 (8%)

Query: 496 KGEKGHGIKGEPGINGTHGVKGEKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGVP 555
           +G +      E      HG K E+GQ GG    G       T S         +G    P
Sbjct: 64  QGHRQTESDTETAEESRHGEKEERGQ-GGPSGSGSESVGSPTPSPSGSAEELASG--LSP 120

Query: 556 GFMGSKGEPGPRGPQGNPGPVGRRGDTGPM-----------GEKGEPGPMGFTGDKGREG 604
                     P      P P    G   P                   P      +  E 
Sbjct: 121 ENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEP 180

Query: 605 PRGLPGYPSPKGDKGEPGISLPGPPGPPGYPGEKGEKG 642
           P   P   SP   + E   S P P  PP  PGE     
Sbjct: 181 PTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPT 218



 Score = 32.6 bits (74), Expect = 1.5
 Identities = 25/144 (17%), Positives = 36/144 (25%), Gaps = 8/144 (5%)

Query: 173 QGPQGKDGIPGIPGKDGLPGYPGPKGDQGLSITGPPGERGPDGLPAGEKGFRGSPGLPGT 232
           QG     G   +      P     +   GLS      E      P         P  P  
Sbjct: 89  QGGPSGSGSESVGSPTPSPSGSAEELASGLS-----PENTSGSSPESPASHSPPPSPPSH 143

Query: 233 PGLKG---EQGDKGSEAQLERRDSEEETKDQKDERENQDSQDIEVPRETKESRHPYLLQE 289
           PG       +    S  Q      +   +D  +E E   S+         +S  P     
Sbjct: 144 PGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPP 203

Query: 290 RKDLLDHQGDKGWPGIDGTPGQDG 313
            +   D  G+   P     P  + 
Sbjct: 204 PQSPPDEPGEPQSPTPQQAPSPNT 227



 Score = 32.2 bits (73), Expect = 1.9
 Identities = 22/125 (17%), Positives = 27/125 (21%), Gaps = 2/125 (1%)

Query: 1021 GYPGLAGPAGEKGFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLPGLA 1080
            G   +  P             G            S      P  P     P     P   
Sbjct: 96   GSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESH 155

Query: 1081 GIPGARGDKGDWGPQGLDGPVGPPGLRGEPGPPGRPGLDGLQGNTGPPGENGL--PGAPC 1138
                 +       P   D P  P     EP P            + PP ++    PG P 
Sbjct: 156  NPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQ 215

Query: 1139 ESTPD 1143
              TP 
Sbjct: 216  SPTPQ 220



 Score = 31.1 bits (70), Expect = 4.8
 Identities = 22/153 (14%), Positives = 34/153 (22%), Gaps = 11/153 (7%)

Query: 124 KNGRDGAPGLAGEKGDYGPPGLIGKPGPPGKAGLPGVEGERGPPGDPGIQGPQGKDGIPG 183
           ++G     G  G  G          P P G A                 + P      P 
Sbjct: 80  RHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPAS-HSPPP 138

Query: 184 IPGKDGLPGYPGPKGDQGLSITGPPGERGPDGLPAGEKGFRGSPGLPGTP-----GLKGE 238
            P     P  P P          P   + P                P T           
Sbjct: 139 SPPSHPGPHEPAPPESHN-----PSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPP 193

Query: 239 QGDKGSEAQLERRDSEEETKDQKDERENQDSQD 271
           Q +  + +   +   +E  + Q    +   S +
Sbjct: 194 QSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPN 226



 Score = 30.3 bits (68), Expect = 8.4
 Identities = 25/153 (16%), Positives = 40/153 (26%), Gaps = 8/153 (5%)

Query: 319 GEKGHGIKGEPGINGTHGVKGEKGQPGGIGYKGEPGQDGAKGDSGGRCIGCLPGARGEKG 378
           GEK    +G P  +G+  V      P G     E    G   ++        P +     
Sbjct: 82  GEKEERGQGGPSGSGSESVGSPTPSPSG---SAEELASGLSPENTSGSSPESPASHSPPP 138

Query: 379 DRGKDGLPGIPGPPGAPGMRGRDGDAGRDGPPGKDAV-----WTSDNLIKGERGPPGLKG 433
                  P  P PP +         +    P  +D+       TS+        P     
Sbjct: 139 SPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETP 198

Query: 434 DIGRDGPMGPPGPKGVIGLTGDRGFNGESGEKV 466
                    P  P      T  +  +  + + V
Sbjct: 199 TSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAV 231



 Score = 29.9 bits (67), Expect = 9.9
 Identities = 22/164 (13%), Positives = 28/164 (17%), Gaps = 4/164 (2%)

Query: 15  GYPGSIGFPGVKGDKGERGPASPVINVKGDKGEPGKPGLEGRNGPPGERGPPGIPGFPGE 74
           G P   G   V         ++  +        P          P     PP  P  PG 
Sbjct: 90  GGPSGSGSESVGSPTPSPSGSAEEL---ASGLSPENTSGSSPESPASHSPPPSPPSHPGP 146

Query: 75  KGDLGAPGPRGFPGSVGPGGPAGDIGPPGVPGLPGITIKGEKGLPGLHGKNGRDGAPGLA 134
                       P                    P  +            +      P  +
Sbjct: 147 HEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQS 206

Query: 135 GEKGDYGPPGLIGKPGPPGKAGLPGVEGERGPPGDPGIQGPQGK 178
                 G P        P       VE E  P        P   
Sbjct: 207 PPDEP-GEPQSPTPQQAPSPNTQQAVEHEDEPTEPEREGPPFPG 249


>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
           Provisional.
          Length = 672

 Score = 39.1 bits (92), Expect = 0.019
 Identities = 39/268 (14%), Positives = 68/268 (25%), Gaps = 45/268 (16%)

Query: 147 GKPGPPGKAGLPGVEGERGPPGDPGIQGPQGKDGIPGIPGKD-GLPGYPGPKGDQGLSIT 205
           G                R             +        K    P           + +
Sbjct: 61  GGAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAAS 120

Query: 206 GPPGERGPDGLPAGEKGFRGSPGLPGTPGLKGEQGDKGSEAQLERRDSEEETKDQKDERE 265
            P   +  +    GE   RG+    G  G +     +   A+    +  +E + + D  +
Sbjct: 121 APEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDRED 180

Query: 266 NQDSQDIEVPRETKESRHPYLLQERKDLLDHQGDKGWPGIDGTPGQDGIPGLKGEKGHGI 325
            Q   +    R  +E R                           G D     + E+G   
Sbjct: 181 RQAEAERGE-RGRREERGR------------------------DGDDRDRRDRREQGDRR 215

Query: 326 KGEPGINGTHGVKGEKGQPGGIGYKGEPGQDGAKGDSGGRCIGCLPGARGEKGDRGKDGL 385
           +     +G     G++        + +   D  + D G R      G  G +G R +D  
Sbjct: 216 EERGRRDG-----GDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRD-- 268

Query: 386 PGIPGPPGAPGMRGRDGDAGRDGPPGKD 413
                       R R G  G DG   ++
Sbjct: 269 ------------RDRRGRRGGDGGNERE 284



 Score = 31.0 bits (71), Expect = 5.7
 Identities = 20/92 (21%), Positives = 32/92 (34%)

Query: 529 GEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEK 588
            +  ER   G     G   + G  G      ++ +   R  +       RRGD      +
Sbjct: 125 AQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAE 184

Query: 589 GEPGPMGFTGDKGREGPRGLPGYPSPKGDKGE 620
            E G  G   ++GR+G          +GD+ E
Sbjct: 185 AERGERGRREERGRDGDDRDRRDRREQGDRRE 216


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 38.8 bits (90), Expect = 0.026
 Identities = 62/297 (20%), Positives = 74/297 (24%), Gaps = 31/297 (10%)

Query: 531  PGERGSTGSKGEPG-PPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKG 589
            P E   T     P  PP    PR  P     +G+P    P     P     D  P     
Sbjct: 2577 PSEPAVTSRARRPDAPPQSARPR-APV--DDRGDPRGPAPPSPLPPDTHAPDPPP----- 2628

Query: 590  EPGPMGFTGDKGREGPRGLPGYPSPKGDKGEPGISLP----------GPPGPPGYPGEKG 639
             P P     +     P  +P    P+ D     +S P              PP  P  + 
Sbjct: 2629 -PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRA 2687

Query: 640  EKGLIGPFGKQGAPGEPGLNGLPGDIGEQGIPGLP-------GLQGLPGQKGDPGPLGPP 692
             +  +G       P  P     P       +   P         Q  P     P P   P
Sbjct: 2688 ARPTVGSLTSLADPPPPPPT--PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVP 2745

Query: 693  GPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQRGEMGLPGIEGEKGNPGLPGKIGAP 752
                  G P R         P  P  P  P     R           +    LP      
Sbjct: 2746 AGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPA 2805

Query: 753  GIPAIGMPGEKGDKGDLGPFGYPGPPGLPGPMGPPGLDGLPGEIGFPGLAGPVGPPG 809
              PA  +           P G   PP    P  PP   G P      G  G V P G
Sbjct: 2806 DPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG--GSVAPGG 2860



 Score = 31.1 bits (70), Expect = 6.6
 Identities = 41/224 (18%), Positives = 55/224 (24%), Gaps = 20/224 (8%)

Query: 543  PGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGR 602
              PP   GP            P  R  +     +    ++ P        P         
Sbjct: 2758 ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAA 2817

Query: 603  EGPRGLPGYPSPKGDKGEPGISLPGPPGPPGYPGEKGEKGLIGPFGKQG----------A 652
              P   P  P P     +P    P P  PP      G     G   ++           A
Sbjct: 2818 LPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA 2877

Query: 653  PGEPGLNGLPGDIGEQGIPGLPGLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGEKGNAGL 712
            P  P +  L      +              +  P P  PP P+     P           
Sbjct: 2878 PARPPVRRLARPAVSRSTESFA--LPPDQPERPPQPQAPPPPQ---PQPQPPPPPQPQPP 2932

Query: 713  PGQPGQPGLPGFPGQRGEMGLPGIEGEKGNPGL----PGKIGAP 752
            P  P +P  P  P      G     G    P L    PG++  P
Sbjct: 2933 PPPPPRPQPPLAP-TTDPAGAGEPSGAVPQPWLGALVPGRVAVP 2975


>gnl|CDD|223029 PHA03264, PHA03264, envelope glycoprotein D; Provisional.
          Length = 416

 Score = 37.7 bits (87), Expect = 0.040
 Identities = 26/101 (25%), Positives = 30/101 (29%), Gaps = 16/101 (15%)

Query: 539 SKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTG 598
           S G P PPG + P   P       E G  G +      G           GEP P     
Sbjct: 271 SGGSPAPPGDDRPEAKPE--PGPVEDGAPGRETGGEGEGPEPAGRDGAAGGEPKPG---- 324

Query: 599 DKGREGPRGLPGYPSPKGDKGEPGISLPGPPGPPGYPGEKG 639
                     P  P+P  D+ E   SL     PP  P    
Sbjct: 325 ----------PPRPAPDADRPEGWPSLEAITFPPPTPATPA 355



 Score = 33.8 bits (77), Expect = 0.60
 Identities = 27/93 (29%), Positives = 30/93 (32%), Gaps = 5/93 (5%)

Query: 95  PAGDIGPPGVPGLPGITIKGEKG--LPGLHGKNGRDGAPGLAGEKGDYGPPGLIGKPGPP 152
           PA   G P  PG      K E G    G  G+       G      D    G   KPGPP
Sbjct: 268 PAPSGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGGE-PKPGPP 326

Query: 153 GKAGLPGVEGERGPPGDPGIQGPQGKDGIPGIP 185
             A  P  +   G P    I  P      P +P
Sbjct: 327 RPA--PDADRPEGWPSLEAITFPPPTPATPAVP 357


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 35.6 bits (82), Expect = 0.19
 Identities = 38/185 (20%), Positives = 45/185 (24%), Gaps = 35/185 (18%)

Query: 61  GERGPPGIPGFPGEKGDLGAPGPRGFPG-------SVGPGGPAGDIGPPGVPGLPGITIK 113
           GE  PP        + +L  PG R + G          PG P  D  P   P  P    +
Sbjct: 93  GEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQR 152

Query: 114 GEKG-LPGLHGKNGRDGAPGLAGEKGDYGPPGLIGKP----GPPGKAGLPGVEGERGPPG 168
            E G  P      G          +  Y  P            P  AG P  +  R    
Sbjct: 153 PEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRRRDYD 212

Query: 169 DPG-----------------------IQGPQGKDGIPGIPGKDGLPGYPGPKGDQGLSIT 205
            P                         +G  G       P     P  PGP   Q     
Sbjct: 213 HPRPDWDRPRRDRTDRPEPPPGAGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAP 272

Query: 206 GPPGE 210
           GP   
Sbjct: 273 GPGEP 277



 Score = 35.6 bits (82), Expect = 0.23
 Identities = 54/217 (24%), Positives = 62/217 (28%), Gaps = 36/217 (16%)

Query: 539 SKGEPGPPGRNGPR------GVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPG 592
           S GEP PP  +  R        PG    +G  GPR     PG    R D  P      P 
Sbjct: 91  SAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGL--PRQDQLPTARPAYP- 147

Query: 593 PMGFTGDKGREGPRGLPG-YPSPKGDKGEPGISLPGPPGPPGYPGEKGEKGLIGPFGKQG 651
                        R  PG +P    D G     L  PP  P Y              +  
Sbjct: 148 ---------AYQQRPEPGAWPRAADDYGWQQQRLGFPPRAP-YASPASYAPEQERDREPY 197

Query: 652 APGEPGLNGLPGDIGEQGIPGLPGLQGLPGQKGDPGPLGPPG----PRGFDGTPGRDGEK 707
             G P          +   P        P +     P  PPG     RG  G P RD   
Sbjct: 198 DAGRPEY---DQRRRDYDHPR--PDWDRPRRDRTDRPEPPPGAGHVHRGGPGPPERDDA- 251

Query: 708 GNAGLPGQPGQPGLPG-FPGQRGEMGLPGIEGEKGNP 743
                P  P +P  PG    Q      PG    + NP
Sbjct: 252 -----PVVPIRPSAPGPLAAQPAPAPGPGEPTARLNP 283



 Score = 32.9 bits (75), Expect = 1.5
 Identities = 41/170 (24%), Positives = 48/170 (28%), Gaps = 32/170 (18%)

Query: 518 EKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPR-GVPGFMGSKGEPGP-------RGP 569
           E  +PG   Y+G  G R      G P        R   P +   + EPG         G 
Sbjct: 109 ELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQ-QRPEPGAWPRAADDYGW 167

Query: 570 QGNP-GPVGRRGDTGPMGEKGEPGPMGFTGDKGR---EGPRGLPGYPSPKGDK------- 618
           Q    G   R     P     E        D GR   +  R    +P P  D+       
Sbjct: 168 QQQRLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRPRRDRTD 227

Query: 619 --------GEPGISLPGPPGPPGYPGEKGEKGLIGPFGKQGA----PGEP 656
                   G      PGPP     P         GP   Q A    PGEP
Sbjct: 228 RPEPPPGAGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGPGEP 277


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 35.4 bits (81), Expect = 0.30
 Identities = 32/134 (23%), Positives = 53/134 (39%), Gaps = 10/134 (7%)

Query: 161 EGERGPPGDPGIQGPQGKDGIPGIPGKDGLPGYPGPKGDQG--LSITGPPGERGPDGLPA 218
           EGE     D  ++G +G +G     G+         +G        T    E G   L A
Sbjct: 780 EGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNA 839

Query: 219 GEKGFRGSPGLPGTPGLKGEQGDKGSEAQLERRDSEEETKDQKDERENQDSQDIEVP--- 275
             +G        G  G  G   D G   + E  + EEE +++++E E ++ ++ E P   
Sbjct: 840 ENQG-EAKQDEKGVDG--GGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSL 896

Query: 276 --RETKESRHPYLL 287
              ET++ +  YL 
Sbjct: 897 EWPETRQKQAIYLF 910


>gnl|CDD|218168 pfam04602, Arabinose_trans, Mycobacterial cell wall arabinan
           synthesis protein.  Arabinosyltransferase is involved in
           arabinogalactan (AG) biosynthesis pathway in
           mycobacteria. AG is a component of the macromolecular
           assembly of the mycolyl-AG-peptidoglycan complex of the
           cell wall. This enzyme has important clinical
           applications as it is believed to be the target of the
           antimycobacterial drug Ethambutol.
          Length = 1075

 Score = 34.0 bits (78), Expect = 0.75
 Identities = 26/71 (36%), Positives = 32/71 (45%), Gaps = 8/71 (11%)

Query: 676 LQGLPGQKGDPGPLGPPGPRGF--DGTPGR---DGEKGNAGLPG--QPGQPGLPGF-PGQ 727
           L  LPGQ G  GPLG   P GF  +G P     D  + N G P      +P +  +  G 
Sbjct: 736 LTPLPGQFGPLGPLGGVNPIGFTPNGVPDDLVADPVRENPGQPNSDYSDKPNVTDWEAGT 795

Query: 728 RGEMGLPGIEG 738
            G +  PGI G
Sbjct: 796 GGGLAAPGING 806


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 33.8 bits (77), Expect = 0.77
 Identities = 49/262 (18%), Positives = 69/262 (26%), Gaps = 15/262 (5%)

Query: 565 GPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRGLPGYP--SPKGD----K 618
           G RGPQ  P   G         + G PG             +   G     P        
Sbjct: 83  GTRGPQMGPMGPGPGR--PMGQQMGGPGTASNLLQSLNVRGQMPMGAAGMGPHQMSRVGT 140

Query: 619 GEPGISLPGPPGPPGYPGEKGEKGLIGPFGKQGAPGEPGLNGLPGDIGEQGIPGLPGLQG 678
            +PG    G         +  +   +GP   Q      G+N        Q  P   G  G
Sbjct: 141 MQPGGQAGGMMQQSSGQPQSQQPNQMGPQQGQAQGQAGGMNQGQQGPVGQQQPPQMGQPG 200

Query: 679 LPGQKGDP------GPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQRGEMG 732
           +PG  G         P G            ++ ++        P          Q+G+ G
Sbjct: 201 MPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGG 260

Query: 733 LPGIEGEKGNPGLPGKIGAPGIPAIGMPGEKGDKGDLGPFGYPGPPGLPGPMGPPGLDGL 792
           +   + + G   +P     P                +      G  G   PMGPP     
Sbjct: 261 MQPQQMQGGQMQVP-MQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPG 319

Query: 793 PGEIGFPGLAGPVGPPGLQGLK 814
               G   +   V   G Q LK
Sbjct: 320 AVPQGGQAVQQGVMSAGQQQLK 341



 Score = 31.5 bits (71), Expect = 3.7
 Identities = 29/135 (21%), Positives = 44/135 (32%), Gaps = 4/135 (2%)

Query: 79  GAPGPRGFPGSVGPGGPAGD-IGPPGVPGLPGITIKGEKGLPGLHGKNGRDGAPGLAGEK 137
           G  GP+  P   GPG P G  +G PG       ++     +P      G      +   +
Sbjct: 83  GTRGPQMGPMGPGPGRPMGQQMGGPGTASNLLQSLNVRGQMPMGAAGMGPHQMSRVGTMQ 142

Query: 138 GDYGPPGLIGKPGPPGKAGLPGVEGERGPPGDPGIQGPQGKDGIP---GIPGKDGLPGYP 194
                 G++ +     ++  P   G +         G       P     P + G PG P
Sbjct: 143 PGGQAGGMMQQSSGQPQSQQPNQMGPQQGQAQGQAGGMNQGQQGPVGQQQPPQMGQPGMP 202

Query: 195 GPKGDQGLSITGPPG 209
           G  G   +   G PG
Sbjct: 203 GGGGQGQMQQQGQPG 217


>gnl|CDD|240291 PTZ00146, PTZ00146, fibrillarin; Provisional.
          Length = 293

 Score = 33.2 bits (76), Expect = 0.93
 Identities = 16/53 (30%), Positives = 18/53 (33%)

Query: 528 KGEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRG 580
            G  G RG     G  G  G  G  G  G    +G  G  G +G  G  G   
Sbjct: 4   GGFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPGK 56



 Score = 32.0 bits (73), Expect = 2.2
 Identities = 18/56 (32%), Positives = 18/56 (32%)

Query: 49  GKPGLEGRNGPPGERGPPGIPGFPGEKGDLGAPGPRGFPGSVGPGGPAGDIGPPGV 104
              G  G  G     G  G  G  G  G  G    RG  G  G  G  G  GP  V
Sbjct: 2   MGGGFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPGKV 57



 Score = 31.6 bits (72), Expect = 2.7
 Identities = 17/52 (32%), Positives = 19/52 (36%)

Query: 514 GVKGEKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPG 565
           G  G +G   G G  G  G  G  G +G     GR G  G  G  G  G   
Sbjct: 5   GFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPGK 56



 Score = 30.9 bits (70), Expect = 4.0
 Identities = 18/52 (34%), Positives = 20/52 (38%)

Query: 556 GFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRG 607
           GF G +G     G  G  G  GR G  G    +G  G  G  G  G  GP  
Sbjct: 5   GFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPGK 56



 Score = 29.7 bits (67), Expect = 9.8
 Identities = 18/52 (34%), Positives = 18/52 (34%)

Query: 357 GAKGDSGGRCIGCLPGARGEKGDRGKDGLPGIPGPPGAPGMRGRDGDAGRDG 408
           G  G  GG   G   G RG  G  G  G     G  G  G RG  G  G   
Sbjct: 5   GFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPGK 56


>gnl|CDD|236722 PRK10590, PRK10590, ATP-dependent RNA helicase RhlE; Provisional.
          Length = 456

 Score = 32.5 bits (74), Expect = 1.8
 Identities = 21/84 (25%), Positives = 23/84 (27%), Gaps = 2/84 (2%)

Query: 492 IPGLKGEKGHGIKGEPGINGTHGVKGEKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGP 551
           IPG   E    IK EP  NG     G     GG   + +   R   G            P
Sbjct: 373 IPGY--EPDPSIKAEPIQNGRQQRGGGGRGQGGGRGQQQGQPRRGEGGAKSASAKPAEKP 430

Query: 552 RGVPGFMGSKGEPGPRGPQGNPGP 575
               G     GE   R     P  
Sbjct: 431 SRRLGDAKPAGEQQRRRRPRKPAA 454



 Score = 30.5 bits (69), Expect = 6.6
 Identities = 20/84 (23%), Positives = 27/84 (32%), Gaps = 6/84 (7%)

Query: 157 LPGVEGERGPPGDPGIQGPQGKDGIPGIPGKDGLPGYPGPKGDQG------LSITGPPGE 210
           +PG E +     +P   G Q + G     G         P+  +G            P  
Sbjct: 373 IPGYEPDPSIKAEPIQNGRQQRGGGGRGQGGGRGQQQGQPRRGEGGAKSASAKPAEKPSR 432

Query: 211 RGPDGLPAGEKGFRGSPGLPGTPG 234
           R  D  PAGE+  R  P  P    
Sbjct: 433 RLGDAKPAGEQQRRRRPRKPAAAQ 456


>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2.  This model
           describes a tightly conserved subfamily of the larger
           family of sugar (and other) transporters described by
           PFAM model pfam00083. Members of this subfamily include
           closely related forms SV2A and SV2B of synaptic vesicle
           protein from vertebrates and a more distantly related
           homolog (below trusted cutoff) from Drosophila
           melanogaster. Members are predicted to have two sets of
           six transmembrane helices.
          Length = 742

 Score = 32.3 bits (73), Expect = 2.2
 Identities = 28/102 (27%), Positives = 32/102 (31%), Gaps = 17/102 (16%)

Query: 166 PPGDPGIQGPQGKDGIPGIPGKDGLPG-------YPGPKGDQGLSITGPPGERGPDGLPA 218
            P D   +G    D   G    D   G       Y G       +  G   E   DG P 
Sbjct: 60  APADGYSRGEAADDEEEGEASSDATEGHDEDDEIYEGEYQGIPRAENGGKDEIMADGAPK 119

Query: 219 GEKGFRGSPGLPGTPGLKGEQGDKGSEAQLERRDSEEETKDQ 260
              G R          LK  +G  G E + ERR  EEE   Q
Sbjct: 120 A--GVRD--------ELKDGEGPPGGEGEAERRADEEELAQQ 151


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 31.2 bits (70), Expect = 4.5
 Identities = 59/259 (22%), Positives = 77/259 (29%), Gaps = 18/259 (6%)

Query: 531 PGERGSTGSKGEPGPPGRNGPRGVPGFM--GSKGEPGPRGPQGNPGPVGRRGDTGPMGEK 588
           P  R    S   PGPP  +  +  P F+   S   P P G   +  P             
Sbjct: 259 PSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSH 318

Query: 589 GEPGPMGFTGDKGREGPRGLPGYPSPKGDKGEPGISLPGPPGPPGYPGEKGEKGLIGPFG 648
             P     +  + ++ PR  P  P+P     +P  + P P  P              PF 
Sbjct: 319 TPPSQ---SALQPQQPPREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFP 375

Query: 649 KQGA-----PGEPGLNGLPGDIGEQGIPGLPGLQGLP-GQKGDPGPLGPPGPRGFDGTPG 702
           +  +     P    L+ LP        P  P LQ +P  Q     P  PP        P 
Sbjct: 376 QMPSNLPPPPALKPLSSLPTHHPPSAHP--PPLQLMPQSQPLQSVPAQPPVLTQSQSLPP 433

Query: 703 RDGEKGNAGLPGQPGQPGLPGFPGQRGEMGLPGIEGEKGNPGLPGKIGAPGIPAIGMPGE 762
           +     ++GL   P Q      P   G  GLP I      P LP    A    A      
Sbjct: 434 KASTHPHSGLHSGPPQSPFAQHPFTSG--GLPAIGPP---PSLPTSTPAAPPRASSGSQP 488

Query: 763 KGDKGDLGPFGYPGPPGLP 781
            G             P LP
Sbjct: 489 PGSALPSSGGCAGPGPPLP 507


>gnl|CDD|111993 pfam03157, Glutenin_hmw, High molecular weight glutenin subunit.
           Members of this family include high molecular weight
           subunits of glutenin. This group of gluten proteins is
           thought to be largely responsible for the elastic
           properties of gluten, and hence, doughs. Indeed,
           glutenin high molecular weight subunits are classified
           as elastomeric proteins, because the glutenin network
           can withstand significant deformations without breaking,
           and return to the original conformation when the stress
           is removed. Elastomeric proteins differ considerably in
           amino acid sequence, but they are all polymers whose
           subunits consist of elastomeric domains, composed of
           repeated motifs, and non-elastic domains that mediate
           cross-linking between the subunits. The elastomeric
           domain motifs are all rich in glycine residues in
           addition to other hydrophobic residues. High molecular
           weight glutenin subunits have an extensive central
           elastomeric domain, flanked by two terminal non-elastic
           domains that form disulphide cross-links. The central
           elastomeric domain is characterized by the following
           three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It
           possesses overlapping beta-turns within and between the
           repeated motifs, and assumes a regular helical secondary
           structure with a diameter of approx. 1.9 nm and a pitch
           of approx. 1.5 nm.
          Length = 779

 Score = 31.2 bits (69), Expect = 5.5
 Identities = 54/244 (22%), Positives = 86/244 (35%), Gaps = 2/244 (0%)

Query: 486 TPGQDGIPGLKGEKGHGIKGEPGINGTHGVKGEKGQPGGIGYKGEPGERGSTGSKGEPGP 545
           +  Q G     G+     +G+PG   T  ++  +GQPG      +   +G    + +   
Sbjct: 440 SLQQPGQGQQPGQWQQPGQGQPGYYPTSLLQPGQGQPGHDPASLQQPGQGQQPGQLQQPA 499

Query: 546 PGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGP 605
            G+ G +   G  G +     +G Q   G  G++   G  G++   G     G++G++  
Sbjct: 500 QGQPGQQLAQGQQGQQPAQVQQGQQPAQGQQGQQLGQGQQGQQPGQGQHPAQGEQGQQPG 559

Query: 606 RGLPGYPSPKGDKGEPGISL--PGPPGPPGYPGEKGEKGLIGPFGKQGAPGEPGLNGLPG 663
           +G  G    +G +   G     P  P   G   + G+    G + + G          P 
Sbjct: 560 QGQQGQQPGQGQQPGQGQPWYYPTSPQESGQGQQPGQWQQPGQWQQPGQGQPGYYLTSPL 619

Query: 664 DIGEQGIPGLPGLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPG 723
            +G+      P     PGQ   PG     G       P      G A   GQ  Q G   
Sbjct: 620 QLGQGQQGYYPTSLQQPGQGQQPGQWQQSGQGQHGHCPMSPQLSGQAQQLGQGQQIGQVL 679

Query: 724 FPGQ 727
            PGQ
Sbjct: 680 QPGQ 683


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 30.9 bits (70), Expect = 5.6
 Identities = 22/107 (20%), Positives = 31/107 (28%), Gaps = 14/107 (13%)

Query: 538 GSKGEP---GPPGRNGPRGVP-GFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGP 593
           G+ G+P   G   +    G P G+      P P GP G   P G      PM     P  
Sbjct: 393 GAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLRPNGLA----PMNAVRAPSR 448

Query: 594 MGFTGDKGREGPRGLPGYPSPKGDKGEPGISLPGPPGPPGYPGEKGE 640
                 +       +     P   +  P       P P     + G+
Sbjct: 449 NA----QNAAQKPPMQPVMYPPNYQSLP--LSQDLPQPQSTASQGGQ 489


>gnl|CDD|184518 PRK14119, gpmA, phosphoglyceromutase; Provisional.
          Length = 228

 Score = 30.2 bits (68), Expect = 6.1
 Identities = 12/29 (41%), Positives = 17/29 (58%)

Query: 269 SQDIEVPRETKESRHPYLLQERKDLLDHQ 297
           S D++ P ET+E R  YL   R + LD +
Sbjct: 116 SYDVKPPAETEEQREAYLADRRYNHLDKR 144


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 30.8 bits (69), Expect = 6.7
 Identities = 59/245 (24%), Positives = 83/245 (33%), Gaps = 35/245 (14%)

Query: 236 KGEQGDKGSEAQLERRDSEEETKDQ-----KDERENQDSQDIEVPR--ETKESRHPYLLQ 288
           +  Q  KGS+   E  D + E +D      KD + ++ +Q         T         Q
Sbjct: 104 EQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKGSKKTQPGVSTSSGSTTSGTDLNTKQ 163

Query: 289 ERKDL-LDHQGDKGWPGIDGTPGQDGIPGLKGEKGHGIKGEPGINGTHGVKGEKGQPGGI 347
            +  L       +  P +  +    G+PGL G  G G+ G  G     GV        G+
Sbjct: 164 SQTGLGASGSHAQQDPAVSQSGVV-GVPGL-GVPGVGVPGGGGAGALPGV--------GV 213

Query: 348 GYKGEPGQDGAKGDSGGRCIGCLPGARGEKGD------RGKDGLPGIPGPPGAPGMRGRD 401
           G  G     G  G  G   +G L      +G       R  DG    PG  G PG+R  D
Sbjct: 214 GRAGVSPGVGVGGLGGVPGVGILASNTSREGQTQDDQERDGDGRVIEPG-VGLPGVRVGD 272

Query: 402 GDAGR--DGPPGKDAVWTSDNLIKGERGPPGLKGDIGR-------DGPMGPPG-PKGVIG 451
             +      P G     T  +      G PG               GP+  PG P+ + G
Sbjct: 273 STSSPSTTRPSGSTTTTTPASSGPSAPGGPGSSSRNAVTRSTDSISGPIPSPGAPRAITG 332

Query: 452 LTGDR 456
             G+R
Sbjct: 333 QMGER 337


>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175). 
          Length = 820

 Score = 30.6 bits (70), Expect = 6.9
 Identities = 29/100 (29%), Positives = 43/100 (43%)

Query: 286 LLQERKDLLDHQGDKGWPGIDGTPGQDGIPGLKGEKGHGIKGEPGINGTHGVKGEKGQPG 345
            L+E++ L D          +   G+      +G +G G  G+ G  G    +G++GQ G
Sbjct: 596 TLREQQGLSDETFRDLQEQFNAQRGEQQGQQGQGGQGQGQPGQQGQQGQGQQQGQQGQGG 655

Query: 346 GIGYKGEPGQDGAKGDSGGRCIGCLPGARGEKGDRGKDGL 385
             G      +  A  D  GR  G LPG  GE G+  +D L
Sbjct: 656 QGGQGSLAERQQALRDELGRQRGGLPGMGGEAGEAARDAL 695



 Score = 30.3 bits (69), Expect = 8.3
 Identities = 24/67 (35%), Positives = 29/67 (43%), Gaps = 2/67 (2%)

Query: 342 GQPGGIGYKGEPGQDGAKGDSGGRCIGCLPGARGEKGDRGKDGLPGIPGPPGAPGMRGRD 401
           GQPG  G +G+  Q G +G  G    G L   +    D       G+PG  G  G   RD
Sbjct: 634 GQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQQALRDELGRQRGGLPGMGGEAGEAARD 693

Query: 402 --GDAGR 406
             G AGR
Sbjct: 694 ALGRAGR 700


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family consists
            of several Mycoplasma species specific Cytadhesin P32 and
            P30 proteins. P30 has been found to be membrane
            associated and localised on the tip organelle. It is
            thought that it is important in cytadherence and
            virulence.
          Length = 279

 Score = 30.0 bits (67), Expect = 7.5
 Identities = 26/95 (27%), Positives = 35/95 (36%)

Query: 1008 NLRQRISRCQLFLGYPGLAGPAGEKGFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGY 1067
            N+  R    Q+    PG+       GF   PG+P  PG         +  R G   QPG 
Sbjct: 181  NMGMRPGFNQMPPHMPGMPPNQMRPGFNPMPGMPPRPGFNQNPNMMPNMNRPGFRPQPGG 240

Query: 1068 RGPPGPQGLPGLAGIPGARGDKGDWGPQGLDGPVG 1102
               PG    P +   PG   ++G   P  + GP  
Sbjct: 241  FNHPGTPMGPNMQQRPGFNPNQGMNPPPHMAGPRA 275



 Score = 30.0 bits (67), Expect = 9.2
 Identities = 30/131 (22%), Positives = 47/131 (35%), Gaps = 8/131 (6%)

Query: 48  PGKPGLEGRNGP-PGERGPPGIPGFPGEKGDLGAPGPRGFPGSVGPGGPAGDIGPPGVPG 106
             +P ++ + GP P +R  P   GFP +      PG    P  +   G   +   PG   
Sbjct: 152 NPQPQVQPQFGPNPQQRINPQRFGFPMQPNMGMRPGFNQMPPHMP--GMPPNQMRPGFNP 209

Query: 107 LPGITIKGEKGLPGLHGKNGRDGAPGLAGEKGDYGPPGLIGKPGPPGKAGLPGVEGERGP 166
           +PG+  +   G             PG   + G +  PG    P    + G    +G   P
Sbjct: 210 MPGMPPR--PGFNQNPNMMPNMNRPGFRPQPGGFNHPGTPMGPNMQQRPGFNPNQGMNPP 267

Query: 167 PGDPGIQGPQG 177
              P + GP+ 
Sbjct: 268 ---PHMAGPRA 275


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 30.5 bits (69), Expect = 8.2
 Identities = 18/118 (15%), Positives = 23/118 (19%)

Query: 1028 PAGEKGFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLPGLAGIPGARG 1087
               E  FR    L  +           S+ +       G     GP   PG A  P    
Sbjct: 790  VRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPAR 849

Query: 1088 DKGDWGPQGLDGPVGPPGLRGEPGPPGRPGLDGLQGNTGPPGENGLPGAPCESTPDYM 1145
                   +         G  G   P              P      P A   +     
Sbjct: 850  SSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRP 907


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.312    0.145    0.469 

Gapped
Lambda     K      H
   0.267   0.0613    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 74,565,324
Number of extensions: 7833643
Number of successful extensions: 6331
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5263
Number of HSP's successfully gapped: 392
Length of query: 1375
Length of database: 10,937,602
Length adjustment: 109
Effective length of query: 1266
Effective length of database: 6,103,016
Effective search space: 7726418256
Effective search space used: 7726418256
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 65 (29.1 bits)