RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17777
(1375 letters)
>gnl|CDD|128421 smart00111, C4, C-terminal tandem repeated domain in type 4
procollagens. Duplicated domain in C-terminus of type 4
collagens. Mutations in alpha-5 collagen IV are
associated with X-linked Alport syndrome.
Length = 114
Score = 187 bits (476), Expect = 2e-55
Identities = 68/116 (58%), Positives = 81/116 (69%), Gaps = 3/116 (2%)
Query: 1260 ANVIAVHSQSVVIPECPVGWNSLWIGYSFVMHTAAGGDGGGQSLASPGSCLEEFRATPFI 1319
VIAVHSQ+ +P+CP GW LW GYSF+MHT G G GQ L SPGSCLE FR PFI
Sbjct: 1 GFVIAVHSQTTNVPQCPAGWVELWTGYSFLMHTGNGE-GHGQDLGSPGSCLERFRTMPFI 59
Query: 1320 ECNGEHGSCHYFANKL-SFWLATIDPQDQWSRPQQQTLKSGNLRQRISRCQVCIKR 1374
ECNG G C+Y + SFWL+TI+P DQ++ P+ T K+G+LR ISRCQVC K
Sbjct: 60 ECNG-RGVCNYASRNDYSFWLSTIEPSDQFTAPRPMTPKAGDLRPYISRCQVCEKP 114
Score = 178 bits (454), Expect = 2e-52
Identities = 65/111 (58%), Positives = 78/111 (70%), Gaps = 3/111 (2%)
Query: 908 ANVIAVHSQSVVIPECPVGWNSLWIGYSFVMHTAAGGDGGGQSLASPGSCLEEFRATPFI 967
VIAVHSQ+ +P+CP GW LW GYSF+MHT G G GQ L SPGSCLE FR PFI
Sbjct: 1 GFVIAVHSQTTNVPQCPAGWVELWTGYSFLMHTGNGE-GHGQDLGSPGSCLERFRTMPFI 59
Query: 968 ECNGEHGSCHYFANKL-SFWLATIDPQDQWSRPQQQTLKSGNLRQRISRCQ 1017
ECNG G C+Y + SFWL+TI+P DQ++ P+ T K+G+LR ISRCQ
Sbjct: 60 ECNG-RGVCNYASRNDYSFWLSTIEPSDQFTAPRPMTPKAGDLRPYISRCQ 109
Score = 161 bits (410), Expect = 2e-46
Identities = 57/120 (47%), Positives = 70/120 (58%), Gaps = 13/120 (10%)
Query: 1147 GILMVKHSQTAEVPSCTDDKTNTQFTKLWEGYSLLYVEGNEQAHNQDLGFAGSCVRKFST 1206
G ++ HSQT VP C + +LW GYS L GN + H QDLG GSC+ +F T
Sbjct: 1 GFVIAVHSQTTNVPQCPAG-----WVELWTGYSFLMHTGNGEGHGQDLGSPGSCLERFRT 55
Query: 1207 MPFLFCDPNNVCNYASRNDRSYWLSTEE-------PMPMMPVESTQIQKFISRCVVCEVP 1259
MPF+ C+ VCNYASRND S+WLST E P PM P ++ +ISRC VCE P
Sbjct: 56 MPFIECNGRGVCNYASRNDYSFWLSTIEPSDQFTAPRPMTPKAG-DLRPYISRCQVCEKP 114
Score = 55.5 bits (134), Expect = 3e-09
Identities = 21/44 (47%), Positives = 25/44 (56%), Gaps = 8/44 (18%)
Query: 871 RNDRSYWLSTEE-------PMPMMPVESTQIQKFISRCVVCEVP 907
RND S+WLST E P PM P ++ +ISRC VCE P
Sbjct: 72 RNDYSFWLSTIEPSDQFTAPRPMTPKAG-DLRPYISRCQVCEKP 114
>gnl|CDD|144854 pfam01413, C4, C-terminal tandem repeated domain in type 4
procollagen. Duplicated domain in C-terminus of type 4
collagens. Mutations in alpha-5 collagen IV are
associated with X-linked Alport syndrome.
Length = 110
Score = 183 bits (467), Expect = 4e-54
Identities = 59/115 (51%), Positives = 73/115 (63%), Gaps = 7/115 (6%)
Query: 1147 GILMVKHSQTAEVPSCTDDKTNTQFTKLWEGYSLLYVEGNEQAHNQDLGFAGSCVRKFST 1206
G L+ HSQT ++P C ++ LW GYS L GN + H QDLG GSC+ +F T
Sbjct: 1 GFLIAVHSQTTQIPQCPAG-----WSSLWTGYSFLMHTGNAEGHGQDLGSPGSCLERFRT 55
Query: 1207 MPFLFCDPNNVCNYASRNDRSYWLST--EEPMPMMPVESTQIQKFISRCVVCEVP 1259
MPF+ C+ N VCNYASRND S+WLST PMPM P +I+ +ISRC VCE P
Sbjct: 56 MPFIECNGNGVCNYASRNDYSFWLSTIEPMPMPMTPKAGREIRPYISRCQVCEAP 110
Score = 179 bits (456), Expect = 1e-52
Identities = 65/117 (55%), Positives = 78/117 (66%), Gaps = 9/117 (7%)
Query: 1260 ANVIAVHSQSVVIPECPVGWNSLWIGYSFVMHTAAGGDGGGQSLASPGSCLEEFRATPFI 1319
+IAVHSQ+ IP+CP GW+SLW GYSF+MHT +G GQ L SPGSCLE FR PFI
Sbjct: 1 GFLIAVHSQTTQIPQCPAGWSSLWTGYSFLMHT-GNAEGHGQDLGSPGSCLERFRTMPFI 59
Query: 1320 ECNGEHGSCHYFA-NKLSFWLATIDPQDQWSRPQQQTLKSG-NLRQRISRCQVCIKR 1374
ECNG +G C+Y + N SFWL+TI+P P T K+G +R ISRCQVC
Sbjct: 60 ECNG-NGVCNYASRNDYSFWLSTIEPM-----PMPMTPKAGREIRPYISRCQVCEAP 110
Score = 171 bits (435), Expect = 8e-50
Identities = 63/112 (56%), Positives = 76/112 (67%), Gaps = 9/112 (8%)
Query: 908 ANVIAVHSQSVVIPECPVGWNSLWIGYSFVMHTAAGGDGGGQSLASPGSCLEEFRATPFI 967
+IAVHSQ+ IP+CP GW+SLW GYSF+MHT +G GQ L SPGSCLE FR PFI
Sbjct: 1 GFLIAVHSQTTQIPQCPAGWSSLWTGYSFLMHT-GNAEGHGQDLGSPGSCLERFRTMPFI 59
Query: 968 ECNGEHGSCHYFA-NKLSFWLATIDPQDQWSRPQQQTLKSG-NLRQRISRCQ 1017
ECNG +G C+Y + N SFWL+TI+P P T K+G +R ISRCQ
Sbjct: 60 ECNG-NGVCNYASRNDYSFWLSTIEPM-----PMPMTPKAGREIRPYISRCQ 105
Score = 68.5 bits (168), Expect = 8e-14
Identities = 22/39 (56%), Positives = 26/39 (66%), Gaps = 2/39 (5%)
Query: 871 RNDRSYWLST--EEPMPMMPVESTQIQKFISRCVVCEVP 907
RND S+WLST PMPM P +I+ +ISRC VCE P
Sbjct: 72 RNDYSFWLSTIEPMPMPMTPKAGREIRPYISRCQVCEAP 110
>gnl|CDD|189968 pfam01391, Collagen, Collagen triple helix repeat (20 copies).
Members of this family belong to the collagen
superfamily. Collagens are generally extracellular
structural proteins involved in formation of connective
tissue structure. The alignment contains 20 copies of the
G-X-Y repeat that forms a triple helix. The first
position of the repeat is glycine, the second and third
positions can be any residue but are frequently proline
and hydroxyproline. Collagens are post translationally
modified by proline hydroxylase to form the
hydroxyproline residues. Defective hydroxylation is the
cause of scurvy. Some members of the collagen superfamily
are not involved in connective tissue structure but share
the same triple helical structure.
Length = 60
Score = 42.1 bits (100), Expect = 5e-05
Identities = 34/58 (58%), Positives = 34/58 (58%)
Query: 1060 GKPGQPGYRGPPGPQGLPGLAGIPGARGDKGDWGPQGLDGPVGPPGLRGEPGPPGRPG 1117
G PG PG GPPGP G PG G PG G G GP G GP GPPG G PG PG PG
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58
Score = 42.1 bits (100), Expect = 5e-05
Identities = 33/58 (56%), Positives = 33/58 (56%)
Query: 678 GLPGQKGDPGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQRGEMGLPG 735
G PG G PGP GPPGP G G PG G G G PG PG PG PG PG G G PG
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58
Score = 40.6 bits (96), Expect = 1e-04
Identities = 35/60 (58%), Positives = 35/60 (58%)
Query: 49 GKPGLEGRNGPPGERGPPGIPGFPGEKGDLGAPGPRGFPGSVGPGGPAGDIGPPGVPGLP 108
G PG G GPPG GPPG PG PG G G PGP G PG GP GP G G PG PG P
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 40.6 bits (96), Expect = 1e-04
Identities = 35/58 (60%), Positives = 35/58 (60%)
Query: 52 GLEGRNGPPGERGPPGIPGFPGEKGDLGAPGPRGFPGSVGPGGPAGDIGPPGVPGLPG 109
G G GPPG GPPG PG PG G G PGP G PG GP GP G GPPG PG PG
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58
Score = 40.2 bits (95), Expect = 2e-04
Identities = 34/58 (58%), Positives = 34/58 (58%)
Query: 1063 GQPGYRGPPGPQGLPGLAGIPGARGDKGDWGPQGLDGPVGPPGLRGEPGPPGRPGLDG 1120
G PG GPPGP G PG G PG G G GP G GP GPPG G PGPPG PG G
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58
Score = 40.2 bits (95), Expect = 2e-04
Identities = 34/60 (56%), Positives = 34/60 (56%)
Query: 1057 GREGKPGQPGYRGPPGPQGLPGLAGIPGARGDKGDWGPQGLDGPVGPPGLRGEPGPPGRP 1116
G G PG PG GPPGP G PG G PG G G GP G GP GPPG G PG PG P
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 39.8 bits (94), Expect = 3e-04
Identities = 33/60 (55%), Positives = 33/60 (55%)
Query: 687 GPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQRGEMGLPGIEGEKGNPGLP 746
GP GPPGP G G PG G G G PG PG PG PG PG G G PG G G PG P
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 39.0 bits (92), Expect = 6e-04
Identities = 32/58 (55%), Positives = 32/58 (55%)
Query: 538 GSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMG 595
G G PGPPG GP G PG G G PGP GP G PGP G G GP G G PG G
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58
Score = 38.6 bits (91), Expect = 0.001
Identities = 31/58 (53%), Positives = 31/58 (53%)
Query: 520 GQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVG 577
G PG G G PG G G G PGPPG GP G PG G G PGP GP G PG G
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58
Score = 38.2 bits (90), Expect = 0.001
Identities = 32/58 (55%), Positives = 32/58 (55%)
Query: 541 GEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTG 598
G PGPPG GP G PG G G PGP GP G PGP G G GP G G PG G G
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58
Score = 38.2 bits (90), Expect = 0.001
Identities = 31/60 (51%), Positives = 31/60 (51%)
Query: 532 GERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEP 591
G G G G PGPPG GP G PG G G PGP GP G PGP G G G G G P
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 38.2 bits (90), Expect = 0.001
Identities = 33/60 (55%), Positives = 33/60 (55%)
Query: 675 GLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQRGEMGLP 734
G G PG G PGP GPPGP G G PG G G G PG PG PG PG PG G G P
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 37.9 bits (89), Expect = 0.001
Identities = 34/60 (56%), Positives = 34/60 (56%)
Query: 669 GIPGLPGLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQR 728
G PG PG G PG G PGP GPPGP G G PG G G G PG PG PG PG PG
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 37.9 bits (89), Expect = 0.002
Identities = 33/59 (55%), Positives = 33/59 (55%)
Query: 535 GSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGP 593
G G G PGPPG GP G PG G G PGP GP G PGP G G GP G G PGP
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59
Score = 37.9 bits (89), Expect = 0.002
Identities = 35/60 (58%), Positives = 35/60 (58%)
Query: 1069 GPPGPQGLPGLAGIPGARGDKGDWGPQGLDGPVGPPGLRGEPGPPGRPGLDGLQGNTGPP 1128
GPPGP G PG G PG G G GP G GP GPPG G PGPPG PG G G GPP
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 37.5 bits (88), Expect = 0.002
Identities = 34/58 (58%), Positives = 34/58 (58%)
Query: 141 GPPGLIGKPGPPGKAGLPGVEGERGPPGDPGIQGPQGKDGIPGIPGKDGLPGYPGPKG 198
GPPG G PGPPG G PG G GPPG PG GP G G PG PG G PG PG G
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58
Score = 37.5 bits (88), Expect = 0.002
Identities = 33/60 (55%), Positives = 33/60 (55%)
Query: 663 GDIGEQGIPGLPGLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLP 722
G G G PG PG G PG G PGP GPPGP G G PG G G G PG PG PG P
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 37.5 bits (88), Expect = 0.002
Identities = 30/58 (51%), Positives = 30/58 (51%)
Query: 1021 GYPGLAGPAGEKGFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLPG 1078
G PG GP G G G PG PG PG G G G G G PG PG GPPG G PG
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58
Score = 37.1 bits (87), Expect = 0.002
Identities = 32/59 (54%), Positives = 32/59 (54%)
Query: 526 GYKGEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGP 584
G G PG G G G PGPPG GP G PG G G PGP GP G PGP G G GP
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59
Score = 37.1 bits (87), Expect = 0.003
Identities = 28/56 (50%), Positives = 28/56 (50%)
Query: 514 GVKGEKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGP 569
G G G PG G G PG G G G PGPPG GP G PG G G PG GP
Sbjct: 4 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59
Score = 37.1 bits (87), Expect = 0.003
Identities = 29/60 (48%), Positives = 29/60 (48%)
Query: 514 GVKGEKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNP 573
G G G PG G G PG G G G PGPPG GP G PG G G PG G G P
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 37.1 bits (87), Expect = 0.003
Identities = 31/60 (51%), Positives = 31/60 (51%)
Query: 553 GVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRGLPGYP 612
G PG G G PGP GP G PGP G G GP G G PGP G G G G G PG P
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 37.1 bits (87), Expect = 0.003
Identities = 32/59 (54%), Positives = 32/59 (54%)
Query: 648 GKQGAPGEPGLNGLPGDIGEQGIPGLPGLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGE 706
G G PG PG G PG G G PG PG G PG G PGP GPPGP G G PG G
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59
Score = 36.7 bits (86), Expect = 0.003
Identities = 34/60 (56%), Positives = 34/60 (56%)
Query: 135 GEKGDYGPPGLIGKPGPPGKAGLPGVEGERGPPGDPGIQGPQGKDGIPGIPGKDGLPGYP 194
G G GPPG G PGPPG G PG G GPPG PG GP G G PG PG G PG P
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 36.7 bits (86), Expect = 0.004
Identities = 32/55 (58%), Positives = 32/55 (58%)
Query: 147 GKPGPPGKAGLPGVEGERGPPGDPGIQGPQGKDGIPGIPGKDGLPGYPGPKGDQG 201
G PGPPG G PG G GPPG PG GP G G PG PG G PG PGP G G
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPG 55
Score = 36.7 bits (86), Expect = 0.004
Identities = 31/59 (52%), Positives = 31/59 (52%)
Query: 529 GEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGE 587
G PG G G G PGPPG GP G PG G G PGP GP G PGP G G G G
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59
Score = 36.3 bits (85), Expect = 0.005
Identities = 30/56 (53%), Positives = 30/56 (53%)
Query: 42 KGDKGEPGKPGLEGRNGPPGERGPPGIPGFPGEKGDLGAPGPRGFPGSVGPGGPAG 97
G G PG PG G GPPG GPPG PG PG G G PGP G PG G G G
Sbjct: 3 PGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58
Score = 35.9 bits (84), Expect = 0.006
Identities = 30/58 (51%), Positives = 30/58 (51%)
Query: 123 GKNGRDGAPGLAGEKGDYGPPGLIGKPGPPGKAGLPGVEGERGPPGDPGIQGPQGKDG 180
G G G PG G G GPPG G PGPPG G PG G GPPG PG G G G
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58
Score = 35.9 bits (84), Expect = 0.007
Identities = 30/58 (51%), Positives = 30/58 (51%)
Query: 559 GSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRGLPGYPSPKG 616
G G PGP GP G PGP G G GP G G PGP G G G GP G PG P G
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58
Score = 35.9 bits (84), Expect = 0.007
Identities = 33/59 (55%), Positives = 33/59 (55%)
Query: 672 GLPGLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQRGE 730
G PG G PG G PGP GPPGP G G PG G G G PG PG PG PG PG G
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59
Score = 35.5 bits (83), Expect = 0.008
Identities = 32/60 (53%), Positives = 32/60 (53%)
Query: 654 GEPGLNGLPGDIGEQGIPGLPGLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGEKGNAGLP 713
G PG G PG G G PG PG G PG G PGP GPPGP G G PG G G G P
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 35.5 bits (83), Expect = 0.009
Identities = 29/58 (50%), Positives = 29/58 (50%)
Query: 113 KGEKGLPGLHGKNGRDGAPGLAGEKGDYGPPGLIGKPGPPGKAGLPGVEGERGPPGDP 170
G G PG G G G PG G G GPPG G PGPPG G PG G G PG P
Sbjct: 3 PGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 35.5 bits (83), Expect = 0.009
Identities = 34/60 (56%), Positives = 34/60 (56%)
Query: 761 GEKGDKGDLGPFGYPGPPGLPGPMGPPGLDGLPGEIGFPGLAGPVGPPGLQGLKGDKGLP 820
G G G GP G PGPPG PGP GPPG G PG G PG GP GPPG G G G P
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 35.5 bits (83), Expect = 0.011
Identities = 28/57 (49%), Positives = 28/57 (49%)
Query: 1021 GYPGLAGPAGEKGFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLP 1077
G PG GP G G G PG PG PG G G G G G PG PG G PG G P
Sbjct: 4 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 35.2 bits (82), Expect = 0.012
Identities = 30/60 (50%), Positives = 30/60 (50%)
Query: 636 GEKGEKGLIGPFGKQGAPGEPGLNGLPGDIGEQGIPGLPGLQGLPGQKGDPGPLGPPGPR 695
G G G GP G G PG PG G PG G G PG PG G PG G PG G PGP
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 35.2 bits (82), Expect = 0.015
Identities = 31/58 (53%), Positives = 31/58 (53%)
Query: 1033 GFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLPGLAGIPGARGDKG 1090
G G PG PG PG G G G G G PG PG GPPGP G PG G PGA G G
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58
Score = 34.8 bits (81), Expect = 0.019
Identities = 32/53 (60%), Positives = 32/53 (60%)
Query: 770 GPFGYPGPPGLPGPMGPPGLDGLPGEIGFPGLAGPVGPPGLQGLKGDKGLPGP 822
GP G PGPPG PGP GPPG G PG G PG GP GPPG G G G PG
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGA 53
Score = 34.4 bits (80), Expect = 0.024
Identities = 30/58 (51%), Positives = 30/58 (51%)
Query: 1024 GLAGPAGEKGFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLPGLAG 1081
G GP G G G PG PG PG G G G G G PG PG GPPGP G PG G
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58
Score = 34.4 bits (80), Expect = 0.029
Identities = 33/56 (58%), Positives = 33/56 (58%)
Query: 767 GDLGPFGYPGPPGLPGPMGPPGLDGLPGEIGFPGLAGPVGPPGLQGLKGDKGLPGP 822
G GP G PGPPG PGP GPPG G PG G PG GP GPPG G G G PG
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGA 56
Score = 33.6 bits (78), Expect = 0.041
Identities = 29/50 (58%), Positives = 30/50 (60%)
Query: 43 GDKGEPGKPGLEGRNGPPGERGPPGIPGFPGEKGDLGAPGPRGFPGSVGP 92
G G PG PG G GPPG GPPG PG PG G G PGP G PG+ GP
Sbjct: 10 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59
Score = 33.2 bits (77), Expect = 0.070
Identities = 31/56 (55%), Positives = 31/56 (55%)
Query: 625 LPGPPGPPGYPGEKGEKGLIGPFGKQGAPGEPGLNGLPGDIGEQGIPGLPGLQGLP 680
PGPPGPPG PG G G GP G G PG PG G PG G G PG PG G P
Sbjct: 5 PPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 32.5 bits (75), Expect = 0.13
Identities = 28/48 (58%), Positives = 28/48 (58%)
Query: 1090 GDWGPQGLDGPVGPPGLRGEPGPPGRPGLDGLQGNTGPPGENGLPGAP 1137
G GP G GP GPPG G PGPPG PG G G GPPG G PG P
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP 48
Score = 32.1 bits (74), Expect = 0.14
Identities = 27/51 (52%), Positives = 27/51 (52%)
Query: 705 GEKGNAGLPGQPGQPGLPGFPGQRGEMGLPGIEGEKGNPGLPGKIGAPGIP 755
G G G PG PG PG PG PG G G PG G G PG PG G PG P
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP 51
Score = 31.3 bits (72), Expect = 0.32
Identities = 34/62 (54%), Positives = 34/62 (54%), Gaps = 2/62 (3%)
Query: 601 GREGPRGLPGYPSPKGDKGEPGISLPGPPGPPGYPGEKGEKGLIGPFGKQGAPGEPGLNG 660
G GP G PG P P G G PG PGPPGPPG PG G G GP G G PG PG G
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGP--PGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58
Query: 661 LP 662
P
Sbjct: 59 PP 60
Score = 31.3 bits (72), Expect = 0.36
Identities = 33/60 (55%), Positives = 33/60 (55%), Gaps = 1/60 (1%)
Query: 598 GDKGREGPRGLPGYPSPKGDKGEPGIS-LPGPPGPPGYPGEKGEKGLIGPFGKQGAPGEP 656
G G GP G PG P P G G PG PGPPGPPG PG G G GP G GAPG P
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 30.9 bits (71), Expect = 0.38
Identities = 27/60 (45%), Positives = 27/60 (45%)
Query: 351 GEPGQDGAKGDSGGRCIGCLPGARGEKGDRGKDGLPGIPGPPGAPGMRGRDGDAGRDGPP 410
G PG G G G PG G G G G PG PGPPG PG G G G GPP
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60
Score = 30.9 bits (71), Expect = 0.47
Identities = 34/62 (54%), Positives = 34/62 (54%), Gaps = 2/62 (3%)
Query: 732 GLPGIEGEKGNPGLPGKIGAPGIPAIGMPGEKGDKGDLGPFGYPGPPGLPGPMGPPGLDG 791
G PG G G PG PG G PG P G PG G G GP G PGPPG PGP G PG G
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPP--GPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58
Query: 792 LP 793
P
Sbjct: 59 PP 60
Score = 30.5 bits (70), Expect = 0.52
Identities = 31/68 (45%), Positives = 31/68 (45%), Gaps = 10/68 (14%)
Query: 3 GLDGLKGELGPPGYPGSIGFPGVKGDKGERGPASPVINVKGDKGEPGKPGLEGRNGPPGE 62
G G G GPPG PG G PG G G GP G PG PG G GPPG
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP----------GPPGPPGPPGPPGPPGP 50
Query: 63 RGPPGIPG 70
G PG PG
Sbjct: 51 PGAPGAPG 58
Score = 30.1 bits (69), Expect = 0.78
Identities = 31/61 (50%), Positives = 31/61 (50%), Gaps = 3/61 (4%)
Query: 183 GIPGKDGLPGYPGPKGDQGLSITGPPGERGPDGLPAGEKGFRGSPGLPGTPGLKGEQGDK 242
G PG G PG PGP G G GPPG GP G P G G G PG PG PG G G
Sbjct: 1 GPPGPPGPPGPPGPPGPPGP--PGPPGPPGPPG-PPGPPGPPGPPGPPGPPGPPGAPGAP 57
Query: 243 G 243
G
Sbjct: 58 G 58
Score = 29.8 bits (68), Expect = 1.3
Identities = 33/73 (45%), Positives = 33/73 (45%), Gaps = 13/73 (17%)
Query: 384 GLPGIPGPPGAPGMRGRDGDAGRDGPPGKDAVWTSDNLIKGERGPPGLKGDIGRDGPMGP 443
G PG PGPPG PG G G G GPPG GPPG G G GP GP
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPG-------------PPGPPGPPGPPGPPGPPGP 47
Query: 444 PGPKGVIGLTGDR 456
PGP G G G
Sbjct: 48 PGPPGAPGAPGPP 60
Score = 29.0 bits (66), Expect = 1.9
Identities = 28/54 (51%), Positives = 28/54 (51%)
Query: 702 GRDGEKGNAGLPGQPGQPGLPGFPGQRGEMGLPGIEGEKGNPGLPGKIGAPGIP 755
G G G G PG PG PG PG PG G G PG G G PG PG G PG P
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAP 54
Score = 29.0 bits (66), Expect = 2.1
Identities = 36/72 (50%), Positives = 37/72 (51%), Gaps = 12/72 (16%)
Query: 162 GERGPPGDPGIQGPQGKDGIPGIPGKDGLPGYPGPKGDQGLSITGPPGERGPDGLPAGEK 221
G GPPG PG GP G G PG PG G PG PGP G PPG GP G P
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPG--------PPGPPGPPGPP---- 48
Query: 222 GFRGSPGLPGTP 233
G G+PG PG P
Sbjct: 49 GPPGAPGAPGPP 60
Score = 28.6 bits (65), Expect = 2.8
Identities = 27/63 (42%), Positives = 27/63 (42%), Gaps = 7/63 (11%)
Query: 371 PGARGEKGDRGKDGLPGIPGPPGAPGMRGRDGDAGRDGPPGKDAVWTSDNLIKGERGPPG 430
PG G G G G PG PGPPG PG G G G GPPG G G PG
Sbjct: 3 PGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGP-------PGPPGAPG 55
Query: 431 LKG 433
G
Sbjct: 56 APG 58
Score = 27.5 bits (62), Expect = 6.6
Identities = 28/60 (46%), Positives = 29/60 (48%), Gaps = 2/60 (3%)
Query: 189 GLPGYPGPKGDQGLS-ITGPPGERGPDGLPAGEKGFRGSPGLPGTPGLKGEQGDKGSEAQ 247
G PG PGP G G GPPG GP G P G G G PG PG PG G G G+
Sbjct: 1 GPPGPPGPPGPPGPPGPPGPPGPPGPPG-PPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 46.1 bits (110), Expect = 1e-04
Identities = 34/227 (14%), Positives = 41/227 (18%), Gaps = 19/227 (8%)
Query: 589 GEPGPMGFTGDKGREGPRGLPGYPSPKGDKGEPGISLPGPPGPPGYPGEKGEKGLIGPFG 648
PG G G P P P P G E G
Sbjct: 590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAA---PAAPAPAGAAAAPAEASAAPAPG 646
Query: 649 KQGAPGEPGLNGLPGDIGEQGIPGLPGLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGEKG 708
P +P G G P G P P P G
Sbjct: 647 VAAPEHHPKHVAVPD--ASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPA-- 702
Query: 709 NAGLPGQPGQPGLPGFPGQRGEMGLPGIEGEKGNPGLPGKIGAPGIPAIGMPGEKGDKGD 768
P P + +P +P P + D
Sbjct: 703 ----PAPAATPPAGQADDPAAQPPQAAQGASAPSPA-----ADDPVPLPPEPDDPPDPAG 753
Query: 769 LGPFGYPGP---PGLPGPMGPPGLDGLPGEIGFPGLAGPVGPPGLQG 812
P P P PP E A + +
Sbjct: 754 APAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRD 800
Score = 42.3 bits (100), Expect = 0.002
Identities = 42/220 (19%), Positives = 48/220 (21%), Gaps = 12/220 (5%)
Query: 626 PGPPGPPGYPGEKGEKGLIGPFGKQGAPGEPGLNGLPGDIGEQGIPGLPGLQGLPGQKGD 685
GP GP + P P P G P G
Sbjct: 588 VGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGV 647
Query: 686 PGPLGPPGPRGF-DGTPGRDGEKGNAGLPGQPGQPGLPGFPGQRGEMGLPGIEGEKGNPG 744
P P D + G DG AG P P G +
Sbjct: 648 AAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAA 707
Query: 745 LPGKIGAPGIPAIGMPGEKGDKGDLGPFGYPGPPGLPGPMGPPGLDGLPGEIGFPGLAGP 804
P A PA P P P P P PP G P + P P
Sbjct: 708 TPPAGQADD-PAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAP 766
Query: 805 VGPPGLQGLKGDKGLPGPYASEVLQGPKENRDPRDGMDPG 844
P P S + + D MD
Sbjct: 767 AAAPA----------AAPPPSPPSEEEEMAEDDAPSMDDE 796
Score = 41.1 bits (97), Expect = 0.005
Identities = 32/193 (16%), Positives = 34/193 (17%), Gaps = 4/193 (2%)
Query: 541 GEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDK 600
PG G GP + P P P E G
Sbjct: 590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAA-AAPAEASAAPAPGVA 648
Query: 601 GREGPRGLPGYPSP--KGDKGEPGISLPGPPGPPGYPGEKGEKGLIGPFGKQGAPGEPGL 658
E P GD P PP P G Q AP
Sbjct: 649 APEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAAT 708
Query: 659 NGLPGDIGEQGIPGLPGLQGLPGQK-GDPGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPG 717
P D PP P G + P
Sbjct: 709 PPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAA 768
Query: 718 QPGLPGFPGQRGE 730
P P E
Sbjct: 769 APAAAPPPSPPSE 781
Score = 37.3 bits (87), Expect = 0.072
Identities = 28/179 (15%), Positives = 34/179 (18%), Gaps = 11/179 (6%)
Query: 518 EKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVG 577
E +P P G+ P GV
Sbjct: 611 EAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWP 670
Query: 578 RR-GDTGPMGEKGEPGPMGFTGDKGREGPRGLPGYPSPKGDKGEPGISLPGPPGPPGYPG 636
+ G P P P G + P+P P P P
Sbjct: 671 AKAGGAAPAAPPPAPAPAAPAAPAGAAPAQ-----PAPAPAATPPAGQADDPAAQPPQAA 725
Query: 637 EKGEKGLIGPFGKQGAPGEPGLNGLPGDIGEQGIPGLPGLQGLPGQKGDPGPLGPPGPR 695
+ P EP P Q P P P PP P
Sbjct: 726 QGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPA-----PAPAAAPAAAPPPSPP 779
Score = 36.9 bits (86), Expect = 0.091
Identities = 34/172 (19%), Positives = 40/172 (23%), Gaps = 7/172 (4%)
Query: 684 GDPGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQRGEMGLPGIEGEKGNP 743
PG G GP P G A P P P P P G P P
Sbjct: 590 PAPGAAGGEGPPA----PASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAP 645
Query: 744 GLPGKIGAPGIPAIGMPGEKGDKG--DLGPFGYPGPPGLPGPMGPPGLDGLPGEIGFPGL 801
G+ P A+ + GD G PP P P P G P
Sbjct: 646 GVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAP 705
Query: 802 -AGPVGPPGLQGLKGDKGLPGPYASEVLQGPKENRDPRDGMDPGVYPDLWDP 852
A P ++ P + DP
Sbjct: 706 AATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQ 757
Score = 35.3 bits (82), Expect = 0.26
Identities = 39/234 (16%), Positives = 47/234 (20%), Gaps = 21/234 (8%)
Query: 23 PGVKGDKGERGPASPVINVKGDKGEPGKPGLEGRNGPPGERGPPGIPGFPGEKGDLGAPG 82
P GE PA + P P P G P G
Sbjct: 590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAA 649
Query: 83 PRGFPGSVGPGGPAGDIGPPGVPGLPGITIKGEKGLPGLHGKNGRDGAPGLAGEKGDYGP 142
P P V + AP A G
Sbjct: 650 PEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPP-------------PAPAPAAPAAPAGA 696
Query: 143 PGLIGKPGPPGKAGLPGVEGERG-PPGDPGIQGPQGKDGIPGIPGKDGLPGYPGPKGDQG 201
P P + PP +P LP P D
Sbjct: 697 APAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP----LPPEPDDPPDPA 752
Query: 202 LSITGPPGERGPDGLPAGEKGFRGSPGLPGTPGLKGEQGDKGSEAQLERRDSEE 255
+ PP P A SP + + E RRD+EE
Sbjct: 753 GAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDED---RRDAEE 803
Score = 35.3 bits (82), Expect = 0.30
Identities = 24/137 (17%), Positives = 31/137 (22%), Gaps = 11/137 (8%)
Query: 517 GEKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGVPGFM-------GSKGEPGPRGP 569
P G P E + + G P VP G P P
Sbjct: 622 AAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAP 681
Query: 570 QGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRGLPGYP----SPKGDKGEPGISL 625
P P G + P P + P P S + + L
Sbjct: 682 PPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPL 741
Query: 626 PGPPGPPGYPGEKGEKG 642
P P P P +
Sbjct: 742 PPEPDDPPDPAGAPAQP 758
Score = 33.8 bits (78), Expect = 0.71
Identities = 28/158 (17%), Positives = 33/158 (20%), Gaps = 10/158 (6%)
Query: 479 GWPGIDGTPGQDGIPGLKGEKGHGIKGEPGINGTHGVKGEKGQPGGIGYKGEPGERGSTG 538
P G P + H K G G P + G
Sbjct: 622 AAPAAPAPAGAAAAPA----EASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAA 677
Query: 539 SKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTG 598
P P P G ++ P P G P +G P
Sbjct: 678 PAAPPPAPAPAAPAAPAGAAPAQPAPAPA-ATPPAGQADDPAAQPPQAAQGASAPSPAAD 736
Query: 599 DKGREGPRGLPGYPSPKGDKGEPGISLPGPPGPPGYPG 636
D LP P D P PP P
Sbjct: 737 DPVP-----LPPEPDDPPDPAGAPAQPPPPPAPAPAAA 769
Score = 32.3 bits (74), Expect = 2.2
Identities = 19/126 (15%), Positives = 22/126 (17%), Gaps = 5/126 (3%)
Query: 1023 PGLAGPAGEKGFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLPGLAGI 1082
+A P G G P G G + P P P
Sbjct: 655 KHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQA 714
Query: 1083 PGARGDK-----GDWGPQGLDGPVGPPGLRGEPGPPGRPGLDGLQGNTGPPGENGLPGAP 1137
G P P + P P AP
Sbjct: 715 DDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAP 774
Query: 1138 CESTPD 1143
S P
Sbjct: 775 PPSPPS 780
Score = 31.9 bits (73), Expect = 2.8
Identities = 38/224 (16%), Positives = 47/224 (20%), Gaps = 10/224 (4%)
Query: 55 GRNGPPGERGPPGIPGFPGEKGDLGAPGPRGFPGSVGPGGPAGDIGPPGVPGLPGITIKG 114
G P P G + P P + PA G P
Sbjct: 589 GPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPA---GAAAAPAEASAAPAP 645
Query: 115 EKGLPGLHGKNGRDGAPGLAGEKGDYGPPGLIGKPGPPGKAGLPGVEGERGPPGDPGIQG 174
P H K+ G G P G P P P G Q
Sbjct: 646 GVAAPEHHPKHVAVPDASD----GGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQP 701
Query: 175 -PQGKDGIPGIPGKDGLPGYPGPKGDQGLSITGPPGERGPDGLPAGEKGFRGSPGLPGTP 233
P P D P P G+P P P
Sbjct: 702 APAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPP 761
Query: 234 GLKGEQGDKGSEAQLERRDSEEETKDQKDERENQDSQDIEVPRE 277
+ EEE + D+ + D +D E
Sbjct: 762 PAPAPAAAPAAAPPPSPPSEEEEMAE--DDAPSMDDEDRRDAEE 803
Score = 30.7 bits (70), Expect = 6.1
Identities = 16/103 (15%), Positives = 18/103 (17%), Gaps = 3/103 (2%)
Query: 1026 AGPAGEKGFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLPGLAGIPGA 1085
A PA G+ P G E P P A
Sbjct: 622 AAPAAPAP-AGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAA 680
Query: 1086 RGDKGDWGPQGLDGPVGPPGLRGEPGPPGRPGLDGLQGNTGPP 1128
P G + P P P P
Sbjct: 681 PPPAPAPAAP--AAPAGAAPAQPAPAPAATPPAGQADDPAAQP 721
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 43.1 bits (101), Expect = 0.001
Identities = 38/162 (23%), Positives = 48/162 (29%), Gaps = 17/162 (10%)
Query: 551 PRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRGLPG 610
PR +P G P G+ G + G GP G G P
Sbjct: 363 PRLMPVESLRPSGGGASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPA 422
Query: 611 YPSPKGDKGEPGISLPGPPGPPGYPGEKGEKGLIGPFGKQGAPGEPGLNGLPGDIGEQGI 670
P+P P +P PP P P P + G P +
Sbjct: 423 TPAP---SAAPSPRVPWDDAPPAPPRSGI-----PPRPAPRMPEASPVPGAPDSVASAS- 473
Query: 671 PGLPGLQGLPGQKGDPGPLGPPGPRGFDG----TPGRDGEKG 708
P G P D P GPR +DG GR+G+ G
Sbjct: 474 -DAPPTLGDP---SDTAEHTPSGPRTWDGFLEFCQGRNGQGG 511
Score = 38.1 bits (88), Expect = 0.035
Identities = 28/124 (22%), Positives = 38/124 (30%), Gaps = 9/124 (7%)
Query: 147 GKPGPPGKAGLPG-VEGERGPPGDPGIQGPQGKDGIPGIPGKDGLPGYPGPKGDQGLSIT 205
G P + G G PG QGPQG G+ P P P +
Sbjct: 376 GGASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRV- 434
Query: 206 GPPGERGPDGLPAGEKGFRGSPGLPGTPGLKGEQGDKGSEAQLERRDSEEETKDQKDERE 265
P + P P R +P +P + G S + D+ D D E
Sbjct: 435 --PWDDAPPAPPRSGIPPRPAPRMPEASPVPGAPDSVASAS-----DAPPTLGDPSDTAE 487
Query: 266 NQDS 269
+ S
Sbjct: 488 HTPS 491
Score = 33.1 bits (75), Expect = 1.3
Identities = 30/121 (24%), Positives = 35/121 (28%), Gaps = 8/121 (6%)
Query: 1024 GLAGPAGEKGFRG-SPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLPGLAGI 1082
G A G + G T G QG +G+ G P P P
Sbjct: 376 GGASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPS---- 431
Query: 1083 PGARGDKGDWGPQGLDGPVGPPGLRGEPGP-PGRPGLDGLQGNTGPPGENGLPGAPCEST 1141
P D P P P E P PG P D + + P G P E T
Sbjct: 432 PRVPWDDAPPAPPRSGIPPRPAPRMPEASPVPGAP--DSVASASDAPPTLGDPSDTAEHT 489
Query: 1142 P 1142
P
Sbjct: 490 P 490
Score = 30.0 bits (67), Expect = 9.5
Identities = 26/123 (21%), Positives = 34/123 (27%), Gaps = 12/123 (9%)
Query: 517 GEKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGV---PGFMGSKGEPGPRGPQGNP 573
G P G +G ++G PG GP+G G S P P P
Sbjct: 376 GGASAPSGSAAEG-----PASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAP 430
Query: 574 GPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRGLPGYPSPKGDK----GEPGISLPGPP 629
P D P + P G P + D G+P + P
Sbjct: 431 SPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVPGAPDSVASASDAPPTLGDPSDTAEHTP 490
Query: 630 GPP 632
P
Sbjct: 491 SGP 493
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 41.1 bits (96), Expect = 0.004
Identities = 36/172 (20%), Positives = 48/172 (27%), Gaps = 4/172 (2%)
Query: 559 GSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRGLPGYPSPKGDK 618
G K E G GP G+ P G E + P + P
Sbjct: 82 GEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPP 141
Query: 619 GEPGISLPGPPGPPGYPGEKGEKGLIGPFGKQGA-PGEPGLNGLPGDIGEQGIPGLPGLQ 677
PG P PP + + P + EP + D P
Sbjct: 142 SHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSS 201
Query: 678 GLPGQKGD-PGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQR 728
P D PG P P+ + + P +P + G P FPG R
Sbjct: 202 PPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDE-PTEPEREG-PPFPGHR 251
Score = 40.7 bits (95), Expect = 0.004
Identities = 30/146 (20%), Positives = 40/146 (27%), Gaps = 3/146 (2%)
Query: 497 GEKGHGIKGEPGINGTHGVKGEKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGVPG 556
GEK +G P +G+ V P G + G S P P + P P
Sbjct: 82 GEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSP- 140
Query: 557 FMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRGLPGYPSPKG 616
S P P + P + + + E P P S
Sbjct: 141 --PSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETP 198
Query: 617 DKGEPGISLPGPPGPPGYPGEKGEKG 642
P S P PG P P +
Sbjct: 199 TSSPPPQSPPDEPGEPQSPTPQQAPS 224
Score = 38.4 bits (89), Expect = 0.022
Identities = 32/173 (18%), Positives = 40/173 (23%), Gaps = 9/173 (5%)
Query: 84 RGFPGSVGPGGPAGDIGPPGVPGLPGITIKGEKGLPGLHGKNGRDGAPGLAGEKGDYGPP 143
G G GGP+G P + E+ GL +N +P PP
Sbjct: 81 HGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSP--ESPASHSPPP 138
Query: 144 GLIGKPGPPGKAGLPGVEGERGPPGDPGIQGPQGKDGIPGIPGKDGLPGYPGPKGDQGLS 203
PG P + P P +D P P Q +
Sbjct: 139 SPPSHPG-PHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSET 197
Query: 204 ITGPPGERGPDGLPAGEKGFRGSPGLPGTPGLKGEQGDKGSEAQLERRDSEEE 256
P P P E G SP P E
Sbjct: 198 ----PTSSPPPQSPPDEPGEPQSPTPQQAPS--PNTQQAVEHEDEPTEPEREG 244
Score = 36.9 bits (85), Expect = 0.068
Identities = 34/174 (19%), Positives = 43/174 (24%), Gaps = 14/174 (8%)
Query: 49 GKPGLEGRNGPPGERGPPGIPGFPGEKGDLGAPGPRGFP---GSVGPGGPAGDIGPPGVP 105
G+ G+ GP G P G P P PA PP P
Sbjct: 82 GEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPP 141
Query: 106 GLPGITIKGEKGLPGLHGKNGRDGAPGLAGEKGDYGPPGLIGKPGPPGKAGLPGVEGERG 165
PG E P H + + P P P +
Sbjct: 142 SHPG---PHEPAPPESHNPSPNQQPSSFLQPSHEDSP------EEPEPPTSEPEPDSPGP 192
Query: 166 PPGDPGIQGPQGKDGI--PGIPGKDGLPGYPGPKGDQGLSITGPPGERGPDGLP 217
P + P + PG P P P Q + P E +G P
Sbjct: 193 PQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEPEREGPP 246
Score = 36.9 bits (85), Expect = 0.071
Identities = 29/162 (17%), Positives = 40/162 (24%), Gaps = 3/162 (1%)
Query: 473 SRHGDKGWPGIDGTPGQDGIPGLKGEKGHGIKGEPGINGTHGVKGEKGQPGGIGYKGEPG 532
SRHG+K G G G E +G P P
Sbjct: 79 SRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPP 138
Query: 533 ERGSTGSKGEPGPPGRNGP---RGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKG 589
S EP PP + P + F+ E P P+ P E
Sbjct: 139 SPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETP 198
Query: 590 EPGPMGFTGDKGREGPRGLPGYPSPKGDKGEPGISLPGPPGP 631
P + P+ +P + + P P
Sbjct: 199 TSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEP 240
Score = 35.7 bits (82), Expect = 0.18
Identities = 35/158 (22%), Positives = 41/158 (25%), Gaps = 14/158 (8%)
Query: 496 KGEKGHGIKGEPGINGTHGVKGEKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGVP 555
+G + E HG K E+GQ GG G T S +G P
Sbjct: 64 QGHRQTESDTETAEESRHGEKEERGQ-GGPSGSGSESVGSPTPSPSGSAEELASG--LSP 120
Query: 556 GFMGSKGEPGPRGPQGNPGPVGRRGDTGPM-----------GEKGEPGPMGFTGDKGREG 604
P P P G P P + E
Sbjct: 121 ENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEP 180
Query: 605 PRGLPGYPSPKGDKGEPGISLPGPPGPPGYPGEKGEKG 642
P P SP + E S P P PP PGE
Sbjct: 181 PTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPT 218
Score = 32.6 bits (74), Expect = 1.5
Identities = 25/144 (17%), Positives = 36/144 (25%), Gaps = 8/144 (5%)
Query: 173 QGPQGKDGIPGIPGKDGLPGYPGPKGDQGLSITGPPGERGPDGLPAGEKGFRGSPGLPGT 232
QG G + P + GLS E P P P
Sbjct: 89 QGGPSGSGSESVGSPTPSPSGSAEELASGLS-----PENTSGSSPESPASHSPPPSPPSH 143
Query: 233 PGLKG---EQGDKGSEAQLERRDSEEETKDQKDERENQDSQDIEVPRETKESRHPYLLQE 289
PG + S Q + +D +E E S+ +S P
Sbjct: 144 PGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPP 203
Query: 290 RKDLLDHQGDKGWPGIDGTPGQDG 313
+ D G+ P P +
Sbjct: 204 PQSPPDEPGEPQSPTPQQAPSPNT 227
Score = 32.2 bits (73), Expect = 1.9
Identities = 22/125 (17%), Positives = 27/125 (21%), Gaps = 2/125 (1%)
Query: 1021 GYPGLAGPAGEKGFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLPGLA 1080
G + P G S P P P P
Sbjct: 96 GSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESH 155
Query: 1081 GIPGARGDKGDWGPQGLDGPVGPPGLRGEPGPPGRPGLDGLQGNTGPPGENGL--PGAPC 1138
+ P D P P EP P + PP ++ PG P
Sbjct: 156 NPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQ 215
Query: 1139 ESTPD 1143
TP
Sbjct: 216 SPTPQ 220
Score = 31.1 bits (70), Expect = 4.8
Identities = 22/153 (14%), Positives = 34/153 (22%), Gaps = 11/153 (7%)
Query: 124 KNGRDGAPGLAGEKGDYGPPGLIGKPGPPGKAGLPGVEGERGPPGDPGIQGPQGKDGIPG 183
++G G G G P P G A + P P
Sbjct: 80 RHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPAS-HSPPP 138
Query: 184 IPGKDGLPGYPGPKGDQGLSITGPPGERGPDGLPAGEKGFRGSPGLPGTP-----GLKGE 238
P P P P P + P P T
Sbjct: 139 SPPSHPGPHEPAPPESHN-----PSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPP 193
Query: 239 QGDKGSEAQLERRDSEEETKDQKDERENQDSQD 271
Q + + + + +E + Q + S +
Sbjct: 194 QSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPN 226
Score = 30.3 bits (68), Expect = 8.4
Identities = 25/153 (16%), Positives = 40/153 (26%), Gaps = 8/153 (5%)
Query: 319 GEKGHGIKGEPGINGTHGVKGEKGQPGGIGYKGEPGQDGAKGDSGGRCIGCLPGARGEKG 378
GEK +G P +G+ V P G E G ++ P +
Sbjct: 82 GEKEERGQGGPSGSGSESVGSPTPSPSG---SAEELASGLSPENTSGSSPESPASHSPPP 138
Query: 379 DRGKDGLPGIPGPPGAPGMRGRDGDAGRDGPPGKDAV-----WTSDNLIKGERGPPGLKG 433
P P PP + + P +D+ TS+ P
Sbjct: 139 SPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETP 198
Query: 434 DIGRDGPMGPPGPKGVIGLTGDRGFNGESGEKV 466
P P T + + + + V
Sbjct: 199 TSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAV 231
Score = 29.9 bits (67), Expect = 9.9
Identities = 22/164 (13%), Positives = 28/164 (17%), Gaps = 4/164 (2%)
Query: 15 GYPGSIGFPGVKGDKGERGPASPVINVKGDKGEPGKPGLEGRNGPPGERGPPGIPGFPGE 74
G P G V ++ + P P PP P PG
Sbjct: 90 GGPSGSGSESVGSPTPSPSGSAEEL---ASGLSPENTSGSSPESPASHSPPPSPPSHPGP 146
Query: 75 KGDLGAPGPRGFPGSVGPGGPAGDIGPPGVPGLPGITIKGEKGLPGLHGKNGRDGAPGLA 134
P P + + P +
Sbjct: 147 HEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQS 206
Query: 135 GEKGDYGPPGLIGKPGPPGKAGLPGVEGERGPPGDPGIQGPQGK 178
G P P VE E P P
Sbjct: 207 PPDEP-GEPQSPTPQQAPSPNTQQAVEHEDEPTEPEREGPPFPG 249
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
Provisional.
Length = 672
Score = 39.1 bits (92), Expect = 0.019
Identities = 39/268 (14%), Positives = 68/268 (25%), Gaps = 45/268 (16%)
Query: 147 GKPGPPGKAGLPGVEGERGPPGDPGIQGPQGKDGIPGIPGKD-GLPGYPGPKGDQGLSIT 205
G R + K P + +
Sbjct: 61 GGAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAAS 120
Query: 206 GPPGERGPDGLPAGEKGFRGSPGLPGTPGLKGEQGDKGSEAQLERRDSEEETKDQKDERE 265
P + + GE RG+ G G + + A+ + +E + + D +
Sbjct: 121 APEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDRED 180
Query: 266 NQDSQDIEVPRETKESRHPYLLQERKDLLDHQGDKGWPGIDGTPGQDGIPGLKGEKGHGI 325
Q + R +E R G D + E+G
Sbjct: 181 RQAEAERGE-RGRREERGR------------------------DGDDRDRRDRREQGDRR 215
Query: 326 KGEPGINGTHGVKGEKGQPGGIGYKGEPGQDGAKGDSGGRCIGCLPGARGEKGDRGKDGL 385
+ +G G++ + + D + D G R G G +G R +D
Sbjct: 216 EERGRRDG-----GDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRD-- 268
Query: 386 PGIPGPPGAPGMRGRDGDAGRDGPPGKD 413
R R G G DG ++
Sbjct: 269 ------------RDRRGRRGGDGGNERE 284
Score = 31.0 bits (71), Expect = 5.7
Identities = 20/92 (21%), Positives = 32/92 (34%)
Query: 529 GEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEK 588
+ ER G G + G G ++ + R + RRGD +
Sbjct: 125 AQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAE 184
Query: 589 GEPGPMGFTGDKGREGPRGLPGYPSPKGDKGE 620
E G G ++GR+G +GD+ E
Sbjct: 185 AERGERGRREERGRDGDDRDRRDRREQGDRRE 216
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 38.8 bits (90), Expect = 0.026
Identities = 62/297 (20%), Positives = 74/297 (24%), Gaps = 31/297 (10%)
Query: 531 PGERGSTGSKGEPG-PPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKG 589
P E T P PP PR P +G+P P P D P
Sbjct: 2577 PSEPAVTSRARRPDAPPQSARPR-APV--DDRGDPRGPAPPSPLPPDTHAPDPPP----- 2628
Query: 590 EPGPMGFTGDKGREGPRGLPGYPSPKGDKGEPGISLP----------GPPGPPGYPGEKG 639
P P + P +P P+ D +S P PP P +
Sbjct: 2629 -PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRA 2687
Query: 640 EKGLIGPFGKQGAPGEPGLNGLPGDIGEQGIPGLP-------GLQGLPGQKGDPGPLGPP 692
+ +G P P P + P Q P P P P
Sbjct: 2688 ARPTVGSLTSLADPPPPPPT--PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVP 2745
Query: 693 GPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQRGEMGLPGIEGEKGNPGLPGKIGAP 752
G P R P P P P R + LP
Sbjct: 2746 AGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPA 2805
Query: 753 GIPAIGMPGEKGDKGDLGPFGYPGPPGLPGPMGPPGLDGLPGEIGFPGLAGPVGPPG 809
PA + P G PP P PP G P G G V P G
Sbjct: 2806 DPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG--GSVAPGG 2860
Score = 31.1 bits (70), Expect = 6.6
Identities = 41/224 (18%), Positives = 55/224 (24%), Gaps = 20/224 (8%)
Query: 543 PGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGR 602
PP GP P R + + ++ P P
Sbjct: 2758 ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAA 2817
Query: 603 EGPRGLPGYPSPKGDKGEPGISLPGPPGPPGYPGEKGEKGLIGPFGKQG----------A 652
P P P P +P P P PP G G ++ A
Sbjct: 2818 LPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA 2877
Query: 653 PGEPGLNGLPGDIGEQGIPGLPGLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGEKGNAGL 712
P P + L + + P P PP P+ P
Sbjct: 2878 PARPPVRRLARPAVSRSTESFA--LPPDQPERPPQPQAPPPPQ---PQPQPPPPPQPQPP 2932
Query: 713 PGQPGQPGLPGFPGQRGEMGLPGIEGEKGNPGL----PGKIGAP 752
P P +P P P G G P L PG++ P
Sbjct: 2933 PPPPPRPQPPLAP-TTDPAGAGEPSGAVPQPWLGALVPGRVAVP 2975
>gnl|CDD|223029 PHA03264, PHA03264, envelope glycoprotein D; Provisional.
Length = 416
Score = 37.7 bits (87), Expect = 0.040
Identities = 26/101 (25%), Positives = 30/101 (29%), Gaps = 16/101 (15%)
Query: 539 SKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTG 598
S G P PPG + P P E G G + G GEP P
Sbjct: 271 SGGSPAPPGDDRPEAKPE--PGPVEDGAPGRETGGEGEGPEPAGRDGAAGGEPKPG---- 324
Query: 599 DKGREGPRGLPGYPSPKGDKGEPGISLPGPPGPPGYPGEKG 639
P P+P D+ E SL PP P
Sbjct: 325 ----------PPRPAPDADRPEGWPSLEAITFPPPTPATPA 355
Score = 33.8 bits (77), Expect = 0.60
Identities = 27/93 (29%), Positives = 30/93 (32%), Gaps = 5/93 (5%)
Query: 95 PAGDIGPPGVPGLPGITIKGEKG--LPGLHGKNGRDGAPGLAGEKGDYGPPGLIGKPGPP 152
PA G P PG K E G G G+ G D G KPGPP
Sbjct: 268 PAPSGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGGE-PKPGPP 326
Query: 153 GKAGLPGVEGERGPPGDPGIQGPQGKDGIPGIP 185
A P + G P I P P +P
Sbjct: 327 RPA--PDADRPEGWPSLEAITFPPPTPATPAVP 357
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 35.6 bits (82), Expect = 0.19
Identities = 38/185 (20%), Positives = 45/185 (24%), Gaps = 35/185 (18%)
Query: 61 GERGPPGIPGFPGEKGDLGAPGPRGFPG-------SVGPGGPAGDIGPPGVPGLPGITIK 113
GE PP + +L PG R + G PG P D P P P +
Sbjct: 93 GEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQR 152
Query: 114 GEKG-LPGLHGKNGRDGAPGLAGEKGDYGPPGLIGKP----GPPGKAGLPGVEGERGPPG 168
E G P G + Y P P AG P + R
Sbjct: 153 PEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRRRDYD 212
Query: 169 DPG-----------------------IQGPQGKDGIPGIPGKDGLPGYPGPKGDQGLSIT 205
P +G G P P PGP Q
Sbjct: 213 HPRPDWDRPRRDRTDRPEPPPGAGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAP 272
Query: 206 GPPGE 210
GP
Sbjct: 273 GPGEP 277
Score = 35.6 bits (82), Expect = 0.23
Identities = 54/217 (24%), Positives = 62/217 (28%), Gaps = 36/217 (16%)
Query: 539 SKGEPGPPGRNGPR------GVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPG 592
S GEP PP + R PG +G GPR PG R D P P
Sbjct: 91 SAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGL--PRQDQLPTARPAYP- 147
Query: 593 PMGFTGDKGREGPRGLPG-YPSPKGDKGEPGISLPGPPGPPGYPGEKGEKGLIGPFGKQG 651
R PG +P D G L PP P Y +
Sbjct: 148 ---------AYQQRPEPGAWPRAADDYGWQQQRLGFPPRAP-YASPASYAPEQERDREPY 197
Query: 652 APGEPGLNGLPGDIGEQGIPGLPGLQGLPGQKGDPGPLGPPG----PRGFDGTPGRDGEK 707
G P + P P + P PPG RG G P RD
Sbjct: 198 DAGRPEY---DQRRRDYDHPR--PDWDRPRRDRTDRPEPPPGAGHVHRGGPGPPERDDA- 251
Query: 708 GNAGLPGQPGQPGLPG-FPGQRGEMGLPGIEGEKGNP 743
P P +P PG Q PG + NP
Sbjct: 252 -----PVVPIRPSAPGPLAAQPAPAPGPGEPTARLNP 283
Score = 32.9 bits (75), Expect = 1.5
Identities = 41/170 (24%), Positives = 48/170 (28%), Gaps = 32/170 (18%)
Query: 518 EKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPR-GVPGFMGSKGEPGP-------RGP 569
E +PG Y+G G R G P R P + + EPG G
Sbjct: 109 ELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQ-QRPEPGAWPRAADDYGW 167
Query: 570 QGNP-GPVGRRGDTGPMGEKGEPGPMGFTGDKGR---EGPRGLPGYPSPKGDK------- 618
Q G R P E D GR + R +P P D+
Sbjct: 168 QQQRLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRPRRDRTD 227
Query: 619 --------GEPGISLPGPPGPPGYPGEKGEKGLIGPFGKQGA----PGEP 656
G PGPP P GP Q A PGEP
Sbjct: 228 RPEPPPGAGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGPGEP 277
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 35.4 bits (81), Expect = 0.30
Identities = 32/134 (23%), Positives = 53/134 (39%), Gaps = 10/134 (7%)
Query: 161 EGERGPPGDPGIQGPQGKDGIPGIPGKDGLPGYPGPKGDQG--LSITGPPGERGPDGLPA 218
EGE D ++G +G +G G+ +G T E G L A
Sbjct: 780 EGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNA 839
Query: 219 GEKGFRGSPGLPGTPGLKGEQGDKGSEAQLERRDSEEETKDQKDERENQDSQDIEVP--- 275
+G G G G D G + E + EEE +++++E E ++ ++ E P
Sbjct: 840 ENQG-EAKQDEKGVDG--GGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSL 896
Query: 276 --RETKESRHPYLL 287
ET++ + YL
Sbjct: 897 EWPETRQKQAIYLF 910
>gnl|CDD|218168 pfam04602, Arabinose_trans, Mycobacterial cell wall arabinan
synthesis protein. Arabinosyltransferase is involved in
arabinogalactan (AG) biosynthesis pathway in
mycobacteria. AG is a component of the macromolecular
assembly of the mycolyl-AG-peptidoglycan complex of the
cell wall. This enzyme has important clinical
applications as it is believed to be the target of the
antimycobacterial drug Ethambutol.
Length = 1075
Score = 34.0 bits (78), Expect = 0.75
Identities = 26/71 (36%), Positives = 32/71 (45%), Gaps = 8/71 (11%)
Query: 676 LQGLPGQKGDPGPLGPPGPRGF--DGTPGR---DGEKGNAGLPG--QPGQPGLPGF-PGQ 727
L LPGQ G GPLG P GF +G P D + N G P +P + + G
Sbjct: 736 LTPLPGQFGPLGPLGGVNPIGFTPNGVPDDLVADPVRENPGQPNSDYSDKPNVTDWEAGT 795
Query: 728 RGEMGLPGIEG 738
G + PGI G
Sbjct: 796 GGGLAAPGING 806
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 33.8 bits (77), Expect = 0.77
Identities = 49/262 (18%), Positives = 69/262 (26%), Gaps = 15/262 (5%)
Query: 565 GPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRGLPGYP--SPKGD----K 618
G RGPQ P G + G PG + G P
Sbjct: 83 GTRGPQMGPMGPGPGR--PMGQQMGGPGTASNLLQSLNVRGQMPMGAAGMGPHQMSRVGT 140
Query: 619 GEPGISLPGPPGPPGYPGEKGEKGLIGPFGKQGAPGEPGLNGLPGDIGEQGIPGLPGLQG 678
+PG G + + +GP Q G+N Q P G G
Sbjct: 141 MQPGGQAGGMMQQSSGQPQSQQPNQMGPQQGQAQGQAGGMNQGQQGPVGQQQPPQMGQPG 200
Query: 679 LPGQKGDP------GPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPGFPGQRGEMG 732
+PG G P G ++ ++ P Q+G+ G
Sbjct: 201 MPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGG 260
Query: 733 LPGIEGEKGNPGLPGKIGAPGIPAIGMPGEKGDKGDLGPFGYPGPPGLPGPMGPPGLDGL 792
+ + + G +P P + G G PMGPP
Sbjct: 261 MQPQQMQGGQMQVP-MQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPG 319
Query: 793 PGEIGFPGLAGPVGPPGLQGLK 814
G + V G Q LK
Sbjct: 320 AVPQGGQAVQQGVMSAGQQQLK 341
Score = 31.5 bits (71), Expect = 3.7
Identities = 29/135 (21%), Positives = 44/135 (32%), Gaps = 4/135 (2%)
Query: 79 GAPGPRGFPGSVGPGGPAGD-IGPPGVPGLPGITIKGEKGLPGLHGKNGRDGAPGLAGEK 137
G GP+ P GPG P G +G PG ++ +P G + +
Sbjct: 83 GTRGPQMGPMGPGPGRPMGQQMGGPGTASNLLQSLNVRGQMPMGAAGMGPHQMSRVGTMQ 142
Query: 138 GDYGPPGLIGKPGPPGKAGLPGVEGERGPPGDPGIQGPQGKDGIP---GIPGKDGLPGYP 194
G++ + ++ P G + G P P + G PG P
Sbjct: 143 PGGQAGGMMQQSSGQPQSQQPNQMGPQQGQAQGQAGGMNQGQQGPVGQQQPPQMGQPGMP 202
Query: 195 GPKGDQGLSITGPPG 209
G G + G PG
Sbjct: 203 GGGGQGQMQQQGQPG 217
>gnl|CDD|240291 PTZ00146, PTZ00146, fibrillarin; Provisional.
Length = 293
Score = 33.2 bits (76), Expect = 0.93
Identities = 16/53 (30%), Positives = 18/53 (33%)
Query: 528 KGEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRG 580
G G RG G G G G G G +G G G +G G G
Sbjct: 4 GGFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPGK 56
Score = 32.0 bits (73), Expect = 2.2
Identities = 18/56 (32%), Positives = 18/56 (32%)
Query: 49 GKPGLEGRNGPPGERGPPGIPGFPGEKGDLGAPGPRGFPGSVGPGGPAGDIGPPGV 104
G G G G G G G G G RG G G G G GP V
Sbjct: 2 MGGGFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPGKV 57
Score = 31.6 bits (72), Expect = 2.7
Identities = 17/52 (32%), Positives = 19/52 (36%)
Query: 514 GVKGEKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGPRGVPGFMGSKGEPG 565
G G +G G G G G G G +G GR G G G G G
Sbjct: 5 GFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPGK 56
Score = 30.9 bits (70), Expect = 4.0
Identities = 18/52 (34%), Positives = 20/52 (38%)
Query: 556 GFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGPRG 607
GF G +G G G G GR G G +G G G G G GP
Sbjct: 5 GFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPGK 56
Score = 29.7 bits (67), Expect = 9.8
Identities = 18/52 (34%), Positives = 18/52 (34%)
Query: 357 GAKGDSGGRCIGCLPGARGEKGDRGKDGLPGIPGPPGAPGMRGRDGDAGRDG 408
G G GG G G RG G G G G G G RG G G
Sbjct: 5 GFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPGK 56
>gnl|CDD|236722 PRK10590, PRK10590, ATP-dependent RNA helicase RhlE; Provisional.
Length = 456
Score = 32.5 bits (74), Expect = 1.8
Identities = 21/84 (25%), Positives = 23/84 (27%), Gaps = 2/84 (2%)
Query: 492 IPGLKGEKGHGIKGEPGINGTHGVKGEKGQPGGIGYKGEPGERGSTGSKGEPGPPGRNGP 551
IPG E IK EP NG G GG + + R G P
Sbjct: 373 IPGY--EPDPSIKAEPIQNGRQQRGGGGRGQGGGRGQQQGQPRRGEGGAKSASAKPAEKP 430
Query: 552 RGVPGFMGSKGEPGPRGPQGNPGP 575
G GE R P
Sbjct: 431 SRRLGDAKPAGEQQRRRRPRKPAA 454
Score = 30.5 bits (69), Expect = 6.6
Identities = 20/84 (23%), Positives = 27/84 (32%), Gaps = 6/84 (7%)
Query: 157 LPGVEGERGPPGDPGIQGPQGKDGIPGIPGKDGLPGYPGPKGDQG------LSITGPPGE 210
+PG E + +P G Q + G G P+ +G P
Sbjct: 373 IPGYEPDPSIKAEPIQNGRQQRGGGGRGQGGGRGQQQGQPRRGEGGAKSASAKPAEKPSR 432
Query: 211 RGPDGLPAGEKGFRGSPGLPGTPG 234
R D PAGE+ R P P
Sbjct: 433 RLGDAKPAGEQQRRRRPRKPAAAQ 456
>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2. This model
describes a tightly conserved subfamily of the larger
family of sugar (and other) transporters described by
PFAM model pfam00083. Members of this subfamily include
closely related forms SV2A and SV2B of synaptic vesicle
protein from vertebrates and a more distantly related
homolog (below trusted cutoff) from Drosophila
melanogaster. Members are predicted to have two sets of
six transmembrane helices.
Length = 742
Score = 32.3 bits (73), Expect = 2.2
Identities = 28/102 (27%), Positives = 32/102 (31%), Gaps = 17/102 (16%)
Query: 166 PPGDPGIQGPQGKDGIPGIPGKDGLPG-------YPGPKGDQGLSITGPPGERGPDGLPA 218
P D +G D G D G Y G + G E DG P
Sbjct: 60 APADGYSRGEAADDEEEGEASSDATEGHDEDDEIYEGEYQGIPRAENGGKDEIMADGAPK 119
Query: 219 GEKGFRGSPGLPGTPGLKGEQGDKGSEAQLERRDSEEETKDQ 260
G R LK +G G E + ERR EEE Q
Sbjct: 120 A--GVRD--------ELKDGEGPPGGEGEAERRADEEELAQQ 151
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 31.2 bits (70), Expect = 4.5
Identities = 59/259 (22%), Positives = 77/259 (29%), Gaps = 18/259 (6%)
Query: 531 PGERGSTGSKGEPGPPGRNGPRGVPGFM--GSKGEPGPRGPQGNPGPVGRRGDTGPMGEK 588
P R S PGPP + + P F+ S P P G + P
Sbjct: 259 PSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSH 318
Query: 589 GEPGPMGFTGDKGREGPRGLPGYPSPKGDKGEPGISLPGPPGPPGYPGEKGEKGLIGPFG 648
P + + ++ PR P P+P +P + P P P PF
Sbjct: 319 TPPSQ---SALQPQQPPREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFP 375
Query: 649 KQGA-----PGEPGLNGLPGDIGEQGIPGLPGLQGLP-GQKGDPGPLGPPGPRGFDGTPG 702
+ + P L+ LP P P LQ +P Q P PP P
Sbjct: 376 QMPSNLPPPPALKPLSSLPTHHPPSAHP--PPLQLMPQSQPLQSVPAQPPVLTQSQSLPP 433
Query: 703 RDGEKGNAGLPGQPGQPGLPGFPGQRGEMGLPGIEGEKGNPGLPGKIGAPGIPAIGMPGE 762
+ ++GL P Q P G GLP I P LP A A
Sbjct: 434 KASTHPHSGLHSGPPQSPFAQHPFTSG--GLPAIGPP---PSLPTSTPAAPPRASSGSQP 488
Query: 763 KGDKGDLGPFGYPGPPGLP 781
G P LP
Sbjct: 489 PGSALPSSGGCAGPGPPLP 507
>gnl|CDD|111993 pfam03157, Glutenin_hmw, High molecular weight glutenin subunit.
Members of this family include high molecular weight
subunits of glutenin. This group of gluten proteins is
thought to be largely responsible for the elastic
properties of gluten, and hence, doughs. Indeed,
glutenin high molecular weight subunits are classified
as elastomeric proteins, because the glutenin network
can withstand significant deformations without breaking,
and return to the original conformation when the stress
is removed. Elastomeric proteins differ considerably in
amino acid sequence, but they are all polymers whose
subunits consist of elastomeric domains, composed of
repeated motifs, and non-elastic domains that mediate
cross-linking between the subunits. The elastomeric
domain motifs are all rich in glycine residues in
addition to other hydrophobic residues. High molecular
weight glutenin subunits have an extensive central
elastomeric domain, flanked by two terminal non-elastic
domains that form disulphide cross-links. The central
elastomeric domain is characterized by the following
three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It
possesses overlapping beta-turns within and between the
repeated motifs, and assumes a regular helical secondary
structure with a diameter of approx. 1.9 nm and a pitch
of approx. 1.5 nm.
Length = 779
Score = 31.2 bits (69), Expect = 5.5
Identities = 54/244 (22%), Positives = 86/244 (35%), Gaps = 2/244 (0%)
Query: 486 TPGQDGIPGLKGEKGHGIKGEPGINGTHGVKGEKGQPGGIGYKGEPGERGSTGSKGEPGP 545
+ Q G G+ +G+PG T ++ +GQPG + +G + +
Sbjct: 440 SLQQPGQGQQPGQWQQPGQGQPGYYPTSLLQPGQGQPGHDPASLQQPGQGQQPGQLQQPA 499
Query: 546 PGRNGPRGVPGFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGPMGFTGDKGREGP 605
G+ G + G G + +G Q G G++ G G++ G G++G++
Sbjct: 500 QGQPGQQLAQGQQGQQPAQVQQGQQPAQGQQGQQLGQGQQGQQPGQGQHPAQGEQGQQPG 559
Query: 606 RGLPGYPSPKGDKGEPGISL--PGPPGPPGYPGEKGEKGLIGPFGKQGAPGEPGLNGLPG 663
+G G +G + G P P G + G+ G + + G P
Sbjct: 560 QGQQGQQPGQGQQPGQGQPWYYPTSPQESGQGQQPGQWQQPGQWQQPGQGQPGYYLTSPL 619
Query: 664 DIGEQGIPGLPGLQGLPGQKGDPGPLGPPGPRGFDGTPGRDGEKGNAGLPGQPGQPGLPG 723
+G+ P PGQ PG G P G A GQ Q G
Sbjct: 620 QLGQGQQGYYPTSLQQPGQGQQPGQWQQSGQGQHGHCPMSPQLSGQAQQLGQGQQIGQVL 679
Query: 724 FPGQ 727
PGQ
Sbjct: 680 QPGQ 683
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 30.9 bits (70), Expect = 5.6
Identities = 22/107 (20%), Positives = 31/107 (28%), Gaps = 14/107 (13%)
Query: 538 GSKGEP---GPPGRNGPRGVP-GFMGSKGEPGPRGPQGNPGPVGRRGDTGPMGEKGEPGP 593
G+ G+P G + G P G+ P P GP G P G PM P
Sbjct: 393 GAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLRPNGLA----PMNAVRAPSR 448
Query: 594 MGFTGDKGREGPRGLPGYPSPKGDKGEPGISLPGPPGPPGYPGEKGE 640
+ + P + P P P + G+
Sbjct: 449 NA----QNAAQKPPMQPVMYPPNYQSLP--LSQDLPQPQSTASQGGQ 489
>gnl|CDD|184518 PRK14119, gpmA, phosphoglyceromutase; Provisional.
Length = 228
Score = 30.2 bits (68), Expect = 6.1
Identities = 12/29 (41%), Positives = 17/29 (58%)
Query: 269 SQDIEVPRETKESRHPYLLQERKDLLDHQ 297
S D++ P ET+E R YL R + LD +
Sbjct: 116 SYDVKPPAETEEQREAYLADRRYNHLDKR 144
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 30.8 bits (69), Expect = 6.7
Identities = 59/245 (24%), Positives = 83/245 (33%), Gaps = 35/245 (14%)
Query: 236 KGEQGDKGSEAQLERRDSEEETKDQ-----KDERENQDSQDIEVPR--ETKESRHPYLLQ 288
+ Q KGS+ E D + E +D KD + ++ +Q T Q
Sbjct: 104 EQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKGSKKTQPGVSTSSGSTTSGTDLNTKQ 163
Query: 289 ERKDL-LDHQGDKGWPGIDGTPGQDGIPGLKGEKGHGIKGEPGINGTHGVKGEKGQPGGI 347
+ L + P + + G+PGL G G G+ G G GV G+
Sbjct: 164 SQTGLGASGSHAQQDPAVSQSGVV-GVPGL-GVPGVGVPGGGGAGALPGV--------GV 213
Query: 348 GYKGEPGQDGAKGDSGGRCIGCLPGARGEKGD------RGKDGLPGIPGPPGAPGMRGRD 401
G G G G G +G L +G R DG PG G PG+R D
Sbjct: 214 GRAGVSPGVGVGGLGGVPGVGILASNTSREGQTQDDQERDGDGRVIEPG-VGLPGVRVGD 272
Query: 402 GDAGR--DGPPGKDAVWTSDNLIKGERGPPGLKGDIGR-------DGPMGPPG-PKGVIG 451
+ P G T + G PG GP+ PG P+ + G
Sbjct: 273 STSSPSTTRPSGSTTTTTPASSGPSAPGGPGSSSRNAVTRSTDSISGPIPSPGAPRAITG 332
Query: 452 LTGDR 456
G+R
Sbjct: 333 QMGER 337
>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175).
Length = 820
Score = 30.6 bits (70), Expect = 6.9
Identities = 29/100 (29%), Positives = 43/100 (43%)
Query: 286 LLQERKDLLDHQGDKGWPGIDGTPGQDGIPGLKGEKGHGIKGEPGINGTHGVKGEKGQPG 345
L+E++ L D + G+ +G +G G G+ G G +G++GQ G
Sbjct: 596 TLREQQGLSDETFRDLQEQFNAQRGEQQGQQGQGGQGQGQPGQQGQQGQGQQQGQQGQGG 655
Query: 346 GIGYKGEPGQDGAKGDSGGRCIGCLPGARGEKGDRGKDGL 385
G + A D GR G LPG GE G+ +D L
Sbjct: 656 QGGQGSLAERQQALRDELGRQRGGLPGMGGEAGEAARDAL 695
Score = 30.3 bits (69), Expect = 8.3
Identities = 24/67 (35%), Positives = 29/67 (43%), Gaps = 2/67 (2%)
Query: 342 GQPGGIGYKGEPGQDGAKGDSGGRCIGCLPGARGEKGDRGKDGLPGIPGPPGAPGMRGRD 401
GQPG G +G+ Q G +G G G L + D G+PG G G RD
Sbjct: 634 GQPGQQGQQGQGQQQGQQGQGGQGGQGSLAERQQALRDELGRQRGGLPGMGGEAGEAARD 693
Query: 402 --GDAGR 406
G AGR
Sbjct: 694 ALGRAGR 700
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family consists
of several Mycoplasma species specific Cytadhesin P32 and
P30 proteins. P30 has been found to be membrane
associated and localised on the tip organelle. It is
thought that it is important in cytadherence and
virulence.
Length = 279
Score = 30.0 bits (67), Expect = 7.5
Identities = 26/95 (27%), Positives = 35/95 (36%)
Query: 1008 NLRQRISRCQLFLGYPGLAGPAGEKGFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGY 1067
N+ R Q+ PG+ GF PG+P PG + R G QPG
Sbjct: 181 NMGMRPGFNQMPPHMPGMPPNQMRPGFNPMPGMPPRPGFNQNPNMMPNMNRPGFRPQPGG 240
Query: 1068 RGPPGPQGLPGLAGIPGARGDKGDWGPQGLDGPVG 1102
PG P + PG ++G P + GP
Sbjct: 241 FNHPGTPMGPNMQQRPGFNPNQGMNPPPHMAGPRA 275
Score = 30.0 bits (67), Expect = 9.2
Identities = 30/131 (22%), Positives = 47/131 (35%), Gaps = 8/131 (6%)
Query: 48 PGKPGLEGRNGP-PGERGPPGIPGFPGEKGDLGAPGPRGFPGSVGPGGPAGDIGPPGVPG 106
+P ++ + GP P +R P GFP + PG P + G + PG
Sbjct: 152 NPQPQVQPQFGPNPQQRINPQRFGFPMQPNMGMRPGFNQMPPHMP--GMPPNQMRPGFNP 209
Query: 107 LPGITIKGEKGLPGLHGKNGRDGAPGLAGEKGDYGPPGLIGKPGPPGKAGLPGVEGERGP 166
+PG+ + G PG + G + PG P + G +G P
Sbjct: 210 MPGMPPR--PGFNQNPNMMPNMNRPGFRPQPGGFNHPGTPMGPNMQQRPGFNPNQGMNPP 267
Query: 167 PGDPGIQGPQG 177
P + GP+
Sbjct: 268 ---PHMAGPRA 275
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 30.5 bits (69), Expect = 8.2
Identities = 18/118 (15%), Positives = 23/118 (19%)
Query: 1028 PAGEKGFRGSPGLPGTPGLKGEQGDKGSEGREGKPGQPGYRGPPGPQGLPGLAGIPGARG 1087
E FR L + S+ + G GP PG A P
Sbjct: 790 VRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPAR 849
Query: 1088 DKGDWGPQGLDGPVGPPGLRGEPGPPGRPGLDGLQGNTGPPGENGLPGAPCESTPDYM 1145
+ G G P P P A +
Sbjct: 850 SSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRP 907
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.312 0.145 0.469
Gapped
Lambda K H
0.267 0.0613 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 74,565,324
Number of extensions: 7833643
Number of successful extensions: 6331
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5263
Number of HSP's successfully gapped: 392
Length of query: 1375
Length of database: 10,937,602
Length adjustment: 109
Effective length of query: 1266
Effective length of database: 6,103,016
Effective search space: 7726418256
Effective search space used: 7726418256
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 65 (29.1 bits)