RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5478
(1009 letters)
>gnl|CDD|215716 pfam00100, Zona_pellucida, Zona pellucida-like domain.
Length = 252
Score = 120 bits (302), Expect = 3e-30
Identities = 48/266 (18%), Positives = 85/266 (31%), Gaps = 39/266 (14%)
Query: 1 MRVNIEFDRPF-YGMIFSKGYYSDSHCVHLKPGTGHLSATFEIFLNSCGMSSSANHAVGG 59
M V + D F G+ S + D C T A F L+SCG +
Sbjct: 7 MTVVVSKDLTFTPGLYLSSLHLGDPSCRP--VETNSTHAIFRFPLSSCGTTR-------- 56
Query: 60 VTTPSGSYVENTIIIQYDPYVQEV----WDQARKLRCTWYDFYEKAVTFRPFQVDMLHAV 115
T G EN I+ Q V D+ + C Y +
Sbjct: 57 QRTGDGIVYENEIVSQPSVSVGPPITRDSDRRLPVSC----SYTSRSVSVSSVAVLPTVP 112
Query: 116 TANFLG---DNLQCWMQIQVGKGPWASEV-SGIVKIGQTMTMVLAIKDDENKFD--MLVR 169
+ +L +++ + +++G + + + + +
Sbjct: 113 PVSPSPSGEGSLTFSLRLY--TDDSYTSDYPVTIELGDPLYVEATVSVLPRSDPLVLFLD 170
Query: 170 NCVAHDGKRAP----IQLVDQHGCVVRPKIMSKFQKIKNFGPSASVVSFAYFQAFKFPDS 225
+C A G L+ ++GC V S + + F+ F+AF+FP
Sbjct: 171 SCWATPGPDPDSSPRYDLI-ENGCPVDGDSTSTLS---HPVGESHTARFS-FKAFRFPGD 225
Query: 226 MN-VHFQCVIQVCRYNCPE--PRCGG 248
+ V+ C ++VC + P P C
Sbjct: 226 SSQVYIHCSVKVCDKSDPSCKPTCSR 251
Score = 110 bits (276), Expect = 7e-27
Identities = 42/246 (17%), Positives = 77/246 (31%), Gaps = 38/246 (15%)
Query: 515 YYSDSHCVHLKPGTGHLSATFEIFLNSCGMSSSANHAVGGVTTPSGSYVENTIIIQYDPY 574
+ D C T A F L+SCG + T G EN I+ Q
Sbjct: 27 HLGDPSCRP--VETNSTHAIFRFPLSSCGTTR--------QRTGDGIVYENEIVSQPSVS 76
Query: 575 VQEV----WDQARKLRCTWYDFYEKAVTFRPFQVDMLHAVTANFLG---DNLQCWMQIQV 627
V D+ + C Y + + +L +++
Sbjct: 77 VGPPITRDSDRRLPVSC----SYTSRSVSVSSVAVLPTVPPVSPSPSGEGSLTFSLRLY- 131
Query: 628 GKGPWASEV-SGIVKIGQTMTMVLAIKDDENKFD--MLVRNCVAHDGKRAP----IQLVD 680
+ +++G + + + + + +C A G L+
Sbjct: 132 -TDDSYTSDYPVTIELGDPLYVEATVSVLPRSDPLVLFLDSCWATPGPDPDSSPRYDLI- 189
Query: 681 QHGCVVRPKIMSKFQKIKNFGPSASVVSFAYFQAFKFPDSMN-VHFQCVIQVCRYNCPE- 738
++GC V S + + F+ F+AF+FP + V+ C ++VC + P
Sbjct: 190 ENGCPVDGDSTSTLS---HPVGESHTARFS-FKAFRFPGDSSQVYIHCSVKVCDKSDPSC 245
Query: 739 -PRCGG 743
P C
Sbjct: 246 KPTCSR 251
>gnl|CDD|214579 smart00241, ZP, Zona pellucida (ZP) domain. ZP proteins are
responsible for sperm-adhesion fo the zona pellucida. ZP
domains are also present in multidomain transmembrane
proteins such as glycoprotein GP2, uromodulin and
TGF-beta receptor type III (betaglycan).
Length = 252
Score = 73.6 bits (181), Expect = 2e-14
Identities = 55/262 (20%), Positives = 88/262 (33%), Gaps = 33/262 (12%)
Query: 1 MRVNIEFDRPFYGMIFSKGYY-SDSHCVHLKPGTGHLSATFEIFLNSCGMSSSANHAVGG 59
M V++ D F G I KG D C +FE+ LN CG N
Sbjct: 7 MVVSVSTDLLFPGGINVKGLTLGDPSCRPQFTDATSAFVSFEVPLNGCGTRRQVN----- 61
Query: 60 VTTPSGSYVENTIIIQYDPY--VQEVWDQARKLRCTWYDFYEKA--VTFRPFQVDMLHAV 115
P G NT+++ + A +C FY + V+ +
Sbjct: 62 ---PDGIVYSNTLVVSPFHPGFITRDDRAAYHFQC----FYPENEKVSLNLDVSTIPPTE 114
Query: 116 TANFLGDNLQCWMQIQVGKGPWASEV-SGIVKIGQTMTMVLAIK-DDENKFDMLVRNCVA 173
++ L C ++ + S S +G + D+ +LV NC A
Sbjct: 115 LSSVSEGPLTCSYRLYKD-DSFGSPYQSADYVLGDPVYHEWECDGADDPPLGLLVDNCYA 173
Query: 174 HDGKRA----PIQLVDQHGCVVRPKIMSKFQKIKNFGPSASVVSFAYFQAFKFPDSMNVH 229
G ++D +GC V + S + + + + FKF D V+
Sbjct: 174 TPGPDPSSGPKYFIID-NGCPVDGYLDS----TIPYNSNPLHRARFSVKVFKFADRSLVY 228
Query: 230 FQCVIQVCR----YNCPEPRCG 247
F C I++C +C P C
Sbjct: 229 FHCQIRLCDKDDGSSCDGPACS 250
Score = 66.3 bits (162), Expect = 9e-12
Identities = 47/244 (19%), Positives = 78/244 (31%), Gaps = 32/244 (13%)
Query: 513 TRYYSDSHCVHLKPGTGHLSATFEIFLNSCGMSSSANHAVGGVTTPSGSYVENTIIIQYD 572
D C +FE+ LN CG N P G NT+++
Sbjct: 25 GLTLGDPSCRPQFTDATSAFVSFEVPLNGCGTRRQVN--------PDGIVYSNTLVVSPF 76
Query: 573 PY--VQEVWDQARKLRCTWYDFYEKA--VTFRPFQVDMLHAVTANFLGDNLQCWMQIQVG 628
+ A +C FY + V+ + ++ L C ++
Sbjct: 77 HPGFITRDDRAAYHFQC----FYPENEKVSLNLDVSTIPPTELSSVSEGPLTCSYRLYKD 132
Query: 629 KGPWASEV-SGIVKIGQTMTMVLAIK-DDENKFDMLVRNCVAHDGKRA----PIQLVDQH 682
+ S S +G + D+ +LV NC A G ++D +
Sbjct: 133 -DSFGSPYQSADYVLGDPVYHEWECDGADDPPLGLLVDNCYATPGPDPSSGPKYFIID-N 190
Query: 683 GCVVRPKIMSKFQKIKNFGPSASVVSFAYFQAFKFPDSMNVHFQCVIQVCR----YNCPE 738
GC V + S + + + + FKF D V+F C I++C +C
Sbjct: 191 GCPVDGYLDS----TIPYNSNPLHRARFSVKVFKFADRSLVYFHCQIRLCDKDDGSSCDG 246
Query: 739 PRCG 742
P C
Sbjct: 247 PACS 250
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 42.2 bits (99), Expect = 0.002
Identities = 39/138 (28%), Positives = 46/138 (33%), Gaps = 25/138 (18%)
Query: 265 GPPPP-PPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPA---ASV 320
PPPP PP + VP P PR PS PA ++P+ P A A V
Sbjct: 2550 DPPPPLPPAAPPAAPDRSVPP------PRPAPR-PSEPAVTSRARRPDAPPQSARPRAPV 2602
Query: 321 SNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQTVKRKGTVSEEGGDLVTLGGR 380
+ +P PAP P D PPPP P A E
Sbjct: 2603 DDRGDPRGPAPPSPLPP---DTHAPDPPPPSPSPAAN-----------EPDPHPPPTVPP 2648
Query: 381 PRSVEIFPDELRGARRRR 398
P P R +R RR
Sbjct: 2649 PERPRDDPAPGRVSRPRR 2666
Score = 42.2 bits (99), Expect = 0.002
Identities = 39/138 (28%), Positives = 46/138 (33%), Gaps = 25/138 (18%)
Query: 760 GPPPP-PPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPA---ASV 815
PPPP PP + VP P PR PS PA ++P+ P A A V
Sbjct: 2550 DPPPPLPPAAPPAAPDRSVPP------PRPAPR-PSEPAVTSRARRPDAPPQSARPRAPV 2602
Query: 816 SNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQTVKRKGTVSEEGGDLVTLGGR 875
+ +P PAP P D PPPP P A E
Sbjct: 2603 DDRGDPRGPAPPSPLPP---DTHAPDPPPPSPSPAAN-----------EPDPHPPPTVPP 2648
Query: 876 PRSVEIFPDELRGARRRR 893
P P R +R RR
Sbjct: 2649 PERPRDDPAPGRVSRPRR 2666
Score = 38.4 bits (89), Expect = 0.027
Identities = 20/108 (18%), Positives = 30/108 (27%), Gaps = 17/108 (15%)
Query: 266 PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSG--------------PAGAYSEQ--- 308
P PPPP + +E PE P+ A A S
Sbjct: 2624 PDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRP 2683
Query: 309 KPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGA 356
+ S+++ +P P P + +PP P A
Sbjct: 2684 RRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQA 2731
Score = 38.4 bits (89), Expect = 0.027
Identities = 20/108 (18%), Positives = 30/108 (27%), Gaps = 17/108 (15%)
Query: 761 PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSG--------------PAGAYSEQ--- 803
P PPPP + +E PE P+ A A S
Sbjct: 2624 PDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRP 2683
Query: 804 KPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGA 851
+ S+++ +P P P + +PP P A
Sbjct: 2684 RRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQA 2731
Score = 38.0 bits (88), Expect = 0.035
Identities = 25/110 (22%), Positives = 35/110 (31%), Gaps = 11/110 (10%)
Query: 263 EYGPPPPPPHVNSIGSEYGVPDFGGPGYPEIN-PRHPSGPAGAYSEQKPEVVPAPAASVS 321
P P P V S P P+ PR P G P P+P +
Sbjct: 2572 RPAPRPSEPAVTSRARRPDAP-------PQSARPRAPVDDRGDPRGPAP---PSPLPPDT 2621
Query: 322 NSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQTVKRKGTVSEEG 371
++P+P P+P ++ +PPP P V R G
Sbjct: 2622 HAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLG 2671
Score = 38.0 bits (88), Expect = 0.035
Identities = 25/110 (22%), Positives = 35/110 (31%), Gaps = 11/110 (10%)
Query: 758 EYGPPPPPPHVNSIGSEYGVPDFGGPGYPEIN-PRHPSGPAGAYSEQKPEVVPAPAASVS 816
P P P V S P P+ PR P G P P+P +
Sbjct: 2572 RPAPRPSEPAVTSRARRPDAP-------PQSARPRAPVDDRGDPRGPAP---PSPLPPDT 2621
Query: 817 NSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQTVKRKGTVSEEG 866
++P+P P+P ++ +PPP P V R G
Sbjct: 2622 HAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLG 2671
Score = 35.7 bits (82), Expect = 0.19
Identities = 27/108 (25%), Positives = 40/108 (37%), Gaps = 7/108 (6%)
Query: 265 GPP-PPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNS 323
GPP P PP + G + P ++ S P+ P V APAA++ +
Sbjct: 2765 GPPAPAPPAAPAAGPPRRLT---RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
Query: 324 PNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQTVKRKGTVSEEG 371
+P+ P P +S + PP P P +V G V
Sbjct: 2822 ASPAGPLPPPTSAQPTAPP---PPPGPPPPSLPLGGSVAPGGDVRRRP 2866
Score = 35.7 bits (82), Expect = 0.19
Identities = 27/108 (25%), Positives = 40/108 (37%), Gaps = 7/108 (6%)
Query: 760 GPP-PPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNS 818
GPP P PP + G + P ++ S P+ P V APAA++ +
Sbjct: 2765 GPPAPAPPAAPAAGPPRRLT---RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
Query: 819 PNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQTVKRKGTVSEEG 866
+P+ P P +S + PP P P +V G V
Sbjct: 2822 ASPAGPLPPPTSAQPTAPP---PPPGPPPPSLPLGGSVAPGGDVRRRP 2866
Score = 34.5 bits (79), Expect = 0.39
Identities = 18/95 (18%), Positives = 30/95 (31%), Gaps = 8/95 (8%)
Query: 266 PPPPPPHVNS------IGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAAS 319
P P PH G P P+GPA +P P A
Sbjct: 2707 TPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGP 2766
Query: 320 VSNSP--NPSSPAPQKSSPSSISDNEVHMPPPPLP 352
+ +P P++ P++ + +++ P P
Sbjct: 2767 PAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP 2801
Score = 34.5 bits (79), Expect = 0.39
Identities = 18/95 (18%), Positives = 30/95 (31%), Gaps = 8/95 (8%)
Query: 761 PPPPPPHVNS------IGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAAS 814
P P PH G P P+GPA +P P A
Sbjct: 2707 TPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGP 2766
Query: 815 VSNSP--NPSSPAPQKSSPSSISDNEVHMPPPPLP 847
+ +P P++ P++ + +++ P P
Sbjct: 2767 PAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP 2801
Score = 33.8 bits (77), Expect = 0.60
Identities = 23/108 (21%), Positives = 29/108 (26%), Gaps = 1/108 (0%)
Query: 242 PEPRCGGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGP 301
PEP A L G + P P G GGP P P +GP
Sbjct: 2708 PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPA-RPPTTAGP 2766
Query: 302 AGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPP 349
P P + + S SP +D + P
Sbjct: 2767 PAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAP 2814
Score = 33.8 bits (77), Expect = 0.60
Identities = 23/108 (21%), Positives = 29/108 (26%), Gaps = 1/108 (0%)
Query: 737 PEPRCGGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGP 796
PEP A L G + P P G GGP P P +GP
Sbjct: 2708 PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPA-RPPTTAGP 2766
Query: 797 AGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPP 844
P P + + S SP +D + P
Sbjct: 2767 PAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAP 2814
Score = 33.4 bits (76), Expect = 0.88
Identities = 19/101 (18%), Positives = 28/101 (27%), Gaps = 5/101 (4%)
Query: 252 APQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPE 311
A F G G P S VP P P P+ P + +
Sbjct: 388 ARHAATPFARGPGGDDQTRPAAPVPAS---VPTPAPTPVPASAPPPPATPLPSAEPGSDD 444
Query: 312 VVPAPAASVSNSP--NPSSPAPQKSSPSSISDNEVHMPPPP 350
P +P P+ P ++ ++ PP P
Sbjct: 445 GPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPPEP 485
Score = 33.4 bits (76), Expect = 0.88
Identities = 19/101 (18%), Positives = 28/101 (27%), Gaps = 5/101 (4%)
Query: 747 APQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPE 806
A F G G P S VP P P P+ P + +
Sbjct: 388 ARHAATPFARGPGGDDQTRPAAPVPAS---VPTPAPTPVPASAPPPPATPLPSAEPGSDD 444
Query: 807 VVPAPAASVSNSP--NPSSPAPQKSSPSSISDNEVHMPPPP 845
P +P P+ P ++ ++ PP P
Sbjct: 445 GPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPPEP 485
Score = 31.8 bits (72), Expect = 2.7
Identities = 29/119 (24%), Positives = 34/119 (28%), Gaps = 27/119 (22%)
Query: 242 PEPRCGGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGP 301
P G AP G GPPPPP P P G
Sbjct: 258 PPVVGEGADRAP----ETARGATGPPPPPE----------------AAAPNGAAAPPDGV 297
Query: 302 AGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQT 360
GA P +PAP P+P PAP + ++ PLP Y
Sbjct: 298 WGAALAGAPLALPAP-------PDPPPPAPAGDAEEEDDEDGAMEVVSPLPRPRQHYPL 349
Score = 31.8 bits (72), Expect = 2.7
Identities = 29/119 (24%), Positives = 34/119 (28%), Gaps = 27/119 (22%)
Query: 737 PEPRCGGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGP 796
P G AP G GPPPPP P P G
Sbjct: 258 PPVVGEGADRAP----ETARGATGPPPPPE----------------AAAPNGAAAPPDGV 297
Query: 797 AGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQT 855
GA P +PAP P+P PAP + ++ PLP Y
Sbjct: 298 WGAALAGAPLALPAP-------PDPPPPAPAGDAEEEDDEDGAMEVVSPLPRPRQHYPL 349
Score = 31.4 bits (71), Expect = 3.2
Identities = 37/167 (22%), Positives = 49/167 (29%), Gaps = 25/167 (14%)
Query: 252 APQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPE 311
P P PP + G P P P+ PA +
Sbjct: 2724 GPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA-GPPAPAPPAAPAAGPPRR 2782
Query: 312 VVPAPAASVSNS----PNPSSPA-------------PQKSSPSSIS---DNEVHMPPPPL 351
+ AS+S S P+P PA P +SP+ + PPP
Sbjct: 2783 LTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842
Query: 352 PGHGATYQTVKRKGTVSEEGGDLVTLGGRPRSVEIFPDELRGARRRR 398
PG + G+V+ GGD V RS P RR
Sbjct: 2843 PGPPPPSLPL--GGSVA-PGGD-VRRRPPSRSPAAKPAAPARPPVRR 2885
Score = 31.4 bits (71), Expect = 3.2
Identities = 37/167 (22%), Positives = 49/167 (29%), Gaps = 25/167 (14%)
Query: 747 APQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPE 806
P P PP + G P P P+ PA +
Sbjct: 2724 GPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA-GPPAPAPPAAPAAGPPRR 2782
Query: 807 VVPAPAASVSNS----PNPSSPA-------------PQKSSPSSIS---DNEVHMPPPPL 846
+ AS+S S P+P PA P +SP+ + PPP
Sbjct: 2783 LTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842
Query: 847 PGHGATYQTVKRKGTVSEEGGDLVTLGGRPRSVEIFPDELRGARRRR 893
PG + G+V+ GGD V RS P RR
Sbjct: 2843 PGPPPPSLPL--GGSVA-PGGD-VRRRPPSRSPAAKPAAPARPPVRR 2885
Score = 31.4 bits (71), Expect = 3.6
Identities = 24/77 (31%), Positives = 28/77 (36%), Gaps = 17/77 (22%)
Query: 266 PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQK-----PEVVPAPAASV 320
PPPPPP P P P +P P+GA + P V P V
Sbjct: 2931 PPPPPP---------PRPQ--PPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRV 2979
Query: 321 SNSPNPSSPAPQKSSPS 337
P PS AP S+P
Sbjct: 2980 P-QPAPSREAPASSTPP 2995
Score = 31.4 bits (71), Expect = 3.6
Identities = 24/77 (31%), Positives = 28/77 (36%), Gaps = 17/77 (22%)
Query: 761 PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQK-----PEVVPAPAASV 815
PPPPPP P P P +P P+GA + P V P V
Sbjct: 2931 PPPPPP---------PRPQ--PPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRV 2979
Query: 816 SNSPNPSSPAPQKSSPS 832
P PS AP S+P
Sbjct: 2980 P-QPAPSREAPASSTPP 2995
Score = 31.1 bits (70), Expect = 4.8
Identities = 17/85 (20%), Positives = 23/85 (27%), Gaps = 9/85 (10%)
Query: 267 PPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNP 326
P S F P P P P + +P P P P P
Sbjct: 2889 PAVSRSTES---------FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
Query: 327 SSPAPQKSSPSSISDNEVHMPPPPL 351
P + P+ + +P P L
Sbjct: 2940 QPPLAPTTDPAGAGEPSGAVPQPWL 2964
Score = 31.1 bits (70), Expect = 4.8
Identities = 17/85 (20%), Positives = 23/85 (27%), Gaps = 9/85 (10%)
Query: 762 PPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNP 821
P S F P P P P + +P P P P P
Sbjct: 2889 PAVSRSTES---------FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
Query: 822 SSPAPQKSSPSSISDNEVHMPPPPL 846
P + P+ + +P P L
Sbjct: 2940 QPPLAPTTDPAGAGEPSGAVPQPWL 2964
Score = 30.7 bits (69), Expect = 5.6
Identities = 28/94 (29%), Positives = 35/94 (37%), Gaps = 14/94 (14%)
Query: 266 PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEV----VPAPAASV- 320
PPPPPP + P G E + P GA + V VP PA S
Sbjct: 2932 PPPPPPRPQPPLA----PTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE 2987
Query: 321 ----SNSPNPSSPAPQKSS-PSSISDNEVHMPPP 349
S P + SS SS++ +E PPP
Sbjct: 2988 APASSTPPLTGHSLSRVSSWASSLALHEETDPPP 3021
Score = 30.7 bits (69), Expect = 5.6
Identities = 28/94 (29%), Positives = 35/94 (37%), Gaps = 14/94 (14%)
Query: 761 PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEV----VPAPAASV- 815
PPPPPP + P G E + P GA + V VP PA S
Sbjct: 2932 PPPPPPRPQPPLA----PTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE 2987
Query: 816 ----SNSPNPSSPAPQKSS-PSSISDNEVHMPPP 844
S P + SS SS++ +E PPP
Sbjct: 2988 APASSTPPLTGHSLSRVSSWASSLALHEETDPPP 3021
>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
carrier protein subunit; Validated.
Length = 153
Score = 39.1 bits (91), Expect = 0.003
Identities = 19/49 (38%), Positives = 26/49 (53%)
Query: 301 PAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPP 349
A EQ P PAPA +V ++P P +PA +P+S +N V P P
Sbjct: 44 DLSAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPASAGENVVTAPMP 92
Score = 39.1 bits (91), Expect = 0.003
Identities = 19/49 (38%), Positives = 26/49 (53%)
Query: 796 PAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPP 844
A EQ P PAPA +V ++P P +PA +P+S +N V P P
Sbjct: 44 DLSAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPASAGENVVTAPMP 92
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 40.8 bits (95), Expect = 0.004
Identities = 21/108 (19%), Positives = 37/108 (34%), Gaps = 14/108 (12%)
Query: 284 DFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNE 343
D P+ A SE P+ + + P P+ + ++P++
Sbjct: 202 DANAAPAEPAEEE-APAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAPAA----- 255
Query: 344 VHMPPPPLPGHGATYQTVKRKGTVSEEGGDLVT-----LGGRPRSVEI 386
P G Y T + ++G DL T +GGR R ++
Sbjct: 256 ---AAPVSSGDSGPYVTPLVRKLAKDKGVDLSTVKGTGVGGRIRKQDV 300
Score = 40.8 bits (95), Expect = 0.004
Identities = 21/108 (19%), Positives = 37/108 (34%), Gaps = 14/108 (12%)
Query: 779 DFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNE 838
D P+ A SE P+ + + P P+ + ++P++
Sbjct: 202 DANAAPAEPAEEE-APAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAPAA----- 255
Query: 839 VHMPPPPLPGHGATYQTVKRKGTVSEEGGDLVT-----LGGRPRSVEI 881
P G Y T + ++G DL T +GGR R ++
Sbjct: 256 ---AAPVSSGDSGPYVTPLVRKLAKDKGVDLSTVKGTGVGGRIRKQDV 300
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 39.6 bits (93), Expect = 0.009
Identities = 25/125 (20%), Positives = 34/125 (27%), Gaps = 16/125 (12%)
Query: 242 PEPRCGGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEIN------- 294
P S P P + GG G+P
Sbjct: 621 PAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAA 680
Query: 295 ---------PRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVH 345
P P+G A A P P + + P A S+PS +D+ V
Sbjct: 681 PPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740
Query: 346 MPPPP 350
+PP P
Sbjct: 741 LPPEP 745
Score = 39.6 bits (93), Expect = 0.009
Identities = 25/125 (20%), Positives = 34/125 (27%), Gaps = 16/125 (12%)
Query: 737 PEPRCGGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEIN------- 789
P S P P + GG G+P
Sbjct: 621 PAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAA 680
Query: 790 ---------PRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVH 840
P P+G A A P P + + P A S+PS +D+ V
Sbjct: 681 PPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740
Query: 841 MPPPP 845
+PP P
Sbjct: 741 LPPEP 745
Score = 34.2 bits (79), Expect = 0.37
Identities = 18/116 (15%), Positives = 31/116 (26%)
Query: 266 PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPN 325
P P + +E G PE +P+H + P + AA + P
Sbjct: 625 AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPA 684
Query: 326 PSSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQTVKRKGTVSEEGGDLVTLGGRP 381
P+ AP + ++ + P G + P
Sbjct: 685 PAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740
Score = 34.2 bits (79), Expect = 0.37
Identities = 18/116 (15%), Positives = 31/116 (26%)
Query: 761 PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPN 820
P P + +E G PE +P+H + P + AA + P
Sbjct: 625 AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPA 684
Query: 821 PSSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQTVKRKGTVSEEGGDLVTLGGRP 876
P+ AP + ++ + P G + P
Sbjct: 685 PAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740
Score = 30.3 bits (69), Expect = 5.7
Identities = 24/104 (23%), Positives = 31/104 (29%), Gaps = 13/104 (12%)
Query: 251 GAPQLGGGFGSGEYGPPPPPPHVNSIGSEY---GVPDFGGPGYPEINPRHPSGPAGAYSE 307
A G PP S S VP P P P+ P
Sbjct: 705 PAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPP----- 759
Query: 308 QKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPL 351
P PAPAA+ + +P PS P+ + D+ M
Sbjct: 760 --PPPAPAPAAAPAAAPPPSPPSEE---EEMAEDDAPSMDDEDR 798
Score = 30.3 bits (69), Expect = 5.7
Identities = 24/104 (23%), Positives = 31/104 (29%), Gaps = 13/104 (12%)
Query: 746 GAPQLGGGFGSGEYGPPPPPPHVNSIGSEY---GVPDFGGPGYPEINPRHPSGPAGAYSE 802
A G PP S S VP P P P+ P
Sbjct: 705 PAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPP----- 759
Query: 803 QKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPL 846
P PAPAA+ + +P PS P+ + D+ M
Sbjct: 760 --PPPAPAPAAAPAAAPPPSPPSEE---EEMAEDDAPSMDDEDR 798
Score = 30.3 bits (69), Expect = 6.9
Identities = 18/110 (16%), Positives = 28/110 (25%), Gaps = 7/110 (6%)
Query: 247 GGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPR----HPSGPA 302
G GAP + P P + + P+ P
Sbjct: 388 AGGAGAPAAAAPSAAAAAPAAAPAP---AAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSP 444
Query: 303 GAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLP 352
+ P PAA+ S P P+ A + + + P P
Sbjct: 445 AGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAP 494
Score = 30.3 bits (69), Expect = 6.9
Identities = 18/110 (16%), Positives = 28/110 (25%), Gaps = 7/110 (6%)
Query: 742 GGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPR----HPSGPA 797
G GAP + P P + + P+ P
Sbjct: 388 AGGAGAPAAAAPSAAAAAPAAAPAP---AAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSP 444
Query: 798 GAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLP 847
+ P PAA+ S P P+ A + + + P P
Sbjct: 445 AGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAP 494
Score = 30.0 bits (68), Expect = 9.2
Identities = 19/97 (19%), Positives = 26/97 (26%), Gaps = 1/97 (1%)
Query: 257 GGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAP 316
G G PP P + E P P+G A A +E P
Sbjct: 589 GPAPGAAGGEGPPAPASSGPPEEAARPA-APAAPAAPAAPAPAGAAAAPAEASAAPAPGV 647
Query: 317 AASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPG 353
AA + + + P + P P P
Sbjct: 648 AAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPA 684
Score = 30.0 bits (68), Expect = 9.2
Identities = 19/97 (19%), Positives = 26/97 (26%), Gaps = 1/97 (1%)
Query: 752 GGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAP 811
G G PP P + E P P+G A A +E P
Sbjct: 589 GPAPGAAGGEGPPAPASSGPPEEAARPA-APAAPAAPAAPAPAGAAAAPAEASAAPAPGV 647
Query: 812 AASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPG 848
AA + + + P + P P P
Sbjct: 648 AAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPA 684
>gnl|CDD|237782 PRK14666, uvrC, excinuclease ABC subunit C; Provisional.
Length = 694
Score = 38.7 bits (90), Expect = 0.017
Identities = 26/122 (21%), Positives = 33/122 (27%), Gaps = 17/122 (13%)
Query: 252 APQLGGGFGSGEYGP---PPP-------PPHVNSIGSEYGVPDFGGPGYPEINPRHPSGP 301
AP+L F YGP PP P G + P +
Sbjct: 289 APELLWSFLGQFYGPQSTIPPRIVVPWLPDTEGREGDD-LAPTAVCTDAGLLPDTPLLPD 347
Query: 302 A-GAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMP-----PPPLPGHG 355
A S+ V A S + AP + S +D V P G
Sbjct: 348 APEGSSDPVVPVAAATPVDASLPDVRTGTAPTSLANVSHADPAVAQPTQAATLAGAAPKG 407
Query: 356 AT 357
AT
Sbjct: 408 AT 409
Score = 38.7 bits (90), Expect = 0.017
Identities = 26/122 (21%), Positives = 33/122 (27%), Gaps = 17/122 (13%)
Query: 747 APQLGGGFGSGEYGP---PPP-------PPHVNSIGSEYGVPDFGGPGYPEINPRHPSGP 796
AP+L F YGP PP P G + P +
Sbjct: 289 APELLWSFLGQFYGPQSTIPPRIVVPWLPDTEGREGDD-LAPTAVCTDAGLLPDTPLLPD 347
Query: 797 A-GAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMP-----PPPLPGHG 850
A S+ V A S + AP + S +D V P G
Sbjct: 348 APEGSSDPVVPVAAATPVDASLPDVRTGTAPTSLANVSHADPAVAQPTQAATLAGAAPKG 407
Query: 851 AT 852
AT
Sbjct: 408 AT 409
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 38.6 bits (90), Expect = 0.019
Identities = 22/94 (23%), Positives = 26/94 (27%), Gaps = 13/94 (13%)
Query: 265 GPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKP-EVVPAPAASVSNS 323
P P N G GGPG P P PA + P + +
Sbjct: 640 KTALPRPDYNRGGEA------GGPGVPG--PVPVGMPAHTARPSRVARGDPVRPTAHHAA 691
Query: 324 PNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGAT 357
APQ P +PPPP A
Sbjct: 692 L----RAPQAPRPGGPPGGGGGLPPPPDLPAAAG 721
Score = 38.6 bits (90), Expect = 0.019
Identities = 22/94 (23%), Positives = 26/94 (27%), Gaps = 13/94 (13%)
Query: 760 GPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKP-EVVPAPAASVSNS 818
P P N G GGPG P P PA + P + +
Sbjct: 640 KTALPRPDYNRGGEA------GGPGVPG--PVPVGMPAHTARPSRVARGDPVRPTAHHAA 691
Query: 819 PNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGAT 852
APQ P +PPPP A
Sbjct: 692 L----RAPQAPRPGGPPGGGGGLPPPPDLPAAAG 721
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 38.6 bits (90), Expect = 0.021
Identities = 21/95 (22%), Positives = 24/95 (25%), Gaps = 14/95 (14%)
Query: 260 GSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAAS 319
PPPP P P P P S P PAA
Sbjct: 109 PGPSSPDPPPPT-----------PPPASPP-PSPAPDLSEMLRPVGSPGPPPAASPPAAG 156
Query: 320 VSNSPNPSSPA--PQKSSPSSISDNEVHMPPPPLP 352
S + S A Q + P S + P P
Sbjct: 157 ASPAAVASDAASSRQAALPLSSPEETARAPSSPPA 191
Score = 38.6 bits (90), Expect = 0.021
Identities = 21/95 (22%), Positives = 24/95 (25%), Gaps = 14/95 (14%)
Query: 755 GSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAAS 814
PPPP P P P P S P PAA
Sbjct: 109 PGPSSPDPPPPT-----------PPPASPP-PSPAPDLSEMLRPVGSPGPPPAASPPAAG 156
Query: 815 VSNSPNPSSPA--PQKSSPSSISDNEVHMPPPPLP 847
S + S A Q + P S + P P
Sbjct: 157 ASPAAVASDAASSRQAALPLSSPEETARAPSSPPA 191
Score = 32.5 bits (74), Expect = 1.4
Identities = 19/112 (16%), Positives = 25/112 (22%), Gaps = 1/112 (0%)
Query: 242 PEPRCGGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGP 301
P P PPP + GP P + P
Sbjct: 100 PAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASP 159
Query: 302 AGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPG 353
A S+ A S + +P P S + P PP
Sbjct: 160 AAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPS-TPPAAASPRPPRRS 210
Score = 32.5 bits (74), Expect = 1.4
Identities = 19/112 (16%), Positives = 25/112 (22%), Gaps = 1/112 (0%)
Query: 737 PEPRCGGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGP 796
P P PPP + GP P + P
Sbjct: 100 PAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASP 159
Query: 797 AGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPG 848
A S+ A S + +P P S + P PP
Sbjct: 160 AAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPS-TPPAAASPRPPRRS 210
Score = 32.1 bits (73), Expect = 2.0
Identities = 26/163 (15%), Positives = 43/163 (26%), Gaps = 23/163 (14%)
Query: 242 PEPRCGGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGP 301
P P D G PP + + R + P
Sbjct: 126 PPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASP----------AAVASDAASSRQAALP 175
Query: 302 AGAYSEQKPEVVPAPAASVSNSPNP--SSPAPQKSSPSSISDNEVHMPPPPLPG------ 353
+ E PA ++P S P++SSP S S P P PG
Sbjct: 176 LSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS----ASSPAPAPGRSAADD 231
Query: 354 -HGATYQTVKRKGTVSEEGGDLVTLGGRPRSVEIFPDELRGAR 395
++ + + + G + RP + + +
Sbjct: 232 AGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASG 274
Score = 32.1 bits (73), Expect = 2.0
Identities = 26/163 (15%), Positives = 43/163 (26%), Gaps = 23/163 (14%)
Query: 737 PEPRCGGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGP 796
P P D G PP + + R + P
Sbjct: 126 PPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASP----------AAVASDAASSRQAALP 175
Query: 797 AGAYSEQKPEVVPAPAASVSNSPNP--SSPAPQKSSPSSISDNEVHMPPPPLPG------ 848
+ E PA ++P S P++SSP S S P P PG
Sbjct: 176 LSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS----ASSPAPAPGRSAADD 231
Query: 849 -HGATYQTVKRKGTVSEEGGDLVTLGGRPRSVEIFPDELRGAR 890
++ + + + G + RP + + +
Sbjct: 232 AGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASG 274
Score = 31.7 bits (72), Expect = 2.6
Identities = 27/158 (17%), Positives = 43/158 (27%), Gaps = 12/158 (7%)
Query: 241 CPEPRCGGDYGAPQLGGGFGSGEYGP-PPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPS 299
CP PR ++ G P P +S P PG +
Sbjct: 255 CPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRA 314
Query: 300 GPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQ 359
+ + + S S+S + SS + + ++ S P P
Sbjct: 315 SSSSS----------SSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSS 364
Query: 360 TVKRKGTVSEEGGDLVTLGGRPRSVEIFPDELRGARRR 397
+++ S GRP ARRR
Sbjct: 365 P-RKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRR 401
Score = 31.7 bits (72), Expect = 2.6
Identities = 27/158 (17%), Positives = 43/158 (27%), Gaps = 12/158 (7%)
Query: 736 CPEPRCGGDYGAPQLGGGFGSGEYGP-PPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPS 794
CP PR ++ G P P +S P PG +
Sbjct: 255 CPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRA 314
Query: 795 GPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQ 854
+ + + S S+S + SS + + ++ S P P
Sbjct: 315 SSSSS----------SSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSS 364
Query: 855 TVKRKGTVSEEGGDLVTLGGRPRSVEIFPDELRGARRR 892
+++ S GRP ARRR
Sbjct: 365 P-RKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRR 401
Score = 29.8 bits (67), Expect = 9.9
Identities = 15/88 (17%), Positives = 19/88 (21%), Gaps = 5/88 (5%)
Query: 265 GPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSP 324
G P S + E +P P + P P AS SP
Sbjct: 76 GTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPD-----PPPPTPPPASPPPSP 130
Query: 325 NPSSPAPQKSSPSSISDNEVHMPPPPLP 352
P + S P
Sbjct: 131 APDLSEMLRPVGSPGPPPAASPPAAGAS 158
Score = 29.8 bits (67), Expect = 9.9
Identities = 15/88 (17%), Positives = 19/88 (21%), Gaps = 5/88 (5%)
Query: 760 GPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSP 819
G P S + E +P P + P P AS SP
Sbjct: 76 GTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPD-----PPPPTPPPASPPPSP 130
Query: 820 NPSSPAPQKSSPSSISDNEVHMPPPPLP 847
P + S P
Sbjct: 131 APDLSEMLRPVGSPGPPPAASPPAAGAS 158
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 36.6 bits (85), Expect = 0.052
Identities = 22/100 (22%), Positives = 27/100 (27%), Gaps = 16/100 (16%)
Query: 251 GAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKP 310
G P S S GVP F P +P S P
Sbjct: 165 TTNSDNSFPGEDADPASASPSDPPS--SSPGVPSFPSPPEDPSSPSDSSLP--------- 213
Query: 311 EVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPP 350
PAP++ S P ++PS PPP
Sbjct: 214 ---PAPSSF--QSDTPPPSPESPTNPSPPPGPAAPPPPPV 248
Score = 36.6 bits (85), Expect = 0.052
Identities = 22/100 (22%), Positives = 27/100 (27%), Gaps = 16/100 (16%)
Query: 746 GAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKP 805
G P S S GVP F P +P S P
Sbjct: 165 TTNSDNSFPGEDADPASASPSDPPS--SSPGVPSFPSPPEDPSSPSDSSLP--------- 213
Query: 806 EVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPP 845
PAP++ S P ++PS PPP
Sbjct: 214 ---PAPSSF--QSDTPPPSPESPTNPSPPPGPAAPPPPPV 248
Score = 33.1 bits (76), Expect = 0.75
Identities = 21/97 (21%), Positives = 23/97 (23%), Gaps = 4/97 (4%)
Query: 262 GEYGPPPPPPH--VNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAAS 319
GE P PP D PG + S P P
Sbjct: 145 GEDPNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDP 204
Query: 320 VSNSPNPSSPAPQK--SSPSSISDNEVHMPPPPLPGH 354
S S + PAP S S P PP
Sbjct: 205 SSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPA 241
Score = 33.1 bits (76), Expect = 0.75
Identities = 21/97 (21%), Positives = 23/97 (23%), Gaps = 4/97 (4%)
Query: 757 GEYGPPPPPPH--VNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAAS 814
GE P PP D PG + S P P
Sbjct: 145 GEDPNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDP 204
Query: 815 VSNSPNPSSPAPQK--SSPSSISDNEVHMPPPPLPGH 849
S S + PAP S S P PP
Sbjct: 205 SSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPA 241
Score = 30.8 bits (70), Expect = 3.6
Identities = 12/84 (14%), Positives = 20/84 (23%), Gaps = 5/84 (5%)
Query: 266 PPPPPPHVNSIGSEYGVPDFGGPGYP-EINPRHPSGPAGAYSEQ----KPEVVPAPAASV 320
P P + D P P P + + P P P
Sbjct: 191 SPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQ 250
Query: 321 SNSPNPSSPAPQKSSPSSISDNEV 344
+ + P P +S + +
Sbjct: 251 VPPLSTAKPTPPSASATPAPIGGI 274
Score = 30.8 bits (70), Expect = 3.6
Identities = 12/84 (14%), Positives = 20/84 (23%), Gaps = 5/84 (5%)
Query: 761 PPPPPPHVNSIGSEYGVPDFGGPGYP-EINPRHPSGPAGAYSEQ----KPEVVPAPAASV 815
P P + D P P P + + P P P
Sbjct: 191 SPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQ 250
Query: 816 SNSPNPSSPAPQKSSPSSISDNEV 839
+ + P P +S + +
Sbjct: 251 VPPLSTAKPTPPSASATPAPIGGI 274
>gnl|CDD|223029 PHA03264, PHA03264, envelope glycoprotein D; Provisional.
Length = 416
Score = 35.4 bits (81), Expect = 0.14
Identities = 20/98 (20%), Positives = 31/98 (31%), Gaps = 14/98 (14%)
Query: 242 PEPRCGGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGP 301
P + G + G P P D G P+ P P+
Sbjct: 281 DRPEAKPEPGPVEDGAPGRETGGEGEGPEP---------AGRDGAAGGEPKPGPPRPAPD 331
Query: 302 AGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSI 339
A +PE P+ A P P++PA ++ P +
Sbjct: 332 A-----DRPEGWPSLEAITFPPPTPATPAVPRARPVIV 364
Score = 35.4 bits (81), Expect = 0.14
Identities = 20/98 (20%), Positives = 31/98 (31%), Gaps = 14/98 (14%)
Query: 737 PEPRCGGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGP 796
P + G + G P P D G P+ P P+
Sbjct: 281 DRPEAKPEPGPVEDGAPGRETGGEGEGPEP---------AGRDGAAGGEPKPGPPRPAPD 331
Query: 797 AGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSI 834
A +PE P+ A P P++PA ++ P +
Sbjct: 332 A-----DRPEGWPSLEAITFPPPTPATPAVPRARPVIV 364
Score = 32.3 bits (73), Expect = 1.4
Identities = 22/90 (24%), Positives = 23/90 (25%), Gaps = 9/90 (10%)
Query: 263 EYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSN 322
G P PP P G E GP A + P
Sbjct: 271 SGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGE-GEGPEPAGRDGAAGGEP-------- 321
Query: 323 SPNPSSPAPQKSSPSSISDNEVHMPPPPLP 352
P P PAP P E PPP P
Sbjct: 322 KPGPPRPAPDADRPEGWPSLEAITFPPPTP 351
Score = 32.3 bits (73), Expect = 1.4
Identities = 22/90 (24%), Positives = 23/90 (25%), Gaps = 9/90 (10%)
Query: 758 EYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSN 817
G P PP P G E GP A + P
Sbjct: 271 SGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGE-GEGPEPAGRDGAAGGEP-------- 321
Query: 818 SPNPSSPAPQKSSPSSISDNEVHMPPPPLP 847
P P PAP P E PPP P
Sbjct: 322 KPGPPRPAPDADRPEGWPSLEAITFPPPTP 351
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 35.6 bits (82), Expect = 0.15
Identities = 22/92 (23%), Positives = 26/92 (28%), Gaps = 9/92 (9%)
Query: 265 GPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPS---GPAGAYSEQKPEVVPA-PAASV 320
G P PPP SE +P G Y + P +Q P PA PA
Sbjct: 93 GEPAPPPPHARRTSEPELPRPGRRPYEGY-GGPRADDRPPGLPRQDQLPTARPAYPAYQQ 151
Query: 321 SNSPNPSSPAPQKSSPSSISDNEVHMPPPPLP 352
P A + PP P
Sbjct: 152 RPEPGAWPRAADDYGW----QQQRLGFPPRAP 179
Score = 35.6 bits (82), Expect = 0.15
Identities = 22/92 (23%), Positives = 26/92 (28%), Gaps = 9/92 (9%)
Query: 760 GPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPS---GPAGAYSEQKPEVVPA-PAASV 815
G P PPP SE +P G Y + P +Q P PA PA
Sbjct: 93 GEPAPPPPHARRTSEPELPRPGRRPYEGY-GGPRADDRPPGLPRQDQLPTARPAYPAYQQ 151
Query: 816 SNSPNPSSPAPQKSSPSSISDNEVHMPPPPLP 847
P A + PP P
Sbjct: 152 RPEPGAWPRAADDYGW----QQQRLGFPPRAP 179
Score = 32.5 bits (74), Expect = 1.2
Identities = 26/124 (20%), Positives = 32/124 (25%), Gaps = 18/124 (14%)
Query: 242 PEPRCGGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSG- 300
PR DYG Q GF P P + E + G PE + R
Sbjct: 157 AWPRAADDYGWQQQRLGFPPRA---PYASPASYAPEQERDREPYDA-GRPEYDQRRRDYD 212
Query: 301 -------PAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPG 353
+PE P P A + P P + P PL
Sbjct: 213 HPRPDWDRPRRDRTDRPE--PPPGAGHVHRGGPGPPER----DDAPVVPIRPSAPGPLAA 266
Query: 354 HGAT 357
A
Sbjct: 267 QPAP 270
Score = 32.5 bits (74), Expect = 1.2
Identities = 26/124 (20%), Positives = 32/124 (25%), Gaps = 18/124 (14%)
Query: 737 PEPRCGGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSG- 795
PR DYG Q GF P P + E + G PE + R
Sbjct: 157 AWPRAADDYGWQQQRLGFPPRA---PYASPASYAPEQERDREPYDA-GRPEYDQRRRDYD 212
Query: 796 -------PAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPG 848
+PE P P A + P P + P PL
Sbjct: 213 HPRPDWDRPRRDRTDRPE--PPPGAGHVHRGGPGPPER----DDAPVVPIRPSAPGPLAA 266
Query: 849 HGAT 852
A
Sbjct: 267 QPAP 270
>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
Validated.
Length = 709
Score = 35.1 bits (80), Expect = 0.21
Identities = 15/82 (18%), Positives = 27/82 (32%), Gaps = 4/82 (4%)
Query: 275 SIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPE----VVPAPAASVSNSPNPSSPA 330
S +P G P N + P + + + A S+ + +P
Sbjct: 408 QTASAAAMPSEGKTAGPVSNQENNDVPPWEDAPDEAQTAAGTAQTSAKSIQTASEAETPP 467
Query: 331 PQKSSPSSISDNEVHMPPPPLP 352
+ S + +DNE P +P
Sbjct: 468 ENQVSKNKAADNETDAPLSEVP 489
Score = 35.1 bits (80), Expect = 0.21
Identities = 15/82 (18%), Positives = 27/82 (32%), Gaps = 4/82 (4%)
Query: 770 SIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPE----VVPAPAASVSNSPNPSSPA 825
S +P G P N + P + + + A S+ + +P
Sbjct: 408 QTASAAAMPSEGKTAGPVSNQENNDVPPWEDAPDEAQTAAGTAQTSAKSIQTASEAETPP 467
Query: 826 PQKSSPSSISDNEVHMPPPPLP 847
+ S + +DNE P +P
Sbjct: 468 ENQVSKNKAADNETDAPLSEVP 489
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 35.2 bits (81), Expect = 0.22
Identities = 14/105 (13%), Positives = 23/105 (21%), Gaps = 11/105 (10%)
Query: 252 APQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPE 311
P + GG G P V P +
Sbjct: 359 EPAVTGGGAPGGGVPARVAGAV-------PAPGARAAAAVGASAVPAVTAVTG----AAG 407
Query: 312 VVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGA 356
AP A+ + + + P +P + +D P
Sbjct: 408 AALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAK 452
Score = 35.2 bits (81), Expect = 0.22
Identities = 14/105 (13%), Positives = 23/105 (21%), Gaps = 11/105 (10%)
Query: 747 APQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPE 806
P + GG G P V P +
Sbjct: 359 EPAVTGGGAPGGGVPARVAGAV-------PAPGARAAAAVGASAVPAVTAVTG----AAG 407
Query: 807 VVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGA 851
AP A+ + + + P +P + +D P
Sbjct: 408 AALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAK 452
Score = 32.1 bits (73), Expect = 1.9
Identities = 11/64 (17%), Positives = 19/64 (29%)
Query: 287 GPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHM 346
E P P+ PA A AP + +N+ + + +D+
Sbjct: 419 AATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSAS 478
Query: 347 PPPP 350
P
Sbjct: 479 APAS 482
Score = 32.1 bits (73), Expect = 1.9
Identities = 11/64 (17%), Positives = 19/64 (29%)
Query: 782 GPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHM 841
E P P+ PA A AP + +N+ + + +D+
Sbjct: 419 AATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSAS 478
Query: 842 PPPP 845
P
Sbjct: 479 APAS 482
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 34.5 bits (79), Expect = 0.28
Identities = 26/91 (28%), Positives = 35/91 (38%), Gaps = 6/91 (6%)
Query: 267 PPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNP 326
P P V S GS VP+ P E P P Y + KP P+P A SP+
Sbjct: 407 PAEPDVVPSPGSASNVPE-VEPAQVEAKKTRPLSPYARYEDLKPPTSPSPTAPTGVSPSV 465
Query: 327 SSPA-----PQKSSPSSISDNEVHMPPPPLP 352
SS + P + ++ +D P P
Sbjct: 466 SSTSSVPAVPDTAPATAATDAAAPPPANMRP 496
Score = 34.5 bits (79), Expect = 0.28
Identities = 26/91 (28%), Positives = 35/91 (38%), Gaps = 6/91 (6%)
Query: 762 PPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNP 821
P P V S GS VP+ P E P P Y + KP P+P A SP+
Sbjct: 407 PAEPDVVPSPGSASNVPE-VEPAQVEAKKTRPLSPYARYEDLKPPTSPSPTAPTGVSPSV 465
Query: 822 SSPA-----PQKSSPSSISDNEVHMPPPPLP 847
SS + P + ++ +D P P
Sbjct: 466 SSTSSVPAVPDTAPATAATDAAAPPPANMRP 496
Score = 32.2 bits (73), Expect = 1.8
Identities = 27/101 (26%), Positives = 37/101 (36%), Gaps = 15/101 (14%)
Query: 266 PPPPPPHVNSIGSEYGVPDFGG----------PGYPEINPRHPSGPAGAYSEQKPEVVPA 315
P P S+ S VP P N R P P Y + KP P+
Sbjct: 455 PTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPANMR-PLSPYAVYDDLKPPTSPS 513
Query: 316 PAASVSNSPNPSSPAPQK----SSPSSISDNEVHMPPPPLP 352
PAA V S+ K + P++++D + H P P P
Sbjct: 514 PAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPRP 554
Score = 32.2 bits (73), Expect = 1.8
Identities = 27/101 (26%), Positives = 37/101 (36%), Gaps = 15/101 (14%)
Query: 761 PPPPPPHVNSIGSEYGVPDFGG----------PGYPEINPRHPSGPAGAYSEQKPEVVPA 810
P P S+ S VP P N R P P Y + KP P+
Sbjct: 455 PTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPANMR-PLSPYAVYDDLKPPTSPS 513
Query: 811 PAASVSNSPNPSSPAPQK----SSPSSISDNEVHMPPPPLP 847
PAA V S+ K + P++++D + H P P P
Sbjct: 514 PAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPRP 554
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 34.9 bits (80), Expect = 0.28
Identities = 19/104 (18%), Positives = 25/104 (24%), Gaps = 3/104 (2%)
Query: 252 APQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPE 311
A Q G P P P + + + PA A + +
Sbjct: 436 ARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDD 495
Query: 312 VVP---APAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLP 352
P P S +P AP SI D P
Sbjct: 496 PPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFE 539
Score = 34.9 bits (80), Expect = 0.28
Identities = 19/104 (18%), Positives = 25/104 (24%), Gaps = 3/104 (2%)
Query: 747 APQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPE 806
A Q G P P P + + + PA A + +
Sbjct: 436 ARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDD 495
Query: 807 VVP---APAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLP 847
P P S +P AP SI D P
Sbjct: 496 PPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFE 539
Score = 34.1 bits (78), Expect = 0.42
Identities = 16/89 (17%), Positives = 24/89 (26%)
Query: 265 GPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSP 324
P P ++ + G G P P + PA A P AA+ +
Sbjct: 422 APARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPA 481
Query: 325 NPSSPAPQKSSPSSISDNEVHMPPPPLPG 353
+ A + E P P
Sbjct: 482 RAAPAAAPAPADDDPPPWEELPPEFASPA 510
Score = 34.1 bits (78), Expect = 0.42
Identities = 16/89 (17%), Positives = 24/89 (26%)
Query: 760 GPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSP 819
P P ++ + G G P P + PA A P AA+ +
Sbjct: 422 APARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPA 481
Query: 820 NPSSPAPQKSSPSSISDNEVHMPPPPLPG 848
+ A + E P P
Sbjct: 482 RAAPAAAPAPADDDPPPWEELPPEFASPA 510
Score = 31.4 bits (71), Expect = 2.7
Identities = 16/87 (18%), Positives = 26/87 (29%), Gaps = 2/87 (2%)
Query: 266 PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPN 325
PP P + P A S + P PAPA + + +P
Sbjct: 401 APPAAPAAAPAAAAAARAVAAAPARRS-PAPEALAAARQASARGPGGAPAPAPAPAAAPA 459
Query: 326 PSSPAPQKSSPSSISDNEVHMPPPPLP 352
++ P + P ++ P P
Sbjct: 460 AAAR-PAAAGPRPVAAAAAAAPARAAP 485
Score = 31.4 bits (71), Expect = 2.7
Identities = 16/87 (18%), Positives = 26/87 (29%), Gaps = 2/87 (2%)
Query: 761 PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPN 820
PP P + P A S + P PAPA + + +P
Sbjct: 401 APPAAPAAAPAAAAAARAVAAAPARRS-PAPEALAAARQASARGPGGAPAPAPAPAAAPA 459
Query: 821 PSSPAPQKSSPSSISDNEVHMPPPPLP 847
++ P + P ++ P P
Sbjct: 460 AAAR-PAAAGPRPVAAAAAAAPARAAP 485
>gnl|CDD|145556 pfam02478, Pneumo_phosprot, Pneumovirus phosphoprotein. This
family represents the phosphoprotein of Paramyxoviridae,
a putative RNA polymerase alpha subunit that may
function in template binding.
Length = 266
Score = 33.3 bits (76), Expect = 0.46
Identities = 12/57 (21%), Positives = 16/57 (28%)
Query: 311 EVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQTVKRKGTV 367
E P S +P + S + + D E P P T K T
Sbjct: 44 EKFELPTISKPTTPKGPCIKDEIISVNKVKDIESIYEKPVTPTSDGKTPTEKSDDTP 100
Score = 33.3 bits (76), Expect = 0.46
Identities = 12/57 (21%), Positives = 16/57 (28%)
Query: 806 EVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQTVKRKGTV 862
E P S +P + S + + D E P P T K T
Sbjct: 44 EKFELPTISKPTTPKGPCIKDEIISVNKVKDIESIYEKPVTPTSDGKTPTEKSDDTP 100
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 33.3 bits (76), Expect = 0.50
Identities = 22/69 (31%), Positives = 26/69 (37%), Gaps = 6/69 (8%)
Query: 291 PEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPN------PSSPAPQKSSPSSISDNEV 344
P P PS A A + P P PAA + P+ P PA + P D
Sbjct: 41 PAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNA 100
Query: 345 HMPPPPLPG 353
PP P PG
Sbjct: 101 PPPPAPEPG 109
Score = 33.3 bits (76), Expect = 0.50
Identities = 22/69 (31%), Positives = 26/69 (37%), Gaps = 6/69 (8%)
Query: 786 PEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPN------PSSPAPQKSSPSSISDNEV 839
P P PS A A + P P PAA + P+ P PA + P D
Sbjct: 41 PAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNA 100
Query: 840 HMPPPPLPG 848
PP P PG
Sbjct: 101 PPPPAPEPG 109
Score = 32.2 bits (73), Expect = 1.3
Identities = 14/58 (24%), Positives = 20/58 (34%), Gaps = 2/58 (3%)
Query: 301 PAGAYSEQKPEVVPAPAASVSNSPNPS--SPAPQKSSPSSISDNEVHMPPPPLPGHGA 356
PA A ++ P P A+ + +P P P D PPPP +
Sbjct: 34 PATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAP 91
Score = 32.2 bits (73), Expect = 1.3
Identities = 14/58 (24%), Positives = 20/58 (34%), Gaps = 2/58 (3%)
Query: 796 PAGAYSEQKPEVVPAPAASVSNSPNPS--SPAPQKSSPSSISDNEVHMPPPPLPGHGA 851
PA A ++ P P A+ + +P P P D PPPP +
Sbjct: 34 PATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAP 91
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 33.9 bits (77), Expect = 0.51
Identities = 29/120 (24%), Positives = 33/120 (27%), Gaps = 29/120 (24%)
Query: 247 GGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEI-------NPRHPS 299
GG Y SG Y PP PP + PG P+ RH
Sbjct: 154 GGTYIPAGGTYILASGTYIPPNPPREAPA------------PGLPKTFTSSHGHRHRHAP 201
Query: 300 GPAGAYSEQKPEVV-----PAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGH 354
P + Q P PA P PA Q + N PP GH
Sbjct: 202 KPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQ-----NPAQQPPQTEQGH 256
Score = 33.9 bits (77), Expect = 0.51
Identities = 29/120 (24%), Positives = 33/120 (27%), Gaps = 29/120 (24%)
Query: 742 GGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEI-------NPRHPS 794
GG Y SG Y PP PP + PG P+ RH
Sbjct: 154 GGTYIPAGGTYILASGTYIPPNPPREAPA------------PGLPKTFTSSHGHRHRHAP 201
Query: 795 GPAGAYSEQKPEVV-----PAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGH 849
P + Q P PA P PA Q + N PP GH
Sbjct: 202 KPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQ-----NPAQQPPQTEQGH 256
>gnl|CDD|105864 PRK12699, flgH, flagellar basal body L-ring protein; Reviewed.
Length = 246
Score = 33.0 bits (75), Expect = 0.58
Identities = 22/75 (29%), Positives = 31/75 (41%), Gaps = 16/75 (21%)
Query: 328 SPAPQKSSPSSISDNEVHMPPPPLPGHGATYQTVKRKGTVSEEGGDLVTLGGRPRSVEIF 387
+PAP+ P S ++ + PP G+G YQ GR R +F
Sbjct: 37 TPAPKVQQPMSARPPDLPLIPPS-QGNGGIYQ-------------AAFDYQGRGRYHPLF 82
Query: 388 PDELRGARRRRDTIL 402
D R AR+ DTI+
Sbjct: 83 ED--RRARQIGDTII 95
Score = 33.0 bits (75), Expect = 0.58
Identities = 22/75 (29%), Positives = 31/75 (41%), Gaps = 16/75 (21%)
Query: 823 SPAPQKSSPSSISDNEVHMPPPPLPGHGATYQTVKRKGTVSEEGGDLVTLGGRPRSVEIF 882
+PAP+ P S ++ + PP G+G YQ GR R +F
Sbjct: 37 TPAPKVQQPMSARPPDLPLIPPS-QGNGGIYQ-------------AAFDYQGRGRYHPLF 82
Query: 883 PDELRGARRRRDTIL 897
D R AR+ DTI+
Sbjct: 83 ED--RRARQIGDTII 95
>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN. FtsN is a poorly
conserved protein active in cell division in a number of
Proteobacteria. The N-terminal 30 residue region tends
to by Lys/Arg-rich, and is followed by a
membrane-spanning region. This is followed by an acidic
low-complexity region of variable length and a
well-conserved C-terminal domain of two tandem regions
matched by pfam05036 (Sporulation related repeat), found
in several cell division and sporulation proteins. The
role of FtsN as a suppressor for other cell division
mutations is poorly understood; it may involve cell wall
hydrolysis [Cellular processes, Cell division].
Length = 298
Score = 33.1 bits (75), Expect = 0.59
Identities = 22/151 (14%), Positives = 44/151 (29%), Gaps = 20/151 (13%)
Query: 255 LGGGFGSGEYG----PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKP 310
L G SG Y P ++ E P P +S
Sbjct: 37 LFIGGSSGLYLLTESKQANEPETLQPKNQ--------TENGETAADLPPKPEERWSY--I 86
Query: 311 EVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQTVKRKGTVSEE 370
E + A +++ PS+ + + ++ + + A K +
Sbjct: 87 EELEAREVLINDPEEPSNGGGVE-ESAQLTAEQRQLLEQMQADMRA-----AEKVLATAP 140
Query: 371 GGDLVTLGGRPRSVEIFPDELRGARRRRDTI 401
V + R ++ E P + R A ++ +
Sbjct: 141 SEQTVAVEARKQTAEKKPQKARTAEAQKTPV 171
Score = 33.1 bits (75), Expect = 0.59
Identities = 22/151 (14%), Positives = 44/151 (29%), Gaps = 20/151 (13%)
Query: 750 LGGGFGSGEYG----PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKP 805
L G SG Y P ++ E P P +S
Sbjct: 37 LFIGGSSGLYLLTESKQANEPETLQPKNQ--------TENGETAADLPPKPEERWSY--I 86
Query: 806 EVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQTVKRKGTVSEE 865
E + A +++ PS+ + + ++ + + A K +
Sbjct: 87 EELEAREVLINDPEEPSNGGGVE-ESAQLTAEQRQLLEQMQADMRA-----AEKVLATAP 140
Query: 866 GGDLVTLGGRPRSVEIFPDELRGARRRRDTI 896
V + R ++ E P + R A ++ +
Sbjct: 141 SEQTVAVEARKQTAEKKPQKARTAEAQKTPV 171
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 33.2 bits (76), Expect = 0.80
Identities = 24/99 (24%), Positives = 28/99 (28%), Gaps = 9/99 (9%)
Query: 265 GPPPPPPHVNSIGS---EYGVPDFGGPGYPEINPRHPSGPAGAYSEQ----KPEVVPAPA 317
P PP V G + P G P P P+ P G Q P PA
Sbjct: 171 QLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQP 230
Query: 318 ASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGA 356
P P Q P ++ PPP P
Sbjct: 231 PLPPQLPQQPPPLQQPQFPGLS--QQMPPPPPQPPQQQQ 267
Score = 33.2 bits (76), Expect = 0.80
Identities = 24/99 (24%), Positives = 28/99 (28%), Gaps = 9/99 (9%)
Query: 760 GPPPPPPHVNSIGS---EYGVPDFGGPGYPEINPRHPSGPAGAYSEQ----KPEVVPAPA 812
P PP V G + P G P P P+ P G Q P PA
Sbjct: 171 QLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQP 230
Query: 813 ASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGA 851
P P Q P ++ PPP P
Sbjct: 231 PLPPQLPQQPPPLQQPQFPGLS--QQMPPPPPQPPQQQQ 267
Score = 31.7 bits (72), Expect = 2.2
Identities = 23/111 (20%), Positives = 27/111 (24%), Gaps = 22/111 (19%)
Query: 265 GPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSP 324
P P + + G P+ PGYP+ HP P PA P
Sbjct: 179 VLPQGMPPRQAAFPQQGPPEQP-PGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLP 237
Query: 325 NPSSPAPQKSSP----------------SSISDNEVHMPPP-----PLPGH 354
P Q P PPP P PG
Sbjct: 238 QQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGL 288
Score = 31.7 bits (72), Expect = 2.2
Identities = 23/111 (20%), Positives = 27/111 (24%), Gaps = 22/111 (19%)
Query: 760 GPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSP 819
P P + + G P+ PGYP+ HP P PA P
Sbjct: 179 VLPQGMPPRQAAFPQQGPPEQP-PGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLP 237
Query: 820 NPSSPAPQKSSP----------------SSISDNEVHMPPP-----PLPGH 849
P Q P PPP P PG
Sbjct: 238 QQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGL 288
>gnl|CDD|222041 pfam13312, DUF4081, Domain of unknown function (DUF4081). This
domain is often found N-terminal to the GNAT
acetyltransferase domain, pfam00583 and FR47, pfam08445.
Length = 107
Score = 31.0 bits (71), Expect = 0.88
Identities = 10/32 (31%), Positives = 16/32 (50%)
Query: 371 GGDLVTLGGRPRSVEIFPDELRGARRRRDTIL 402
G +LV + P ++ F D R RR +I+
Sbjct: 20 GANLVPVAADPAALRAFADRARRRGRRCSSIV 51
Score = 31.0 bits (71), Expect = 0.88
Identities = 10/32 (31%), Positives = 16/32 (50%)
Query: 866 GGDLVTLGGRPRSVEIFPDELRGARRRRDTIL 897
G +LV + P ++ F D R RR +I+
Sbjct: 20 GANLVPVAADPAALRAFADRARRRGRRCSSIV 51
>gnl|CDD|99944 cd05512, Bromo_brd1_like, Bromodomain; brd1_like subfamily. BRD1 is
a mammalian gene which encodes for a nuclear protein
assumed to be a transcriptional regulator. BRD1 has been
implicated with brain development and susceptibility to
schizophrenia and bipolar affective disorder.
Bromodomains are 110 amino acid long domains that are
found in many chromatin associated proteins.
Bromodomains can interact specifically with acetylated
lysine.
Length = 98
Score = 30.4 bits (69), Expect = 1.1
Identities = 13/40 (32%), Positives = 26/40 (65%), Gaps = 5/40 (12%)
Query: 157 IKDDENKFDMLVRNCVAHDGK-----RAPIQLVDQHGCVV 191
++D E F++++ NC+A++ K RA ++L DQ G ++
Sbjct: 59 LEDFEADFNLIINNCLAYNAKDTIFYRAAVRLRDQGGAIL 98
Score = 30.4 bits (69), Expect = 1.1
Identities = 13/40 (32%), Positives = 26/40 (65%), Gaps = 5/40 (12%)
Query: 652 IKDDENKFDMLVRNCVAHDGK-----RAPIQLVDQHGCVV 686
++D E F++++ NC+A++ K RA ++L DQ G ++
Sbjct: 59 LEDFEADFNLIINNCLAYNAKDTIFYRAAVRLRDQGGAIL 98
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 32.4 bits (74), Expect = 1.3
Identities = 18/105 (17%), Positives = 26/105 (24%), Gaps = 3/105 (2%)
Query: 267 PPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNP 326
P + P P P A + P PA A + + P
Sbjct: 393 PVAQAAAAPAPAA--APAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAV-ALAP 449
Query: 327 SSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQTVKRKGTVSEEG 371
+ PA +I P + T +EEG
Sbjct: 450 APPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEG 494
Score = 32.4 bits (74), Expect = 1.3
Identities = 18/105 (17%), Positives = 26/105 (24%), Gaps = 3/105 (2%)
Query: 762 PPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNP 821
P + P P P A + P PA A + + P
Sbjct: 393 PVAQAAAAPAPAA--APAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAV-ALAP 449
Query: 822 SSPAPQKSSPSSISDNEVHMPPPPLPGHGATYQTVKRKGTVSEEG 866
+ PA +I P + T +EEG
Sbjct: 450 APPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEG 494
>gnl|CDD|217461 pfam03261, CDK5_activator, Cyclin-dependent kinase 5 activator
protein.
Length = 314
Score = 31.8 bits (72), Expect = 1.6
Identities = 15/53 (28%), Positives = 20/53 (37%), Gaps = 2/53 (3%)
Query: 297 HPSGPAGAYS--EQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMP 347
A S E P PAP SV + N + QK++PS + P
Sbjct: 105 TTLSCANLSSLREPSPGQPPAPPPSVLSGKNANCIPSQKNAPSIAITSTGGSP 157
Score = 31.8 bits (72), Expect = 1.6
Identities = 15/53 (28%), Positives = 20/53 (37%), Gaps = 2/53 (3%)
Query: 792 HPSGPAGAYS--EQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMP 842
A S E P PAP SV + N + QK++PS + P
Sbjct: 105 TTLSCANLSSLREPSPGQPPAPPPSVLSGKNANCIPSQKNAPSIAITSTGGSP 157
>gnl|CDD|221718 pfam12695, Abhydrolase_5, Alpha/beta hydrolase family. This family
contains a diverse range of alpha/beta hydrolase
enzymes.
Length = 145
Score = 30.8 bits (70), Expect = 1.7
Identities = 23/96 (23%), Positives = 35/96 (36%), Gaps = 9/96 (9%)
Query: 444 VVSQKSIDSEHICISLPSFVAGLAMLLA-----VLVVACLVAAFLFVRVRSLDAKGMPTT 498
V++ +D E I + S G+A+LLA V L A + L +P
Sbjct: 49 VLADAPLDPERIVLVGHSLGGGVALLLAARDPRVKAAVVLAAGDPPDALDDLAKLTVPVL 108
Query: 499 YMHGGFD----PAEFVKVTRYYSDSHCVHLKPGTGH 530
+HG D P E + + + G GH
Sbjct: 109 IIHGTRDGVVPPEEAEALAAALPGPAELVVIEGAGH 144
>gnl|CDD|223014 PHA03231, PHA03231, glycoprotein BALF4; Provisional.
Length = 829
Score = 32.2 bits (74), Expect = 1.9
Identities = 17/49 (34%), Positives = 24/49 (48%), Gaps = 10/49 (20%)
Query: 461 SFVAGL---------AMLLAVLVVACLVAAFL-FVRVRSLDAKGMPTTY 499
S V+G+ + + +LV+A LVA FL + RVR L M Y
Sbjct: 691 SIVSGVISFLKNPFGGLAIGLLVIAVLVAVFLAYRRVRRLRQNPMKMLY 739
Score = 32.2 bits (74), Expect = 1.9
Identities = 17/49 (34%), Positives = 24/49 (48%), Gaps = 10/49 (20%)
Query: 956 SFVAGL---------AMLLAVLVVACLVAAFL-FVRVRSLDAKGMPTTY 994
S V+G+ + + +LV+A LVA FL + RVR L M Y
Sbjct: 691 SIVSGVISFLKNPFGGLAIGLLVIAVLVAVFLAYRRVRRLRQNPMKMLY 739
>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
carboxylase.
Length = 274
Score = 31.3 bits (71), Expect = 2.2
Identities = 21/65 (32%), Positives = 27/65 (41%), Gaps = 8/65 (12%)
Query: 297 HPSGPAGAYSEQKPEVVPAPAASVSN-SPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHG 355
P P P PAP+A S+ P P+SP P K+ SS PP P G
Sbjct: 154 QPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPKSS-------HPPLKSPMAG 206
Query: 356 ATYQT 360
Y++
Sbjct: 207 TFYRS 211
Score = 31.3 bits (71), Expect = 2.2
Identities = 21/65 (32%), Positives = 27/65 (41%), Gaps = 8/65 (12%)
Query: 792 HPSGPAGAYSEQKPEVVPAPAASVSN-SPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHG 850
P P P PAP+A S+ P P+SP P K+ SS PP P G
Sbjct: 154 QPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPKSS-------HPPLKSPMAG 206
Query: 851 ATYQT 855
Y++
Sbjct: 207 TFYRS 211
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 31.6 bits (72), Expect = 2.2
Identities = 27/109 (24%), Positives = 36/109 (33%), Gaps = 10/109 (9%)
Query: 286 GGPGYPEINPRHPSGPAGAYSEQKPEVVPA-PAASVSNSPNPSSPAPQKSSPSSISDNEV 344
P P+ PA P V PA PA + P P+ P ++P + +
Sbjct: 382 PAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPA-RSADP 440
Query: 345 HMPPPPLPGHGATYQTVKRK----GTVSEEGGDLVTLGGRPRSVEI-FP 388
A VK K G E+G L G +EI FP
Sbjct: 441 AAAASAGDRWRAFVAFVKGKKPALGASLEQGSPLGVSAGL---LEIGFP 486
Score = 31.6 bits (72), Expect = 2.2
Identities = 27/109 (24%), Positives = 36/109 (33%), Gaps = 10/109 (9%)
Query: 781 GGPGYPEINPRHPSGPAGAYSEQKPEVVPA-PAASVSNSPNPSSPAPQKSSPSSISDNEV 839
P P+ PA P V PA PA + P P+ P ++P + +
Sbjct: 382 PAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPA-RSADP 440
Query: 840 HMPPPPLPGHGATYQTVKRK----GTVSEEGGDLVTLGGRPRSVEI-FP 883
A VK K G E+G L G +EI FP
Sbjct: 441 AAAASAGDRWRAFVAFVKGKKPALGASLEQGSPLGVSAGL---LEIGFP 486
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 30.2 bits (68), Expect = 2.4
Identities = 17/61 (27%), Positives = 19/61 (31%), Gaps = 7/61 (11%)
Query: 266 PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPN 325
PP PP V P P P S PA + PA A S+ P
Sbjct: 84 PPAPPEPVT-------PPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQ 136
Query: 326 P 326
P
Sbjct: 137 P 137
Score = 30.2 bits (68), Expect = 2.4
Identities = 17/61 (27%), Positives = 19/61 (31%), Gaps = 7/61 (11%)
Query: 761 PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPN 820
PP PP V P P P S PA + PA A S+ P
Sbjct: 84 PPAPPEPVT-------PPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQ 136
Query: 821 P 821
P
Sbjct: 137 P 137
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 31.8 bits (72), Expect = 2.5
Identities = 20/99 (20%), Positives = 28/99 (28%), Gaps = 15/99 (15%)
Query: 261 SGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASV 320
G P N + G H S P PAP+ ++
Sbjct: 75 VGIPKNSVPQSPNNKGKPSFFGDQRGRSIPSSHQSTHTSAPG-----------PAPSGNI 123
Query: 321 SNSPNPSSPAPQK----SSPSSISDNEVHMPPPPLPGHG 355
S S S P Q PS++S ++ H H
Sbjct: 124 SQSQKRSQPRSQWTKAGHPPSTVSSSQRHGQQGSSHSHS 162
Score = 31.8 bits (72), Expect = 2.5
Identities = 20/99 (20%), Positives = 28/99 (28%), Gaps = 15/99 (15%)
Query: 756 SGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASV 815
G P N + G H S P PAP+ ++
Sbjct: 75 VGIPKNSVPQSPNNKGKPSFFGDQRGRSIPSSHQSTHTSAPG-----------PAPSGNI 123
Query: 816 SNSPNPSSPAPQK----SSPSSISDNEVHMPPPPLPGHG 850
S S S P Q PS++S ++ H H
Sbjct: 124 SQSQKRSQPRSQWTKAGHPPSTVSSSQRHGQQGSSHSHS 162
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 31.5 bits (72), Expect = 2.8
Identities = 9/60 (15%), Positives = 14/60 (23%)
Query: 294 NPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPG 353
P P P + + P +P P + + S N P
Sbjct: 517 ASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSP 576
Score = 31.5 bits (72), Expect = 2.8
Identities = 9/60 (15%), Positives = 14/60 (23%)
Query: 789 NPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPG 848
P P P + + P +P P + + S N P
Sbjct: 517 ASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSP 576
Score = 31.1 bits (71), Expect = 3.9
Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 2/64 (3%)
Query: 286 GGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVH 345
P P P P+ ++Q P+P + ++ P P P + SP + N +
Sbjct: 377 PTPAPNPSPPPAPIQPSAPKTKQAATT-PSPPPAKASPPIPVPAEPTEPSP-TPPANAAN 434
Query: 346 MPPP 349
PP
Sbjct: 435 APPS 438
Score = 31.1 bits (71), Expect = 3.9
Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 2/64 (3%)
Query: 781 GGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVH 840
P P P P+ ++Q P+P + ++ P P P + SP + N +
Sbjct: 377 PTPAPNPSPPPAPIQPSAPKTKQAATT-PSPPPAKASPPIPVPAEPTEPSP-TPPANAAN 434
Query: 841 MPPP 844
PP
Sbjct: 435 APPS 438
>gnl|CDD|177553 PHA03185, PHA03185, UL14 tegument protein; Provisional.
Length = 214
Score = 30.8 bits (69), Expect = 3.0
Identities = 16/51 (31%), Positives = 20/51 (39%), Gaps = 3/51 (5%)
Query: 291 PEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISD 341
P + P SGP G+ P P P P P P + P+S SD
Sbjct: 158 PAVRPGDLSGP-GSRPTSGPVAGGLPGGGAPAPPTPPRPGP--TDPASESD 205
Score = 30.8 bits (69), Expect = 3.0
Identities = 16/51 (31%), Positives = 20/51 (39%), Gaps = 3/51 (5%)
Query: 786 PEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISD 836
P + P SGP G+ P P P P P P + P+S SD
Sbjct: 158 PAVRPGDLSGP-GSRPTSGPVAGGLPGGGAPAPPTPPRPGP--TDPASESD 205
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 31.1 bits (70), Expect = 3.4
Identities = 26/93 (27%), Positives = 33/93 (35%), Gaps = 9/93 (9%)
Query: 264 YGPPPPPPHVNSIGSEYGVPDFGGPGYPE--INPRHPSGPAGAYSEQKPEVVPAPA-ASV 320
P PPP +Y P P P+ NP + A S PE P +
Sbjct: 139 QQPQPPPQVPQQ--QQYQSP----PQQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYP 192
Query: 321 SNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPG 353
N P PSS A Q + + + PP P P
Sbjct: 193 PNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSPY 225
Score = 31.1 bits (70), Expect = 3.4
Identities = 26/93 (27%), Positives = 33/93 (35%), Gaps = 9/93 (9%)
Query: 759 YGPPPPPPHVNSIGSEYGVPDFGGPGYPE--INPRHPSGPAGAYSEQKPEVVPAPA-ASV 815
P PPP +Y P P P+ NP + A S PE P +
Sbjct: 139 QQPQPPPQVPQQ--QQYQSP----PQQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYP 192
Query: 816 SNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPG 848
N P PSS A Q + + + PP P P
Sbjct: 193 PNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSPY 225
>gnl|CDD|234399 TIGR03923, T7SS_EccE, type VII secretion protein EccE. This model
represents the transmembrane protein EccB of the
actinobacterial flavor of type VII secretion systems.
Species such as Mycobacterium tuberculosis have several
instances of this system per genome, designated EccE1,
EccE2, etc. This model represents a conserved core
region, and many members have 200 or more additional
C-terminal residues [Protein fate, Protein and peptide
secretion and trafficking].
Length = 341
Score = 30.9 bits (70), Expect = 3.5
Identities = 11/26 (42%), Positives = 12/26 (46%)
Query: 463 VAGLAMLLAVLVVACLVAAFLFVRVR 488
LA L VA +V LFVR R
Sbjct: 21 ATRLAGAWVALGVAVVVVLLLFVRWR 46
Score = 30.9 bits (70), Expect = 3.5
Identities = 11/26 (42%), Positives = 12/26 (46%)
Query: 958 VAGLAMLLAVLVVACLVAAFLFVRVR 983
LA L VA +V LFVR R
Sbjct: 21 ATRLAGAWVALGVAVVVVLLLFVRWR 46
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 30.8 bits (70), Expect = 3.9
Identities = 12/62 (19%), Positives = 18/62 (29%)
Query: 291 PEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPP 350
+P+H P A ++ + V P A V P P + P
Sbjct: 84 ARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQ 143
Query: 351 LP 352
P
Sbjct: 144 QP 145
Score = 30.8 bits (70), Expect = 3.9
Identities = 12/62 (19%), Positives = 18/62 (29%)
Query: 786 PEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPP 845
+P+H P A ++ + V P A V P P + P
Sbjct: 84 ARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQ 143
Query: 846 LP 847
P
Sbjct: 144 QP 145
>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
Length = 318
Score = 30.6 bits (69), Expect = 4.3
Identities = 20/84 (23%), Positives = 25/84 (29%), Gaps = 6/84 (7%)
Query: 266 PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPN 325
P PP S P+ P P P G A + + PA +S
Sbjct: 13 RRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRGCPAGVTFSSSA 72
Query: 326 PSSPAPQKSSPSSISDNEVHMPPP 349
P P P + D PPP
Sbjct: 73 PPRP------PLGLDDAPAATPPP 90
Score = 30.6 bits (69), Expect = 4.3
Identities = 20/84 (23%), Positives = 25/84 (29%), Gaps = 6/84 (7%)
Query: 761 PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPN 820
P PP S P+ P P P G A + + PA +S
Sbjct: 13 RRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRGCPAGVTFSSSA 72
Query: 821 PSSPAPQKSSPSSISDNEVHMPPP 844
P P P + D PPP
Sbjct: 73 PPRP------PLGLDDAPAATPPP 90
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 30.8 bits (69), Expect = 4.5
Identities = 22/93 (23%), Positives = 30/93 (32%), Gaps = 9/93 (9%)
Query: 266 PPPPPPHVNSI----GSEYGVPDFGGPGYPEINP--RHPSGPAGAYSEQKPEVVPAPAAS 319
P PP + P G P +P +HP G + P P+ S
Sbjct: 418 VPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPP---PSLPTS 474
Query: 320 VSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLP 352
+P +S Q + S P PPLP
Sbjct: 475 TPAAPPRASSGSQPPGSALPSSGGCAGPGPPLP 507
Score = 30.8 bits (69), Expect = 4.5
Identities = 22/93 (23%), Positives = 30/93 (32%), Gaps = 9/93 (9%)
Query: 761 PPPPPPHVNSI----GSEYGVPDFGGPGYPEINP--RHPSGPAGAYSEQKPEVVPAPAAS 814
P PP + P G P +P +HP G + P P+ S
Sbjct: 418 VPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPP---PSLPTS 474
Query: 815 VSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLP 847
+P +S Q + S P PPLP
Sbjct: 475 TPAAPPRASSGSQPPGSALPSSGGCAGPGPPLP 507
Score = 30.4 bits (68), Expect = 7.0
Identities = 23/91 (25%), Positives = 30/91 (32%), Gaps = 18/91 (19%)
Query: 266 PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPN 325
P P PP S+ P P + RHP + P + A +
Sbjct: 238 PSPHPPLQPQTASQ------QSPQPPAPSSRHPQSS---HHGPGPPMPHALQQGPVFLQH 288
Query: 326 PSS--PAPQKSSPSSISDNEVHMPPPPLPGH 354
PSS P P + S + PP PLP
Sbjct: 289 PSSNPPQPFGLAQSQV-------PPLPLPSQ 312
Score = 30.4 bits (68), Expect = 7.0
Identities = 23/91 (25%), Positives = 30/91 (32%), Gaps = 18/91 (19%)
Query: 761 PPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPN 820
P P PP S+ P P + RHP + P + A +
Sbjct: 238 PSPHPPLQPQTASQ------QSPQPPAPSSRHPQSS---HHGPGPPMPHALQQGPVFLQH 288
Query: 821 PSS--PAPQKSSPSSISDNEVHMPPPPLPGH 849
PSS P P + S + PP PLP
Sbjct: 289 PSSNPPQPFGLAQSQV-------PPLPLPSQ 312
>gnl|CDD|190657 pfam03482, SIC, sic protein. Serotype M1 group A Streptococcus
strains cause epidemic waves of human infections. This
family includes the sic protein an extracellular protein
(streptococcal inhibitor of complement) that inhibits
human complement.
Length = 274
Score = 30.2 bits (67), Expect = 4.5
Identities = 34/122 (27%), Positives = 45/122 (36%), Gaps = 39/122 (31%)
Query: 249 DYGAPQLGGGFGSGEY------GP------PPPPPHVNSIGSEY-GVPDFGGPGYPEINP 295
D P G G+G GP P PP+ ++G+ Y D+GGPG +P
Sbjct: 72 DKTRPPYGEALGTGYEKRDDWGGPGTVATDPYTPPYGGALGTGYEKRDDWGGPGTVATDP 131
Query: 296 RHP--SGPAGAYSEQK-----PEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPP 348
P G G E++ P +P P SPNPS H+P
Sbjct: 132 YTPPYGGALGTGYEKRDDWRGPGHIPKPEN--EQSPNPS-----------------HIPE 172
Query: 349 PP 350
PP
Sbjct: 173 PP 174
Score = 30.2 bits (67), Expect = 4.5
Identities = 34/122 (27%), Positives = 45/122 (36%), Gaps = 39/122 (31%)
Query: 744 DYGAPQLGGGFGSGEY------GP------PPPPPHVNSIGSEY-GVPDFGGPGYPEINP 790
D P G G+G GP P PP+ ++G+ Y D+GGPG +P
Sbjct: 72 DKTRPPYGEALGTGYEKRDDWGGPGTVATDPYTPPYGGALGTGYEKRDDWGGPGTVATDP 131
Query: 791 RHP--SGPAGAYSEQK-----PEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPP 843
P G G E++ P +P P SPNPS H+P
Sbjct: 132 YTPPYGGALGTGYEKRDDWRGPGHIPKPEN--EQSPNPS-----------------HIPE 172
Query: 844 PP 845
PP
Sbjct: 173 PP 174
>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824). This is
a repeating domain found in fungal proteins. It is
proline-rich, and the function is not known.
Length = 135
Score = 29.5 bits (66), Expect = 4.7
Identities = 19/74 (25%), Positives = 21/74 (28%)
Query: 263 EYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSN 322
Y P P PP Y P P P P GP Y+ P N
Sbjct: 53 PYDPTPYPPSPPVSDPRYYPNSNYFPPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQN 112
Query: 323 SPNPSSPAPQKSSP 336
P P P +P
Sbjct: 113 YPYPPGPGQDPYAP 126
Score = 29.5 bits (66), Expect = 4.7
Identities = 19/74 (25%), Positives = 21/74 (28%)
Query: 758 EYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSN 817
Y P P PP Y P P P P GP Y+ P N
Sbjct: 53 PYDPTPYPPSPPVSDPRYYPNSNYFPPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQN 112
Query: 818 SPNPSSPAPQKSSP 831
P P P +P
Sbjct: 113 YPYPPGPGQDPYAP 126
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 30.3 bits (68), Expect = 5.0
Identities = 25/89 (28%), Positives = 29/89 (32%), Gaps = 10/89 (11%)
Query: 265 GPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSP 324
G PP GS G D P P PR GPA + P P AS +P
Sbjct: 170 GTLAAPPLGE--GSADGSCD---PALPLSAPR--LGPADVFVPATPRPTPRTTASPETTP 222
Query: 325 NPSSPAPQKSSPSSISDNEVHMPPPPLPG 353
PS+ PS+ P G
Sbjct: 223 TPSTTTS---PPSTTIPAPSTTIAAPQAG 248
Score = 30.3 bits (68), Expect = 5.0
Identities = 25/89 (28%), Positives = 29/89 (32%), Gaps = 10/89 (11%)
Query: 760 GPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSP 819
G PP GS G D P P PR GPA + P P AS +P
Sbjct: 170 GTLAAPPLGE--GSADGSCD---PALPLSAPR--LGPADVFVPATPRPTPRTTASPETTP 222
Query: 820 NPSSPAPQKSSPSSISDNEVHMPPPPLPG 848
PS+ PS+ P G
Sbjct: 223 TPSTTTS---PPSTTIPAPSTTIAAPQAG 248
>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
Length = 327
Score = 30.4 bits (69), Expect = 5.1
Identities = 15/57 (26%), Positives = 23/57 (40%), Gaps = 2/57 (3%)
Query: 282 VPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSS 338
V G P HP+ P + +KP V A+V+ P S P+ + S+
Sbjct: 181 VAPSKGAKVPATAETHPTPPQKP-ATKKPAVNHHKTATVAVPPAT-SGKPKSGAASA 235
Score = 30.4 bits (69), Expect = 5.1
Identities = 15/57 (26%), Positives = 23/57 (40%), Gaps = 2/57 (3%)
Query: 777 VPDFGGPGYPEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSS 833
V G P HP+ P + +KP V A+V+ P S P+ + S+
Sbjct: 181 VAPSKGAKVPATAETHPTPPQKP-ATKKPAVNHHKTATVAVPPAT-SGKPKSGAASA 235
>gnl|CDD|177646 PHA03418, PHA03418, hypothetical E4 protein; Provisional.
Length = 230
Score = 30.1 bits (67), Expect = 5.5
Identities = 15/29 (51%), Positives = 17/29 (58%)
Query: 243 EPRCGGDYGAPQLGGGFGSGEYGPPPPPP 271
EP GG+ P LG G G+ E PPP PP
Sbjct: 150 EPPEGGEESQPPLGEGEGAVEGHPPPLPP 178
Score = 30.1 bits (67), Expect = 5.5
Identities = 15/29 (51%), Positives = 17/29 (58%)
Query: 738 EPRCGGDYGAPQLGGGFGSGEYGPPPPPP 766
EP GG+ P LG G G+ E PPP PP
Sbjct: 150 EPPEGGEESQPPLGEGEGAVEGHPPPLPP 178
>gnl|CDD|233365 TIGR01347, sucB, 2-oxoglutarate dehydrogenase complex
dihydrolipoamide succinyltransferase (E2 component).
This model describes the TCA cycle 2-oxoglutarate system
E2 component, dihydrolipoamide succinyltransferase. It
is closely related to the pyruvate dehydrogenase E2
component, dihydrolipoamide acetyltransferase. The seed
for this model includes mitochondrial and Gram-negative
bacterial forms. Mycobacterial candidates are highly
derived, differ in having and extra copy of the
lipoyl-binding domain at the N-terminus. They score
below the trusted cutoff, but above the noise cutoff and
above all examples of dihydrolipoamide acetyltransferase
[Energy metabolism, TCA cycle].
Length = 403
Score = 30.1 bits (68), Expect = 5.9
Identities = 10/51 (19%), Positives = 18/51 (35%)
Query: 293 INPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNE 343
P +G E+ P A A + + + SPA ++ + D
Sbjct: 79 DATAAPPAKSGEEKEETPAASAAAAPTAAANRPSLSPAARRLAKEHGIDLS 129
Score = 30.1 bits (68), Expect = 5.9
Identities = 10/51 (19%), Positives = 18/51 (35%)
Query: 788 INPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNE 838
P +G E+ P A A + + + SPA ++ + D
Sbjct: 79 DATAAPPAKSGEEKEETPAASAAAAPTAAANRPSLSPAARRLAKEHGIDLS 129
>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
(DUF2360). This is the conserved 140 amino acid region
of a family of proteins conserved from nematodes to
humans. One C. elegans member is annotated as a
Daf-16-dependent longevity protein 1 but this could not
be confirmed. The function is unknown.
Length = 147
Score = 29.3 bits (66), Expect = 6.2
Identities = 9/43 (20%), Positives = 15/43 (34%)
Query: 310 PEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLP 352
P PA++++N P P + + S PP
Sbjct: 56 QTTPPPPASAITNGGPPPPPPARAEAASPPPPEAPAEPPAEPE 98
Score = 29.3 bits (66), Expect = 6.2
Identities = 9/43 (20%), Positives = 15/43 (34%)
Query: 805 PEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLP 847
P PA++++N P P + + S PP
Sbjct: 56 QTTPPPPASAITNGGPPPPPPARAEAASPPPPEAPAEPPAEPE 98
>gnl|CDD|216018 pfam00606, Glycoprotein_B, Herpesvirus Glycoprotein B. This family
of proteins contains a transmembrane region.
Length = 711
Score = 30.2 bits (69), Expect = 6.8
Identities = 17/49 (34%), Positives = 25/49 (51%), Gaps = 10/49 (20%)
Query: 461 SFVAGL---------AMLLAVLVVACLVAAFL-FVRVRSLDAKGMPTTY 499
S V+G+ +L+ +LV+A LVA FL + RVR L + Y
Sbjct: 652 SIVSGVVSFLKNPFGGLLIILLVLAGLVAVFLAYRRVRRLRQNPIKMLY 700
Score = 30.2 bits (69), Expect = 6.8
Identities = 17/49 (34%), Positives = 25/49 (51%), Gaps = 10/49 (20%)
Query: 956 SFVAGL---------AMLLAVLVVACLVAAFL-FVRVRSLDAKGMPTTY 994
S V+G+ +L+ +LV+A LVA FL + RVR L + Y
Sbjct: 652 SIVSGVVSFLKNPFGGLLIILLVLAGLVAVFLAYRRVRRLRQNPIKMLY 700
>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 30.3 bits (68), Expect = 7.0
Identities = 14/66 (21%), Positives = 19/66 (28%), Gaps = 13/66 (19%)
Query: 291 PEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPP 350
PE + P G + P PA S +++P P P P
Sbjct: 395 PEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVA-------------RSAP 441
Query: 351 LPGHGA 356
LP
Sbjct: 442 LPPSPQ 447
Score = 30.3 bits (68), Expect = 7.0
Identities = 14/66 (21%), Positives = 19/66 (28%), Gaps = 13/66 (19%)
Query: 786 PEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPP 845
PE + P G + P PA S +++P P P P
Sbjct: 395 PEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVA-------------RSAP 441
Query: 846 LPGHGA 851
LP
Sbjct: 442 LPPSPQ 447
>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
Length = 463
Score = 30.1 bits (67), Expect = 7.1
Identities = 20/58 (34%), Positives = 25/58 (43%), Gaps = 9/58 (15%)
Query: 291 PEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPP 348
PE PS PA +QKP+V AP A +P+ P Q S E +PP
Sbjct: 182 PETTDPKPSPPA--EDKQKPKVESAPVAEKPKAPSSPPPPKQ-------SAKEPQLPP 230
Score = 30.1 bits (67), Expect = 7.1
Identities = 20/58 (34%), Positives = 25/58 (43%), Gaps = 9/58 (15%)
Query: 786 PEINPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPP 843
PE PS PA +QKP+V AP A +P+ P Q S E +PP
Sbjct: 182 PETTDPKPSPPA--EDKQKPKVESAPVAEKPKAPSSPPPPKQ-------SAKEPQLPP 230
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 30.2 bits (68), Expect = 7.6
Identities = 15/66 (22%), Positives = 21/66 (31%), Gaps = 6/66 (9%)
Query: 293 INPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKS------SPSSISDNEVHM 346
+ P PA + V P PA S ++ P K +P + V
Sbjct: 360 LVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAP 419
Query: 347 PPPPLP 352
P P P
Sbjct: 420 PVPHTP 425
Score = 30.2 bits (68), Expect = 7.6
Identities = 15/66 (22%), Positives = 21/66 (31%), Gaps = 6/66 (9%)
Query: 788 INPRHPSGPAGAYSEQKPEVVPAPAASVSNSPNPSSPAPQKS------SPSSISDNEVHM 841
+ P PA + V P PA S ++ P K +P + V
Sbjct: 360 LVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAP 419
Query: 842 PPPPLP 847
P P P
Sbjct: 420 PVPHTP 425
>gnl|CDD|234022 TIGR02813, omega_3_PfaA, polyketide-type polyunsaturated fatty acid
synthase PfaA. Members of the seed for this alignment
are involved in omega-3 polyunsaturated fatty acid
biosynthesis, such as the protein PfaA from the
eicosapentaenoic acid biosynthesis operon in
Photobacterium profundum strain SS9. PfaA is encoded
together with PfaB, PfaC, and PfaD, and the functions of
the individual polypeptides have not yet been described.
More distant homologs of PfaA, also included with the
reach of this model, appear to be involved in
polyketide-like biosynthetic mechanisms of
polyunsaturated fatty acid biosynthesis, an alternative
to the more familiar iterated mechanism of chain
extension and desaturation, and in most cases are encoded
near genes for homologs of PfaB, PfaC, and/or PfaD.
Length = 2582
Score = 30.0 bits (67), Expect = 9.0
Identities = 13/49 (26%), Positives = 18/49 (36%), Gaps = 5/49 (10%)
Query: 288 PGYPEINPRHPS-----GPAGAYSEQKPEVVPAPAASVSNSPNPSSPAP 331
P PE+NP + G +Y + K V S S+PA
Sbjct: 1566 PDLPELNPEDLAECRTLGEIVSYMQSKVPPVADVTVETSTLAAESAPAI 1614
Score = 30.0 bits (67), Expect = 9.0
Identities = 13/49 (26%), Positives = 18/49 (36%), Gaps = 5/49 (10%)
Query: 783 PGYPEINPRHPS-----GPAGAYSEQKPEVVPAPAASVSNSPNPSSPAP 826
P PE+NP + G +Y + K V S S+PA
Sbjct: 1566 PDLPELNPEDLAECRTLGEIVSYMQSKVPPVADVTVETSTLAAESAPAI 1614
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 29.7 bits (67), Expect = 9.2
Identities = 13/67 (19%), Positives = 23/67 (34%), Gaps = 3/67 (4%)
Query: 286 GGPGYPEINPRHPSGPA--GAYSEQKPEVVPAPAAS-VSNSPNPSSPAPQKSSPSSISDN 342
G P+ P + +P A + S +S APQ ++ + +
Sbjct: 366 GDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPP 425
Query: 343 EVHMPPP 349
V + PP
Sbjct: 426 TVSVDPP 432
Score = 29.7 bits (67), Expect = 9.2
Identities = 13/67 (19%), Positives = 23/67 (34%), Gaps = 3/67 (4%)
Query: 781 GGPGYPEINPRHPSGPA--GAYSEQKPEVVPAPAAS-VSNSPNPSSPAPQKSSPSSISDN 837
G P+ P + +P A + S +S APQ ++ + +
Sbjct: 366 GDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPP 425
Query: 838 EVHMPPP 844
V + PP
Sbjct: 426 TVSVDPP 432
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 29.8 bits (67), Expect = 9.8
Identities = 21/112 (18%), Positives = 28/112 (25%), Gaps = 19/112 (16%)
Query: 247 GGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYS 306
GG G P G ++ P S P P GP
Sbjct: 392 GGAMGQPPYYGQGPQQQF---NGQPLGWPRMS------------MMPTPMGPGGPLRPNG 436
Query: 307 EQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGATY 358
V AP+ + N+ P P+ S LP +T
Sbjct: 437 LAPMNAVRAPSRNAQNAAQKPPMQPVMYPPNYQS----LPLSQDLPQPQSTA 484
Score = 29.8 bits (67), Expect = 9.8
Identities = 21/112 (18%), Positives = 28/112 (25%), Gaps = 19/112 (16%)
Query: 742 GGDYGAPQLGGGFGSGEYGPPPPPPHVNSIGSEYGVPDFGGPGYPEINPRHPSGPAGAYS 801
GG G P G ++ P S P P GP
Sbjct: 392 GGAMGQPPYYGQGPQQQF---NGQPLGWPRMS------------MMPTPMGPGGPLRPNG 436
Query: 802 EQKPEVVPAPAASVSNSPNPSSPAPQKSSPSSISDNEVHMPPPPLPGHGATY 853
V AP+ + N+ P P+ S LP +T
Sbjct: 437 LAPMNAVRAPSRNAQNAAQKPPMQPVMYPPNYQS----LPLSQDLPQPQSTA 484
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.136 0.421
Gapped
Lambda K H
0.267 0.0692 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 51,949,315
Number of extensions: 5143762
Number of successful extensions: 8282
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7488
Number of HSP's successfully gapped: 359
Length of query: 1009
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 903
Effective length of database: 6,236,078
Effective search space: 5631178434
Effective search space used: 5631178434
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.5 bits)