RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10461
(1359 letters)
>gnl|CDD|227508 COG5181, HSH155, U2 snRNP spliceosome subunit [RNA processing and
modification].
Length = 975
Score = 638 bits (1646), Expect = 0.0
Identities = 239/464 (51%), Positives = 306/464 (65%), Gaps = 2/464 (0%)
Query: 896 ARLVGASEIINRVVDDLKDENEQYRKMVMESIEKTMSNLGAADIDSRLEEQLIDGILYAF 955
A++ G + ++++ DE E YRKM + + S LG D RLEE+L D IL AF
Sbjct: 514 AKMGGDPRVSRKILEYYSDEPEPYRKMNAGLVSRIFSRLGRLGFDERLEERLYDSILNAF 573
Query: 956 QEQTTEDVVMLNGFGTIVNQLGKRVKPYLPQICGTILWRLNNKSAKVRQQAADLISRIAV 1015
QEQ T ++L F T++ L R KP+L I TIL L +K VR +AADL+ +A
Sbjct: 574 QEQDTTVGLILPCFSTVLVSLEFRGKPHLSMIVSTILKLLRSKPPDVRIRAADLMGSLAK 633
Query: 1016 VMKTCQEEKLMGHLGVVLYEYLGEEYPEVLGSILGALKAIVNVIGMTKMTPPIKDLLPRL 1075
V+K C E K + LG +LYE LGE+YPEVLGSIL A+ +I +V M PPI +LP L
Sbjct: 634 VLKACGETKELAKLGNILYENLGEDYPEVLGSILKAICSIYSVHRFRSMQPPISGILPSL 693
Query: 1076 TPILKNRHEKVQENCIDLVGRIADRGPEYVSAREWMRICFELLELLKAHKKAIRRATVNT 1135
TPIL+N+H+KV N I LVG I PEY+ REWMRICFEL++ LK+ K IRR T
Sbjct: 694 TPILRNKHQKVVANTIALVGTICMNSPEYIGVREWMRICFELVDSLKSWNKEIRRNATET 753
Query: 1136 FGYIAKAIGPHDVLATLLNNLKVQERQNRVCTTVAIAIVAETCSPFTVLPALMNEYRVPE 1195
FG I++AIGP DVL LLNNLKVQERQ RVCT+VAI+IVAE C PF+VLP LM++Y PE
Sbjct: 754 FGCISRAIGPQDVLDILLNNLKVQERQQRVCTSVAISIVAEYCGPFSVLPTLMSDYETPE 813
Query: 1196 LNVQNGVLKALSFLFEYIGEMGKDYIYAVTPLLEDALMDRDLVHRQTACATIKHMALGVY 1255
NVQNGVLKA+ F+FEYIG+ DY+Y++TPLLEDAL DRD VHRQTA I+H+ L
Sbjct: 814 ANVQNGVLKAMCFMFEYIGQASLDYVYSITPLLEDALTDRDPVHRQTAMNVIRHLVLNCP 873
Query: 1256 GFGCEDALTHLLNYVWPNIFETSPHLVQAFMDAVEGLRVALGPVRILQYVLQGLFHPARK 1315
G G EDA HLLN +WPNI E SPH++Q+F + +E LG +++YV QGLFHP+
Sbjct: 874 GTGDEDAAIHLLNLLWPNILEPSPHVIQSFDEGMESFATVLGSGAMMKYVQQGLFHPSST 933
Query: 1316 VRDVYWKIYNSLYIGGQDALISAYPRIQNDMKNVYLRYELDYVL 1359
VR YW +YN +Y+ DA++ YP ++ N L L +
Sbjct: 934 VRKRYWTVYNIMYVFDSDAMVPCYPVEEDL--NPELARTLHICI 975
Score = 281 bits (721), Expect = 6e-79
Identities = 110/198 (55%), Positives = 129/198 (65%)
Query: 407 AKYMDNQPKGNLPFLKPEDAQYFDKLLVDVDEDSLSPEEAKERKIMKLLLKIKNGTPPMR 466
D +L F K ED +YF D L E ER + +LLLK+KNG MR
Sbjct: 126 KALTDFHGYADLGFFKVEDLKYFADDEKDFFMPLLEDREGDERDVYRLLLKVKNGGKRMR 185
Query: 467 KAALRQITDKAREFGAGPLFNQILPLLMSPTLEDQERHLLVKVIDRILYKLDDLVRPYVH 526
LR +TDKA FGA +FN++LP+LMS LEDQERHL+VK+IDR+LY LDDL PYVH
Sbjct: 186 MEGLRILTDKAVNFGAAAVFNKVLPMLMSRELEDQERHLVVKLIDRLLYGLDDLKVPYVH 245
Query: 527 KILVVIEPLLIDEDYYARVEGREIISNLAKAAGLATMISTMRPDIDNIDEYVRNTTARAF 586
KILVV+ PLLIDED R GREII NL GL +S+MRPDI + DEYVRN T RA
Sbjct: 246 KILVVVGPLLIDEDLKRRCMGREIILNLVYRCGLGFSVSSMRPDITSKDEYVRNVTGRAV 305
Query: 587 AVVASALGIPSLLPFLKA 604
VVA ALG+ LLPFL+A
Sbjct: 306 GVVADALGVEELLPFLEA 323
Score = 33.4 bits (76), Expect = 1.2
Identities = 37/200 (18%), Positives = 62/200 (31%), Gaps = 43/200 (21%)
Query: 458 IKNGTPPMRKAALRQITDKAREFGAGPLFNQILPLLMS---PTLEDQERHLLVKVIDRIL 514
I + +R R + A G ++LP L + + RH +++ +I
Sbjct: 290 ITSKDEYVRNVTGRAVGVVADALGV----EELLPFLEALCGSRKSWEARHTGIRIAQQIC 345
Query: 515 YKLDDLVRPYVHKILVVIEPLLIDEDYYARVEGREIISNLAKAAG--------------- 559
L ++ +L I LL D + R++ +S LA+ G
Sbjct: 346 ELLGRSRLSHLGPLLKCISKLLKDRSRFVRIDTANALSYLAELVGPYGIEQFDEVLCPLW 405
Query: 560 ----------LATMISTMRPDIDNID-EYVRNTTARAFAVVASAL------GIPSLLPFL 602
L + + M I + EY + T +V LL
Sbjct: 406 EGASQHRGKELVSFLKAMGFIIPLMSPEYACHDTREHMEIVFREFKSPDEEMKKDLLVVE 465
Query: 603 ----KAGGVTPAATRWDETP 618
K G TP R +P
Sbjct: 466 RICDKVGTDTPWKLRDQVSP 485
Score = 32.2 bits (73), Expect = 2.3
Identities = 31/148 (20%), Positives = 59/148 (39%), Gaps = 7/148 (4%)
Query: 469 ALRQITDKAREFGAGPLFNQILPLLMSPTLEDQERHLLVKVIDRILYKLDDLVRPYV--- 525
A+ I R P + ILP L P L ++ + ++ I ++ + Y+
Sbjct: 669 AICSIYSVHRFRSMQPPISGILPSLT-PILRNKHQKVVANTIA-LVGTICMNSPEYIGVR 726
Query: 526 --HKILVVIEPLLIDEDYYARVEGREIISNLAKAAGLATMISTMRPDIDNIDEYVRNTTA 583
+I + L + R E +++A G ++ + ++ + R T+
Sbjct: 727 EWMRICFELVDSLKSWNKEIRRNATETFGCISRAIGPQDVLDILLNNLKVQERQQRVCTS 786
Query: 584 RAFAVVASALGIPSLLPFLKAGGVTPAA 611
A ++VA G S+LP L + TP A
Sbjct: 787 VAISIVAEYCGPFSVLPTLMSDYETPEA 814
Score = 31.1 bits (70), Expect = 5.0
Identities = 31/137 (22%), Positives = 57/137 (41%), Gaps = 10/137 (7%)
Query: 485 LFNQILPLLMSP--TLEDQERHLLVKVIDRILYKLDDLVRPYVHKILVVIEPLLIDEDYY 542
F+ +LP LMS T E ++ ++K + + + YV+ I ++E L D D
Sbjct: 798 PFS-VLPTLMSDYETPEANVQNGVLKAMCFMFEYIGQASLDYVYSITPLLEDALTDRDPV 856
Query: 543 ARVEGREIISNLA-KAAGLA---TMISTMRPDIDNIDE---YVRNTTARAFAVVASALGI 595
R +I +L G I + NI E +V + A+ LG
Sbjct: 857 HRQTAMNVIRHLVLNCPGTGDEDAAIHLLNLLWPNILEPSPHVIQSFDEGMESFATVLGS 916
Query: 596 PSLLPFLKAGGVTPAAT 612
+++ +++ G P++T
Sbjct: 917 GAMMKYVQQGLFHPSST 933
Score = 30.3 bits (68), Expect = 8.5
Identities = 48/266 (18%), Positives = 103/266 (38%), Gaps = 29/266 (10%)
Query: 905 INRVVDDLKDENEQYRKMVMESIEKTMSNLGAADIDSRLEEQLIDGILYAFQEQTTEDVV 964
+ R++ +K+ ++ R + + N GAA + +++ L+ L ++Q VV
Sbjct: 170 VYRLLLKVKNGGKRMRMEGLRILTDKAVNFGAAAVFNKVLPMLMSREL---EDQERHLVV 226
Query: 965 MLNGFGTIVNQLGKRVKPYLPQICGTILWRLNNKSAKVRQQAADLISRIAVVMKTCQEEK 1024
L ++ L PY+ +I + L ++ K R ++I +
Sbjct: 227 KL--IDRLLYGLDDLKVPYVHKILVVVGPLLIDEDLKRRCMGREIILNLVYRCG------ 278
Query: 1025 LMGHLGVVLYEYLGEEYPEVLGSILGALKAIVNVIGMTKMTPPIKDLLPRLTPILKNRHE 1084
+G + + + V A+ + + +G+ ++ P ++ L + RH
Sbjct: 279 -LGFSVSSMRPDITSKDEYVRNVTGRAVGVVADALGVEELLPFLEALC-GSRKSWEARH- 335
Query: 1085 KVQENCIDLVGRIADRGPEYVSAREWMRICFELLE----LLKAHKKAIRRATVNTFGYIA 1140
I + +I + + R + LL+ LLK + +R T N Y+A
Sbjct: 336 ----TGIRIAQQICE-----LLGRSRLSHLGPLLKCISKLLKDRSRFVRIDTANALSYLA 386
Query: 1141 KAIGPH--DVLATLLNNLKVQERQNR 1164
+ +GP+ + +L L Q+R
Sbjct: 387 ELVGPYGIEQFDEVLCPLWEGASQHR 412
>gnl|CDD|220070 pfam08920, SF3b1, Splicing factor 3B subunit 1. This family
consists of several eukaryotic splicing factor 3B
subunit 1 proteins, which associate with p14 through a
C-terminus beta-strand that interacts with beta-3 of the
p14 RNA recognition motif (RRM) beta-sheet, which is in
turn connected to an alpha-helix by a loop that makes
extensive contacts with both the shorter C-terminal
helix and RRM of p14. This subunit is required for 'A'
splicing complex assembly (formed by the stable binding
of U2 snRNP to the branchpoint sequence in pre-mRNA) and
'E' splicing complex assembly.
Length = 142
Score = 119 bits (299), Expect = 5e-31
Identities = 52/114 (45%), Positives = 63/114 (55%), Gaps = 13/114 (11%)
Query: 713 SSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMA 772
K+RSRWD+TP +GA+TP A T TP M G TP GN+
Sbjct: 5 KKKKRSRWDQTPVPEAAGAVTPEAPTKK--------RSRWDQTPQM--EGQTPIGNE-GG 53
Query: 773 MATPTPGHLAAMTPEQIQAYRWEREIDERNRPLTDDELEAMFPP-GYKVLQPPA 825
MATP MTPEQ +R EIDERNRPLTD+EL+ + P GYK+L+PP
Sbjct: 54 MATPMTPSSV-MTPEQYIPFRIGTEIDERNRPLTDEELDELLPSEGYKILEPPP 106
Score = 115 bits (289), Expect = 1e-29
Identities = 50/81 (61%), Positives = 58/81 (71%), Gaps = 4/81 (4%)
Query: 330 AMTPEQIQAYRWEREIDERNRPLTDDELEAMFPP-GYKVLQPPAGYIPIRTPARKLTATP 388
MTPEQ +R EIDERNRPLTD+EL+ + P GYK+L+PP GY PIRTPARKL ATP
Sbjct: 63 VMTPEQYIPFRIGTEIDERNRPLTDEELDELLPSEGYKILEPPPGYEPIRTPARKLLATP 122
Query: 389 TPIAGTPTGFFIQQEDKTAKY 409
TPI G GF IQ+ED+
Sbjct: 123 TPITG---GFMIQEEDRGLAK 140
Score = 47.8 bits (114), Expect = 3e-06
Identities = 23/36 (63%), Positives = 25/36 (69%), Gaps = 3/36 (8%)
Query: 1 YIPIRTPARKLTATPTPIAGTPTGFFIQQEDKTAKY 36
Y PIRTPARKL ATPTPI G GF IQ+ED+
Sbjct: 108 YEPIRTPARKLLATPTPITG---GFMIQEEDRGLAK 140
Score = 40.1 bits (94), Expect = 0.001
Identities = 20/49 (40%), Positives = 21/49 (42%), Gaps = 3/49 (6%)
Query: 613 RWDETPGHPKPGAETPGA-TPSTRLWDATPGH--ATPGAATPGRETPSH 658
RWD+TP GA TP A T WD TP TP G TP
Sbjct: 11 RWDQTPVPEAAGAVTPEAPTKKRSRWDQTPQMEGQTPIGNEGGMATPMT 59
Score = 38.2 bits (89), Expect = 0.007
Identities = 24/68 (35%), Positives = 30/68 (44%), Gaps = 9/68 (13%)
Query: 626 ETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTERETPGHSS 685
TP WD TP GA TP T +R+RWD+TP+ E +TP +
Sbjct: 1 NTPEKKKKRSRWDQTPVPEAAGAVTPEAPTK---------KRSRWDQTPQMEGQTPIGNE 51
Query: 686 GWAETPKT 693
G TP T
Sbjct: 52 GGMATPMT 59
>gnl|CDD|219152 pfam06728, PIG-U, GPI transamidase subunit PIG-U. Many eukaryotic
proteins are anchored to the cell surface via
glycosylphosphatidylinositol (GPI), which is
posttranslationally attached to the carboxyl-terminus by
GPI transamidase. The mammalian GPI transamidase is a
complex of at least four subunits, GPI8, GAA1, PIG-S,
and PIG-T. PIG-U is thought to represent a fifth subunit
in this complex and may be involved in the recognition
of either the GPI attachment signal or the lipid portion
of GPI.
Length = 373
Score = 67.7 bits (166), Expect = 1e-11
Identities = 17/51 (33%), Positives = 28/51 (54%), Gaps = 2/51 (3%)
Query: 813 MFPPGYKVLQPP--AACFFIGCSMFAPTVWHLWIYTRSANANFYFGVTLAF 861
+F +K ++ P + + +P +HLWIY S NANF+F +TL +
Sbjct: 323 LFRHLFKYMRYPFLVLLTLLVALLLSPIFYHLWIYLGSGNANFFFAITLVY 373
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 52.7 bits (127), Expect = 1e-06
Identities = 31/218 (14%), Positives = 45/218 (20%), Gaps = 10/218 (4%)
Query: 572 DNIDEYVRNTTARAFAVVASALGIPSLLPFLKAGGVTPAATRWDETPGHPKPGAETPGAT 631
+ + + + V A P G PA++ E P A
Sbjct: 568 EVLVTALAEELGGDWQVEAVVGPAPGAAGG--EGPPAPASSGPPEEAARPAAPAAPAAPA 625
Query: 632 PSTRLWDATPGHATPGAATPGRETPSH-----DKAQSSIRRNRWDETPKTERETP-GHSS 685
+A P + A K P
Sbjct: 626 APAP--AGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPP 683
Query: 686 GWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPS 745
A AG P + + + PQA + AA P
Sbjct: 684 APAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPP 743
Query: 746 TPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPGHLAA 783
P P + P A A P P +
Sbjct: 744 EPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSE 781
Score = 36.5 bits (85), Expect = 0.12
Identities = 21/145 (14%), Positives = 30/145 (20%), Gaps = 22/145 (15%)
Query: 606 GVTPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSI 665
+ A P A P A P+ A A A P
Sbjct: 384 RLGVAGGAGAPAAAAPSAAAAAPAAAPA----PAAAAPAAAAAPAPAAAPQPAPAPAP-- 437
Query: 666 RRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQ 725
+ A + P P ++ + P
Sbjct: 438 ----------APAPPSPAGNAPAGGAPSPPPAAAP---SAQPAPAPAAAPEPTA---APA 481
Query: 726 ATPSGAMTPSAATPGGMTPSTPITP 750
P A P+AA P+ P
Sbjct: 482 PAPPAAPAPAAAPAAPAAPAAPAGA 506
Score = 32.3 bits (74), Expect = 2.1
Identities = 25/154 (16%), Positives = 33/154 (21%), Gaps = 22/154 (14%)
Query: 638 DATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAG 697
+A G A A G P P E P + A AG
Sbjct: 585 EAVVGPAPGAAGGEGPPAP-------------ASSGPPEEAARPAAPAAPAAPAAPAPAG 631
Query: 698 PGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPL 757
P S+ A+ ++ G + P
Sbjct: 632 A-------AAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPA--KAGGAAPAAPP 682
Query: 758 MTPSGVTPTGNKAMAMATPTPGHLAAMTPEQIQA 791
P+ P A A P P A Q
Sbjct: 683 PAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADD 716
Score = 30.3 bits (69), Expect = 8.9
Identities = 21/110 (19%), Positives = 26/110 (23%), Gaps = 5/110 (4%)
Query: 685 SGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTP 744
+G A P P ++ + P A P A P+ A
Sbjct: 388 AGGAGAPAAAAPSAAAA--APAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPA 445
Query: 745 STPITPHVGSTPLMTPSGVTPTGNKAMAMAT---PTPGHLAAMTPEQIQA 791
S P P A A P P AA P A
Sbjct: 446 GNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPA 495
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 51.3 bits (123), Expect = 4e-06
Identities = 33/174 (18%), Positives = 57/174 (32%), Gaps = 28/174 (16%)
Query: 610 AATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGR-----ETPSHDKAQSS 664
A+ W+ +PG + ++P R +P G A + S + + SS
Sbjct: 271 EASGWNGPS--SRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSS 328
Query: 665 IRRNRWDETPKTERETPGHS----------SGWAETPKTDRAGPGGDLIQETPTPGGVSS 714
+ E+ + +PG S A+ + G +
Sbjct: 329 TSSS--SESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPT 386
Query: 715 KRRSRWDETPQATPSGAMTPSAATPGGMTPSTPITPHVGS------TPLMTPSG 762
+RR+R +A A + P G +P+ S PL+TPSG
Sbjct: 387 RRRARAAVAGRARRRDA---TGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSG 437
Score = 36.3 bits (84), Expect = 0.16
Identities = 30/177 (16%), Positives = 46/177 (25%), Gaps = 24/177 (13%)
Query: 607 VTPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIR 666
+ + G+ TP S P P A+ P PS S +
Sbjct: 87 TPTWSLSTLAPASPAREGSPTPPGPSS----PDPPPPTPPPASPP----PSPAPDLSEML 138
Query: 667 RNRWDETPKTERETPGHSSGWAETPKTDRAGPGG----DLIQETPTPGGVSSKRRSRWDE 722
R P P + A + +ET
Sbjct: 139 RPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPS----------- 187
Query: 723 TPQATPSGAMTPSAATPGGMTPSTPI-TPHVGSTPLMTPSGVTPTGNKAMAMATPTP 778
+P A P + P+AA+P S+PI P S G + ++
Sbjct: 188 SPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSES 244
Score = 33.6 bits (77), Expect = 0.91
Identities = 28/178 (15%), Positives = 45/178 (25%), Gaps = 17/178 (9%)
Query: 607 VTPAATRWDETPGHP----KPGAETPGATPSTRLWDATPGHATPGAATPGRET--PSHDK 660
R +P P A P + A+ A PGR +
Sbjct: 178 SPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS--ASSPAPAPGRSAADDAGAS 235
Query: 661 AQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRW 720
+ S E E P P G + + PG SS
Sbjct: 236 SSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNG--PSSRPGPASS------ 287
Query: 721 DETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTP 778
+ S + +PS+ G S + S+ + S + + + A
Sbjct: 288 -SSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPG 344
>gnl|CDD|237987 cd00020, ARM, Armadillo/beta-catenin-like repeats. An approximately
40 amino acid long tandemly repeated sequence motif first
identified in the Drosophila segment polarity gene
armadillo; these repeats were also found in the mammalian
armadillo homolog beta-catenin, the junctional plaque
protein plakoglobin, the adenomatous polyposis coli (APC)
tumor suppressor protein, and a number of other proteins.
ARM has been implicated in mediating protein-protein
interactions, but no common features among the target
proteins recognized by the ARM repeats have been
identified; related to the HEAT domain; three consecutive
copies of the repeat are represented by this alignment
model.
Length = 120
Score = 45.0 bits (107), Expect = 2e-05
Identities = 17/104 (16%), Positives = 34/104 (32%), Gaps = 4/104 (3%)
Query: 1035 EYLGEEYPEVLGSILGALKAIVNVIGMTKMTPPIKDL--LPRLTPILKNRHEKVQENCID 1092
L V AL + G + + LP L +LK+ E+V + +
Sbjct: 14 SLLSSSDENVQREAAWALSNLS--AGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKAALW 71
Query: 1093 LVGRIADRGPEYVSAREWMRICFELLELLKAHKKAIRRATVNTF 1136
+ +A + +L+ LL + + I++
Sbjct: 72 ALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGAL 115
Score = 31.9 bits (73), Expect = 0.66
Identities = 18/100 (18%), Positives = 35/100 (35%), Gaps = 17/100 (17%)
Query: 1070 DLLPRLTPILKNRHEKVQENCIDLVGRIADRGPE----YVSAREWMRICFE-LLELLKAH 1124
LP L +L + E VQ + ++ + V A L++LLK+
Sbjct: 7 GGLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAG-----GLPALVQLLKSE 61
Query: 1125 KKAIRRATVNTFGYIA-------KAIGPHDVLATLLNNLK 1157
+ + +A + +A + + L+N L
Sbjct: 62 DEEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVNLLD 101
Score = 31.1 bits (71), Expect = 1.4
Identities = 11/69 (15%), Positives = 27/69 (39%), Gaps = 4/69 (5%)
Query: 990 TILWRLNNKSAKVRQQAADLISRIAVVMKTCQEEKLMGHLGVV--LYEYLGEEYPEVLGS 1047
++ L ++ +V + A + +A + + ++ G V L L ++ +
Sbjct: 53 ALVQLLKSEDEEVVKAALWALRNLAAGPEDNK--LIVLEAGGVPKLVNLLDSSNEDIQKN 110
Query: 1048 ILGALKAIV 1056
GAL +
Sbjct: 111 ATGALSNLA 119
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 44.5 bits (105), Expect = 5e-04
Identities = 37/184 (20%), Positives = 45/184 (24%), Gaps = 11/184 (5%)
Query: 600 PFLKAGGVTPAATRWDETPGHPKPGAETPGAT-----PSTRLWDATPGHATPGAATPGRE 654
P A P R + G P P P + A P P E
Sbjct: 2592 PPQSARPRAPVDDR-GDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPE 2650
Query: 655 TPSHDKAQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSS 714
P D A + R R + G L P P
Sbjct: 2651 RPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEP 2710
Query: 715 KRRSRWDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMA 774
+ TP P G A+P P+ P P V + P TP G
Sbjct: 2711 APHALVSATP--LPPGPAAARQASPA--LPAAPAPPAVPAGP-ATPGGPARPARPPTTAG 2765
Query: 775 TPTP 778
P P
Sbjct: 2766 PPAP 2769
Score = 38.0 bits (88), Expect = 0.052
Identities = 56/228 (24%), Positives = 68/228 (29%), Gaps = 16/228 (7%)
Query: 609 PAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRN 668
PAA P P P P T R DA P A P A R P S +
Sbjct: 2561 PAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLP-- 2618
Query: 669 RWDETPKTERETPGHSSGWAETPKTDR-AGPGGDLIQETPTPGGVSSKRRSRWDETP--- 724
+T + P S E P + ++ P PG VS RR+R
Sbjct: 2619 --PDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQA 2676
Query: 725 QATPSGAMTPSA-ATPGGMT-----PSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTP 778
+ P +A T G +T P P TP L + P G A A+P
Sbjct: 2677 SSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHAL-VSATPLPPGPAAARQASPAL 2735
Query: 779 GHLAAMTPEQIQAYRWEREIDERNRPLTDDELEAMFPPGYKVLQPPAA 826
A P RP T A PP PP
Sbjct: 2736 P-AAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRR 2782
Score = 36.1 bits (83), Expect = 0.16
Identities = 50/234 (21%), Positives = 67/234 (28%), Gaps = 22/234 (9%)
Query: 605 GGVTPAATRWDETPG-HPKPGAETPGATPSTRLWDA----TPGHATPGAATPGRETPSHD 659
P R P P+ A A P +W A P A P P P+ D
Sbjct: 265 ADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAP-LALPAPPDPPPPAPAGD 323
Query: 660 KAQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSR 719
+ + R + G+ PK R E + G KR S
Sbjct: 324 AEEEDDEDGAMEVVSPLPRPRQHYPLGF---PKRRRPTWTPPSSLEDLSAGRHHPKRASL 380
Query: 720 WDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPG 779
++ A TP A PGG + P P S P P+ V + A P+
Sbjct: 381 PTRKRRSARHAA-TPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAE 439
Query: 780 HLAAMTPEQIQAYRWEREIDERNRPLTD-------DELEAMFPPGYKVLQPPAA 826
+ P + E D D L PP +PP A
Sbjct: 440 PGSDDGPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPP-----EPPGA 488
Score = 34.1 bits (78), Expect = 0.69
Identities = 33/211 (15%), Positives = 51/211 (24%), Gaps = 12/211 (5%)
Query: 609 PAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRN 668
PA P A A+P+ A P G ATPG ++
Sbjct: 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA-GPATPGGPARPARPPTTA---- 2764
Query: 669 RWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATP 728
P ++G + + P+P + + P
Sbjct: 2765 ----GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820
Query: 729 SGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPGHLAAMTPEQ 788
+ + P P+ P P P + G G P +
Sbjct: 2821 AASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPAR 2880
Query: 789 IQAYRWEREIDERNR---PLTDDELEAMFPP 816
R R R+ L D+ E P
Sbjct: 2881 PPVRRLARPAVSRSTESFALPPDQPERPPQP 2911
Score = 33.4 bits (76), Expect = 1.2
Identities = 42/211 (19%), Positives = 56/211 (26%), Gaps = 32/211 (15%)
Query: 609 PAATRWDETPGHPKPGAETPG-----ATPSTRLWDATPGHATPGAATPGRETPSHDKAQS 663
P D P P P A P P P A + P R AQ+
Sbjct: 2619 PDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDP--APGRVSRPRRARRLGRAAQA 2676
Query: 664 SIRRNRWDETPKTERETPGHSSGWAETPKTDR--AGPGGDLIQETPTPGGVSSKRRSR-- 719
S R + R T G + A+ P L+ TP P G ++ R++
Sbjct: 2677 SSPPQRP--RRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPA 2734
Query: 720 --WDETPQATPSGAMT---------------PSAATPGGMTPSTP--ITPHVGSTPLMTP 760
P A P+G T P A P + P L
Sbjct: 2735 LPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSES 2794
Query: 761 SGVTPTGNKAMAMATPTPGHLAAMTPEQIQA 791
P+ AA+ P A
Sbjct: 2795 RESLPSPWDPADPPAAVLAPAAALPPAASPA 2825
Score = 30.7 bits (69), Expect = 7.9
Identities = 44/247 (17%), Positives = 55/247 (22%), Gaps = 35/247 (14%)
Query: 581 TTARAFAVVASALGIPSLLPFLKAGGVTPAATRWDETPGHPK-PGAETPGATPSTRLWDA 639
A + AL P + AG TP P P A P A P+
Sbjct: 2723 PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAA----G 2778
Query: 640 TPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPG 699
P T A + S + S WD + A P AGP
Sbjct: 2779 PPRRLTRPAVASL--SESRESLPSP-----WDPADPPAAVLAPAA---ALPPAASPAGP- 2827
Query: 700 GDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTPIT--PHVGSTPL 757
P P T P G PS G + P + P S
Sbjct: 2828 ------LPPPTSAQ--------PTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPA- 2872
Query: 758 MTPSGVTPTGNKAMAMATPTPGHLAAMTPEQIQAYRWEREIDERNRPLTDDELEAMFPPG 817
P +A P + P + P
Sbjct: 2873 --AKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQ 2930
Query: 818 YKVLQPP 824
PP
Sbjct: 2931 PPPPPPP 2937
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 42.4 bits (99), Expect = 0.002
Identities = 29/87 (33%), Positives = 35/87 (40%), Gaps = 14/87 (16%)
Query: 697 GPGGDLIQETPTPGGVSSKRRSRWDETPQAT-PSGAMTPSAATPGGMTPS---TPITPHV 752
GP PTPG PQ T P+ MTPS+A P PS +P P
Sbjct: 388 GPASGGAATIPTPGTQG----------PQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWD 437
Query: 753 GSTPLMTPSGVTPTGNKAMAMATPTPG 779
+ P SG+ P M A+P PG
Sbjct: 438 DAPPAPPRSGIPPRPAPRMPEASPVPG 464
Score = 32.7 bits (74), Expect = 1.5
Identities = 26/107 (24%), Positives = 34/107 (31%), Gaps = 14/107 (13%)
Query: 673 TPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAM 732
TP T+ G + TP + A P P+P R WD+ P A P +
Sbjct: 399 TPGTQG-PQGTAPAAGMTPSS--AAPATPAPSAAPSP-------RVPWDDAPPAPPRSGI 448
Query: 733 TPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPG 779
P A P P + + S PT A TP
Sbjct: 449 PPRPAPRMPEASPVPGAPDSVA----SASDAPPTLGDPSDTAEHTPS 491
Score = 32.3 bits (73), Expect = 2.0
Identities = 24/90 (26%), Positives = 27/90 (30%), Gaps = 7/90 (7%)
Query: 606 GVTPAATRWDETPGHPKPGAETPGATPSTRL-WDATPGHATPGAATPGRETPSHDKAQSS 664
G PAA TP P P A PS R+ WD P A P + P R P +A
Sbjct: 407 GTAPAAGM---TPSSAAPATPAPSAAPSPRVPWDDAPP-APPRSGIPPRPAPRMPEASPV 462
Query: 665 IRRNRWDETPKTERETPGHSSGWAETPKTD 694
D P T
Sbjct: 463 P--GAPDSVASASDAPPTLGDPSDTAEHTP 490
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 40.4 bits (94), Expect = 0.006
Identities = 31/156 (19%), Positives = 47/156 (30%), Gaps = 17/156 (10%)
Query: 608 TPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRR 667
P P AT + RL +T T G T S+ + Q+
Sbjct: 186 HPTYPSQVTPQSQPATQGHQT-ATANQRL------SSTEPVGTQGTTTSSNPEPQTEPPP 238
Query: 668 NRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQAT 727
++ + + + P +S G G + Q TP S++R TP T
Sbjct: 239 SQRGPS-GSPQHPPSTTS-----QDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPT 292
Query: 728 P----SGAMTPSAATPGGMTPSTPITPHVGSTPLMT 759
+G TP S P + G T
Sbjct: 293 TKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPT 328
>gnl|CDD|205691 pfam13513, HEAT_EZ, HEAT-like repeat. The HEAT repeat family is
related to armadillo/beta-catenin-like repeats (see
pfam00514). These EZ repeats are found in subunits of
cyanobacterial phycocyanin lyase and other proteins and
probably carry out a scaffolding role.
Length = 55
Score = 35.9 bits (83), Expect = 0.007
Identities = 19/56 (33%), Positives = 30/56 (53%), Gaps = 1/56 (1%)
Query: 1042 PEVLGSILGALKAIVNVIGMTKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRI 1097
EV + AL A+ G + P + +LLP L P+LK+ ++V+E +GRI
Sbjct: 1 WEVREAAALALGALAGG-GPELLRPAVPELLPALLPLLKDDDDEVREAAAWALGRI 55
Score = 34.3 bits (79), Expect = 0.023
Identities = 11/48 (22%), Positives = 23/48 (47%)
Query: 966 LNGFGTIVNQLGKRVKPYLPQICGTILWRLNNKSAKVRQQAADLISRI 1013
G + + ++P +P++ +L L + +VR+ AA + RI
Sbjct: 8 ALALGALAGGGPELLRPAVPELLPALLPLLKDDDDEVREAAAWALGRI 55
Score = 28.2 bits (63), Expect = 3.7
Identities = 12/53 (22%), Positives = 20/53 (37%)
Query: 1198 VQNGVLKALSFLFEYIGEMGKDYIYAVTPLLEDALMDRDLVHRQTACATIKHM 1250
V+ AL L E+ + + + P L L D D R+ A + +
Sbjct: 3 VREAAALALGALAGGGPELLRPAVPELLPALLPLLKDDDDEVREAAAWALGRI 55
>gnl|CDD|215026 smart01104, CTD, Spt5 C-terminal nonapeptide repeat binding Spt4.
The C-terminal domain of the transcription elongation
factor protein Spt5 is necessary for binding to Spt4 to
form the functional complex that regulates early
transcription elongation by RNA polymerase II. The
complex may be involved in pre-mRNA processing through
its association with mRNA capping enzymes. This CTD
domain carries a regular nonapeptide repeat that can be
present in up to 18 copies, as in S. pombe. The repeat
has a characteristic TPA motif.
Length = 121
Score = 37.1 bits (86), Expect = 0.010
Identities = 32/155 (20%), Positives = 40/155 (25%), Gaps = 42/155 (27%)
Query: 617 TPGHPKPGAETP---GATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDET 673
TP G++TP TP T A + R TP+ A S
Sbjct: 4 TPAWGASGSKTPAWGSRTPGTAAGGAPTA----RGGSGSR-TPAWGGAGSR--------- 49
Query: 674 PKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMT 733
TP +T G + G S S A G
Sbjct: 50 ------TPAWGGAGPTGSRTPAWGGA--------SAWGNKSSEGSAS---SWAAGPGG-A 91
Query: 734 PSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGN 768
A TPG + P TP G G+
Sbjct: 92 YGAPTPGYGGTPSAYGPA-------TPGGGAMAGS 119
Score = 31.7 bits (72), Expect = 0.73
Identities = 22/111 (19%), Positives = 32/111 (28%), Gaps = 19/111 (17%)
Query: 688 AETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPS-T 746
TP +G +TP G + + SG+ TP+ G TP+
Sbjct: 2 GRTPAWGASG------SKTPAWGS-RTPGTAAGGAPTARGGSGSRTPAWGGAGSRTPAWG 54
Query: 747 PITPHVGSTP-----------LMTPSGVTPTGNKAMAMATPTPGHLAAMTP 786
P TP S + A PTPG+ +
Sbjct: 55 GAGPTGSRTPAWGGASAWGNKSSEGSASSWAAGPGGAYGAPTPGYGGTPSA 105
>gnl|CDD|221787 pfam12815, CTD, Spt5 C-terminal nonapeptide repeat binding Spt4.
The C-terminal domain of the transcription elongation
factor protein Spt5 is necessary for binding to Spt4 to
form the functional complex that regulates early
transcription elongation by RNA polymerase II. The
complex may be involved in pre-mRNA processing through
its association with mRNA capping enzymes. This CTD
domain carries a regular nonapeptide repeat that can be
present in up to 18 copies, as in S. pombe. The repeat
has a characteristic TPA motif.
Length = 112
Score = 37.1 bits (86), Expect = 0.010
Identities = 22/67 (32%), Positives = 24/67 (35%), Gaps = 10/67 (14%)
Query: 600 PFLKAGGVTPA--ATRWDETP----GHPKPGAETPGATPSTRLWDATPGH----ATPGAA 649
P G TPA TP GA TPGA P+ TPG TPG +
Sbjct: 46 PAFGDGSRTPAYGGESNGRTPAYGGSGSYLGASTPGAAPTPAKSAPTPGAWGGAPTPGYS 105
Query: 650 TPGRETP 656
TP
Sbjct: 106 ADSAPTP 112
Score = 30.9 bits (70), Expect = 1.5
Identities = 31/146 (21%), Positives = 41/146 (28%), Gaps = 37/146 (25%)
Query: 624 GAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETP---KTERET 680
G+ TP ++ TP G+ TP TP +T
Sbjct: 1 GSRTPAY--NSSAGSRTPAWGQGGSRTP---AYGGAG----------GRTPAWGGAGNKT 45
Query: 681 PGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPG 740
P G + TP GG+ TP GG + GA TP AA
Sbjct: 46 PAFGDG-SRTPAY-----GGESNGRTPAYGG------------SGSYL-GASTPGAAPTP 86
Query: 741 GMTPSTPITPHVGSTPLMTPSGVTPT 766
+ TP TP +
Sbjct: 87 AKSAPTPGAWGGAPTPGYSADSAPTP 112
>gnl|CDD|227493 COG5164, SPT5, Transcription elongation factor [Transcription].
Length = 607
Score = 40.0 bits (93), Expect = 0.010
Identities = 32/174 (18%), Positives = 44/174 (25%), Gaps = 20/174 (11%)
Query: 628 PGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTER-ETPGHSSG 686
S + TP +AT G A T S S R W + PKT PG G
Sbjct: 421 KPLRYSEAIGSKTPNYATGGIAAGAAATSS---GLSGYRTPGWKDGPKTPALNDPGIDWG 477
Query: 687 WAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWD--ETPQATPSGAMTPS----AATPG 740
E + L + + W T A+ + P
Sbjct: 478 -DEKDNWYKGSLIHALGKGLALEDYRDGLFMTAWKGQATS-ASEISFVQPRWNNLVCVLD 535
Query: 741 GMTPSTP-----ITPHVGSTPLM-TPSGVT--PTGNKAMAMATPTPGHLAAMTP 786
G T I G T + T V P + + +
Sbjct: 536 GEKKGTCGILNGINGDWGGTVIRSTKGSVVHWPMNDNSKKIYDGGASAWGNQDD 589
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 39.8 bits (92), Expect = 0.011
Identities = 42/191 (21%), Positives = 56/191 (29%), Gaps = 20/191 (10%)
Query: 609 PAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRN 668
P T P+ E+P +T + + P T T T S +Q
Sbjct: 543 PTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGI-- 600
Query: 669 RWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATP 728
P + TP +S T GG+ I E TP S+ S P
Sbjct: 601 -----PSSSHSTPRSNSTSTTPLLTSAHPTGGENITEE-TPSVPSTTHVSTLSPGPGPGT 654
Query: 729 ----SGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGV-----TPTGNKAMAMA--TPT 777
SG S + G T P+ +T PSG T T A + T
Sbjct: 655 TSQVSGPGNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANSTTKET 714
Query: 778 PGHLAAMTPEQ 788
G M
Sbjct: 715 SGS-TLMASTS 724
Score = 34.8 bits (79), Expect = 0.41
Identities = 36/187 (19%), Positives = 54/187 (28%), Gaps = 23/187 (12%)
Query: 594 GIPSLL---PFLKAGGVTPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAAT 650
GIPS P + TP T T G + E + PST H + +
Sbjct: 599 GIPSSSHSTPRSNSTSTTPLLTSAHPTGG--ENITEETPSVPSTT-------HVSTLSPG 649
Query: 651 PGRETPSHDKAQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPG 710
PG T S + +R+ PG P + P Q+T P
Sbjct: 650 PGPGTTSQVSGPGNSSTSRY----------PGEVHVTEGMPNPNATSPSAPSGQKTAVPT 699
Query: 711 GVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKA 770
S+ ++ + + S M ++ T P T S + P
Sbjct: 700 VTSTGGKANSTTK-ETSGSTLMASTSPHTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFT 758
Query: 771 MAMATPT 777
T
Sbjct: 759 SPPVTTK 765
Score = 34.8 bits (79), Expect = 0.41
Identities = 28/123 (22%), Positives = 34/123 (27%), Gaps = 25/123 (20%)
Query: 617 TPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKT 676
T G P P A +P A + T A + +ET T
Sbjct: 676 TEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANSTTKETSGS--------------TLMA 721
Query: 677 ERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDET-PQATPSGAMTPS 735
+ + TP T P SSK R RW T P T A P
Sbjct: 722 STSPHTNEGAFRTTPYN----------ATTYLPPSTSSKLRPRWTFTSPPVTTKQATVPV 771
Query: 736 AAT 738
T
Sbjct: 772 PPT 774
Score = 31.3 bits (70), Expect = 4.1
Identities = 35/166 (21%), Positives = 53/166 (31%), Gaps = 11/166 (6%)
Query: 617 TPGHPKPGAETPGATPS---TRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDET 673
T H +P T T T A P + TP T S +S T
Sbjct: 437 TLVHVEPHKTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSAT 496
Query: 674 PKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMT 733
P TP ++ A +P T + + TP GV++ T + P+G +
Sbjct: 497 PNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTT--------TATSPPTGTTS 548
Query: 734 PSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPG 779
AT +T +P+ PS +T +P
Sbjct: 549 VPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPT 594
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 39.7 bits (92), Expect = 0.012
Identities = 41/174 (23%), Positives = 58/174 (33%), Gaps = 8/174 (4%)
Query: 606 GVTPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSH--DKAQS 663
A PG +P A PG R A PG A P AA PG TP +
Sbjct: 729 AAPGRARPPAAAPGRARPPAAAPGRA---RPPAAAPGRARPPAAAPGAPTPQPPPQAPPA 785
Query: 664 SIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDET 723
+R R TP+ + S + GP ++++ T G + +
Sbjct: 786 PQQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAA 845
Query: 724 PQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPT 777
+ + TPS PG T + V P++ P V A A T
Sbjct: 846 LERQAAAGPTPS---PGSGTSDKIVQAPVFYPPVLQPIQVMRQLGSVRAAAAST 896
Score = 35.8 bits (82), Expect = 0.22
Identities = 43/210 (20%), Positives = 59/210 (28%), Gaps = 35/210 (16%)
Query: 609 PAATRWDETPGHPKPG----AETPGATPST-RLWDATPGHA-TPGAATPGRETPSHDKAQ 662
PA+T P PG P +P+T +L + P +A TP +TP Q
Sbjct: 553 PASTEPVHDQLLPAPGLGPLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQ 612
Query: 663 SSIRRN----RWDETPKTERETPGHSSGWAETPKTDRAGPGGD----------------- 701
S I +W P R P P
Sbjct: 613 SHIPETSAPRQW---PMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQI 669
Query: 702 -LIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTP 760
I P+P G ++ +W P A TP P P +T P
Sbjct: 670 GHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTP-MRPPAA--PPGRAQRPAAATGRARP 726
Query: 761 SGVTPTGNKAMAMATPTPGHLAAMTPEQIQ 790
P + A A P A P + +
Sbjct: 727 PAAAPGRARPPAAA-PGRARPPAAAPGRAR 755
Score = 35.4 bits (81), Expect = 0.24
Identities = 44/192 (22%), Positives = 54/192 (28%), Gaps = 26/192 (13%)
Query: 608 TPAATRWDETPGHPKPGAETP--GATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSI 665
P P PG AT R A PG A P AA PGR P
Sbjct: 696 PPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPA------- 748
Query: 666 RRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQ 725
P R A A G + P + ++R R TPQ
Sbjct: 749 ------AAPGRARPPAA-----APGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQ 797
Query: 726 ATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMT------PSGVTPTGNKAMAMATPTPG 779
P T P T + L PS P + A A PTP
Sbjct: 798 PPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQAAAGPTPS 857
Query: 780 HLAAMTPEQIQA 791
+ + + +QA
Sbjct: 858 PGSGTSDKIVQA 869
>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal. Mediator is a
large complex of up to 33 proteins that is conserved
from plants to fungi to humans - the number and
representation of individual subunits varying with
species. It is arranged into four different sections, a
core, a head, a tail and a kinase-activity part, and the
number of subunits within each of these is what varies
with species. Overall, Mediator regulates the
transcriptional activity of RNA polymerase II but it
would appear that each of the four different sections
has a slightly different function. Mediator subunit
Hrs1/Med3 is a physical target for Cyc8-Tup1, a yeast
transcriptional co-repressor.
Length = 381
Score = 38.8 bits (90), Expect = 0.019
Identities = 30/173 (17%), Positives = 42/173 (24%), Gaps = 28/173 (16%)
Query: 621 PKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTERET 680
GA S AT TP A S P T
Sbjct: 123 GNAGASASITKTSNGSDAATTSSTAN---TPAAAKVLKANAAS---------APNTT-TG 169
Query: 681 PGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPG 740
G ++ A T P P +++ +A S A+
Sbjct: 170 VGSAATTAAISATTATTPT----TTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQASAYN 225
Query: 741 GMTPSTPITPHVGSTPLMTPSGVTPTGNKAM-----AMATPTPGHLAAMTPEQ 788
M + P S P+ AMA+P L M+P +
Sbjct: 226 QM--GSLGVPQNTSMLAQIPNPTPLMQLLNGVSPNNAMASP----LNNMSPMR 272
Score = 35.4 bits (81), Expect = 0.18
Identities = 12/76 (15%), Positives = 24/76 (31%), Gaps = 3/76 (3%)
Query: 705 ETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVT 764
+ G +S ++ T S A TP+AA + + P+ + + + +
Sbjct: 121 QLGNAGASASITKTSNGSDAATTSSTANTPAAAK--VLKANAASAPNTTTG-VGSAATTA 177
Query: 765 PTGNKAMAMATPTPGH 780
T T
Sbjct: 178 AISATTATTPTTTQKK 193
>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
factor (transactivator). This family includes EBV BRLF1
and similar ORF 50 proteins from other herpesviruses.
Length = 500
Score = 37.8 bits (88), Expect = 0.038
Identities = 42/232 (18%), Positives = 66/232 (28%), Gaps = 27/232 (11%)
Query: 607 VTPAATRW--DETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSS 664
V PA+T+ +ET P +P + + + + A P + S+ S
Sbjct: 247 VWPASTQSPREETSAEP-LAPASPSSRRPSTAQE-EQIACSSPQAEPEQGVQSYVPQSSD 304
Query: 665 IRRNRWDETPKTERETPG-------HSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRR 717
R + P P + P+ G GG R
Sbjct: 305 SRPSC---FPAPSTTQPTFLPPNTNKKAKRDRRPQMVTPKQEGGAAVSQNHDGGTVRAPR 361
Query: 718 SRWDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPT 777
R + Q+ PS + S+ TP L+ P+ V A PT
Sbjct: 362 GRPSGSGQSPPSNSPLLSS------LADTPSGAAHQPASLLPPAVVQQQLEDASDKQPPT 415
Query: 778 PGH---LAAMTPEQIQAYRWEREIDERNRPLTDDELEAMFPPGYKVLQPPAA 826
PG E + + D+ + A P Y + PP
Sbjct: 416 PGSSLVPQPDEQELGPSVMALLDRDQG----ILALILAPIPGLYDLSGPPNL 463
>gnl|CDD|221040 pfam11235, Med25_SD1, Mediator complex subunit 25 synapsin 1. The
overall function of the full-length Med25 is efficiently
to coordinate the transcriptional activation of RAR/RXR
(retinoic acid receptor/retinoic X receptor) in higher
eukaryotic cells. Human Med25 consists of several
domains with different binding properties, the
N-terminal, VWA, domain, this SD1 - synapsin 1 - domain
from residues 229-381, a PTOV(B) or ACID domain from
395-545, an SD2 domain from residues 564-645 and a
C-terminal NR box-containing domain (646-650) from
646-747. This The function of the SD domains is unclear.
Length = 168
Score = 36.0 bits (82), Expect = 0.057
Identities = 33/173 (19%), Positives = 51/173 (29%), Gaps = 9/173 (5%)
Query: 621 PKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTERET 680
P G PG S + P P + P + P + + P+
Sbjct: 2 PVGGGSVPGPLQSKQPVSLPPAAVLPPQSLPAPQNPLPPVTPPQM------QVPQNVSLH 55
Query: 681 PGHSSGWA--ETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAAT 738
H + E K + G TP P P P +
Sbjct: 56 AAHDAAQKAVEAAKNQKQGLKNRFSPITPLQQAPIVGPPFSQAPAPVLPPGPPGAPKPSP 115
Query: 739 PGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMA-TPTPGHLAAMTPEQIQ 790
++ T ++P G P++T V P + +M TP G + P Q Q
Sbjct: 116 ASQLSLVTTVSPGSGLAPVLTQQQVPPQQPQQPSMVPTPALGGVQPPQPSQQQ 168
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 36.6 bits (85), Expect = 0.082
Identities = 23/117 (19%), Positives = 45/117 (38%), Gaps = 5/117 (4%)
Query: 136 AKKKVAAAEWEKEADVNIEAQIRDIQNRKKDVSEGGGEEKQKRVGLGESGYFDSDIYDGG 195
+K+A AE EK ++ +I + + E E + K GE G + + +
Sbjct: 20 EDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEEEAK 79
Query: 196 ---GKFEGYVKSIADDDFDYQASFNQNKRS--GYTAPAALLNDIAQSEKDYDPFADR 247
+F Y++ D + + + + S GY P +I + ++Y P
Sbjct: 80 AEAAEFRAYLRGGDDALAEERKALSTGTDSDGGYLVPEEFSTEIIELLREYSPLRQL 136
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 36.3 bits (84), Expect = 0.14
Identities = 28/134 (20%), Positives = 35/134 (26%), Gaps = 14/134 (10%)
Query: 620 HPKPGAETPGATPSTRLWDATPGHATPGA--ATPGRETPSHDKAQSSIRRNRWDETPKTE 677
P+PGA W P A A+P P + R E
Sbjct: 152 RPEPGAWPR--AADDYGWQQQRLGFPPRAPYASPASYAPEQE---------RDREPYDAG 200
Query: 678 RETPGHSSGWAETPKTDRAGPGGDLIQET-PTPGGVSSKRRSRWDETPQATPSGAMTPSA 736
R + P+ D P D P PG R P + PSA
Sbjct: 201 RPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRGGPGPPERDDAPVVPIRPSA 260
Query: 737 ATPGGMTPSTPITP 750
P P+ P
Sbjct: 261 PGPLAAQPAPAPGP 274
Score = 31.3 bits (71), Expect = 4.3
Identities = 21/133 (15%), Positives = 29/133 (21%), Gaps = 17/133 (12%)
Query: 608 TPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRR 667
W + P A P +D+ +
Sbjct: 161 AADDYGWQQQRLGFPPRAPYASPASYAP-------EQERDREPYDAGRPEYDQRRRDYDH 213
Query: 668 NR--WDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQ 725
R WD + + P G R GPG + P R
Sbjct: 214 PRPDWDRPRRDRTDRPEPPPGAGHVH---RGGPGPPERDDAPVV-----PIRPSAPGPLA 265
Query: 726 ATPSGAMTPSAAT 738
A P+ A P T
Sbjct: 266 AQPAPAPGPGEPT 278
>gnl|CDD|202817 pfam03937, Sdh5, Flavinator of succinate dehydrogenase. This
family of uncharacterized proteins. Based on personal
observation it was previously annotated in Pfam as being
a divergent TPR repeat but structural evidence has
indicated this is not true.This family is now found to
be a highly conserved mitochondrial protein, Sdh5. Both
yeast and human Sdh5 interact with the catalytic subunit
of the succinate dehydrogenase (SDH) complex, a
component of both the electron transport chain and the
tricarboxylic acid cycle. Sdh5 is required for
SDH-dependent respiration and for Sdh1 flavination
(incorporation of the flavin adenine dinucleotide
cofactor). Mutational inactivation of Sdh5 confers tumor
susceptibility in humans.
Length = 51
Score = 32.0 bits (74), Expect = 0.14
Identities = 10/39 (25%), Positives = 20/39 (51%), Gaps = 10/39 (25%)
Query: 259 YRAIRRRM-----IISPERVDPFAEGHLAAMTPEQIQAY 292
+R+ RR M ++ FA+ HL ++ E++ A+
Sbjct: 1 WRSWRRGMLELDLLLGR-----FADAHLDTLSEEELDAF 34
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 34.4 bits (79), Expect = 0.46
Identities = 31/188 (16%), Positives = 44/188 (23%), Gaps = 24/188 (12%)
Query: 604 AGGVTPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQS 663
GGV P PGA A ++ A P A P ++
Sbjct: 369 GGGVPARVAG-----AVPAPGARAAAAVGAS----AVPAVTAVTGAAGAALAP-----KA 414
Query: 664 SIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDET 723
+ P ++ + A + D
Sbjct: 415 AAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDER-----DAQ 469
Query: 724 PQATPSGAMTPSAATPGGMT--PSTPITPHVGSTPLMTPSGVTPTG---NKAMAMATPTP 778
P A A P++ P P+ +TP P P A A A P
Sbjct: 470 PPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPA 529
Query: 779 GHLAAMTP 786
TP
Sbjct: 530 PEARPPTP 537
Score = 30.2 bits (68), Expect = 9.2
Identities = 13/109 (11%), Positives = 21/109 (19%), Gaps = 7/109 (6%)
Query: 680 TPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATP 739
T G + G + A P A+ P AA
Sbjct: 363 TGGGAPGGGVPARVAGAVPAP-----GARAAAAVGASAVPAVTAVTGAAGAALAPKAAAA 417
Query: 740 GGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPGHLAAMTPEQ 788
+ P P T + + +A +
Sbjct: 418 --AAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCD 464
>gnl|CDD|221752 pfam12752, SUZ, SUZ domain. The SUZ domain is a conserved
RNA-binding domain found in eukaryotes and enriched in
positively charged amino acids. It was first
characterized in the C.elegans protein Szy-20 where it
has been shown to bind RNA and allow their localization
to the centrosome. Warning- the domain has a
compositionally biased character.
Length = 57
Score = 30.0 bits (68), Expect = 0.79
Identities = 9/20 (45%), Positives = 13/20 (65%)
Query: 246 DRRQKTVAEKEDEYRAIRRR 265
DR+ K++ E+E EY R R
Sbjct: 32 DRKAKSLEEREAEYAEARER 51
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
inhibitor.
Length = 670
Score = 33.5 bits (76), Expect = 0.86
Identities = 22/103 (21%), Positives = 35/103 (33%), Gaps = 16/103 (15%)
Query: 700 GDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTPITP--------- 750
G+ TP G ++ + + +PS + PS + P G S P TP
Sbjct: 564 GNPGSTNSTPTGSAASSNTTFSSD---SPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTS 620
Query: 751 ----HVGSTPLMTPSGVTPTGNKAMAMATPTPGHLAAMTPEQI 789
H+GS S + + A+P A T +
Sbjct: 621 PPASHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESSV 663
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 33.4 bits (76), Expect = 0.92
Identities = 34/112 (30%), Positives = 43/112 (38%), Gaps = 14/112 (12%)
Query: 672 ETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQAT--PS 729
P E G S A R GP + TP P R +P+ T PS
Sbjct: 174 APPLGEGSADG-SCDPALPLSAPRLGPADVFVPATPRP-------TPRTTASPETTPTPS 225
Query: 730 GAMTPSAATPGGMTPSTPITPHVGSTPLM--TPSGVTPTGNKAM-AMATPTP 778
+P + T + T P G+TP TP+ TP G +A A ATP P
Sbjct: 226 TTTSPPSTTIPAPST-TIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAP 276
Score = 30.7 bits (69), Expect = 6.2
Identities = 16/66 (24%), Positives = 21/66 (31%), Gaps = 2/66 (3%)
Query: 608 TPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRR 667
TP + P P T A P G TP TPG A +
Sbjct: 221 TPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEG--TPAPPTPGGGEAPPANATPAPEA 278
Query: 668 NRWDET 673
+R++ T
Sbjct: 279 SRYELT 284
>gnl|CDD|221190 pfam11727, ISG65-75, Invariant surface glycoprotein. This family
is found in Trypanosome species, and appears to be one
of two invariant surface glycoproteins, ISG65 and ISG75.
that are found in the mammalian stage of the parasitic
protozoan. the sequence suggests the two families are
polypeptides with N-terminal signal sequences,
hydrophilic extracellular domains, single trans-membrane
alpha-helices and short cytoplasmic domains. they are
both expressed in the bloodstream form but not in the
midgut stage. Both polypeptides are distributed over the
entire surface of the parasite.
Length = 286
Score = 32.9 bits (75), Expect = 0.93
Identities = 27/157 (17%), Positives = 52/157 (33%), Gaps = 27/157 (17%)
Query: 71 DVGSRTYAEVMREQLVRGEEKEVRKKLQDKAKDGTLKAVPTNGEAKAAPKKRGRWDQTSD 130
+ SR+ ++ L ++K++++ +KAKD +A+P +A +K
Sbjct: 84 KLESRSDSK-----LSDSDKKKIKEIC-EKAKDKIDEALPEAKKAGREAEKLA------- 130
Query: 131 GDVTPAKKKVAAAEWEKEADVNIEAQIRDIQN---RKKDVSEGGGEEKQKRVGLGES--G 185
+V A K A W + + + + N K E + ++ G
Sbjct: 131 EEVKEAAAK--ALGWALDEGTSNSTGLSKVLNWHCGGKKPGEN------AKSSPSQNCDG 182
Query: 186 YFDSDIYDGGGKFEGYVKSI-ADDDFDYQASFNQNKR 221
Y GG+ D F + K
Sbjct: 183 IAFKRHYSDGGRNVIDCSDTGVDRVFYTDVTSEAMKS 219
>gnl|CDD|214547 smart00185, ARM, Armadillo/beta-catenin-like repeats. Approx. 40
amino acid repeat. Tandem repeats form superhelix of
helices that is proposed to mediate interaction of
beta-catenin with its ligands. Involved in transducing
the Wingless/Wnt signal. In plakoglobin arm repeats bind
alpha-catenin and N-cadherin.
Length = 41
Score = 29.3 bits (67), Expect = 0.98
Identities = 7/28 (25%), Positives = 14/28 (50%)
Query: 1072 LPRLTPILKNRHEKVQENCIDLVGRIAD 1099
LP L +LK+ E+V + + ++
Sbjct: 14 LPALVELLKSEDEEVVKEAAWALSNLSS 41
>gnl|CDD|220165 pfam09290, AcetDehyd-dimer, Prokaryotic acetaldehyde dehydrogenase,
dimerisation. Members of this family are found in
prokaryotic acetaldehyde dehydrogenase (acylating), and
adopt a structure consisting of an
alpha-beta-alpha-beta(3) core. They mediate dimerisation
of the protein.
Length = 137
Score = 31.7 bits (73), Expect = 1.0
Identities = 21/51 (41%), Positives = 28/51 (54%), Gaps = 10/51 (19%)
Query: 543 ARVEGREIISNLA-KAAGLATMISTMRPDIDNIDEYVRNTTARAFAVVASA 592
A V+ EI++++A K+AG T R NIDE+ TTARA V A
Sbjct: 17 APVDYAEIVASIASKSAGPGT-----R---ANIDEFTE-TTARAIEKVGGA 58
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 33.3 bits (76), Expect = 1.2
Identities = 38/195 (19%), Positives = 56/195 (28%), Gaps = 20/195 (10%)
Query: 642 GHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGD 701
G A P A P A ++ P P + A + A P
Sbjct: 370 GGAGPATAAAA---PVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPA-- 424
Query: 702 LIQETPTPGGVSSKRRSRWDE---TPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLM 758
+ +P P +++ R++ P P+ A P+AA P+ + P
Sbjct: 425 --RRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPAR 482
Query: 759 TPSGVTPTGNKAMAMATP----TPGHLAAMTPEQIQAYRWEREIDERNRPLT---DDELE 811
P P P A+ P Q A + P T DD E
Sbjct: 483 AAPAAAP---APADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFE 539
Query: 812 AMFPPGYKVLQPPAA 826
+ P P AA
Sbjct: 540 TLAPAPAAAPAPRAA 554
Score = 30.2 bits (68), Expect = 8.5
Identities = 29/186 (15%), Positives = 40/186 (21%), Gaps = 11/186 (5%)
Query: 607 VTPAATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIR 666
PAA P P A P+ A P +P E + + S+
Sbjct: 385 PAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAA-PARRSPAPEALAAARQASARG 443
Query: 667 RNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSSKRRSR------W 720
P A P P P ++ W
Sbjct: 444 PGGAPAPAPAPAAAPAA----AARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPW 499
Query: 721 DETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPGH 780
+E P S A A P G + P + A
Sbjct: 500 EELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEP 559
Query: 781 LAAMTP 786
+ A P
Sbjct: 560 VVAPRP 565
>gnl|CDD|239573 cd03493, SQR_QFR_TM, Succinate:quinone oxidoreductase (SQR) and
Quinol:fumarate reductase (QFR) family, transmembrane
subunits; SQR catalyzes the oxidation of succinate to
fumarate coupled to the reduction of quinone to quinol,
while QFR catalyzes the reverse reaction. SQR, also
called succinate dehydrogenase or Complex II, is part of
the citric acid cycle and the aerobic respiratory chain,
while QFR is involved in anaerobic respiration with
fumarate as the terminal electron acceptor. SQRs may
reduce either high or low potential quinones while QFRs
oxidize only low potential quinols. SQR and QFR share a
common subunit arrangement, composed of a flavoprotein
catalytic subunit, an iron-sulfur protein and one or two
hydrophobic transmembrane subunits. The structural
arrangement allows efficient electron transfer between
the catalytic subunit, through iron-sulfur centers, and
the transmembrane subunit(s) containing the electron
donor/acceptor (quinol or quinone). The reversible
reduction of quinone is an essential feature of
respiration, allowing the transfer of electrons between
respiratory complexes. SQRs and QFRs can be classified
into five types (A-E) according to the number of their
hydrophobic subunits and heme groups. This
classification is consistent with the characteristics
and phylogeny of the catalytic and iron-sulfur subunits.
Type E proteins, e.g. non-classical archael SQRs,
contain atypical transmembrane subunits and are not
included in this hierarchy. The heme and quinone binding
sites reside in the transmembrane subunits. Although
succinate oxidation and fumarate reduction are carried
out by separate enzymes in most organisms, some
bifunctional enzymes that exhibit both SQR and QFR
activities exist.
Length = 98
Score = 30.7 bits (70), Expect = 1.2
Identities = 10/46 (21%), Positives = 16/46 (34%)
Query: 831 GCSMFAPTVWHLWIYTRSANANFYFGVTLAFATSQIFLLTDLLFAY 876
G ++ HL + F +AF +S + L LL
Sbjct: 7 GVALLLFLPLHLLGLLALLGGPYAFAEVVAFLSSPLGKLLYLLLLL 52
>gnl|CDD|184128 PRK13541, PRK13541, cytochrome c biogenesis protein CcmA;
Provisional.
Length = 195
Score = 32.2 bits (73), Expect = 1.4
Identities = 20/76 (26%), Positives = 35/76 (46%), Gaps = 11/76 (14%)
Query: 851 ANFYFGVTLAFATSQIFLLTDLLFAYLKRDYTLENGIQKTIKGKPARLVGASE---IINR 907
+ Y +A F L DLL ++ Y+L +G+QK + ARL+ +++
Sbjct: 95 SEIYNSAETLYAAIHYFKLHDLLD---EKCYSLSSGMQKIV--AIARLIACQSDLWLLDE 149
Query: 908 VVDDLKDENEQYRKMV 923
V +L EN R ++
Sbjct: 150 VETNLSKEN---RDLL 162
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 32.4 bits (74), Expect = 1.6
Identities = 20/73 (27%), Positives = 25/73 (34%), Gaps = 1/73 (1%)
Query: 720 WDETPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPG 779
WD +PQA P A P T + P TP+G +P N A P
Sbjct: 49 WDPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTG-SPAANPPATFALPAGP 107
Query: 780 HLAAMTPEQIQAY 792
+ E Q Y
Sbjct: 108 AGPTIQTEPGQLY 120
Score = 30.9 bits (70), Expect = 4.9
Identities = 17/79 (21%), Positives = 24/79 (30%), Gaps = 10/79 (12%)
Query: 723 TPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVT----------PTGNKAMA 772
TP +P+ + A P G T T P+ P VT P +A+
Sbjct: 88 TPTGSPAANPPATFALPAGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQ 147
Query: 773 MATPTPGHLAAMTPEQIQA 791
G A+ Q
Sbjct: 148 QLQQRYGAPASGQLPSQQQ 166
>gnl|CDD|236615 PRK09709, PRK09709, exonuclease VIII; Reviewed.
Length = 877
Score = 32.6 bits (74), Expect = 1.9
Identities = 26/106 (24%), Positives = 38/106 (35%), Gaps = 11/106 (10%)
Query: 672 ETPKTERETPGHSSGWAET--PKTDRAGPGGDLI------QETPTPGGVSSKRRSRWDET 723
E P P H + + +TD A P +++ P P V+ + + +E
Sbjct: 341 EAPIGIEVIPAHVTEYLNKVLTETDHANPDPEIVDIACGRSSAPMPQRVTEEGKQDDEEK 400
Query: 724 PQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNK 769
PQ PSG AT + P H + PL S V K
Sbjct: 401 PQ--PSGTTADEQATAETVEPDA-TEHHQDTQPLDAQSQVNSVDAK 443
>gnl|CDD|213664 TIGR01909, C_GCAxxG_C_C, C_GCAxxG_C_C family probable redox protein.
This model represents a putative redox-active protein of
about 140 residues, with four perfectly conserved Cys
residues. It includes a CGAXXG motif. Most members are
found within one or two loci of transporter or
oxidoreductase genes. A member from Geobacter
sulfurreducens, located in a molybdenum transporter
operon, has a TAT (twin-arginine translocation) signal
sequence for Sec-independent transport across the plasma
membrane, a hallmark of bound prosthetic groups such as
FeS clusters.
Length = 120
Score = 30.6 bits (69), Expect = 1.9
Identities = 19/79 (24%), Positives = 29/79 (36%), Gaps = 11/79 (13%)
Query: 1046 GSILGALKAIVNVIGMTKMTPPIKDLLPRLTPILKNRHEKVQE-----NCIDLVGRIADR 1100
G I GAL V +G+ T + + + +EK +E NC DL +
Sbjct: 38 GHICGALTGAVMALGLYFGTEGDGEQVNNTYALCAELYEKFREKNGSPNCGDLNPKKDLV 97
Query: 1101 GPEYVSAREWMRICFELLE 1119
E R C E++
Sbjct: 98 SKEQ------RRKCIEIVS 110
>gnl|CDD|233516 TIGR01661, ELAV_HUD_SF, ELAV/HuD family splicing factor. This
model describes the ELAV/HuD subfamily of splicing
factors found in metazoa. HuD stands for the human
paraneoplastic encephalomyelitis antigen D of which
there are 4 variants in human. ELAV stnds for the
Drosophila Embryonic lethal abnormal visual protein.
ELAV-like splicing factors are also known in human as
HuB (ELAV-like protein 2), HuC (ELAV-like protein 3,
Paraneoplastic cerebellar degeneration-associated
antigen) and HuR (ELAV-like protein 1). These genes are
most closely related to the sex-lethal subfamily of
splicing factors found in Dipteran insects (TIGR01659).
These proteins contain 3 RNA-recognition motifs (rrm:
pfam00076).
Length = 352
Score = 31.8 bits (72), Expect = 2.3
Identities = 27/116 (23%), Positives = 45/116 (38%), Gaps = 9/116 (7%)
Query: 723 TPQATPSGAMTPSAATPGGMTPS-----TPITPHVGSTPLMTPSGVTPTGNKAMAMATPT 777
T T A PS++ G+ P T V + ++T +G+ P + A A P+
Sbjct: 160 TEPITVKFANNPSSSNSKGLLSQLEAVQNPQTTRVPLSTILTAAGIGPM-HHAAARFRPS 218
Query: 778 PG-HLAAMTPEQIQAYRWEREIDERNRPLTDDELEAMFPPGYKVLQPPAA--CFFI 830
G A + +Q Q ++ +R P D A G ++ A C F+
Sbjct: 219 AGDFTAVLAHQQQQHAVAQQHAAQRASPPATDGQTAGLAAGAQIAASDGAGYCIFV 274
>gnl|CDD|173356 PTZ00061, PTZ00061, DNA-directed RNA polymerase; Provisional.
Length = 205
Score = 31.3 bits (71), Expect = 2.7
Identities = 29/117 (24%), Positives = 48/117 (41%), Gaps = 19/117 (16%)
Query: 209 DFDYQASFNQNKRSGYTAPAALLNDIAQSEKDYDPFADRRQKT-------VAEKEDEYRA 261
F + N+ RS A+ D + + FAD +KT + EK +E
Sbjct: 37 TFKERFEENERLRSRMLMVASHKTD--PTNRIIVYFADETKKTGVKPIRELTEKMEE-HD 93
Query: 262 IRRRMIISPERVDPFAEGHLAAMTPEQIQAYRWE---------REIDERNRPLTDDE 309
I+R ++++ + PFA+ + P I E E+ ++ PLTDDE
Sbjct: 94 IQRAILVTQNVLTPFAKDAILEAAPRHIIENFLETELLVNITKHELVPKHIPLTDDE 150
>gnl|CDD|227484 COG5155, ESP1, Separase, a protease involved in sister chromatid
separation [Cell division and chromosome partitioning /
Posttranslational modification, protein turnover,
chaperones].
Length = 1622
Score = 31.9 bits (72), Expect = 3.0
Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 4/60 (6%)
Query: 1171 IAIVAETCS--PFTVLPALMNE--YRVPELNVQNGVLKALSFLFEYIGEMGKDYIYAVTP 1226
I ++ E+ P+ LP L N+ YRVP L L + + G G+ + Y + P
Sbjct: 1389 ILVLDESTHEFPWESLPVLRNKSVYRVPSLEYLENALSSFHYQLPLQGASGRSFSYILNP 1448
>gnl|CDD|223648 COG0575, CdsA, CDP-diglyceride synthetase [Lipid metabolism].
Length = 265
Score = 31.1 bits (71), Expect = 3.0
Identities = 10/57 (17%), Positives = 18/57 (31%), Gaps = 7/57 (12%)
Query: 825 AACFFIGCSMFAPTVWHLWIYTRSANANFYFGVTLAFATSQIFLLTDLLFAYLKRDY 881
A + + + L + N + + L L DL +Y+KR
Sbjct: 179 ALGAVLVAVLVIFLLSSLIL-------NIWTLLILGLLLVLTSQLGDLFESYIKRLL 228
>gnl|CDD|105710 PRK12514, PRK12514, RNA polymerase sigma factor; Provisional.
Length = 179
Score = 30.9 bits (70), Expect = 3.1
Identities = 24/73 (32%), Positives = 31/73 (42%), Gaps = 8/73 (10%)
Query: 1072 LPRLTPILKNRHEKVQENCIDLVGRIAD--RGPE--YVSAREWMRI--CFELLELLKAHK 1125
+ R I + R K ID +AD GPE ++ E RI C E EL K
Sbjct: 78 IARNHAIDRLRARKAVAVDIDEAHDLADPSPGPEAEVIAGDEGQRIDACLE--ELEKDRA 135
Query: 1126 KAIRRATVNTFGY 1138
A+RRA + Y
Sbjct: 136 AAVRRAYLEGLSY 148
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 31.4 bits (71), Expect = 3.4
Identities = 20/79 (25%), Positives = 27/79 (34%), Gaps = 1/79 (1%)
Query: 723 TPQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPGHLA 782
TP S A SAA + +P +T +PS P+ A
Sbjct: 98 TPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPSTSVPSSGSDAASTLVVGSERE 157
Query: 783 AMTPEQIQAYRWEREIDER 801
E+I +ERE ER
Sbjct: 158 TTI-EEIMEMGYEREEVER 175
>gnl|CDD|236779 PRK10864, PRK10864, putative methyltransferase; Provisional.
Length = 346
Score = 31.3 bits (71), Expect = 3.6
Identities = 17/79 (21%), Positives = 27/79 (34%), Gaps = 8/79 (10%)
Query: 653 RETPSHDKAQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGV 712
R + R + + +R + W ++P + GD E GG+
Sbjct: 31 RTGKGGGRPSGKSRADGGRRPARDDRNSQSRDRKWEDSPWRTVSRAPGDETPEKADHGGI 90
Query: 713 SSK--------RRSRWDET 723
S K RR R +ET
Sbjct: 91 SGKSFIDPEVLRRQRAEET 109
>gnl|CDD|236227 PRK08300, PRK08300, acetaldehyde dehydrogenase; Validated.
Length = 302
Score = 31.3 bits (72), Expect = 3.6
Identities = 20/52 (38%), Positives = 27/52 (51%), Gaps = 10/52 (19%)
Query: 542 YARVEGREIISNLA-KAAGLATMISTMRPDIDNIDEYVRNTTARAFAVVASA 592
A V EI++++A K+AG T R NIDE+ TT+RA V A
Sbjct: 144 VAPVHYAEIVASIASKSAGPGT-----R---ANIDEFTE-TTSRAIEKVGGA 186
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 31.4 bits (71), Expect = 3.7
Identities = 49/232 (21%), Positives = 71/232 (30%), Gaps = 36/232 (15%)
Query: 551 ISNLAKAAGLATMISTMRPDIDNIDEYVRNTTARAFAVVASALGIPSLLPFLKAGGVTPA 610
+SNL A +A M R + E + TTA + IPS K
Sbjct: 279 VSNLQVAELMACMAKNRRLSYCKVVEVIAETTAPLTPMEELLAKIPSQRVPPKE----SD 334
Query: 611 ATRWDETPGHPKPGAETPGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRW 670
A + P KP + P E P KA + +
Sbjct: 335 AADGPK-PVPTKPVTPEAPSPPI-------------------EEEPPQPKAVVPRPLSPY 374
Query: 671 ----DETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPGGVSS--KRRSRWDETP 724
D P T P S + K+ A P+PG S+ + E
Sbjct: 375 TAYEDLKPPTS-PIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAK 433
Query: 725 QATPSGAMTPSAATPGGMTPSTPI-TPHVG-STPLMTPSGVTPTGNKAMAMA 774
+ P ++P A P++P T G S + + S V + A A A
Sbjct: 434 KTRP---LSPYARYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATA 482
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 31.5 bits (71), Expect = 3.8
Identities = 61/284 (21%), Positives = 95/284 (33%), Gaps = 52/284 (18%)
Query: 540 DYYARVEGREIISNLAKAAGLATMISTMRPDIDNIDEYVRNTTARAFAVVASALGIPSLL 599
++Y GR+ + + + + P E VR T + ++ + +
Sbjct: 153 NHYISTSGRQRVKSYTPKP--RGEVKSSSPT--QTREKVRKYTPSPLGRMVNSYAPSTFM 208
Query: 600 PFLKAGGVTPAATRWD-ETPG-----HPKPGAETPGATPSTRLWDATPGHATPGAATPGR 653
++ G+TP T D E P T G T T L T TP T
Sbjct: 209 TMPRSHGITPRTTVKDSEITATYKMLETNPSKRTAGKTTPTPLKGMTDN--TPTFLTREV 266
Query: 654 ETPSHDKAQSSIRRNRWDETPKTERETPGHSSGWAETPKTDRAGPGGDLIQETPTPG--- 710
ET +S + +N + E + + W K + P G +++ TP
Sbjct: 267 ETDLLTSPRSVVEKNTLTTPRRVESNSSTNH--WGLVGKNNLTTPQGTVLEHTPATSEGQ 324
Query: 711 -GVSSKRRSRWDETPQATP--------SGAMTP----SAATPGGM------TPSTPITPH 751
+S S ET +T S P ++AT G+ PSTP TP
Sbjct: 325 VTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIASATFRGLEKNPSTAPSTPATPR 384
Query: 752 VGSTPLMT--------PSGVTPTGNKAMAMATPTPGHLAAMTPE 787
V + P+ PT TP+P A+ PE
Sbjct: 385 VRAVLTTQVHHCVVVKPAPAVPT--------TPSPSLTTALFPE 420
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 30.8 bits (70), Expect = 4.2
Identities = 17/84 (20%), Positives = 33/84 (39%), Gaps = 1/84 (1%)
Query: 89 EEKEVRKKLQDKAKDGTLKA-VPTNGEAKAAPKKRGRWDQTSDGDVTPAKKKVAAAEWEK 147
+E+E K LKA + +AK +++G + D +K+ + ++
Sbjct: 45 KEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRKLQE 104
Query: 148 EADVNIEAQIRDIQNRKKDVSEGG 171
E+D+N A + KD
Sbjct: 105 ESDLNNAADLFGGPAPTKDTDLES 128
>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
Length = 234
Score = 30.6 bits (68), Expect = 4.6
Identities = 22/102 (21%), Positives = 29/102 (28%), Gaps = 2/102 (1%)
Query: 690 TPKTDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTPIT 749
P T A + T T V+ + T T AT G ST +T
Sbjct: 66 APITTTAILSTNTTTVTSTGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVT 125
Query: 750 PHVGSTPLMTPSGVTPTGNKAMAMATPTPGHLAAMTPEQIQA 791
+V + T S T N PT + A
Sbjct: 126 SNVTTRSSSTTSATTRITNA--TTLAPTLSSKGTSNATKTTA 165
>gnl|CDD|146273 pfam03546, Treacle, Treacher Collins syndrome protein Treacle.
Length = 519
Score = 31.0 bits (69), Expect = 4.7
Identities = 34/166 (20%), Positives = 48/166 (28%), Gaps = 18/166 (10%)
Query: 639 ATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTERETPGHSSG------WAETPK 692
A PG P A E P D SS + +E P + SG A P
Sbjct: 3 APPGKTGPAATQAKAEKPEEDSESSSEDSDSEEEMPAAKNPPQAKPSGKSPQVKAASAPA 62
Query: 693 TDRAGPGGDLIQETPTPGGVSSKRRSRWDETPQATPSGAMTPSAATPGGMTPSTP---IT 749
+ G P G + ++ E + TP T +T
Sbjct: 63 KESPQKGA-----PPVTPGKAGPAAAQAGEEEAKSSEEESDSEGETPTAATLTTSPAQAK 117
Query: 750 PHVGSTPLMTPSGVTPTGNKAMAMATPTPGHLAAMTPEQIQAYRWE 795
P ++ + S VTP G P P +Q + E
Sbjct: 118 PLGKNSQVRPASTVTP-GPSGKGANLPCPQKAG---SAAVQVGKQE 159
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 31.3 bits (70), Expect = 5.1
Identities = 27/104 (25%), Positives = 47/104 (45%), Gaps = 13/104 (12%)
Query: 83 EQLVRGEEK----EVRKKLQD--KAKDGTLKAVPTNGEAKAAPKK---RGRWDQTSDGDV 133
++ + EEK E +KK ++ KA + KA +A AA KK + + + +
Sbjct: 1293 DEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEA 1352
Query: 134 TPAKKKVAAAEWEKEADVNIEAQIRDIQNRKKDVSEGGGEEKQK 177
A + AAE + EA + + +K D ++ EEK+K
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEE----AKKKADAAKKKAEEKKK 1392
>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2. A Saccharomyces
cerevisiae member of this family (PGA2) is an ER protein
which has been implicated in protein trafficking.
Length = 139
Score = 29.8 bits (67), Expect = 5.1
Identities = 16/88 (18%), Positives = 32/88 (36%), Gaps = 3/88 (3%)
Query: 90 EKEVRKKLQDKAKDGTLKAVPTNGEAKAAPKKRGRWDQTSDGDVTPAKKKVAAAEWEKEA 149
EKE ++ + + K + G A G D + D A +W K+A
Sbjct: 47 EKERAEREEAREKKAKISPNALRG---GATAGHGEEDTDDEEDEEDFATPSAVPQWGKKA 103
Query: 150 DVNIEAQIRDIQNRKKDVSEGGGEEKQK 177
IR + ++ + E +++ +
Sbjct: 104 RKRQRKVIRKLLEAEEQLREDQYDDEDE 131
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 30.9 bits (70), Expect = 6.3
Identities = 15/75 (20%), Positives = 24/75 (32%), Gaps = 4/75 (5%)
Query: 724 PQATPSGAMTPSAATPGGMTPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPGHLAA 783
PS A S + + P P P+G PT + A P A
Sbjct: 388 AAPQPSAAAAASPSPSQSSAAAQPSAP----QSATQPAGTPPTVSVDPPAAVPVNPPSTA 443
Query: 784 MTPEQIQAYRWEREI 798
+ ++ E++I
Sbjct: 444 PQAVRPAQFKEEKKI 458
>gnl|CDD|217439 pfam03229, Alpha_GJ, Alphavirus glycoprotein J.
Length = 126
Score = 29.3 bits (65), Expect = 6.6
Identities = 21/85 (24%), Positives = 29/85 (34%), Gaps = 17/85 (20%)
Query: 550 IISNLAKAAGLATMISTMRPDIDNIDEYVRNTTARAFAVVASALGIPSLLPFLKAGGVTP 609
++++L A LAT TTA A A+A+ + A +
Sbjct: 15 LVTDLTPKAALAT-----------------GTTATVGAANATAVATATGANATSAAAGST 57
Query: 610 AATRWDETPGHPKPGAETPGATPST 634
A P P ETPG P T
Sbjct: 58 IAATAGAPPTDSTPAGETPGPFPPT 82
>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
Length = 405
Score = 30.5 bits (69), Expect = 7.1
Identities = 20/61 (32%), Positives = 27/61 (44%), Gaps = 7/61 (11%)
Query: 77 YAEVMREQLVRGEEKEVRKKLQDKAKDGTLKAVPTNGEAKAAPKKRGR-WDQT--SDGDV 133
YA+ R + E V+ K + AK K + E AA K G+ W +T SD D
Sbjct: 336 YAKTARRMALLAEAARVKAKKEKLAK----KRKQVSKEEAAAIKAAGKAWYKTMISDSDY 391
Query: 134 T 134
T
Sbjct: 392 T 392
>gnl|CDD|222285 pfam13646, HEAT_2, HEAT repeats. This family includes multiple HEAT
repeats.
Length = 88
Score = 28.1 bits (63), Expect = 7.3
Identities = 25/90 (27%), Positives = 37/90 (41%), Gaps = 18/90 (20%)
Query: 1068 IKDLLPRLTPILKNRHEKVQENCIDLVGRIADRGPEYVSAREWMRICFELLELLKAHKKA 1127
++ LL L L + +V+ +G + D PE + A LLELLK
Sbjct: 1 LEALLEAL---LSDPDPEVRAAAARALGELGD--PEALPA---------LLELLKDPDPE 46
Query: 1128 IRRATVNTFGYIAKAIGPHDVLATLLNNLK 1157
+RRA G +G + L LL L+
Sbjct: 47 VRRAAAEALG----KLGDPEALPALLELLQ 72
>gnl|CDD|235677 PRK06029, PRK06029, 3-octaprenyl-4-hydroxybenzoate carboxy-lyase;
Provisional.
Length = 185
Score = 29.5 bits (67), Expect = 7.7
Identities = 21/70 (30%), Positives = 33/70 (47%), Gaps = 15/70 (21%)
Query: 447 KERKIMKLLLKIKNGTPPMRKAALRQITDKAREFGAGPLFNQILPLLM-----SPTLEDQ 501
KER+ +L+L ++ P+ LR +T K E GA I+P + TLED
Sbjct: 113 KERR--RLVLCVRE--TPLHLGHLRNMT-KLAEMGA-----IIMPPVPAFYHRPQTLEDM 162
Query: 502 ERHLLVKVID 511
+ +V+D
Sbjct: 163 VDQTVGRVLD 172
>gnl|CDD|177460 PHA02678, PHA02678, hypothetical protein; Provisional.
Length = 89
Score = 28.4 bits (63), Expect = 7.7
Identities = 18/53 (33%), Positives = 24/53 (45%), Gaps = 4/53 (7%)
Query: 734 PSAATPGGM--TPSTPITPHVGSTPLMTPSGVTPTGNKAMAMATPTPGHLAAM 784
P P G+ P+ P GSTP M P +A+ + PT G LAA+
Sbjct: 13 PGTRFPEGLFDRQGRPLGP--GSTPTMEYEDSVPRDARALLLVNPTAGQLAAV 63
>gnl|CDD|233228 TIGR00998, 8a0101, efflux pump membrane protein (multidrug
resistance protein A). [Transport and binding proteins,
Other].
Length = 334
Score = 30.1 bits (68), Expect = 7.8
Identities = 12/34 (35%), Positives = 17/34 (50%)
Query: 173 EEKQKRVGLGESGYFDSDIYDGGGKFEGYVKSIA 206
E + K V +G+ SD+Y FEG V I+
Sbjct: 249 ETQLKNVRIGQPVTIRSDLYGSDVVFEGKVTGIS 282
>gnl|CDD|202816 pfam03936, Terpene_synth_C, Terpene synthase family, metal binding
domain. It has been suggested that this gene family be
designated tps (for terpene synthase). It has been split
into six subgroups on the basis of phylogeny, called
tpsa-tpsf. tpsa includes vetispiridiene synthase, 5-epi-
aristolochene synthase, and (+)-delta-cadinene synthase.
tpsb includes (-)-limonene synthase. tpsc includes
kaurene synthase A. tpsd includes taxadiene synthase,
pinene synthase, and myrcene synthase. tpse includes
kaurene synthase B. tpsf includes linalool synthase.
Length = 270
Score = 30.0 bits (68), Expect = 8.3
Identities = 23/103 (22%), Positives = 37/103 (35%), Gaps = 19/103 (18%)
Query: 850 NANFYFGVTLAFATSQIFL---LTDLLFAYLKRDYTLENGIQKTIKGKPARLVGASEIIN 906
N G L S + + + F +LK +LV S +I
Sbjct: 174 NGRVSIGAPLLLLLSFLGMGDSAGEEPFEWLK---------------SRPKLVKLSSLIL 218
Query: 907 RVVDDLKD-ENEQYRKMVMESIEKTMSNLGAADIDSRLEEQLI 948
R+++D+ E E R V SIE M G + ++ Q +
Sbjct: 219 RLLNDIASYEKEMARGEVANSIECYMKEYGVTEEEALEHVQKM 261
>gnl|CDD|236386 PRK09129, PRK09129, NADH dehydrogenase subunit G; Validated.
Length = 776
Score = 30.3 bits (69), Expect = 8.5
Identities = 27/83 (32%), Positives = 33/83 (39%), Gaps = 31/83 (37%)
Query: 1089 NCIDL--VGRIADRGPEYVSAREWMRICFELLELLKAHKKAIRRATVNTFGYIAKAIGPH 1146
N IDL VG + + Y SAR W EL RR K++ PH
Sbjct: 191 NMIDLCPVGALTSKPFRY-SARTW--------EL-------SRR----------KSVSPH 224
Query: 1147 DVLATLLNNLKVQERQNRVCTTV 1169
D L + NL V + NRV V
Sbjct: 225 DSLGS---NLVVHVKNNRVMRVV 244
>gnl|CDD|226587 COG4102, COG4102, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 418
Score = 30.2 bits (68), Expect = 9.7
Identities = 25/123 (20%), Positives = 35/123 (28%), Gaps = 17/123 (13%)
Query: 628 PGATPSTRLWDATPGHATPGAATPGRETPSHDKAQSSIRRNRWDETPKTERETPGH-SSG 686
P RL+ A ATP R SH Q + +T PGH +G
Sbjct: 94 PSLPNFQRLFQAGEAAVVHAVATPYRLR-SHFDGQ---------DVLETGSAGPGHLETG 143
Query: 687 W-----AETPKTDRAGPGGDLIQETPTPGGVSSKRRS-RWDETPQATPSGAMTPSAATPG 740
W P G L P + K + W +P + +
Sbjct: 144 WLNRLLEALPAGAAFTTAGGLSVGPTAPLVLRGKAPALSWAPSPLFLSAARLAMLLNDLY 203
Query: 741 GMT 743
G+
Sbjct: 204 GVL 206
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.135 0.402
Gapped
Lambda K H
0.267 0.0630 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 72,877,320
Number of extensions: 7599583
Number of successful extensions: 8484
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8311
Number of HSP's successfully gapped: 149
Length of query: 1359
Length of database: 10,937,602
Length adjustment: 109
Effective length of query: 1250
Effective length of database: 6,103,016
Effective search space: 7628770000
Effective search space used: 7628770000
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 65 (29.0 bits)