RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14745
(854 letters)
>gnl|CDD|111745 pfam02891, zf-MIZ, MIZ/SP-RING zinc finger. This domain has SUMO
(small ubiquitin-like modifier) ligase activity and is
involved in DNA repair and chromosome organisation.
Length = 50
Score = 98.5 bits (246), Expect = 3e-25
Identities = 30/50 (60%), Positives = 39/50 (78%)
Query: 177 VSLKCPITFKKITLPARGHDCKHIQCFDLESYLQLNCERGSWRCPVCNKP 226
VSLKCPI++ +I++P RG CKHIQCFDL S+L+ N +W CPVC+KP
Sbjct: 1 VSLKCPISYLRISIPVRGRFCKHIQCFDLLSFLESNERTPTWNCPVCDKP 50
>gnl|CDD|192832 pfam11789, zf-Nse, Zinc-finger of the MIZ type in Nse subunit.
Nse1 and Nse2 are novel non-SMC subunits of the fission
yeast Smc5-6 DNA repair complex. This family is the
zinc-finger domain similar to the MIZ type of
zinc-finger.
Length = 57
Score = 39.2 bits (92), Expect = 3e-04
Identities = 14/51 (27%), Positives = 25/51 (49%), Gaps = 4/51 (7%)
Query: 176 KVSLKCPITFKKITLPARGHDCKHIQCFDLESYLQLNCERGSWRCPV--CN 224
+SL CP+T + P C H+ F+ ++ L + + +CPV C+
Sbjct: 9 TISLTCPLTLQPFEEPVTSKKCNHV--FEKDAILSMLRRNKTVKCPVAGCS 57
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 43.2 bits (102), Expect = 7e-04
Identities = 55/362 (15%), Positives = 89/362 (24%), Gaps = 53/362 (14%)
Query: 484 PLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPTFSDIKPTFSDIKPA 543
P PG T S + ST +P + PT P
Sbjct: 71 PPPGPGTEAPANESRS---TPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPP 127
Query: 544 INPHPQSIASGSMMNNNSGPPSYSMSMNRSSGGNGGGFNDFPSGGNSGDFPGDNSLESLN 603
+P P M S G + +G + D +
Sbjct: 128 PSP---------------APDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQA 172
Query: 604 AMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDIPS 663
A+ S E+ P +P P TP P P SS + + + +
Sbjct: 173 ALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDA 232
Query: 664 DLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNNPGGPT 723
+ ++ + G G N P P+ L + + PSS ++ P
Sbjct: 233 GASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPR 292
Query: 724 TNDDILALEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDIP 783
+P +PG PS P +SS+ ++SSS+ S+ +
Sbjct: 293 ----------------ERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESS 336
Query: 784 SDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNNPGGP 843
P PP +
Sbjct: 337 RGAAVSPGPSPSRSPSPSR-------------------PPPPADPSSPRKRPRPSRAPSS 377
Query: 844 TT 845
Sbjct: 378 PA 379
Score = 37.5 bits (87), Expect = 0.037
Identities = 41/261 (15%), Positives = 72/261 (27%), Gaps = 12/261 (4%)
Query: 588 GNSGDFPGDNSLESLNAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQD 647
G+ G D++ + + + P T P N S P S +
Sbjct: 38 GSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAP 97
Query: 648 TQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSY-LDPPDLA 706
+ + + S P P + M PP A
Sbjct: 98 ASPAREGSPTPPGPSSPDPPPPTPPPASPPPS-------PAPDLSEMLRPVGSPGPPPAA 150
Query: 707 TPPSSGSSNGNNPGGPTTNDDILALEQMP----HTPHTPHTPHTPHTPGGNGPPSVPPSS 762
+PP++G+S ++ P P +P P TP P P S
Sbjct: 151 SPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRS 210
Query: 763 STQQQDTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLD 822
S + + + + + ++ + G G N P P+ L + +
Sbjct: 211 SPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIW 270
Query: 823 PPDLATPPSSGSSNGNNPGGP 843
PSS ++ P
Sbjct: 271 EASGWNGPSSRPGPASSSSSP 291
Score = 34.0 bits (78), Expect = 0.46
Identities = 42/246 (17%), Positives = 67/246 (27%), Gaps = 28/246 (11%)
Query: 482 SSPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPTFSDIKPTFSDIK 541
+ + + A S P + + SSP PP S P S
Sbjct: 159 PAAVASDAASSRQAALPLSSPEETAR-APSSPPAEPPPSTPPAAASPRPPRRSSPISA-- 215
Query: 542 PAINPHPQSIASGSMMNNNSGPPSYSMSMNRSSGGNGGGFNDFPSGGNS-GDFPG--DNS 598
A +P P A G +++G S S + SSG G N+ P + P +
Sbjct: 216 SASSPAP---APGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEA 272
Query: 599 LESLNAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVST 658
+ +P +P +PG PS P +SS+ ++SSS+ S+
Sbjct: 273 SGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSS 332
Query: 659 DDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNN 718
+ P PP +
Sbjct: 333 SESSRGAAVSPGPSPSRSPSPSR-------------------PPPPADPSSPRKRPRPSR 373
Query: 719 PGGPTT 724
Sbjct: 374 APSSPA 379
Score = 29.8 bits (67), Expect = 8.8
Identities = 36/198 (18%), Positives = 55/198 (27%), Gaps = 26/198 (13%)
Query: 481 HSSPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRD--VKPTFSDIKPTFS 538
+ P PP TP ++ S P +S S S P GR S + S
Sbjct: 184 RAPSSPPAEPPPSTPPAAASPRPPR-RSSPISASASSPAPAPGRSAADDAGASSSDSSSS 242
Query: 539 DIK--------------PAINPHPQSIASGSMMNN---------NSGPPSYSMSMNRSSG 575
+ PA P I S N +S P S
Sbjct: 243 ESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSS 302
Query: 576 GNGGGFNDFPSGGNSGDFPGDNSLESLNAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPP 635
G P +S ++S S ++ +S +P +P P P
Sbjct: 303 PGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADP 362
Query: 636 SVPPSSSTQQQDTQTSSS 653
S P + + ++
Sbjct: 363 SSPRKRPRPSRAPSSPAA 380
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 42.2 bits (99), Expect = 0.001
Identities = 72/433 (16%), Positives = 114/433 (26%), Gaps = 46/433 (10%)
Query: 468 QQNEMRNNLTYQQHSSPLPGNPT-PPLTPASSTSVPPFVSPNSTSSPFISP-PHSVSGRD 525
R + Q P + P PA + +PP P SP +
Sbjct: 2583 TSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHP 2642
Query: 526 VKPTFSDIKPTFSDIKPAINPHPQSIASGSMMNNNSGPPSYSMSMNRSSGGNGGGFNDFP 585
+P ++ ++ G +S P R + G+ D P
Sbjct: 2643 PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPP 2702
Query: 586 SGGNSGDFPGDNSLESL-----NAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPS 640
+ + + + A + S +P P P P P TPGG P+ PP+
Sbjct: 2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT 2762
Query: 641 SSTQQQDT---------------QTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNL 685
++ +S + S + +PS +
Sbjct: 2763 TAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAA 2822
Query: 686 LPDNVVDPMELLSYLDPPDLATPPSSGSSNGNN--PGG--------------PTTNDDI- 728
P + P PP PP G + PGG P
Sbjct: 2823 SPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPP 2882
Query: 729 ---LALEQMPHTPHT----PHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDD 781
LA + + + P P P P PP P Q
Sbjct: 2883 VRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
Query: 782 IPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNNPG 841
+ + A G L+P V P + P A S+ G++
Sbjct: 2943 LAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLS 3002
Query: 842 GPTTNDDILALFE 854
++ LAL E
Sbjct: 3003 RVSSWASSLALHE 3015
Score = 39.9 bits (93), Expect = 0.008
Identities = 55/321 (17%), Positives = 89/321 (27%), Gaps = 34/321 (10%)
Query: 486 PGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPTFSDIKPTFSDIKPAIN 545
P P PP PA+ P ++ + +S S S D + + + + PA +
Sbjct: 2766 PPAPAPPAAPAAGP--PRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAS 2823
Query: 546 PHPQSIASGSMMNNNSGPPSYSMSMNRSSGGN---GGGFNDFPSGGNSGDFPGDNSLESL 602
P S PP + GG+ GG P + P + +
Sbjct: 2824 PAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPV 2883
Query: 603 NAMEK-SLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDI 661
+ + ++S P P P P PP P Q +
Sbjct: 2884 RRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPL 2943
Query: 662 PSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNNPGG 721
+ A G L+P V P + P A S+ G++
Sbjct: 2944 APTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSR 3003
Query: 722 PTTNDDILALEQMPHTP-----HTPHTPHT-----------------------PHTPGGN 753
++ LAL + P T P P P +
Sbjct: 3004 VSSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPEPH 3063
Query: 754 GPPSVPPSSSTQQQDTQTSSS 774
P + P +T + + S S
Sbjct: 3064 DPFAHEPDPATPEAGARESPS 3084
Score = 31.1 bits (70), Expect = 4.0
Identities = 20/82 (24%), Positives = 27/82 (32%), Gaps = 10/82 (12%)
Query: 685 LLPDNVVDPMELLSYLDPP---DLATPPSSGSSNGNNPGGPTTNDDILALEQMPHTPHTP 741
L D+ DP L PP D + PP + P P + + + P P
Sbjct: 2543 LASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAPR---PSEPA----VTSRARRPDAPPQS 2595
Query: 742 HTPHTPHTPGGNGPPSVPPSSS 763
P P G+ PPS
Sbjct: 2596 ARPRAPVDDRGDPRGPAPPSPL 2617
Score = 30.3 bits (68), Expect = 5.9
Identities = 20/86 (23%), Positives = 27/86 (31%), Gaps = 3/86 (3%)
Query: 337 PNGYSMPGRPSFQGHHQYPSQQPGSGPNGAPFG--PNSAPSSRGSSVRQSTPSFPSSSGP 394
P G + PG P+ + P P AP P +S+ +S S PS P
Sbjct: 2745 PAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDP 2804
Query: 395 NSNANGGSNHQYFGAGNQFP-GGVPP 419
P G +PP
Sbjct: 2805 ADPPAAVLAPAAALPPAASPAGPLPP 2830
>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 944
Score = 38.6 bits (90), Expect = 0.015
Identities = 27/151 (17%), Positives = 48/151 (31%), Gaps = 13/151 (8%)
Query: 626 PHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNL 685
P TP PP+ ++ +QTS+S ++ D + P E
Sbjct: 645 PKTPPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQSVP------EAALASGSAP 698
Query: 686 LPDNVVDPMELLSYLDPPDLA----TPPSSGSSNGNNPGGPTTNDDILALEQMPHTPHTP 741
P V DP + + + P++A P ++ N + +N ++ A+EQ
Sbjct: 699 APPPVPDPYDRPPWEEAPEVASANDGPNNAAEGNLSESVEDASNSELQAVEQQATHQPQV 758
Query: 742 HTPHTPHTPGGNGPPSVPPSSSTQQQDTQTS 772
SS Q +
Sbjct: 759 QAEAQSPAS---TTALTQTSSEVQDTELNLV 786
Score = 31.6 bits (72), Expect = 2.3
Identities = 45/284 (15%), Positives = 71/284 (25%), Gaps = 50/284 (17%)
Query: 595 GDNSLESLNAMEKSLSEQMPHTPHTPHTPHTPH-TPGGNGPPSVPPSSSTQQQDTQTSSS 653
G + SL+A ++ EQ+ T + T + S ++ D ++
Sbjct: 471 GFEASSSLDADNSAVPEQIDSTAEQSVVNPSVTDTQVDDTSASNNSAADNTVDDNYSAED 530
Query: 654 NNVSTDDIPSDLNFDPA-----AVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATP 708
S D D A + L D A
Sbjct: 531 TLESNGLDEGDYAQDSAPLDAYQDDYVAFSSESYNALSDDEQHSAN--------VQSAQS 582
Query: 709 PSSGSSNGNNPGGPTT---------NDDIL---------------ALEQMPHTPHTPHTP 744
+ + + + +DDIL AL
Sbjct: 583 AAEAQPSSQSLSPISAVTTAAASLADDDILDAVLAARDSLLSDLDALSPKEGDGKKSSAD 642
Query: 745 HTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENL 804
P TP PP+ ++ +QTS+S ++ D + P E
Sbjct: 643 RKPKTPPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQSVP------EAALASGS 696
Query: 805 NLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNNPGGPTTNDD 848
P V DP D P P S+N N
Sbjct: 697 APAPPPVPDPY------DRPPWEEAPEVASANDGPNNAAEGNLS 734
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 38.1 bits (89), Expect = 0.015
Identities = 16/104 (15%), Positives = 31/104 (29%), Gaps = 1/104 (0%)
Query: 676 EGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNNPGGPTTNDDILALEQMP 735
+ E+ ++ N + + P S S + P + + +P
Sbjct: 154 LDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLP 213
Query: 736 HTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVST 779
P + + P +P PS PP + +ST
Sbjct: 214 PAPSSFQSDTPPPSPESPTNPSPPPGPAAPPP-PPVQQVPPLST 256
Score = 35.0 bits (81), Expect = 0.12
Identities = 17/93 (18%), Positives = 23/93 (24%), Gaps = 3/93 (3%)
Query: 464 FTGFQQNEMRNNLTYQQHSSPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSG 523
G S LP P+ + S +P+ P PP V
Sbjct: 191 SPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQ 250
Query: 524 RDVKPTFSDIKPTFSDIKPAINPH---PQSIAS 553
T P+ S I +IA
Sbjct: 251 VPPLSTAKPTPPSASATPAPIGGITLDDDAIAK 283
Score = 32.3 bits (74), Expect = 0.86
Identities = 15/88 (17%), Positives = 24/88 (27%), Gaps = 1/88 (1%)
Query: 571 NRSSGGNGGGFNDFPSGGNSGDFPGDNSLESLNAMEKSLSEQMPHTPHTPHTPHTPHTPG 630
N G + + PS S + +P P + + P +P
Sbjct: 170 NSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPE 229
Query: 631 GNGPPSVPPSSSTQQQDTQTSSSNNVST 658
PS PP + +ST
Sbjct: 230 SPTNPSPPPGPAAPPP-PPVQQVPPLST 256
Score = 31.2 bits (71), Expect = 2.4
Identities = 18/85 (21%), Positives = 23/85 (27%)
Query: 293 SPGSMTLPTMSNWEMGQSMSPYIPPDMNSKSPFMHIESSAILSFPNGYSMPGRPSFQGHH 352
S P S P P S +P ES S P G + P P Q
Sbjct: 193 GVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVP 252
Query: 353 QYPSQQPGSGPNGAPFGPNSAPSSR 377
+ +P A P +
Sbjct: 253 PLSTAKPTPPSASATPAPIGGITLD 277
Score = 29.7 bits (67), Expect = 7.1
Identities = 32/134 (23%), Positives = 43/134 (32%), Gaps = 6/134 (4%)
Query: 718 NPGGPTT-NDDILALEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSS-STQQQDTQTSSSN 775
NPG P D+ + P P + + PPS P S SS +
Sbjct: 149 NPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPS 208
Query: 776 NVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATP-PSSGS 834
+ S PS D N + P P + + P A P P S S
Sbjct: 209 DSSLPPAPSSFQSDTPPPSPES---PTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSAS 265
Query: 835 SNGNNPGGPTTNDD 848
+ GG T +DD
Sbjct: 266 ATPAPIGGITLDDD 279
>gnl|CDD|206094 pfam13923, zf-C3HC4_2, Zinc finger, C3HC4 type (RING finger).
Length = 45
Score = 33.6 bits (77), Expect = 0.019
Identities = 13/49 (26%), Positives = 21/49 (42%), Gaps = 6/49 (12%)
Query: 178 SLKCPITFKKITLPARGHDCKHIQCFD-LESYLQLNCERGSWRCPVCNK 225
L+CPI + P C H+ C + + YL + +CP+C
Sbjct: 2 ELECPICLDLLRDPVVLTPCGHVFCRECILRYL-----KKKSKCPICRT 45
>gnl|CDD|216205 pfam00937, Corona_nucleoca, Coronavirus nucleocapsid protein.
Length = 346
Score = 35.8 bits (83), Expect = 0.073
Identities = 16/54 (29%), Positives = 18/54 (33%), Gaps = 3/54 (5%)
Query: 353 QYPSQQPGSGPNG---APFGPNSAPSSRGSSVRQSTPSFPSSSGPNSNANGGSN 403
P + P G F S SSR SS S SS NS S+
Sbjct: 134 AIPLRFSPGLPKGFYIEGFRGRSRSSSRSSSRSNSRGPSRGSSRNNSRNRNSSS 187
>gnl|CDD|238093 cd00162, RING, RING-finger (Really Interesting New Gene) domain, a
specialized type of Zn-finger of 40 to 60 residues that
binds two atoms of zinc; defined by the 'cross-brace'
motif C-X2-C-X(9-39)-C-X(1-3)-
H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved
in mediating protein-protein interactions; identified in
a proteins with a wide range of functions such as viral
replication, signal transduction, and development; has
two variants, the C3HC4-type and a C3H2C3-type (RING-H2
finger), which have different cysteine/histidine
pattern; a subset of RINGs are associated with B-Boxes
(C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
Length = 45
Score = 32.0 bits (73), Expect = 0.081
Identities = 13/48 (27%), Positives = 22/48 (45%), Gaps = 5/48 (10%)
Query: 180 KCPITFKKITLPARGHDCKHIQCFD-LESYLQLNCERGSWRCPVCNKP 226
+CPI ++ P C H+ C ++ +L + G CP+C P
Sbjct: 1 ECPICLEEFREPVVLLPCGHVFCRSCIDKWL----KSGKNTCPLCRTP 44
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 36.2 bits (83), Expect = 0.083
Identities = 71/378 (18%), Positives = 115/378 (30%), Gaps = 30/378 (7%)
Query: 468 QQNEMRNNLTYQQHSSPLPGNPTPPLTP-ASSTSVPPFVSPNSTSSPFISPPHSVSGRDV 526
Q NE ++ + QQ G P+ + P A+ P +P++ + P P +
Sbjct: 158 QDNESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQ---- 213
Query: 527 KPTFSDIKPTFSDIKPAINPHPQSIASGSMMNNNSGPPSYSMSMNRSSGGNGGGFNDFPS 586
P +P+ + A + HPQ + S PP + ++ S + P
Sbjct: 214 -PAPQPQQPSPLSLISAPSLHPQRLPS-------PHPPLQPQTASQQSPQPPAPSSRHPQ 265
Query: 587 GGNSGDFPGDNSLESLNAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQ 646
+ G P M +L + H P P + P +P S Q
Sbjct: 266 SSHHGPGP---------PMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPH 316
Query: 647 DTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLA 706
S + + P + PA + ++ P + + S+ PP L
Sbjct: 317 SHTPPSQSALQPQQPPREQPLPPAPSM-------PHIKPPPTTPIPQLPNQSHKHPPHLQ 369
Query: 707 TPPSSGSSNGNNPGGPTTNDDILALEQMPHTPHTPHTPHTPHTPGGNGPPSVPPS-SSTQ 765
P N P P P + H P P + P+ PP + +Q
Sbjct: 370 GPSPFPQMPSNLPPPPALKPLSSLPTHHPPSAHPPPLQLMPQSQPLQSVPAQPPVLTQSQ 429
Query: 766 QQDTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPD 825
+ S+ + P F G LP + S P
Sbjct: 430 SLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAPPRASSGSQPP 489
Query: 826 LATPPSSGSSNGNNPGGP 843
+ PSSG G P P
Sbjct: 490 GSALPSSGGCAGPGPPLP 507
>gnl|CDD|236504 PRK09418, PRK09418, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 780
Score = 35.5 bits (81), Expect = 0.14
Identities = 26/82 (31%), Positives = 33/82 (40%), Gaps = 7/82 (8%)
Query: 707 TPPSSGSSNGNNPGGPTTNDDILALEQMPHTPHTPH-----TPHTPHTPGGNGPPSVPPS 761
TPP+ NG NP P T + + P TP T P TP T GN S +
Sbjct: 669 TPPTGEGDNGENPTTPPTGEG--NNGENPTTPPTGEGNNGGNPTTPSTDEGNNAGSGQTT 726
Query: 762 SSTQQQDTQTSSSNNVSTDDIP 783
+ Q T+ S N D+P
Sbjct: 727 TDNQNSKETTTVSENKEERDLP 748
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 33.6 bits (77), Expect = 0.45
Identities = 31/175 (17%), Positives = 51/175 (29%), Gaps = 8/175 (4%)
Query: 497 SSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPTFSDIKPTFSDIKPAINPHPQSIASGSM 556
+ F + +S++S + + K + + + ++
Sbjct: 71 QDNNDKKFSTIDSSTSDSNNIIDFI----YKNLPQTNINQLLTKNKYDDNYSLTTLIQNL 126
Query: 557 MNNNSGPPSYSMSMNRSSGGNGGGFNDFPSGGNSGDFPGDNSLESLNAMEKSLSEQMPHT 616
N NS Y N N N S N D ++ N S + P T
Sbjct: 127 FNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPST 186
Query: 617 PHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDPAA 671
+ P P + P S T Q + SS +N S D D D +
Sbjct: 187 SNKQPNSPKPTQPNQSNSQ--PASDDTANQKS--SSKDNQSMSDSALDSILDQYS 237
Score = 31.3 bits (71), Expect = 2.9
Identities = 29/162 (17%), Positives = 53/162 (32%), Gaps = 4/162 (2%)
Query: 634 PPSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDP 693
S SS++ + N+ +I L + +NL L ++ D
Sbjct: 77 KFSTIDSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISD- 135
Query: 694 MELLSYLDPPDLATPPSSGSSNGNNPGGPTTNDDILALEQMPHTPHT-PHTPHTPHTPGG 752
E + + +S SS N+ ++ D ++ P + +T P T +
Sbjct: 136 YEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPK 195
Query: 753 NGPPSVPPSSSTQQQDTQ--TSSSNNVSTDDIPSDLNFDPAA 792
P+ S +SS +N S D D D +
Sbjct: 196 PTQPNQSNSQPASDDTANQKSSSKDNQSMSDSALDSILDQYS 237
>gnl|CDD|223025 PHA03253, PHA03253, UL35; Provisional.
Length = 609
Score = 33.4 bits (76), Expect = 0.52
Identities = 33/179 (18%), Positives = 62/179 (34%), Gaps = 35/179 (19%)
Query: 592 DFPGDNSLESLNAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTS 651
+E LN E S + + T P++P G + P S+ + + ++++S
Sbjct: 449 GLSLKQVIERLNVNEGRSSSRASPSHSTSTIPYSPPQSGRSTPTSILRQRTPIRSNSRSS 508
Query: 652 SSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSS 711
S + D S + + N++ + PM L D + P S
Sbjct: 509 SVSFSQGDSNRSHYS------------DETNIS----DYSYPMADLDLEDEEPMEDHPHS 552
Query: 712 GSSNGNNPGGPTTNDDILALEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQ 770
S +N T+ + ++ PP++ PSSS +QQ+
Sbjct: 553 PQSISSNNSMSRTSR------ALQNSQR-------------RRPPTMFPSSSERQQNNA 592
>gnl|CDD|219133 pfam06682, DUF1183, Protein of unknown function (DUF1183). This
family consists of several eukaryotic proteins of around
360 residues in length. The function of this family is
unknown.
Length = 317
Score = 33.2 bits (76), Expect = 0.57
Identities = 16/81 (19%), Positives = 21/81 (25%), Gaps = 7/81 (8%)
Query: 340 YSMPGRPSFQGHHQYPSQQP-----GSGPNGAPFGPNSAPSSRGSSVRQSTPSFPSSSGP 394
Y P + G G G G + P + +GP
Sbjct: 174 YKFFLSCGGVRGGPRPERAGYGGGGGGGGGGGGGGGSGPGPPPPGFKSSFPPPYGPGAGP 233
Query: 395 NSNANGGSNHQYFGAGNQFPG 415
+S GS G G PG
Sbjct: 234 SS--GYGSGGTRSGQGGWGPG 252
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 33.0 bits (75), Expect = 0.78
Identities = 35/182 (19%), Positives = 57/182 (31%), Gaps = 26/182 (14%)
Query: 470 NEMRNNLTYQQHSSPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPT 529
+EM++ LT + + L G P SVP SPN+ P GR + +
Sbjct: 59 DEMKDFLTDRSNQPHLVGIPKN--------SVPQ--SPNNKGKPSFFGDQR--GRSIPSS 106
Query: 530 FSDIKPTFSDIKPAINP-HPQSIASGSMMNNNSGPPSYSMSMNRSSGGNGGGFNDFPSGG 588
+ P+ N Q + +G P ++S ++ G G +
Sbjct: 107 HQSTHTSAPGPAPSGNISQSQKRSQPRSQWTKAGHPPSTVSSSQRHGQQGSSHSHSGDHH 166
Query: 589 NSGDFPGDNSLESLNAMEKSLSEQMPHTPHTPHTPHTPHTPGG----NGPPSVPPSSSTQ 644
+ + + E +P P N PPS+ SS
Sbjct: 167 SKEICRSKSPRDL---------EVNVQSPERPPDSLVLLPSSQHSVQNFPPSLMSKSSAM 217
Query: 645 QQ 646
QQ
Sbjct: 218 QQ 219
>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
Length = 327
Score = 32.7 bits (75), Expect = 0.81
Identities = 28/165 (16%), Positives = 46/165 (27%), Gaps = 22/165 (13%)
Query: 610 SEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDP 669
S + H S PP S T Q + N ++P +++
Sbjct: 94 SPSAANNTSDGHDASGVKNTAPPQDISAPPISPTPTQAAPPQTPNGQQRIELPGNIS--- 150
Query: 670 AAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNNPGGPTTNDDIL 729
D Q Q +N N L T P++ + + T
Sbjct: 151 ----DALSQQQGQVNAASQNAQGN--------TSTLPTAPATVAPSKGAKVPATAE---- 194
Query: 730 ALEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSS 774
P P P T +VPP++S + + S+
Sbjct: 195 ---THPTPPQKPATKKPAVNHHKTATVAVPPATSGKPKSGAASAR 236
Score = 29.7 bits (67), Expect = 7.2
Identities = 15/78 (19%), Positives = 23/78 (29%), Gaps = 11/78 (14%)
Query: 710 SSGSSNGNNPGGPTTNDDILALEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDT 769
SS S+GN N+ H S PP S T Q
Sbjct: 84 SSSLSSGNQSSPSAANNT-----------SDGHDASGVKNTAPPQDISAPPISPTPTQAA 132
Query: 770 QTSSSNNVSTDDIPSDLN 787
+ N ++P +++
Sbjct: 133 PPQTPNGQQRIELPGNIS 150
>gnl|CDD|113815 pfam05058, ActA, ActA Protein. The ActA family is found in
Listeria and is associated with motility. ActA protein
acts as a scaffold to assemble and activate host cell
actin cytoskeletal factors at the bacterial surface,
resulting in directional actin polymerisation and
propulsion of the bacterium through the cytoplasm of the
host cell.
Length = 601
Score = 32.5 bits (73), Expect = 0.97
Identities = 21/84 (25%), Positives = 35/84 (41%), Gaps = 3/84 (3%)
Query: 483 SPLPGNPTPPLTP-ASSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPTFSDI--KPTFSD 539
+P +P P L+P S P +S + +PF +PP + + K T + K T
Sbjct: 408 NPFASSPVPSLSPKVPKISAPALISDITKKAPFKNPPQPLKVFNKKTTTKTVLKKITPVK 467
Query: 540 IKPAINPHPQSIASGSMMNNNSGP 563
I P + P + + + N P
Sbjct: 468 IAPKLAELPATKPQETAIGENKAP 491
>gnl|CDD|222279 pfam13639, zf-RING_2, Ring finger domain.
Length = 46
Score = 28.9 bits (65), Expect = 1.2
Identities = 13/49 (26%), Positives = 19/49 (38%), Gaps = 8/49 (16%)
Query: 181 CPITFKKITLPARGH--DCKHIQCFD-LESYLQLNCERGSWRCPVCNKP 226
CPI + C H+ + L+ +L R S CP+C P
Sbjct: 3 CPICLDEFEPGEEVVVLPCGHVFHKECLDKWL-----RSSNTCPLCRAP 46
>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal. Members
of this family are found in the N terminal region of
cytochrome B561, as well as in various other putative
uncharacterized proteins.
Length = 559
Score = 32.4 bits (74), Expect = 1.2
Identities = 24/99 (24%), Positives = 32/99 (32%), Gaps = 17/99 (17%)
Query: 318 DMNSKSPFMHIESSAILSFPNGYSMPGRPSFQGHHQYPSQQPGSGPNGAPFGPNSAPSSR 377
MN+ P + SS S S F P S +P S +
Sbjct: 128 PMNTSEPLVPGHSS--FSDSPSRSASPSRKFS---------PSSTIQQSPQLTPSNKPAS 176
Query: 378 GSSVRQSTPSFPSS-----SGPNSNANGGSNHQYFGAGN 411
SS QS PS+ SS S N + S +G+
Sbjct: 177 PSSSYQS-PSYSSSLGPVNSSGNRSNLRSSPWALRSSGD 214
Score = 31.3 bits (71), Expect = 2.2
Identities = 25/135 (18%), Positives = 38/135 (28%), Gaps = 8/135 (5%)
Query: 462 LCFTGFQQNEMRNNLTYQQHSSPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSV 521
L + Q + Q + +PP + S+ SS SP S
Sbjct: 91 LVLSERQLGLLGVKAKDSQFTVVSQAKKSPPASKTSTPMNTSEPLVPGHSSFSDSPSRSA 150
Query: 522 SGRDVKPTFSDIKPTFSDIKPAINPHPQSIASGSMMNNNSGPPSYSMSMNRSSGGNGGGF 581
S K + S + P+ P AS S + S +N S +
Sbjct: 151 SPSR-KFSPSSTIQQSPQLTPSNKP-----ASPSSSYQSPSYSSSLGPVNSSGNRSNLRS 204
Query: 582 NDFPSGGNSGDFPGD 596
+ P S D
Sbjct: 205 S--PWALRSSGDKKD 217
>gnl|CDD|178806 PRK00030, minC, septum formation inhibitor; Provisional.
Length = 292
Score = 31.2 bits (70), Expect = 2.0
Identities = 21/68 (30%), Positives = 26/68 (38%), Gaps = 5/68 (7%)
Query: 702 PPDLATP-PSSGSSNGNNPGGPTTNDDILALEQ----MPHTPHTPHTPHTPHTPGGNGPP 756
P D+ATP PS + DD EQ H P + T T N P
Sbjct: 116 PNDVATPVPSVPEATAEAAAKAGPQDDEADGEQADEAPAHNPESVPTRAARETTEANRPT 175
Query: 757 SVPPSSST 764
+ PP SS+
Sbjct: 176 ATPPQSSS 183
>gnl|CDD|166894 PRK00294, hscB, co-chaperone HscB; Provisional.
Length = 173
Score = 30.6 bits (69), Expect = 2.1
Identities = 17/42 (40%), Positives = 22/42 (52%), Gaps = 7/42 (16%)
Query: 18 PPFRLEHNLAVSNHVFQLKTTVH------QTLMWRSDLE-LQ 52
PP R + LA+S H L+ TVH Q + R +LE LQ
Sbjct: 67 PPRRARYLLALSGHEVPLEVTVHDPEFLLQQMQLREELEELQ 108
>gnl|CDD|235088 PRK02943, PRK02943, SecA regulator SecM; Provisional.
Length = 167
Score = 30.3 bits (69), Expect = 2.2
Identities = 10/35 (28%), Positives = 17/35 (48%), Gaps = 3/35 (8%)
Query: 599 LESLNAM---EKSLSEQMPHTPHTPHTPHTPHTPG 630
L++LNA+ E + E + T + TP + G
Sbjct: 116 LDTLNALLTQEGTPPEIIRRTSYAHFTPQASFSTG 150
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 31.2 bits (70), Expect = 2.5
Identities = 30/159 (18%), Positives = 46/159 (28%), Gaps = 4/159 (2%)
Query: 637 VPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMEL 696
P ++ T + TS VS PS + G N L V
Sbjct: 167 EPKTAVTTSK--TTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGT 224
Query: 697 LSYLDPPDLATPPSS--GSSNGNNPGGPTTNDDILALEQMPHTPHTPHTPHTPHTPGGNG 754
+ +P PP S G S TT+ D TP
Sbjct: 225 TTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPH 284
Query: 755 PPSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDPAAV 793
+ PP ++ +Q+ + + +T S + P V
Sbjct: 285 STATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGV 323
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 31.3 bits (71), Expect = 2.5
Identities = 20/109 (18%), Positives = 31/109 (28%), Gaps = 19/109 (17%)
Query: 696 LLSYLDP-PDLATPPSSGSSNGNNPGGPTTNDDILA----LEQ--------------MPH 736
+L +LD PD P + ++DIL LEQ P
Sbjct: 1 VLEFLDSLPDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPG 60
Query: 737 TPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDIPSD 785
TP P P P + T++S + + +
Sbjct: 61 TPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEE 109
>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain. Syndecans are transmembrane
heparin sulfate proteoglycans which are implicated in
the binding of extracellular matrix components and
growth factors.
Length = 207
Score = 30.5 bits (69), Expect = 2.6
Identities = 25/135 (18%), Positives = 43/135 (31%), Gaps = 8/135 (5%)
Query: 635 PSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPM 694
P++ ++ + D + S DD + + G G G P + D
Sbjct: 17 PALAAQAAAEYPDERYLDEEG-SGDDDEFI-DDEMDDEYSGSGSGA-----TPSDDEDSE 69
Query: 695 ELLSYLDPPDLATPPSSGSSNGNNPGGPT-TNDDILALEQMPHTPHTPHTPHTPHTPGGN 753
+ + PP L T SS S++ T T+ + +P T T
Sbjct: 70 PVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEATTTVSTE 129
Query: 754 GPPSVPPSSSTQQQD 768
P S++T
Sbjct: 130 TPTEGGSSAATDPSK 144
>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
Length = 486
Score = 31.1 bits (70), Expect = 2.7
Identities = 12/69 (17%), Positives = 24/69 (34%), Gaps = 1/69 (1%)
Query: 710 SSGSSNGNNPGGPTTNDDILALEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDT 769
+SG+S ++ + + T HT +P G + + + ST Q
Sbjct: 25 ASGASTSSSIENSDNSTAEMQSTPATPT-HTTSNLTSPFGTGTDNSTNANGTESTTQASQ 83
Query: 770 QTSSSNNVS 778
S ++
Sbjct: 84 PHSHETTIT 92
>gnl|CDD|219420 pfam07466, DUF1517, Protein of unknown function (DUF1517). This
family consists of several hypothetical glycine rich
plant and bacterial proteins of around 300 residues in
length. The function of this family is unknown.
Length = 280
Score = 30.8 bits (70), Expect = 2.7
Identities = 18/43 (41%), Positives = 20/43 (46%), Gaps = 1/43 (2%)
Query: 552 ASGSMMNNNS-GPPSYSMSMNRSSGGNGGGFNDFPSGGNSGDF 593
ASG + S PS S S RSS GGG+ P GG F
Sbjct: 3 ASGGRIGGGSFRAPSRSSSSPRSSSPGGGGYYGSPGGGFGFPF 45
>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
Length = 171
Score = 30.1 bits (68), Expect = 3.0
Identities = 9/54 (16%), Positives = 15/54 (27%)
Query: 483 SPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPTFSDIKPT 536
P +T + + P V+ P + R+VK T
Sbjct: 68 GNFRTMPVQTVTATTPSGSRPAVTATPEQKLLAPRPLFTAAREVKTVVPVSSVT 121
>gnl|CDD|221634 pfam12548, DUF3740, Sulfatase protein. This domain family is found
in eukaryotes, and is typically between 144 and 173
amino acids in length. The family is found in
association with pfam00884.
Length = 144
Score = 29.7 bits (67), Expect = 3.2
Identities = 15/54 (27%), Positives = 25/54 (46%), Gaps = 3/54 (5%)
Query: 134 LQGLLRKRLLTADHCIAKIKRNFNNSNSSSMSSSDS-IEQ--TALKVSLKCPIT 184
L LL+ R L H + KR+ + M S +S I +++V+ +C I
Sbjct: 32 LLDLLQSRNLLKRHEEHRSKRDTEEYSGEGMLSDNSNIASVPQSIQVTHRCFIL 85
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 30.8 bits (70), Expect = 3.3
Identities = 24/86 (27%), Positives = 31/86 (36%), Gaps = 11/86 (12%)
Query: 469 QNEMRN--NLTYQQHSSPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRDV 526
NE RN LT Q +PL G PL V P T +P +PP SG +
Sbjct: 25 MNEARNLRALTETQ--TPLLGGENTPLHETDFDGVTPRKQQIQTPNPLATPPFR-SGNGI 81
Query: 527 KPTFSDIKPTFSDI------KPAINP 546
T + + K +IN
Sbjct: 82 GATPLRGGSGYGPLRTPNRDKLSIND 107
>gnl|CDD|227914 COG5627, MMS21, DNA repair protein MMS21 [DNA replication,
recombination, and repair].
Length = 275
Score = 30.4 bits (68), Expect = 3.3
Identities = 22/86 (25%), Positives = 31/86 (36%), Gaps = 13/86 (15%)
Query: 163 SMSSSDSIEQTALKV-------SLKCPITFKKITLPARGHDCKHIQCFDL-ESYLQLNCE 214
M S + E K+ S +CPIT P C H DL LQ+ C
Sbjct: 167 VMKSPNPEEGLVDKILIHQELLSNRCPITLNPDFYPILSSKCNHKPEMDLINKKLQVECT 226
Query: 215 RGSWRCPV--CNKPAQLEGLEVDQYM 238
R CP C++ ++ D +
Sbjct: 227 R---VCPRLICSQKEVVDPYVCDHIL 249
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 30.5 bits (69), Expect = 3.5
Identities = 18/98 (18%), Positives = 27/98 (27%), Gaps = 8/98 (8%)
Query: 346 PSFQGHHQYPSQQPGSGPNGAPFGPNSAPSSRGSSVRQSTPSFPSSSGPNSNANGGSNHQ 405
Q Q P Q ++ P+ ++ +T + P A
Sbjct: 61 QLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALP--------AGPAGPTI 112
Query: 406 YFGAGNQFPGGVPPNSNQYQEIKECDQKAQISMTYQMH 443
G +P VP Q DQ AQ Q+
Sbjct: 113 QTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQ 150
>gnl|CDD|110602 pfam01611, Filo_glycop, Filovirus glycoprotein. This family
includes an extracellular region from the envelope
glycoprotein of Ebola and Marburg viruses. This region
is also produced as a separate transcript that gives
rise to a non-structural, secreted glycoprotein, which
is produced in large amounts and has an unknown
function. Processing of this protein may be involved in
viral pathogenicity.
Length = 364
Score = 30.6 bits (69), Expect = 3.8
Identities = 14/58 (24%), Positives = 20/58 (34%)
Query: 606 EKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDIPS 663
LS T H+ N P +++T T +S +N ST IP
Sbjct: 290 STCLSTPSLSPRTTNHSRQAVTELDKNRTSLQPSTNNTTTISTNNTSKHNFSTQSIPL 347
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 30.9 bits (70), Expect = 4.0
Identities = 22/133 (16%), Positives = 31/133 (23%), Gaps = 11/133 (8%)
Query: 294 PGSMTLPTMSNWEM-GQSMSPYIPPDMNSKSPFMHIESSAILSFPNGYSMPGRPSFQGHH 352
P + +S E+ Q P P + P + P + +
Sbjct: 146 PQTPAQKMLSLEEVEAQLQQRQQAPQ----LPQPPQQVLPQGMPPRQAAFPQQGPPEQPP 201
Query: 353 QYPSQQPGSGPNGAPFGPNSAPSSRGSSVRQSTPSFPSSSGPNSNANGGSNHQYFGAGNQ 412
YP G P APS + P P P +
Sbjct: 202 GYPQPPQGHPEQVQPQQFLPAPSQ-----APAQPPLPPQL-PQQPPPLQQPQFPGLSQQM 255
Query: 413 FPGGVPPNSNQYQ 425
P P Q Q
Sbjct: 256 PPPPPQPPQQQQQ 268
Score = 29.7 bits (67), Expect = 7.8
Identities = 27/158 (17%), Positives = 40/158 (25%), Gaps = 15/158 (9%)
Query: 284 RGMCAGKAMSPGSMTLPTMSNWEMGQSMSPYIPPDMNSKSPFMHIESSAILSFPNGYSMP 343
+ + G + G P P+ F+ S A P +P
Sbjct: 178 QVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLP 237
Query: 344 GRPSFQGHHQYP--SQQPGSGPNGAP------------FGPNSAPSSRGSSVRQSTPSFP 389
+P Q+P SQQ P P P + P+ + P
Sbjct: 238 QQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLP 297
Query: 390 SSSGPNSNANGGSNHQYFGAGNQFPGGVPPNSNQYQEI 427
P G QF + S Q +E
Sbjct: 298 PPQQPQLLPLVQQPQG-QQRGPQFREQLVQLSQQQREA 334
>gnl|CDD|215570 PLN03091, PLN03091, hypothetical protein; Provisional.
Length = 459
Score = 30.3 bits (68), Expect = 4.8
Identities = 25/95 (26%), Positives = 37/95 (38%), Gaps = 10/95 (10%)
Query: 485 LPGNPTPPLTPASSTSVPPFVSP-NSTSSPFISPPHSVSGRDVKPTFSDIKPTFSDIKPA 543
L NP P L + ++ S +S+ +P I PP SV+ S PT KP+
Sbjct: 256 LSTNPNPTLWFSQNSKSFEMNSEFSSSMTPSILPP-SVT--------SSFLPTPMSYKPS 306
Query: 544 INPHPQSIASGSMMNNNSGPPSYSMSMNRSSGGNG 578
I+ + + S N N S+ NG
Sbjct: 307 ISLPSDNPSIPSFTVNGVRNWEAGAFSNNSNSSNG 341
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
inhibitor.
Length = 670
Score = 30.4 bits (68), Expect = 4.8
Identities = 21/87 (24%), Positives = 30/87 (34%)
Query: 576 GNGGGFNDFPSGGNSGDFPGDNSLESLNAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPP 635
GN G N P+G + +S + S S H P TP +P + P
Sbjct: 564 GNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPPA 623
Query: 636 SVPPSSSTQQQDTQTSSSNNVSTDDIP 662
S S ST ++S + P
Sbjct: 624 SHLGSPSTTPSSPESSIKVASTETASP 650
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 30.1 bits (67), Expect = 5.7
Identities = 55/305 (18%), Positives = 88/305 (28%), Gaps = 40/305 (13%)
Query: 482 SSPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPTFSDIKPTFSDIK 541
+S P +P + + P S + SP V G T + P
Sbjct: 493 TSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIG----VTTTATSPPTGTTS 548
Query: 542 PAINPHPQSIASGSMMNNNSGPPSYSMSMNRSSGGNGGGFNDFPSGGNSGDFPGDNSLES 601
PQ + S +NN + P S +S +G P
Sbjct: 549 VPNATSPQ-VTEESPVNNTNTPVVTSAPSVLTSA-------VTTGQHGTGSSPT------ 594
Query: 602 LNAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDDI 661
+ TP + T TP + P ++T + S +
Sbjct: 595 -----SQQPGIPSSSHSTPRSNSTSTTPLLTS--AHPTGGENITEETPSVPSTTHVSTLS 647
Query: 662 PSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSSGSSNGNNPGG 721
P P G G + + P V ++ P AT PS+ S
Sbjct: 648 PGP---GPGTTSQVSGPGNSSTSRYPGEVH-----VTEGMPNPNATSPSAPSGQKTAVPT 699
Query: 722 PTTNDDILALEQMPHTPHTPHTPHTPHTPGGNGPPSV-------PPSSSTQQQDTQTSSS 774
T+ + T +PHT G + PPS+S++ + T +S
Sbjct: 700 VTSTGGKANSTTKETSGSTLMASTSPHTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFTS 759
Query: 775 NNVST 779
V+T
Sbjct: 760 PPVTT 764
>gnl|CDD|184256 PRK13700, PRK13700, conjugal transfer protein TraD; Provisional.
Length = 732
Score = 29.9 bits (67), Expect = 6.4
Identities = 17/84 (20%), Positives = 26/84 (30%), Gaps = 17/84 (20%)
Query: 721 GPTTNDDILALEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTD 780
+ Q P P P P P P V P + D ++ + NV
Sbjct: 611 ASGEDVTQAEQPQQPQQPQQPQQPQQPQQP-------VSPVIN----DKKSDAGVNVPAG 659
Query: 781 DIPSDLNFDPAAVIDGEGQGQENL 804
I +L P E + ++ L
Sbjct: 660 GIEQELKMKP------EEEMEQQL 677
>gnl|CDD|221868 pfam12938, M_domain, M domain of GW182.
Length = 238
Score = 29.5 bits (66), Expect = 6.5
Identities = 30/107 (28%), Positives = 36/107 (33%), Gaps = 5/107 (4%)
Query: 336 FPNGYSMPGRPSF-QGHHQYPSQQPGSGPNGAPFGPNSAPSSRGSSVRQSTPSFPSSSGP 394
FP+G S P F Q + G GP G G N +S S Q S P
Sbjct: 16 FPSGGSSVNSPPFSQNNLPNNLGGGGGGPGGGGGGNNPNLASLSSLTSQGLGKILSGLQP 75
Query: 395 NSNANGGSNHQYFGAGNQFPGGVPPNSNQYQEIKECDQKAQISMTYQ 441
NGG GAG P G + + +AQ T Q
Sbjct: 76 PPLGNGGG----SGAGGPGPVGGGGGPGVAPNNIQPNAQAQQPSTQQ 118
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 29.8 bits (67), Expect = 6.6
Identities = 18/75 (24%), Positives = 24/75 (32%)
Query: 483 SPLPGNPTPPLTPASSTSVPPFVSPNSTSSPFISPPHSVSGRDVKPTFSDIKPTFSDIKP 542
PP P T+ PP V P + P P S D KP ++ P
Sbjct: 390 KAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAP 449
Query: 543 AINPHPQSIASGSMM 557
IA G ++
Sbjct: 450 PKEEEKALIADGDVL 464
>gnl|CDD|218056 pfam04388, Hamartin, Hamartin protein. This family includes the
hamartin protein which is thought to function as a
tumour suppressor. The hamartin protein interacts with
the tuberin protein pfam03542. Tuberous sclerosis
complex (TSC) is an autosomal dominant disorder and is
characterized by the presence of hamartomas in many
organs, such as brain, skin, heart, lung, and kidney. It
is caused by mutation either TSC1 or TSC2 tumour
suppressor gene. TSC1 encodes a protein, hamartin,
containing two coiled-coil regions, which have been
shown to mediate binding to tuberin. The TSC2 gene codes
for tuberin pfam03542. These two proteins function
within the same pathway(s) regulating cell cycle, cell
growth, adhesion, and vesicular trafficking.
Length = 667
Score = 29.9 bits (67), Expect = 7.2
Identities = 38/176 (21%), Positives = 56/176 (31%), Gaps = 32/176 (18%)
Query: 601 SLNAMEKSLSEQMPHTPHTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSSNNVSTDD 660
SL+ E S SE + P++ S+P ++ST +S +T
Sbjct: 262 SLDPTETS-SEDGYSFSRSSAYPNSSPRQALPPSISLPQNTSTSGSLHSAQTSRRPNT-- 318
Query: 661 IPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDPMELLSYLDPPDLATPPSS-GSSNGNNP 719
FD AA + D++ P L ATPPSS G S
Sbjct: 319 -----TFDKAASSGTK-----------DSLWSPSSLCGM------ATPPSSIGMSPLILS 356
Query: 720 GGPTTNDDILALEQMPHTP-HTPHTPHTPHTPGGNGPPSVPPSSSTQQQDTQTSSS 774
P+ + P T P + TP + PP + +S S
Sbjct: 357 LSPSHLS-----GRAPGTTGSGKGEPASESTPSTSPPPPGLADDIVRAIFATSSRS 407
>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
UL32. The large phosphorylated protein (UL32-like) of
herpes viruses is the polypeptide most frequently
reactive in immuno-blotting analyses with antisera when
compared with other viral proteins.
Length = 777
Score = 29.5 bits (66), Expect = 9.4
Identities = 32/184 (17%), Positives = 50/184 (27%), Gaps = 3/184 (1%)
Query: 634 PPSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDPAAVIDGEGQGQENLNLLPDNVVDP 693
P SS D ++ + + P L + D L + VD
Sbjct: 302 PLLDLTGSSLWSDDEESGDEDGDGSGFAPEPLIKTDSRSNDTLVDLGRGGGSLKLDSVDA 361
Query: 694 MELLSYLDPPDLATPPSSGSSNGNNPGGPTTNDDILALEQMPHTPHTPHTPHTPHTPGGN 753
SYL P L+ P+SG + + + + P G
Sbjct: 362 PGTSSYLFEPGLSPSPNSGKEMPGILTTENLDLPLASTDSTEMDPEDKRGGAVKINNSGI 421
Query: 754 --GPPSVPPSSSTQQQDTQTSSSNNVSTDDIPSDLNFDP-AAVIDGEGQGQENLNLLPDN 810
P + ++ SS D PS L VID + P +
Sbjct: 422 LAWGLKTPGLAVNDERSIAVSSDGITDVLDPPSPLRLHSSDKVIDSVSPPSKRRVSAPAS 481
Query: 811 VVDP 814
+D
Sbjct: 482 RLDD 485
>gnl|CDD|240430 PTZ00473, PTZ00473, Plasmodium Vir superfamily; Provisional.
Length = 420
Score = 29.4 bits (66), Expect = 9.8
Identities = 21/77 (27%), Positives = 33/77 (42%), Gaps = 6/77 (7%)
Query: 337 PNGYSMPGRPSFQGHHQYPSQQPGSGPNGAPFGPNSAP----SSRGSSVRQSTPSFPSSS 392
+ P ++ G S + GS + F +S+ SS G+S ST ++ S S
Sbjct: 311 GHDSRGPYNANYGGQFNSRSGRTGSSESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRS 370
Query: 393 GPNSNANGGSNHQYFGA 409
+S+ GGS Q G
Sbjct: 371 TFDSSTGGGS--QSGGG 385
>gnl|CDD|220662 pfam10265, DUF2217, Uncharacterized conserved protein (DUF2217).
This is a family of conserved proteins of from 500 - 600
residues found from worms to humans. Its function is not
known.
Length = 515
Score = 29.3 bits (66), Expect = 9.8
Identities = 21/69 (30%), Positives = 29/69 (42%), Gaps = 5/69 (7%)
Query: 357 QQPGSGPNGAPFGPNSAPSSRGSSVRQSTPSF--PSSSGPNSNANGGSNHQYFGAGNQFP 414
+ P S PN G +S SS+ S S S +SS S AN GS + AG + P
Sbjct: 69 RSPSSKPNDTLSGASSKLSSKHSGSSHSLASVSDRNSSSSGSCANSGS---WEAAGMEEP 125
Query: 415 GGVPPNSNQ 423
+ +
Sbjct: 126 INTTDTTPE 134
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.311 0.129 0.395
Gapped
Lambda K H
0.267 0.0641 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 42,491,098
Number of extensions: 3993594
Number of successful extensions: 4031
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3758
Number of HSP's successfully gapped: 208
Length of query: 854
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 749
Effective length of database: 6,280,432
Effective search space: 4704043568
Effective search space used: 4704043568
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 63 (28.2 bits)