RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy1605
(718 letters)
>gnl|CDD|215784 pfam00200, Disintegrin, Disintegrin.
Length = 76
Score = 116 bits (294), Expect = 2e-31
Identities = 45/78 (57%), Positives = 53/78 (67%), Gaps = 2/78 (2%)
Query: 86 EDGEECDCGLEDSCKNACCNATTCMLNVNATCATGSCCNLETCQPHTAGRQCRAADRECD 145
E+GEECDCG + C+N CC+ATTC L A CATG CC+ C+ AG CR A ECD
Sbjct: 1 EEGEECDCGSPEECQNPCCDATTCKLKPGAQCATGPCCD--QCKFKPAGTVCRPASGECD 58
Query: 146 LPEFCTGDSEFCPSDVFK 163
LPE+CTG S CP DV+K
Sbjct: 59 LPEYCTGQSAECPPDVYK 76
>gnl|CDD|214490 smart00050, DISIN, Homologues of snake disintegrins. Snake
disintegrins inhibit the binding of ligands to integrin
receptors. They contain a 'RGD' sequence, identical to
the recognition site of many adhesion proteins.
Molecules containing both disintegrin and
metalloprotease domains are known as ADAMs.
Length = 75
Score = 105 bits (264), Expect = 2e-27
Identities = 37/77 (48%), Positives = 46/77 (59%), Gaps = 2/77 (2%)
Query: 86 EDGEECDCGLEDSCKNACCNATTCMLNVNATCATGSCCNLETCQPHTAGRQCRAADRECD 145
E+GEECDCG C + CC+ TC L A CA+G CC + C+ AG CR + ECD
Sbjct: 1 EEGEECDCGSPKECTDPCCDPATCKLKPGAQCASGPCC--DNCKFKPAGTLCRPSVDECD 58
Query: 146 LPEFCTGDSEFCPSDVF 162
LPE+C G S CP D +
Sbjct: 59 LPEYCNGTSADCPPDPY 75
>gnl|CDD|219879 pfam08516, ADAM_CR, ADAM cysteine-rich. ADAMs are
membrane-anchored proteases that proteolytically modify
cell surface and extracellular matrix (ECM) in order to
alter cell behaviour. It has been shown that the
cysteine-rich domain of ADAM13 regulates the protein's
metalloprotease activity.
Length = 117
Score = 78.5 bits (194), Expect = 2e-17
Identities = 31/72 (43%), Positives = 41/72 (56%), Gaps = 5/72 (6%)
Query: 165 DGETCKGGEAFCYEGSCRTHSDQCLLLWGPSASSSDKRCF-DLNTSGNRHGNCGYYKPNM 223
DG C+ + +CY G C T +QC L+GP A + C+ +LN+ G+R GNCG K N
Sbjct: 1 DGHPCQNNQGYCYNGRCPTRDNQCQDLFGPGAKVAPDSCYEELNSKGDRFGNCG--KENG 58
Query: 224 TYAKCEED--KC 233
TY CE KC
Sbjct: 59 TYIPCEPQDVKC 70
>gnl|CDD|214743 smart00608, ACR, ADAM Cysteine-Rich Domain.
Length = 137
Score = 77.8 bits (192), Expect = 6e-17
Identities = 31/73 (42%), Positives = 41/73 (56%), Gaps = 5/73 (6%)
Query: 164 MDGETCKGGEAFCYEGSCRTHSDQCLLLWGPSASSSDKRCFD-LNTSGNRHGNCGYYKPN 222
DG C G+ +CY G C T +QC L+GP A + C++ LNT G+R GNCG + N
Sbjct: 1 QDGTPCDNGQGYCYNGRCPTRDNQCQALFGPGAKVAPDSCYEELNTKGDRFGNCG--REN 58
Query: 223 MTYAKCEED--KC 233
TY C + KC
Sbjct: 59 GTYIPCAPEDVKC 71
>gnl|CDD|239797 cd04269, ZnMc_adamalysin_II_like, Zinc-dependent metalloprotease;
adamalysin_II_like subfamily. Adamalysin II is a snake
venom zinc endopeptidase. This subfamily contains other
snake venom metalloproteinases, as well as
membrane-anchored metalloproteases belonging to the ADAM
family. ADAMs (A Disintegrin And Metalloprotease) are
glycoproteins, which play roles in cell signaling, cell
fusion, and cell-cell interactions.
Length = 194
Score = 55.7 bits (135), Expect = 7e-09
Identities = 17/25 (68%), Positives = 18/25 (72%)
Query: 47 GHNLGLEHDTTECTCPSDRCIMAPS 71
GHNLG+EHD CTC CIMAPS
Sbjct: 140 GHNLGMEHDDGGCTCGRSTCIMAPS 164
>gnl|CDD|216491 pfam01421, Reprolysin, Reprolysin (M12B) family zinc
metalloprotease. The members of this family are enzymes
that cleave peptides. These proteases require zinc for
catalysis. Members of this family are also known as
adamalysins. Most members of this family are snake venom
endopeptidases, but there are also some mammalian
proteins such as human ADAM8 and fertilin. Fertilin and
closely related proteins appear to not have some active
site residues and may not be active enzymes.
Length = 198
Score = 50.3 bits (121), Expect = 5e-07
Identities = 15/35 (42%), Positives = 17/35 (48%), Gaps = 6/35 (17%)
Query: 47 GHNLGLEHDTT-ECTCPSDRCIMAP-----SSRLF 75
GHNLG+ HD CTC CIM P + F
Sbjct: 140 GHNLGMTHDDIDGCTCGGGGCIMNPVASSSPGKKF 174
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 43.8 bits (103), Expect = 4e-04
Identities = 57/304 (18%), Positives = 89/304 (29%), Gaps = 27/304 (8%)
Query: 285 SVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANK 344
+ LP A + P +P+ + A P P R + T P P
Sbjct: 2718 ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP----PTTAGPPAPAPPA 2773
Query: 345 GPNTSPYAR---PLISSPILSDTTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVAT 401
P P R P ++S S + + + D A P + + + LPP +
Sbjct: 2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS 2833
Query: 402 ESNALPGSTLHPVSSESNAPKGSTLNRITSFMSRNSTRRNSKVLTPPATQGSARSLEISE 461
P P + P G ++ R +R + PA ++
Sbjct: 2834 AQPTAPPPPPGPP--PPSLPLGGSVAPGGDVRRRPPSRSPA---AKPAAPARPPVRRLAR 2888
Query: 462 PIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRPNLPAFGSMRAKRPTSLAGSQRPSQPPP 521
P + T+S +RP P + +P PPP
Sbjct: 2889 PAV-------------SRSTESFALPPDQPERPPQPQ-APPPPQPQPQPPPPPQPQPPPP 2934
Query: 522 PAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPSGIESLPGYQNPSAGQAK 581
P P L +PAG + PSG P G+ P P + +
Sbjct: 2935 PPPRPQPPLAPTTDPAGAGE-PSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASST 2993
Query: 582 TPGT 585
P T
Sbjct: 2994 PPLT 2997
Score = 43.0 bits (101), Expect = 7e-04
Identities = 58/323 (17%), Positives = 89/323 (27%), Gaps = 64/323 (19%)
Query: 297 PTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELP--PANKGPNTSPYARP 354
P PP P A + A P P + A A +A PP +P PA G P P
Sbjct: 2702 PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761
Query: 355 LISSPILSDTTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVATESNALPGSTLHPV 414
+ P A + T P VA+ S +
Sbjct: 2762 TTAGP------------------PAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWD 2803
Query: 415 SSESNAPKGSTLNRITSFMSRNSTRRNSKVLTPPATQGSARSLEISEPIPLNNLRDNEDK 474
++ A + + + L PP + P P
Sbjct: 2804 PADPPAAVLAPAAAL------PPAASPAGPLPPPTSAQPTAPPPPPGPPP---------- 2847
Query: 475 VDMVKRTQSMRETDVNFKRPNLPAFGSMR-----AKRPTSLAGSQRPSQPPPPAPSGIQS 529
P+LP GS+ +RP S + + +P+ P P +
Sbjct: 2848 -------------------PSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLAR 2888
Query: 530 LPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPSGIESLPGYQNPSAGQAKTPGTPGQG 589
++ A P + P Q Q + P P + + P P
Sbjct: 2889 PAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
Query: 590 LLGPGQGLGY----QNPAVPPGK 608
G G+ G A+ PG+
Sbjct: 2949 PAGAGEPSGAVPQPWLGALVPGR 2971
Score = 39.5 bits (92), Expect = 0.008
Identities = 65/345 (18%), Positives = 101/345 (29%), Gaps = 44/345 (12%)
Query: 292 AAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSA-----------------DFANFSAT 334
P+ P P D + P R AP+ +V A D +
Sbjct: 2552 PPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGP 2611
Query: 335 LPPE-LPPANKGPNTSPYA-RPLISSPILSDTTSQTVRELIDAKNAES----TRQTTRPA 388
PP LPP P+ P + P + P + E A R+ R
Sbjct: 2612 APPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLG 2671
Query: 389 SVAKSSTLPPVATESNALPGSTLHPVSSESNAPKGSTLNRITSFMSRNSTRRNSKVLTPP 448
A++S+ PP A T+ ++S ++ P ++ V P
Sbjct: 2672 RAAQASS-PPQRPRRRAAR-PTVGSLTSLADPPPPP---------PTPEPAPHALVSATP 2720
Query: 449 ATQGSARSLEISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRPNLPAFGSMRAKRPT 508
G A + + S +P V + RP A G P
Sbjct: 2721 LPPGPAAARQASPALPAAPA------PPAVPAGPATPGGPARPARPPTTA-GPPAPAPPA 2773
Query: 509 SLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKT-PSGIE 567
+ A P S +S +P A P+ + + +A A P
Sbjct: 2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS 2833
Query: 568 SLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPA-VPPGKPKT 611
+ P P G G G + PG + + P+ P KP
Sbjct: 2834 AQPTAPPPPPGPPPPSLPLG-GSVAPGGDVRRRPPSRSPAAKPAA 2877
Score = 35.3 bits (81), Expect = 0.15
Identities = 62/339 (18%), Positives = 92/339 (27%), Gaps = 30/339 (8%)
Query: 293 APVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYA 352
AP + P+ P D + P AP P + D S + P + P P
Sbjct: 2591 APPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPE 2650
Query: 353 RPLISSPILSDTTSQTVRELIDAKNAESTRQTTRP-------ASVAKSSTLPPVATESNA 405
RP + + R L A A S Q R S+ + PP
Sbjct: 2651 RPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPT--- 2707
Query: 406 LPGSTLHPVSSESNAPKGSTLNRITSFMSRNSTR------------RNSKVLTPPATQGS 453
P H + S + P G R S + ++ PP T G
Sbjct: 2708 -PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGP 2766
Query: 454 ARSLEISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRPNLPAFGSMRAKRPTS-LAG 512
+ P R V S+ E+ + P PA P + L
Sbjct: 2767 PAPAPPAAPAAGPPRRLTRPAV------ASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820
Query: 513 SQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPSGIESLPGY 572
+ P+ P PP S + P P G PG +++P+ + P
Sbjct: 2821 AASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPAR 2880
Query: 573 QNPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPPGKPKT 611
+ P P P+
Sbjct: 2881 PPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQP 2919
>gnl|CDD|239801 cd04273, ZnMc_ADAMTS_like, Zinc-dependent metalloprotease,
ADAMTS_like subgroup. ADAMs (A Disintegrin And
Metalloprotease) are glycoproteins, which play roles in
cell signaling, cell fusion, and cell-cell interactions.
This particular subfamily represents domain
architectures that combine ADAM-like metalloproteinases
with thrombospondin type-1 repeats. ADAMTS (a
disintegrin and metalloproteinase with thrombospondin
motifs) proteinases are inhibited by TIMPs (tissue
inhibitors of metalloproteinases), and they play roles
in coagulation, angiogenesis, development and
progression of arthritis. They hydrolyze the von
Willebrand factor precursor and various components of
the extracellular matrix.
Length = 207
Score = 36.8 bits (86), Expect = 0.019
Identities = 10/28 (35%), Positives = 15/28 (53%), Gaps = 3/28 (10%)
Query: 47 GHNLGLEHDTTECTCPSDR---CIMAPS 71
GH LG+ HD +C + IM+P+
Sbjct: 149 GHVLGMPHDGDGNSCGPEGKDGHIMSPT 176
>gnl|CDD|239795 cd04267, ZnMc_ADAM_like, Zinc-dependent metalloprotease, ADAM_like
or reprolysin_like subgroup. The adamalysin_like or ADAM
family of metalloproteases contains proteolytic domains
from snake venoms, proteases from the mammalian
reproductive tract, and the tumor necrosis factor alpha
convertase, TACE. ADAMs (A Disintegrin And
Metalloprotease) are glycoproteins, which play roles in
cell signaling, cell fusion, and cell-cell interactions.
Length = 192
Score = 36.6 bits (85), Expect = 0.021
Identities = 12/30 (40%), Positives = 13/30 (43%), Gaps = 4/30 (13%)
Query: 47 GHNLGLEHDTTECTCPS----DRCIMAPSS 72
GHNLG EHD + IMAP
Sbjct: 142 GHNLGAEHDGGDELAFECDGGGNYIMAPVD 171
>gnl|CDD|224825 COG1913, COG1913, Predicted Zn-dependent proteases [General
function prediction only].
Length = 181
Score = 35.8 bits (83), Expect = 0.031
Identities = 14/30 (46%), Positives = 17/30 (56%), Gaps = 6/30 (20%)
Query: 47 GHNLGLEHDTTECTCPSDRCIMAPSSRLFD 76
GH LGL H CP+ RC+M S+ L D
Sbjct: 133 GHLLGLSH------CPNPRCVMNFSNSLRD 156
>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
UL32. The large phosphorylated protein (UL32-like) of
herpes viruses is the polypeptide most frequently
reactive in immuno-blotting analyses with antisera when
compared with other viral proteins.
Length = 777
Score = 36.0 bits (83), Expect = 0.081
Identities = 44/253 (17%), Positives = 71/253 (28%), Gaps = 25/253 (9%)
Query: 280 KSSDMSVKLPTRAAPVKPTPPKVP----SDAQITPIRAAPKIPATSVRLSADFANFSATL 335
S SV +K P++ S +P ++ + S A S
Sbjct: 514 SSDAESVVS------IKELRPRIGFINKSPPPKSPPKSRRTLIVALSLASPSTAG-SPRP 566
Query: 336 PPELPPANKGPNTSPYARPLISSPILSDTTSQTVRELIDAKNAESTRQTTRPASVAKSST 395
P L G P+A + V + K + S++ T P S K +T
Sbjct: 567 KPSLGKFVIGT--DPFAFANTVRLTDNMRGGNGVGSSVKPKGSASSKPLTGPGSDLKPAT 624
Query: 396 LPPVATESNALPGSTLHPVSSESNAPKGSTLNRITSFMSRNSTRRNSKVLTPPATQGSAR 455
L S+ + + SS+ P G + ++ T + +
Sbjct: 625 LNGKTPSSSLVGAARNAGASSKVKIPSGLGGFTSPISLLESALEDVLTSATSTPVKKNDP 684
Query: 456 SLEISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNF--KRPNLPAFGSMRAKRPTSLAGS 513
L N +K + S + NF P G + K P S A
Sbjct: 685 YLW----------DTNGEKAGGGTESASTTDVFQNFAGLNKKTPVGGPFQPKPPLSRALD 734
Query: 514 QRPSQPPPPAPSG 526
S G
Sbjct: 735 SASSPGGSGGKPG 747
>gnl|CDD|213029 cd11375, Peptidase_M54, Peptidase family M54, also called
archaemetzincins or archaelysins. Peptidase M54
(archaemetzincin or archaelysin) is a zinc-dependent
aminopeptidase that contains the consensus zinc-binding
sequence HEXXHXXGXXH/D and a conserved Met residue at
the active site, and is thus classified as a metzincin.
Archaemetzincins, first identified in archaea, are also
found in bacteria and eukaryotes, including two human
members, archaemetzincin-1 and -2 (AMZ1 and AMZ2). AMZ1
is mainly found in the liver and heart while AMZ2 is
primarily expressed in testis and heart; both have been
reported to degrade synthetic substrates and peptides.
The Peptidase M54 family contains an extended metzincin
concensus sequence of HEXXHXXGX3CX4CXMX17CXXC such that
a second zinc ion is bound to four cysteines, thus
resembling a zinc finger. Phylogenetic analysis of this
family reveals a complex evolutionary process involving
a series of lateral gene transfer, gene loss and genetic
duplication events.
Length = 173
Score = 34.2 bits (79), Expect = 0.096
Identities = 14/42 (33%), Positives = 19/42 (45%), Gaps = 14/42 (33%)
Query: 47 GHNLGLEHDTTECTCPSDRCIMAPSSRLFDS--------PVC 80
GH GL+H CP C+M S+ L ++ PVC
Sbjct: 132 GHLFGLDH------CPYYACVMNFSNSLEETDRKPPYLCPVC 167
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 35.2 bits (81), Expect = 0.15
Identities = 28/143 (19%), Positives = 45/143 (31%), Gaps = 6/143 (4%)
Query: 285 SVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPAT--SVRLSADFANFSATLPPELPPA 342
S + P A P P P+ A + A + A A A + PP
Sbjct: 440 SARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPW 499
Query: 343 NKGPNTSPYARPLISSPILSDTTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVATE 402
+ P P + ++++ + A + + +T PA A + ATE
Sbjct: 500 EELPPEFASPAPAQPDAAPAGWVAESIPDPATA-DPDDAFETLAPAPAAAPAPRAAAATE 558
Query: 403 SNALPGSTLHPVSSESNAPKGST 425
P P +S S P
Sbjct: 559 PVVAPRP---PRASASGLPDMFD 578
Score = 29.8 bits (67), Expect = 5.5
Identities = 23/125 (18%), Positives = 33/125 (26%), Gaps = 10/125 (8%)
Query: 297 PTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYARPLI 356
P AQ P AAP +A A + P PA + A P
Sbjct: 374 PATAAAAPVAQPAPAAAAPA--------AAAPAPAAPPAAPAAAPAAAAAARAVAAAPAR 425
Query: 357 SSPILSDTTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVATESNALPGSTLHPVSS 416
SP R+ + PA+ ++ P A + P +
Sbjct: 426 RSPAPE--ALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARA 483
Query: 417 ESNAP 421
A
Sbjct: 484 APAAA 488
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 34.7 bits (80), Expect = 0.17
Identities = 19/106 (17%), Positives = 25/106 (23%), Gaps = 3/106 (2%)
Query: 504 AKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTP 563
A +P SQP P P+ Q + + Q + LP Q P
Sbjct: 129 APKPEPQPPQAPESQPQPQTPA--QKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPP 186
Query: 564 SGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPPGKP 609
+ P P G Q P P
Sbjct: 187 RQA-AFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPP 231
Score = 33.2 bits (76), Expect = 0.52
Identities = 22/118 (18%), Positives = 28/118 (23%), Gaps = 4/118 (3%)
Query: 494 PNLPAFGSMRAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQ 553
+ + S A +Q P PP P L Q P + P P Q
Sbjct: 209 GHPEQVQPQQFLPAPSQAPAQPP--LPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQ 266
Query: 554 NHSAGQAKTPSGIESLPGYQNPSAGQAKT--PGTPGQGLLGPGQGLGYQNPAVPPGKP 609
P + GQ P Q L Q G Q +
Sbjct: 267 QQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQL 324
Score = 31.3 bits (71), Expect = 2.4
Identities = 23/101 (22%), Positives = 26/101 (25%), Gaps = 8/101 (7%)
Query: 513 SQRPSQPPPPAPS-GIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPSGIESLPG 571
Q P P PP P Q PGY G + + LP
Sbjct: 167 QQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQP----PGYPQPPQGHPEQVQPQQFLPA 222
Query: 572 YQNPSAG---QAKTPGTPGQGLLGPGQGLGYQNPAVPPGKP 609
A + P P GL Q P PP P
Sbjct: 223 PSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPP 263
Score = 30.1 bits (68), Expect = 4.8
Identities = 22/111 (19%), Positives = 26/111 (23%), Gaps = 16/111 (14%)
Query: 516 PSQPPPPAPSGIQSLPGYQNPAG-----------QAKTPSGIQSLPGYQNHSAGQAKTPS 564
P Q PP P G P P LP
Sbjct: 192 PQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGL 251
Query: 565 GIESLPGYQNPSAGQAKTPGT----PGQGLLGPGQGLGYQ-NPAVPPGKPK 610
+ P P Q + P P Q P GL N +PP +
Sbjct: 252 SQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQP 302
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 34.1 bits (78), Expect = 0.31
Identities = 50/289 (17%), Positives = 75/289 (25%), Gaps = 44/289 (15%)
Query: 292 AAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNT-SP 350
AP P +V A P A + A + T A P +
Sbjct: 366 GAPGGGVPARVA--------GAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAA 417
Query: 351 YARPLISSPILSDTTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVATESNALPGST 410
A +P + T DA + ++ A + S + A GS
Sbjct: 418 AAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSA 477
Query: 411 LHPVSSE-----------SNAPKGSTLNRITSFMSRNSTRRNSKVLTPPATQGSARSLEI 459
P S + AP +T + + + R AR
Sbjct: 478 SAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTP 537
Query: 460 SEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRPNLPAFGSMRAKRPTSLAGSQRPSQP 519
+ P +D V R MR + + R A + +P+
Sbjct: 538 AAAAPAARAGGAAAALD-VLRNAGMR----------------VSSDRGARAAAAAKPAAA 580
Query: 520 PPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPSGIES 568
P AP P A Q TP + G A+ ES
Sbjct: 581 PAAAPK-----PAAPRVAVQVPTPRARAATG--DAPPNGAARAEQAAES 622
Score = 29.4 bits (66), Expect = 7.7
Identities = 27/136 (19%), Positives = 37/136 (27%), Gaps = 5/136 (3%)
Query: 275 AAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSAT 334
AA K++ + A P P PP AP + R SAD
Sbjct: 408 AALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERD 467
Query: 335 LPPELPPANK---GPNTSPYAR--PLISSPILSDTTSQTVRELIDAKNAESTRQTTRPAS 389
P + + P A P + S T V + A A
Sbjct: 468 AQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAP 527
Query: 390 VAKSSTLPPVATESNA 405
A + P A + A
Sbjct: 528 PAPEARPPTPAAAAPA 543
>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
Length = 209
Score = 32.9 bits (75), Expect = 0.41
Identities = 28/95 (29%), Positives = 34/95 (35%), Gaps = 2/95 (2%)
Query: 260 QRLLSSGEGQNYNLLAAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPA 319
Q+ S E N A T M + A PV P V + A AAPK PA
Sbjct: 97 QKATSMSEPATENKPAEVTTPVEPMGLPETPPAVPVPAPAPAVAAAAAQAA--AAPKAPA 154
Query: 320 TSVRLSADFANFSATLPPELPPANKGPNTSPYARP 354
S A +A P E A K T+ +P
Sbjct: 155 KPRAKSPRPAAKAAPKPTETITAKKAKKTAAAPKP 189
>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
Length = 1000
Score = 33.5 bits (76), Expect = 0.54
Identities = 57/268 (21%), Positives = 90/268 (33%), Gaps = 43/268 (16%)
Query: 336 PPELPPANKGPNTSPYARPLISSPILSDTTSQTVRELIDAKNAESTRQTTRPASVAKSST 395
PP++ P+++GP P A P + +P +T V + +T P AK
Sbjct: 552 PPKVSPSDRGP---PKASPPVMAP---PSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKD 605
Query: 396 LPPVATESNALPGST---------LHPVSSESNAPKGSTLNRITSFMSRNSTRRNSKVLT 446
PP + P S+ + E + ST + SF + R S +
Sbjct: 606 GPPASGPHEKQPPSSAPRDMAPSVVRMFLRERLLEQ-STGPKPKSFWEMRAGRDGSGIQQ 664
Query: 447 PPATQGSARSLEISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRP-NLPAFGSMRAK 505
P+++ + + + P P + + D +P SM
Sbjct: 665 EPSSRRQP-ATQSTPPRP-----------SWLPSVFVLPSVDAGRAQPSEESHLSSMSPT 712
Query: 506 RPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPSG 565
+P S ++P P P + P QA PS G++ A QA P
Sbjct: 713 QPIS--HEEQPRYEDPDDPLDLSLHPD------QAPPPSHQAPYSGHEEPQAQQAPYPGY 764
Query: 566 IESLP------GYQNPSAGQAKTPGTPG 587
E P GYQ P A + PG
Sbjct: 765 WEPRPPQAPYLGYQEPQAQGVQVSSYPG 792
>gnl|CDD|219677 pfam07974, EGF_2, EGF-like domain. This family contains EGF
domains found in a variety of extracellular proteins.
Length = 31
Score = 28.9 bits (65), Expect = 0.56
Identities = 8/26 (30%), Positives = 11/26 (42%), Gaps = 1/26 (3%)
Query: 18 NCHDQGVCNSR-GHCHCHPGFAPPYC 42
C+ +G C G C C G+ C
Sbjct: 6 ICNGRGTCVRPCGKCVCDSGYQGATC 31
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 33.2 bits (75), Expect = 0.63
Identities = 52/337 (15%), Positives = 105/337 (31%), Gaps = 22/337 (6%)
Query: 261 RLLSSGEGQNYNLLAAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPAT 320
R ++ + ++ K ++ + + T VP+ + P P +
Sbjct: 408 RTATNATTTTHKVVFHKAPDTTKSVIFVYTLVHVEPHKTTAVPTTPSLPPASTGPTVSTA 467
Query: 321 SVRLSADFANFSATLPPELPPANKGPNTSPYARPLISSPILSDTTSQTVRELIDAKNAES 380
S+TLP + P ++ + +P A + + TS T ++ D +
Sbjct: 468 DPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSD--TPNA 525
Query: 381 TRQTTRPASVAKSSTLPPVATESNALPGSTLHPVSSESNAPKGSTLNRITSFMSRNSTRR 440
T T V ++T PP T S + P +E + + +TS S ++
Sbjct: 526 TSPTPIVIGVTTTATSPPTGTTS---VPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAV 582
Query: 441 NSKVLTPPATQGSARSL--EISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRPNLPA 498
+ ++ S + S P +N + T T+ P++P+
Sbjct: 583 TTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEET---PSVPS 639
Query: 499 FGSMRAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAG 558
+ P P P + S PG + + + +P N ++
Sbjct: 640 TTHVSTLSPG-----------PGPGTTSQVSGPGNSSTSRYPGEVHVTEGMPN-PNATSP 687
Query: 559 QAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQ 595
A + + T T G L+
Sbjct: 688 SAPSGQKTAVPTVTSTGGKANSTTKETSGSTLMASTS 724
Score = 30.9 bits (69), Expect = 3.2
Identities = 36/209 (17%), Positives = 60/209 (28%), Gaps = 7/209 (3%)
Query: 243 VAKSTQNHSNSTSGGRGQRLLSSGE--GQNYNLLAAKTHKSSDMSVKLPTRAAPVKPTPP 300
V + ST +SG G + L T +S + P +P
Sbjct: 449 VPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVT- 507
Query: 301 KVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYARPLISSPI 360
P+ T + + ATS +AT PP + + SP+
Sbjct: 508 -TPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPT---GTTSVPNATSPQVTEESPV 563
Query: 361 LSDTTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVATESNALPGSTLHPVSSESNA 420
+ T +A +T Q +S P ++ S ST S
Sbjct: 564 NNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAH 623
Query: 421 PKGSTLNRITSFMSRNSTRRNSKVLTPPA 449
P G + ++T ++ P
Sbjct: 624 PTGGENITEETPSVPSTTHVSTLSPGPGP 652
Score = 30.1 bits (67), Expect = 5.0
Identities = 38/208 (18%), Positives = 70/208 (33%), Gaps = 12/208 (5%)
Query: 244 AKSTQNHSNSTSGGRGQRLLSSGEGQNYNLLAAKTHKSSDMSVKLPTRAAPVKPTPPKVP 303
A +T H +S Q + S + + T + ++ PT + P VP
Sbjct: 581 AVTTGQHGTGSSPTSQQPGIPSS--SHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVP 638
Query: 304 SDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYARPLISSPILSD 363
S ++ + P P T+ ++S + ++ P E+ PN P +SP
Sbjct: 639 STTHVSTLSPGPG-PGTTSQVSGPGNSSTSRYPGEVHVTEGMPN------PNATSPSAPS 691
Query: 364 TTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVATESNALPGSTLHPVSSESNAPKG 423
V + ++ S +ST P + +T + ++
Sbjct: 692 GQKTAVPTVTSTGGKANSTTKETSGSTLMASTSPHTNEGAFR---TTPYNATTYLPPSTS 748
Query: 424 STLNRITSFMSRNSTRRNSKVLTPPATQ 451
S L +F S T + + V PP
Sbjct: 749 SKLRPRWTFTSPPVTTKQATVPVPPTQH 776
>gnl|CDD|238124 cd00203, ZnMc, Zinc-dependent metalloprotease. This super-family of
metalloproteases contains two major branches, the
astacin-like proteases and the
adamalysin/reprolysin-like proteases. Both branches have
wide phylogenetic distribution, and contain
sub-families, which are involved in vertebrate
development and disease.
Length = 167
Score = 31.7 bits (72), Expect = 0.71
Identities = 7/23 (30%), Positives = 7/23 (30%)
Query: 47 GHNLGLEHDTTECTCPSDRCIMA 69
GH LG HD I
Sbjct: 105 GHALGFYHDHDRKDRDDYPTIDD 127
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular
(mostly animal) proteins. Many of these proteins
require calcium for their biological function and
calcium-binding sites have been found to be located at
the N-terminus of particular EGF-like domains;
calcium-binding may be crucial for numerous
protein-protein interactions. Six conserved core
cysteines form three disulfide bridges as in non
calcium-binding EGF domains, whose structures are very
similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 28.8 bits (65), Expect = 0.76
Identities = 11/28 (39%), Positives = 14/28 (50%), Gaps = 3/28 (10%)
Query: 19 CHDQGVC-NSRG--HCHCHPGFAPPYCE 43
C + G C N+ G C C PG+ CE
Sbjct: 11 CQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 32.7 bits (74), Expect = 0.92
Identities = 33/174 (18%), Positives = 57/174 (32%), Gaps = 12/174 (6%)
Query: 447 PPATQGSARSLEISEPIPLNNLRDNEDKVDMVKRTQSMR--ETDVNFKRPNLPAFGSMRA 504
P T + PIP+ LR +++ + + ++ +P G +
Sbjct: 615 IPETSAPRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPY 674
Query: 505 K-RPTSLAGSQRPS------QPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSA 557
+ PT QPPP AP+ + P G+A+ P+ +
Sbjct: 675 QPSPTGANTMLPIQWAPGTMQPPPRAPT---PMRPPAAPPGRAQRPAAATGRARPPAAAP 731
Query: 558 GQAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPPGKPKT 611
G+A+ P+ + G+A+ P P G P PP P
Sbjct: 732 GRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPA 785
Score = 30.0 bits (67), Expect = 5.7
Identities = 32/117 (27%), Positives = 37/117 (31%), Gaps = 12/117 (10%)
Query: 497 PAFGSMRAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHS 556
PA RA+RP + G RP P + PG P A P + +
Sbjct: 707 PAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPP--AAAPGRARPPAAAPGRA 764
Query: 557 AGQAKTPSGIESLPGYQNPSAGQ-----AKTPGTPGQGL-----LGPGQGLGYQNPA 603
A P P Q P A Q A TP P Q L P G Q P
Sbjct: 765 RPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPT 821
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 32.0 bits (73), Expect = 1.1
Identities = 26/132 (19%), Positives = 34/132 (25%), Gaps = 4/132 (3%)
Query: 518 QPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPSGIESLPGYQNPSA 577
P P P P Q P QA + + A P +LP
Sbjct: 52 SPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPT 111
Query: 578 GQAKTPGTPGQGLLGPGQGLGYQNPAVPPGKPKTYDDCLNLLSESMAALANVDHNPNDNI 637
Q + PG + QNPA P L L + A A+
Sbjct: 112 IQTE-PGQLYPVQVPVMVT---QNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQS 167
Query: 638 YAVIEESPATSD 649
+ES
Sbjct: 168 AQKNDESQLQQQ 179
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 32.3 bits (74), Expect = 1.1
Identities = 23/113 (20%), Positives = 31/113 (27%), Gaps = 2/113 (1%)
Query: 497 PAFGSMRAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHS 556
PA G+ + P + A S P + PA + P PAG A P+ + P
Sbjct: 590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPG--V 647
Query: 557 AGQAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPPGKP 609
A P + AK G PA
Sbjct: 648 AAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQ 700
Score = 31.1 bits (71), Expect = 2.8
Identities = 17/117 (14%), Positives = 28/117 (23%), Gaps = 5/117 (4%)
Query: 503 RAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKT 562
+A A P+ P AP+G PA + G +
Sbjct: 672 KAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAP 731
Query: 563 PS----GIESLPGYQNPSAG-QAKTPGTPGQGLLGPGQGLGYQNPAVPPGKPKTYDD 614
+ P +P A P P+ P + + +D
Sbjct: 732 SPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAED 788
>gnl|CDD|191923 pfam07998, Peptidase_M54, Peptidase family M54. This is a family
of metallopeptidases. Two human proteins have been
reported to degrade synthetic substrates and peptides.
Length = 176
Score = 31.3 bits (71), Expect = 1.1
Identities = 11/30 (36%), Positives = 14/30 (46%), Gaps = 6/30 (20%)
Query: 47 GHNLGLEHDTTECTCPSDRCIMAPSSRLFD 76
GH GL H C + C+M S+ L D
Sbjct: 135 GHTYGLSH------CNNTDCVMNFSNSLKD 158
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 32.5 bits (74), Expect = 1.1
Identities = 51/357 (14%), Positives = 90/357 (25%), Gaps = 10/357 (2%)
Query: 255 SGGRGQRLLSSGEGQNYNLLAAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAA 314
SG +GQ + S E ++A P + + S + A
Sbjct: 37 SGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLA 96
Query: 315 PKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYARPLISSPILSDTTSQTVRELID 374
P PA + + PP PPA+ P+ +P ++
Sbjct: 97 PASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAG 156
Query: 375 AKNAESTRQTTRPASVAKSSTLPPVATESNALPGSTLHPVSSESNAPKGSTLNRITSFMS 434
A A A + P + + P + P + + A
Sbjct: 157 ASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPI--- 213
Query: 435 RNSTRRNSKVLTPPATQGSARSLEISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRP 494
S +S P + S+ + + + T
Sbjct: 214 --SASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWE 271
Query: 495 NLPAFGSMRAKRPTSLAGSQRPSQPP--PPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGY 552
G P S + S R P P +P + + + + + S
Sbjct: 272 ASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSS 331
Query: 553 QNHSAGQAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPPGKP 609
+ S+ A G P +P + P +PA G+P
Sbjct: 332 SSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRK---RPRPSRAPSSPAASAGRP 385
>gnl|CDD|237182 PRK12727, PRK12727, flagellar biosynthesis regulator FlhF;
Provisional.
Length = 559
Score = 32.3 bits (73), Expect = 1.1
Identities = 29/153 (18%), Positives = 48/153 (31%), Gaps = 18/153 (11%)
Query: 273 LLAAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIP----ATSVRLSADF 328
++AA + + AAPV+ P+ + A P A S F
Sbjct: 108 MIAAMALRQPVSVPRQAPAAAPVRAASIPSPAAQALAHAAAVRTAPRQEHALSAVPEQLF 167
Query: 329 ANFSATLPPELPPANKGPNTSPYARPLISSPILSDT-TSQTVRELIDAKNAESTRQTTRP 387
A+F T P P +P P I++ + + +Q E +D +
Sbjct: 168 ADFLTTAPVPRAPVQAPVVAAPAPVPAIAAALAAHAAYAQDDDEQLDDDGFDLDDA---- 223
Query: 388 ASVAKSSTLPPVATESNALPGSTLHPVSSESNA 420
P ALP + P + + A
Sbjct: 224 ---------LPQILPPAALPPIVVAPAAPAALA 247
Score = 29.2 bits (65), Expect = 8.4
Identities = 27/155 (17%), Positives = 49/155 (31%), Gaps = 1/155 (0%)
Query: 275 AAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSAT 334
A +T +S + AP PT P P A + A V +A+ +
Sbjct: 54 ALETARSDTPATAAAPAPAPQAPTKPAAPVHAPLKLSANANMSQRQRVASAAEDMIAAMA 113
Query: 335 LPPELPPANKGPNTSPYARPLISSPILSDTT-SQTVRELIDAKNAESTRQTTRPASVAKS 393
L + + P +P I SP + VR ++A S A +
Sbjct: 114 LRQPVSVPRQAPAAAPVRAASIPSPAAQALAHAAAVRTAPRQEHALSAVPEQLFADFLTT 173
Query: 394 STLPPVATESNALPGSTLHPVSSESNAPKGSTLNR 428
+ +P ++ + P + + A +
Sbjct: 174 APVPRAPVQAPVVAAPAPVPAIAAALAAHAAYAQD 208
>gnl|CDD|237325 PRK13267, PRK13267, archaemetzincin-like protein; Reviewed.
Length = 179
Score = 31.1 bits (71), Expect = 1.1
Identities = 11/22 (50%), Positives = 13/22 (59%), Gaps = 6/22 (27%)
Query: 47 GHNLGLEHDTTECTCPSDRCIM 68
GH LGLEH C + RC+M
Sbjct: 134 GHTLGLEH------CDNPRCVM 149
>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
modification].
Length = 2365
Score = 32.3 bits (73), Expect = 1.4
Identities = 15/51 (29%), Positives = 20/51 (39%), Gaps = 6/51 (11%)
Query: 507 PTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSA 557
P G + PSQPPPP P PG K S + + G+ +
Sbjct: 12 PPPPPGFEPPSQPPPPPP------PGVNVKKRSRKQLSIVGDILGHSGNPI 56
>gnl|CDD|222320 pfam13688, Peptidase_M84, Metallo-peptidase family M84.
Length = 193
Score = 31.2 bits (71), Expect = 1.4
Identities = 16/38 (42%), Positives = 18/38 (47%), Gaps = 12/38 (31%)
Query: 47 GHNLGLEHD------TTEC-----TCPSD-RCIMAPSS 72
GHN G HD + C TCP+ R IM PSS
Sbjct: 148 GHNFGAVHDCDSSTESQCCPLSSSTCPAGGRYIMNPSS 185
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 31.7 bits (72), Expect = 1.5
Identities = 15/94 (15%), Positives = 27/94 (28%), Gaps = 2/94 (2%)
Query: 305 DAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYARPLISSPILSDT 364
+A + P+ A T+ S + + P+ A P P P+
Sbjct: 357 EALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP 416
Query: 365 TSQTVRELIDAKNAESTRQTTRPASVAKSSTLPP 398
+ V ++ A + P T P
Sbjct: 417 VAPPVPHTPES--APKLTRAAIPVDEKPKYTPPA 448
>gnl|CDD|200169 TIGR02232, myxo_disulf_rpt, Myxococcus cysteine-rich repeat. This
model represents a sequence region shared between
several proteins of Myxococcus xanthus DK 1622 and some
eukaryotic proteins that include human pappalysin-1
(SP|Q13219). The region of about 40 amino acids contains
several conserved Cys residues presumed to form
disulfide bonds. The region appears in up to 13 repeats
in Myxococcus.
Length = 38
Score = 28.1 bits (63), Expect = 1.5
Identities = 14/30 (46%), Positives = 17/30 (56%), Gaps = 3/30 (10%)
Query: 77 SPVCGNGFVEDGEECDCG---LEDSCKNAC 103
+P CG+G +E GEECD G D C C
Sbjct: 1 APTCGDGIIEPGEECDDGNTTSGDGCSATC 30
>gnl|CDD|215474 PLN02877, PLN02877, alpha-amylase/limit dextrinase.
Length = 970
Score = 31.7 bits (72), Expect = 1.5
Identities = 17/98 (17%), Positives = 28/98 (28%), Gaps = 6/98 (6%)
Query: 283 DMSVKLPTRAAPVKP----TPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPE 338
+ P+ A + P P+ S A + R ATS+ A P
Sbjct: 1 MALLLTPSPAVHLAPAPCVARPRASSAAFVLRSRWLRSRGATSISFVAHRGRNFRVRIPS 60
Query: 339 LPPANKGPNTSPYARPLISSPILSDTTSQTVRELIDAK 376
S + S TS +L+ ++
Sbjct: 61 ATSLAMATRPSSASSFFTLEL--SVPTSLFQDDLLYSR 96
>gnl|CDD|221242 pfam11816, DUF3337, Domain of unknown function (DUF3337). This
family of proteins are functionally uncharacterized.
This family is only found in eukaryotes. This presumed
domain is typically between 285 to 342 amino acids in
length.
Length = 320
Score = 31.0 bits (70), Expect = 1.9
Identities = 30/159 (18%), Positives = 43/159 (27%), Gaps = 13/159 (8%)
Query: 280 KSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPA-----------TSVRLSADF 328
K S SV L + P S+ + + P +S F
Sbjct: 13 KDSSGSVTLWDIPSGKVVETPGEVSEEEEIKELESVYSPNWFTVDSKEGKLSSSLFGKKF 72
Query: 329 ANFSATLPP-ELPPANKGPNTSPYARPLISSPILSDTTSQTVRELIDAKNAESTRQTTRP 387
S+ L P S A L SS D L+ A P
Sbjct: 73 RMSSSLLKKCGAASTEGKPQKSEKAIDLKSSKAEKDPEINLGGLLLRALLEYWKELKCNP 132
Query: 388 ASVAKSSTLPPVATESNALPGSTLHPVS-SESNAPKGST 425
+ S+ LP + E+ L + SE + G
Sbjct: 133 RVLVFSTFLPSLDNETPYLKLPPDTTIIISEESPDLGGG 171
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 31.1 bits (70), Expect = 2.1
Identities = 27/110 (24%), Positives = 33/110 (30%), Gaps = 13/110 (11%)
Query: 507 PTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTP--------SGIQSLPGYQNHSAG 558
P Q +Q P P P Q YQ+P Q + S Q Y S
Sbjct: 128 PQPPPAQQPQAQQPQPPPQVPQQ-QQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPY 186
Query: 559 QAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGL--GYQNPAVPP 606
Q ++ E LP + P Q GP Q Y P P
Sbjct: 187 QPQSYPPNEPLPSSMAMQ--PPYSGAPPSQQFYGPPQPSPYMYGGPGGRP 234
>gnl|CDD|132975 cd06644, STKc_STK10_LOK, Catalytic domain of the Protein
Serine/Threonine Kinase, STK10 or Lymphocyte-oriented
kinase. Serine/threonine kinases (STKs), STK10
subfamily, catalytic (c) domain. STKs catalyze the
transfer of the gamma-phosphoryl group from ATP to
serine/threonine residues on protein substrates. The
STK10 subfamily is part of a larger superfamily that
includes the catalytic domains of other protein STKs,
protein tyrosine kinases, RIO kinases, aminoglycoside
phosphotransferase, choline kinase, and phosphoinositide
3-kinase. Other names for STK10 include
lymphocyte-oriented kinase (LOK) and Xenopus polo-like
kinase kinase 1 (xPlkk1). STK10 is highly expressed in
lymphocytes and is responsible in regulating leukocyte
function associated antigen (LFA-1)-mediated lymphocyte
adhesion. It plays a role in regulating the CD28
responsive element in T cells, and may also function as
a regulator of polo-like kinase 1 (Plk1), a protein
which is overexpressed in multiple tumor types.
Length = 292
Score = 31.2 bits (70), Expect = 2.1
Identities = 24/85 (28%), Positives = 37/85 (43%), Gaps = 14/85 (16%)
Query: 303 PSDAQITPIRAAPKIP-------ATSVRLSADFANFSATLPPELPPANKGPNTSPYARPL 355
P ++ P+R KI + + S +F +F T +K P T P A L
Sbjct: 213 PPHHELNPMRVLLKIAKSEPPTLSQPSKWSMEFRDFLKT------ALDKHPETRPSAAQL 266
Query: 356 ISSPILSDTTS-QTVRELIDAKNAE 379
+ P +S TS + +REL+ AE
Sbjct: 267 LEHPFVSSVTSNRPLRELVAEAKAE 291
>gnl|CDD|222468 pfam13948, DUF4215, Domain of unknown function (DUF4215). The
function of this family is unknown.
Length = 47
Score = 27.8 bits (62), Expect = 2.1
Identities = 10/27 (37%), Positives = 14/27 (51%), Gaps = 3/27 (11%)
Query: 78 PVCGNGFVEDGEECDCG---LEDSCKN 101
+CG+G + E+CD G D C N
Sbjct: 19 SICGDGIIVGNEQCDDGNNVSFDGCYN 45
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 30.9 bits (70), Expect = 2.7
Identities = 25/144 (17%), Positives = 34/144 (23%), Gaps = 11/144 (7%)
Query: 493 RPNLPAFGS--MRAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLP 550
P A G + P Q P P + PG P P
Sbjct: 389 SPMGGAMGQPPYYGQGPQQQFNGQ-PLGWPRMSMMPTPMGPGGP-LRPNGLAPMNAVRAP 446
Query: 551 GYQNHSAGQAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPPGKPK 610
+A Q + P YQ+ Q G + L + P K
Sbjct: 447 SRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQK 506
Query: 611 TYDDCLNLLSESMAALANVDHNPN 634
+L E + L P
Sbjct: 507 ------QVLGERLFPLV-EAIEPA 523
>gnl|CDD|219865 pfam08493, AflR, Aflatoxin regulatory protein. This domain is
found in the aflatoxin regulatory protein (AflR) which
is involved in the regulation of the biosynthesis of
aflatoxin in the fungal genus Aspergillus. It occurs
together with the fungal Zn(2)-Cys(6) binuclear cluster
domain (pfam00172).
Length = 275
Score = 30.3 bits (68), Expect = 3.0
Identities = 33/127 (25%), Positives = 50/127 (39%), Gaps = 9/127 (7%)
Query: 293 APVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYA 352
+ P PP + I P A P P + S F++ S PEL + PN S
Sbjct: 23 SSSHPQPPVQSGPSSIQPPVATPHTPNGTSSPSPKFSHQSPPAEPELWGSILSPNASNQD 82
Query: 353 RPLISSPILSDT------TSQTVRELIDAKNAESTRQTTRPASVAK---SSTLPPVATES 403
+ +SS + +T S + L D +A+ + T SVA SS + + S
Sbjct: 83 QGDLSSLLSVNTDFGQLFASLSPSPLFDGNDADLHAEATGELSVADLEVSSPMQDLFLTS 142
Query: 404 NALPGST 410
P S+
Sbjct: 143 ALSPPSS 149
>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 30.7 bits (69), Expect = 3.2
Identities = 28/98 (28%), Positives = 36/98 (36%), Gaps = 11/98 (11%)
Query: 494 PNLPAFGSMRAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQ 553
P P R S A + P Q PP A S LP P+ QA P + S
Sbjct: 408 PAKPEAPGARPAELPSPASAPTPEQQPPVARSA--PLP----PSPQASAPRNVASGKPGV 461
Query: 554 NHSAGQAK----TPSGIESLPGYQNPSAGQAKTPGTPG 587
+ + Q K T +G P Q S+ A+T G
Sbjct: 462 DLGSWQGKFMNFTRNGSRKQP-VQASSSDAAQTGVFEG 498
>gnl|CDD|214543 smart00180, EGF_Lam, Laminin-type epidermal growth factor-like
domai.
Length = 46
Score = 27.3 bits (61), Expect = 3.4
Identities = 9/31 (29%), Positives = 11/31 (35%), Gaps = 1/31 (3%)
Query: 14 DCPFNCHDQGVCNSR-GHCHCHPGFAPPYCE 43
DC G C+ G C C P C+
Sbjct: 2 DCDPGGSASGTCDPDTGQCECKPNVTGRRCD 32
>gnl|CDD|215680 pfam00053, Laminin_EGF, Laminin EGF-like (Domains III and V).
This family is like pfam00008 but has 8 conserved
cysteines instead of six.
Length = 49
Score = 27.3 bits (61), Expect = 3.6
Identities = 9/31 (29%), Positives = 14/31 (45%), Gaps = 5/31 (16%)
Query: 18 NCHDQG----VCNSR-GHCHCHPGFAPPYCE 43
+C+ G C+ G C C PG +C+
Sbjct: 2 DCNPHGSLSDTCDPETGQCLCKPGVTGRHCD 32
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 30.5 bits (69), Expect = 3.7
Identities = 20/114 (17%), Positives = 27/114 (23%), Gaps = 6/114 (5%)
Query: 495 NLPAFGSMRAKRPTSLAGSQRPSQPPPP---APSGIQSLPGY--QNPAGQAKTPSGIQSL 549
A G K+ +Q + P P A S S Q A Q+ T
Sbjct: 366 GDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTP- 424
Query: 550 PGYQNHSAGQAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPA 603
P + + + K LGP Q A
Sbjct: 425 PTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKA 478
>gnl|CDD|218883 pfam06075, DUF936, Plant protein of unknown function (DUF936).
This family consists of several hypothetical proteins
from Arabidopsis thaliana and Oryza sativa. The function
of this family is unknown.
Length = 564
Score = 30.2 bits (68), Expect = 4.0
Identities = 30/174 (17%), Positives = 52/174 (29%), Gaps = 12/174 (6%)
Query: 251 SNSTSGGRGQRLLSSGEGQNYNLLAAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITP 310
S+S S G T ++ + +R+ P+P S +
Sbjct: 148 SSSKSASIDSSPTVIGPRPRSFSELNLTDRT--PAKVRSSRSELGAPSPSGGTSCPS-SS 204
Query: 311 IRAAPKIPATSVRLSADFANFSATLPPELPPANK--GPNTSPYARPLIS-SPILSDTTSQ 367
I + +R SA A L P ++ +SP AR + SP S +
Sbjct: 205 GGRRSSIGSRRLRGSASLRKKVAVLSAPRKPGSRSSDCKSSPRARSSSAKSPFKSSIQRK 264
Query: 368 TVRELIDAKNAESTRQTTRPASVAKSSTLPPVATESNALPGSTLHPVSSESNAP 421
+ L S P +KSS + + + ++ N
Sbjct: 265 ATKALSKLSLRAS------PKDTSKSSKSEVAPPKKSEAKVPSSSKKWTDGNVS 312
>gnl|CDD|236652 PRK10118, PRK10118, flagellar hook-length control protein;
Provisional.
Length = 408
Score = 29.8 bits (67), Expect = 5.1
Identities = 18/123 (14%), Positives = 40/123 (32%), Gaps = 1/123 (0%)
Query: 251 SNSTSGGRGQRLLSSGEGQNYNLLAAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITP 310
+++ S + + +TH S + +A + P P
Sbjct: 160 ADAPSTVLPAEKPTLLTKDMPSAPQDETHTLSSDEHEKGLTSAQLTTAQPDDAPGTPAQP 219
Query: 311 IRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYARPLISSPILSDTTSQTVR 370
+ + + + +A P + P P A P++S+P+ S Q++
Sbjct: 220 LTPLAAEAQAKAEVISTPSPVTAAASPTITPHQTQP-LPTAAAPVLSAPLGSHEWQQSLS 278
Query: 371 ELI 373
+ I
Sbjct: 279 QHI 281
>gnl|CDD|200561 cd10935, CE4_WalW, Putative catalytic domain of
lipopolysaccharide biosynthesis protein WalW and its
bacterial homologs. This family corresponds to a group
of uncharacterized lipopolysaccharide biosynthesis
protein WalW found in bacteria. Although their
biochemical properties remain to be determined, members
of this family is composed of a seven-stranded barrel
with detectable sequence similarity to the six-stranded
barrel rhizobial NodB-like proteins, which remove
N-linked or O-linked acetyl groups from cell wall
polysaccharides and belong to the larger carbohydrate
esterase 4 (CE4) superfamily.
Length = 295
Score = 29.9 bits (68), Expect = 5.1
Identities = 13/43 (30%), Positives = 15/43 (34%), Gaps = 4/43 (9%)
Query: 11 PVADCPFNCHDQGVCNSRGHC----HCHPGFAPPYCEYPGGHN 49
PVA P RG H HP PP+ E +N
Sbjct: 52 PVASDPAAVAILARALDRGRAEIGAHLHPWVTPPFDELTDPYN 94
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 30.2 bits (68), Expect = 5.2
Identities = 17/88 (19%), Positives = 27/88 (30%), Gaps = 9/88 (10%)
Query: 557 AGQAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPPGKPKTYDDCL 616
G+ T G + P P Q PA PP P +
Sbjct: 899 TGRVATAPGGD--AASAPPPGAGPPAPPQAVPPPRTT------QPPAAPPRGPDVPPAAV 950
Query: 617 NLLSESMAALANVDHNPNDNIY-AVIEE 643
L E++A L + + + Y A ++
Sbjct: 951 AELRETLADLRSAQRSGDFTAYGAALDR 978
>gnl|CDD|218350 pfam04959, ARS2, Arsenite-resistance protein 2. Arsenite is a
carcinogenic compound which can act as a co-mutagen by
inhibiting DNA repair. Arsenite-resistance protein 2 is
thought to play a role in arsenite resistance.
Length = 211
Score = 29.4 bits (66), Expect = 5.3
Identities = 28/92 (30%), Positives = 35/92 (38%), Gaps = 33/92 (35%)
Query: 472 EDKVDMVKRTQSMRETDVNF--------KRPNLPAFGSMRAKRPTSLAGSQRPSQPPPPA 523
DKV+ V++ +V F KRP LP +P QPP P
Sbjct: 102 GDKVEEVRK-------EVEFFNNYLTDAKRPALP---------------EIKPLQPPKPD 139
Query: 524 PSGI-QSLPGYQNPAGQAKTPSGI--QSLPGY 552
P G+ LPGY QA P G + GY
Sbjct: 140 PGGLAPGLPGYPPQTPQALMPYGQPRPPMMGY 171
>gnl|CDD|215078 PLN00150, PLN00150, potassium ion transporter family protein;
Provisional.
Length = 779
Score = 29.8 bits (67), Expect = 5.4
Identities = 26/139 (18%), Positives = 42/139 (30%), Gaps = 18/139 (12%)
Query: 317 IPATSVRLSADFANFSATLPPELPPANKGPNTSPYARPLISSPILSDTTSQTVRELIDAK 376
I S R +D + +A+ PE S Q+ ++
Sbjct: 639 IEIESFREQSDLESMAASWTPEELMGEGNSVGSGL----------FTQYDQSDINFATSQ 688
Query: 377 NAESTRQTTRPASVAKSSTLPPVATESNA-------LPGSTLHPVSSESNAPKGSTLNRI 429
+ S S T VA + + G+ +S K +N I
Sbjct: 689 EWKRPSSQEDSVSGHSSDTQDEVAFLNKCKEAGVVYILGNNDVKARKDSGFFKKVIINYI 748
Query: 430 TSFMSRNSTRRNSKVLTPP 448
+F+ R S R + VL P
Sbjct: 749 YTFLRRIS-RDSEVVLNIP 766
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 30.1 bits (68), Expect = 5.5
Identities = 22/90 (24%), Positives = 33/90 (36%), Gaps = 13/90 (14%)
Query: 289 PTRAAPVKPTPPKVPSDAQI---TPIRAAPKIPATSVRLSADFANFSATLPPELPPANKG 345
AA P P+ + P +AAP+ A VR++ P + A
Sbjct: 428 APAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAP---------EPAVASAAPA 478
Query: 346 PNTSPYARPLISSPILSDTTSQTVRELIDA 375
P +P A L + D TV++L A
Sbjct: 479 PAAAPAAARLTPTE-EGDVWHATVQQLAAA 507
Score = 29.3 bits (66), Expect = 7.8
Identities = 22/112 (19%), Positives = 35/112 (31%), Gaps = 9/112 (8%)
Query: 289 PTRAAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNT 348
A P P+ + A AAP A + A +A+ P P A
Sbjct: 373 AAPAEKKTPARPEAAAPA------AAPVAQAAAAPAPAAAPAAAASAPAAPPAA---APP 423
Query: 349 SPYARPLISSPILSDTTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVA 400
+P A P ++P + + L A A++ +T A
Sbjct: 424 APVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASA 475
>gnl|CDD|215613 PLN03174, PLN03174, Chalcone-flavanone isomerase-related;
Provisional.
Length = 278
Score = 29.4 bits (66), Expect = 5.8
Identities = 18/73 (24%), Positives = 28/73 (38%), Gaps = 5/73 (6%)
Query: 299 PPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGPNTSPYARPLISS 358
PP+ PS AA + +A A +A+ P P N + P +S
Sbjct: 13 PPRAPSFFAAAAAVAA-----AAAAAAAAAAAIAASRNPPHPFLQNALNFHNSSSPPWAS 67
Query: 359 PILSDTTSQTVRE 371
L+D + +V E
Sbjct: 68 ISLADPSPPSVVE 80
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 30.0 bits (67), Expect = 5.8
Identities = 85/393 (21%), Positives = 127/393 (32%), Gaps = 55/393 (13%)
Query: 249 NHSNSTSGGRGQRLLSSGEGQNYNLLAAKTHKSSD-MSVKLPTRA-APVKPTPPKVPSDA 306
+ NS S G G+ GEG++ + + SSD + R+ +P P+P SD+
Sbjct: 108 SRPNSPSEGEGE---GEGEGESSDSRSVNEEGSSDPKDIDQDNRSSSPSIPSPQDNESDS 164
Query: 307 ------QITPIRAAPKIPATSVRLSADFANF----SATLPPELPPANKGPNTSPYARP-- 354
Q+ + P I A A + +PP+ P P P
Sbjct: 165 DSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPL 224
Query: 355 -LISSPILSDTTSQT-----VRELIDAKNAESTRQTTRPASVAKSSTLPPVATESNALPG 408
LIS+P L + + ++ + ++R + PP+ P
Sbjct: 225 SLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPV 284
Query: 409 STLHPVSS----ESNAPKGSTLNRITSFMSRNSTRRNSKVLT-----------PPATQGS 453
HP S+ A + S +S S+ PPA
Sbjct: 285 FLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQPLPPAPSMP 344
Query: 454 ARSLEISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRPNLPAFGSMRAKRPTSLAGS 513
+ PIP + ++ + P PA K +SL
Sbjct: 345 HIKPPPTTPIPQLPNQSHK---HPPHLQGPSPFPQMPSNLPPPPAL-----KPLSSLPTH 396
Query: 514 QRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPSGIESLPGYQ 573
PS PPP QS P Q+ Q + QSLP + SG+ S P Q
Sbjct: 397 HPPSAHPPPLQLMPQSQP-LQSVPAQPPVLTQSQSLPPKASTHP-----HSGLHSGPP-Q 449
Query: 574 NPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPP 606
+P A T G +GP L PA PP
Sbjct: 450 SPFAQHPFTSGGLPA--IGPPPSLPTSTPAAPP 480
>gnl|CDD|238012 cd00055, EGF_Lam, Laminin-type epidermal growth factor-like
domain; laminins are the major noncollagenous
components of basement membranes that mediate cell
adhesion, growth migration, and differentiation; the
laminin-type epidermal growth factor-like module occurs
in tandem arrays; the domain contains 4 disulfide bonds
(loops a-d) the first three resemble epidermal growth
factor (EGF); the number of copies of this domain in
the different forms of laminins is highly variable
ranging from 3 up to 22 copies.
Length = 50
Score = 26.5 bits (59), Expect = 6.1
Identities = 8/31 (25%), Positives = 12/31 (38%), Gaps = 5/31 (16%)
Query: 18 NCHDQG----VCNSR-GHCHCHPGFAPPYCE 43
+C+ G C+ G C C P C+
Sbjct: 3 DCNGHGSLSGQCDPGTGQCECKPNTTGRRCD 33
>gnl|CDD|219938 pfam08618, Opi1, Transcription factor Opi1. Opi1 is a leucine
zipper containing yeast transcription factor that
negatively regulates phospholipid biosynthesis. It
represses the expression of several UAS(INO) cis acting
element containing genes and its activity is mediated by
phosphorylations catalyzed by protein kinase A, protein
kinase C and casein kinase II.
Length = 387
Score = 29.7 bits (66), Expect = 6.3
Identities = 24/168 (14%), Positives = 47/168 (27%), Gaps = 17/168 (10%)
Query: 409 STLHPVSSESNA-PKGSTLNRITSFMSRNSTRRNSKVLTPPATQGSARSLEISEPIPLNN 467
+ PV+S A + + + + R T P+ S + N
Sbjct: 40 RSAIPVASTVAAVGRATGVESNNRWALNTPVRTTPSSTTMPSALSKRSLDAASIHMASNG 99
Query: 468 -----------LRDNEDKVDMVKRTQSMRETDVNFKRPNLPAFGSMRAKRPTSLAGSQRP 516
+ D + + ++ DV + + A S G+
Sbjct: 100 APPLIQKSSEKVNGGIDAIRNSETEGTLYSVDVGSQGLRMRIQTQGYAPSGNSNRGA--- 156
Query: 517 SQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHSAGQAKTPS 564
P + +LPGY + + S + P + G + P
Sbjct: 157 --PVRTSALSTSTLPGYDDHRSPRYSSSPVPQQPQTAVTANGGPRPPQ 202
>gnl|CDD|218745 pfam05783, DLIC, Dynein light intermediate chain (DLIC). This
family consists of several eukaryotic dynein light
intermediate chain proteins. The light intermediate
chains (LICs) of cytoplasmic dynein consist of multiple
isoforms, which undergo post-translational modification
to produce a large number of species. DLIC1 is known to
be involved in assembly, organisation, and function of
centrosomes and mitotic spindles when bound to
pericentrin. DLIC2 is a subunit of cytoplasmic dynein 2
that may play a role in maintaining Golgi organisation
by binding cytoplasmic dynein 2 to its Golgi-associated
cargo.
Length = 490
Score = 29.8 bits (67), Expect = 6.4
Identities = 30/118 (25%), Positives = 44/118 (37%), Gaps = 15/118 (12%)
Query: 501 SMRAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSL----PGYQNHS 556
S+ AK+P A R + P +P G P A S + S+ P + +
Sbjct: 359 SLLAKQP---ATPTRGVESPARSPGGSPRTTNRSGPRNVASV-SPMTSVKKIDPNMKPGA 414
Query: 557 AGQAKTPSGIESLPGYQNPSAGQAKTPGTPGQGLLGPGQGLGYQNPAVPPG-KPKTYD 613
A + + SL + + +PG+PG G G G Q A G KP D
Sbjct: 415 ASEGVLANFFNSL------LSKKTGSPGSPGGGGSPAGTGTNTQGTAKKSGQKPVLTD 466
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 29.7 bits (66), Expect = 7.1
Identities = 51/246 (20%), Positives = 77/246 (31%), Gaps = 32/246 (13%)
Query: 276 AKTHKSSDMSV-----------KLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPATSVRL 324
AK HK S + K P P +P PK P AQ +PK+P
Sbjct: 564 AKEHKPSKIPTLSKKPEFPKDPKHPKD--PEEPKKPKRPRSAQRPTRPKSPKLPELLDIP 621
Query: 325 SADFANFSATLP-----------PELPPANKGPNTS-PYARPLISSPILSDTTSQTVREL 372
+ S P PE P K + P P P + +
Sbjct: 622 KSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSP--KPPFDPKFKEKFYDDY 679
Query: 373 IDAKNAESTRQTTRPASVAKSSTLPPVATESNALPGSTLHPVSSESNAPKGSTLNRITSF 432
+DA +TT + S L E+ P +T P+ + + I
Sbjct: 680 LDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDP 739
Query: 433 MSRNSTRRNSKVLTPPATQGSARSLEISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFK 492
+ + + TPP + E PL ++ E K + + E D K
Sbjct: 740 DAEQPD--DIEFFTPPEEE-RTFFHETPADTPLPDILAEEFKEEDI--HAETGEPDEAMK 794
Query: 493 RPNLPA 498
RP+ P+
Sbjct: 795 RPDSPS 800
Score = 29.7 bits (66), Expect = 8.1
Identities = 29/140 (20%), Positives = 47/140 (33%), Gaps = 21/140 (15%)
Query: 270 NYNLLAAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFA 329
+Y AAK+ ++ V + + +K T P+ P TP PK+P
Sbjct: 678 DYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDE-------- 729
Query: 330 NFSATLPPELPPANKGPNTSPYARPLISSPI----------LSDTTSQTVREL-IDAKNA 378
P P + P+ + P L D ++ +E I A+
Sbjct: 730 --EFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETG 787
Query: 379 ESTRQTTRPASVAKSSTLPP 398
E RP S ++ PP
Sbjct: 788 EPDEAMKRPDSPSEHEDKPP 807
>gnl|CDD|235722 PRK06164, PRK06164, acyl-CoA synthetase; Validated.
Length = 540
Score = 29.3 bits (66), Expect = 7.4
Identities = 12/50 (24%), Positives = 15/50 (30%)
Query: 507 PTSLAGSQRPSQPPPPAPSGIQSLPGYQNPAGQAKTPSGIQSLPGYQNHS 556
P A Q + P P P+ + T SG S P H
Sbjct: 153 PAPGARVQLFALPDPAPPAAAGERAADPDAGALLFTTSGTTSGPKLVLHR 202
>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509). This
family consists of several uncharacterized viral
proteins from the Marek's disease-like viruses. Members
of this family are typically around 400 residues in
length. The function of this family is unknown.
Length = 377
Score = 29.2 bits (65), Expect = 7.5
Identities = 21/132 (15%), Positives = 42/132 (31%), Gaps = 8/132 (6%)
Query: 336 PPELPPANKGPNTSPYARPLISSPILSDTTSQT-VRELIDAKNAESTRQTTRPASVAKSS 394
PP + P + +R + +P ++ + E +A+ + A +
Sbjct: 237 PPRTDTLDPDPAIAGPSRAVNRTPSPRPSSPPPEIDEEYNAQPDPWGTNAQTWDNRAGTP 296
Query: 395 TLPPVATESNA--LPGSTLHPVSSESNAPKGSTLNRITS-----FMSRNSTRRNSKVLTP 447
+E +A + P S S +G NR S S + R +
Sbjct: 297 DDGSAISEDSAGGSHHTMRRPPHSTSGERRGRRRNRSESRSRSRSRSGSRRYRRRRGRGV 356
Query: 448 PATQGSARSLEI 459
P + +R +
Sbjct: 357 PGRRSESRQDTV 368
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 29.5 bits (67), Expect = 7.9
Identities = 19/77 (24%), Positives = 26/77 (33%)
Query: 287 KLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFANFSATLPPELPPANKGP 346
++ +AP PTP PS +APK + S A S +P P P
Sbjct: 367 EIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSP 426
Query: 347 NTSPYARPLISSPILSD 363
A S L +
Sbjct: 427 TPPANAANAPPSLNLEE 443
>gnl|CDD|188098 TIGR00957, MRP_assoc_pro, multi drug resistance-associated protein
(MRP). This model describes multi drug
resistance-associated protein (MRP) in eukaryotes. The
multidrug resistance-associated protein is an integral
membrane protein that causes multidrug resistance when
overexpressed in mammalian cells. It belongs to ABC
transporter superfamily. The protein topology and
function was experimentally demonstrated by epitope
tagging and immunofluorescence. Insertion of tags in the
critical regions associated with drug efflux, abrogated
its function. The C-terminal domain seem to highly
conserved [Transport and binding proteins, Other].
Length = 1522
Score = 29.5 bits (66), Expect = 8.7
Identities = 15/60 (25%), Positives = 23/60 (38%), Gaps = 4/60 (6%)
Query: 351 YARPLISSPILS----DTTSQTVRELIDAKNAESTRQTTRPASVAKSSTLPPVATESNAL 406
Y +PL S + S DT+ V L++ E + +P S P S+ L
Sbjct: 226 YRQPLEESDLWSLNKEDTSEMVVPVLVENWKKECKKTRKQPVSAVYGKKDPSKPKGSSQL 285
>gnl|CDD|219253 pfam06986, TraN, Type-1V conjugative transfer system mating pair
stabilisation. TraN is a large cysteine-rich outer
membrane protein involved in the mating-pair
stabilisation (adhesin) component of the F-type
conjugative plasmid transfer system. TraN is believed to
interact with the core type IV secretion system
apparatus through the TraV protein.
Length = 243
Score = 28.9 bits (65), Expect = 8.9
Identities = 18/74 (24%), Positives = 22/74 (29%), Gaps = 6/74 (8%)
Query: 118 ATGSCCNLETCQPHTAGRQCRAADRECDLPE---FCTGDSEF--CPSDVFKMDGETCKGG 172
T C + TC + C C C S C V C GG
Sbjct: 33 RTYQCQSNNTCAALESQPGCSFTRSTCLEGSDEGVCLQWSVTYDCEQRVCSTTQYVC-GG 91
Query: 173 EAFCYEGSCRTHSD 186
+ FC +G C T
Sbjct: 92 DVFCLDGDCLTPES 105
>gnl|CDD|233595 TIGR01845, outer_NodT, efflux transporter, outer membrane factor
(OMF) lipoprotein, NodT family. Members of This model
comprise a subfamily of the Outer Membrane Factor (TCDB
1.B.17) porins. OMF proteins operate in conjunction with
a primary transporter of the RND, MFS, ABC, or PET
systems, and a MFP (membrane fusion protein) to tranport
substrates across membranes. The complex thus formed
allows transport (export) of various solutes (heavy
metal cations; drugs, oligosaccharides, proteins, etc.)
across the two envelopes of the Gram-negative bacterial
cell envelope in a single energy-coupled step. Current
data suggest that the OMF (and not the MFP) is largely
responsible for the formation of both the trans-outer
membrane and trans-periplasmic channels. The roles
played by the MFP have yet to be determined [Cellular
processes, Detoxification, Transport and binding
proteins, Porins].
Length = 454
Score = 29.3 bits (66), Expect = 9.1
Identities = 17/59 (28%), Positives = 22/59 (37%), Gaps = 1/59 (1%)
Query: 272 NLLAAKTHKSSDMSVKLPTRAAPVKPTPPKVPSDAQITPIRAAPKIPATSVRLSADFAN 330
N LAA K + R + PP +P +R P I A RL+A A
Sbjct: 236 NALAALLGKGPSRGLA-IARPLLLDQLPPDLPLSLPSDLLRRRPDIRAAERRLAAANAQ 293
>gnl|CDD|217461 pfam03261, CDK5_activator, Cyclin-dependent kinase 5 activator
protein.
Length = 314
Score = 28.7 bits (64), Expect = 9.9
Identities = 30/159 (18%), Positives = 49/159 (30%), Gaps = 9/159 (5%)
Query: 380 STRQTTRPASVAKSSTLPPVATESNALPGSTLHPVSSESNAPKGSTLNRITSFMSRNSTR 439
S T+R + + + L T VSS NA L R + + S +
Sbjct: 6 SLSPTSRKNHLFE-----DKDGGAPQLDHYT--SVSSSKNAKSEKNLKRHSIIIPALSWK 58
Query: 440 RNSKVLTPP--ATQGSARSLEISEPIPLNNLRDNEDKVDMVKRTQSMRETDVNFKRPNLP 497
R + ++ S +L S + + E+ V R + N P
Sbjct: 59 RLVASGSKKKKTSKKSTANLASSSYQKIVIQLNRENLKKSVPRPSDTTLSCANLSSLREP 118
Query: 498 AFGSMRAKRPTSLAGSQRPSQPPPPAPSGIQSLPGYQNP 536
+ G A P+ L+G P I +P
Sbjct: 119 SPGQPPAPPPSVLSGKNANCIPSQKNAPSIAITSTGGSP 157
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.312 0.128 0.390
Gapped
Lambda K H
0.267 0.0705 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 34,079,005
Number of extensions: 3115737
Number of successful extensions: 2660
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2457
Number of HSP's successfully gapped: 190
Length of query: 718
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 614
Effective length of database: 6,324,786
Effective search space: 3883418604
Effective search space used: 3883418604
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 63 (28.1 bits)