RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6980
(760 letters)
>gnl|CDD|215056 PLN00104, PLN00104, MYST -like histone acetyltransferase;
Provisional.
Length = 450
Score = 376 bits (966), Expect = e-123
Identities = 156/301 (51%), Positives = 198/301 (65%), Gaps = 22/301 (7%)
Query: 256 QAHEEATKATPLLPLAVPEQINPAAIEFGQYEVKTWYSSPFPQEYARLPKLFLCEFCLKY 315
+ HEE TK N A IE G+YE+ TWY SPFP EY KL+ CEFCLK+
Sbjct: 159 REHEEFTKVK-----------NIATIELGRYEIDTWYFSPFPPEYNDCSKLYFCEFCLKF 207
Query: 316 TKSKAVLERHRDKCAWRHPPATEIYRK----ENLSVFEVDGNQNKFYCQNLCLLAKLFLD 371
K K L+RH KC +HPP EIYR E LS+FEVDG +NK YCQNLC LAKLFLD
Sbjct: 208 MKRKEQLQRHMKKCDLKHPPGDEIYRHPTRQEGLSMFEVDGKKNKVYCQNLCYLAKLFLD 267
Query: 372 HKTLYYDVEPFLFYVLTQNDDKGCHLVGYFSKEKHCQQKYNVSCIMTLPQYQRKGYGRFL 431
HKTLYYDV+ FLFYVL + DD+GCH+VGYFSKEKH ++ YN++CI+TLP YQRKGYG+FL
Sbjct: 268 HKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHSEEDYNLACILTLPPYQRKGYGKFL 327
Query: 432 IDFSYLLSKKEGQRGTPEKPLSDLGRVSYHAYWKSVLLEYLDTIRNQKLICIDQMCADTG 491
I FSY LSK+EG+ GTPE+PLSDLG VSY YW VLLE L + I I ++ T
Sbjct: 328 IAFSYELSKREGKVGTPERPLSDLGLVSYRGYWTRVLLEILKKHKGN--ISIKELSDMTA 385
Query: 492 LYHHDVAETLELLGMLRTKHGDSSEPCIVINWAIVDAHMKRLEQSKTRIKIDPECLRWTP 551
+ D+ TL+ L +++ + G + I + +++ H+K + +++DP L WTP
Sbjct: 386 IKAEDIVSTLQSLNLIQYRKG---QHVICADPKVLEEHLK--AAGRGGLEVDPSKLIWTP 440
Query: 552 L 552
Sbjct: 441 Y 441
Score = 47.1 bits (112), Expect = 3e-05
Identities = 21/46 (45%), Positives = 24/46 (52%), Gaps = 11/46 (23%)
Query: 34 QAHEEATKATPLLPLAVPEQINPAAIEFGQYEVKTWYSSPFPQEYA 79
+ HEE TK N A IE G+YE+ TWY SPFP EY
Sbjct: 159 REHEEFTKVK-----------NIATIELGRYEIDTWYFSPFPPEYN 193
>gnl|CDD|216741 pfam01853, MOZ_SAS, MOZ/SAS family. This region of these proteins
has been suggested to be homologous to
acetyltransferases.
Length = 189
Score = 333 bits (857), Expect = e-110
Identities = 113/183 (61%), Positives = 139/183 (75%)
Query: 331 WRHPPATEIYRKENLSVFEVDGNQNKFYCQNLCLLAKLFLDHKTLYYDVEPFLFYVLTQN 390
RHPP EIYRK N+SVFEVDG + K YCQNLCLLAKLFLDHKTLYYDV+PFLFY+LT+
Sbjct: 1 LRHPPGNEIYRKGNISVFEVDGRKQKLYCQNLCLLAKLFLDHKTLYYDVDPFLFYILTET 60
Query: 391 DDKGCHLVGYFSKEKHCQQKYNVSCIMTLPQYQRKGYGRFLIDFSYLLSKKEGQRGTPEK 450
D+ GCH+VGYFSKEK + YN++CI+TLP YQRKGYG+ LIDFSY LS+KEG+ G+PEK
Sbjct: 61 DETGCHIVGYFSKEKESSENYNLACILTLPPYQRKGYGKLLIDFSYELSRKEGKIGSPEK 120
Query: 451 PLSDLGRVSYHAYWKSVLLEYLDTIRNQKLICIDQMCADTGLYHHDVAETLELLGMLRTK 510
PLSDLG +SY +YW LLE L R++ I I ++ TG+ D+ TL+ L M++
Sbjct: 121 PLSDLGLLSYRSYWSQTLLELLLEHRDEHKITIKELSKMTGITVEDIISTLQSLNMIKYY 180
Query: 511 HGD 513
G
Sbjct: 181 KGQ 183
>gnl|CDD|215642 PLN03238, PLN03238, probable histone acetyltransferase MYST;
Provisional.
Length = 290
Score = 332 bits (854), Expect = e-109
Identities = 147/298 (49%), Positives = 192/298 (64%), Gaps = 22/298 (7%)
Query: 258 HEEATKATPLLPLAVPEQINPAAIEFGQYEVKTWYSSPFPQEYARLPKLFLCEFCLKYTK 317
HEE TK N IE G+YE+ TWY SP+P+ YA KL++CE+CLKY +
Sbjct: 11 HEETTKVK-----------NIEMIELGKYEMDTWYYSPYPEPYASCTKLYICEYCLKYMR 59
Query: 318 SKAVLERHRDKCAWRHPPATEIYR---KENLSVFEVDGNQNKFYCQNLCLLAKLFLDHKT 374
K L RH KC R PP IY + LSVFEVDG + K YCQNLCLLAKLFLDHKT
Sbjct: 60 KKKSLLRHLAKCDIRQPPGGGIYGAVTEGPLSVFEVDGKKAKVYCQNLCLLAKLFLDHKT 119
Query: 375 LYYDVEPFLFYVLTQNDDKGCHLVGYFSKEKHCQQKYNVSCIMTLPQYQRKGYGRFLIDF 434
LYYDV+PFLFYV+T+ DD G H+VGYFSKEK + YN++CI+TLP YQRKGYG+FLI F
Sbjct: 120 LYYDVDPFLFYVMTEVDDHGSHIVGYFSKEKVSAEDYNLACILTLPPYQRKGYGKFLISF 179
Query: 435 SYLLSKKEGQRGTPEKPLSDLGRVSYHAYWKSVLLEYLDTIRNQKLICIDQMCADTGLYH 494
+Y LSK+EG+ GTPE+PLSDLG+VS+ +YW VLLE L ++ + I + TG+
Sbjct: 180 AYELSKREGKVGTPERPLSDLGKVSFRSYWTRVLLEQLRDVKGD--VSIKDLSLATGIRG 237
Query: 495 HDVAETLELLGMLRTKHGDSSEPCIVINWAIVDAHMKRLEQSKTRIKIDPECLRWTPL 552
D+ TL+ L +++ G + I ++ ++D H + + I++D W PL
Sbjct: 238 EDIVSTLQSLNLIKYWKG---QHVIHVDQRVLDEHWAKFAHQRV-IEVDCL--HWQPL 289
Score = 43.7 bits (103), Expect = 2e-04
Identities = 20/49 (40%), Positives = 26/49 (53%), Gaps = 11/49 (22%)
Query: 36 HEEATKATPLLPLAVPEQINPAAIEFGQYEVKTWYSSPFPQEYARNTQV 84
HEE TK N IE G+YE+ TWY SP+P+ YA T++
Sbjct: 11 HEETTKVK-----------NIEMIELGKYEMDTWYYSPYPEPYASCTKL 48
>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
structure and dynamics].
Length = 395
Score = 327 bits (840), Expect = e-105
Identities = 140/269 (52%), Positives = 192/269 (71%), Gaps = 6/269 (2%)
Query: 281 IEFGQYEVKTWYSSPFPQEYARLPKLFLCEFCLKYTKSKAVLERHRDKCAWRHPPATEIY 340
I+ G YE++ WY SP+P+E++ L +++CEFCLKY S+ L RHR KC+ +HPP EIY
Sbjct: 133 IKLGNYEIEPWYFSPYPEEFSDLDIVYICEFCLKYYGSQTSLVRHRKKCSLQHPPGNEIY 192
Query: 341 RKENLSVFEVDGNQNKFYCQNLCLLAKLFLDHKTLYYDVEPFLFYVLTQNDDKGCHLVGY 400
R + +S FE+DG + + YC+NLCLL+KLFLDHKTLYYDV+PFLFYVLT+ D GCHLVGY
Sbjct: 193 RDKYISFFEIDGRKQRLYCRNLCLLSKLFLDHKTLYYDVDPFLFYVLTERGDTGCHLVGY 252
Query: 401 FSKEKHCQQKYNVSCIMTLPQYQRKGYGRFLIDFSYLLSKKEGQRGTPEKPLSDLGRVSY 460
FSKEK +Q YN++CI+TLP YQR+GYG+ LIDFSYLLS+KEG+ G+PEKPLSDLG +SY
Sbjct: 253 FSKEKESEQDYNLACILTLPPYQRRGYGKLLIDFSYLLSQKEGKVGSPEKPLSDLGLLSY 312
Query: 461 HAYWKSVLLEYLDTIRNQKLICIDQMCADTGLYHHDVAETLELLGMLRTKHGDSSEPCIV 520
AYW ++ + L + + I+++ +TG+ DV TLE L +LR + I
Sbjct: 313 RAYWSEIVAKLLLKMDKEIT-DINEISKETGMSTDDVIHTLEALNILRE---YKGQYIIS 368
Query: 521 INWAIVDAHMKRLEQSKTRIKIDPECLRW 549
+N + +++ SK R +I+P+ L W
Sbjct: 369 LNSDKLHNYLR--LWSKKRRRINPDLLLW 395
Score = 29.4 bits (66), Expect = 7.8
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 59 IEFGQYEVKTWYSSPFPQEYARNTQV 84
I+ G YE++ WY SP+P+E++ V
Sbjct: 133 IKLGNYEIEPWYFSPYPEEFSDLDIV 158
>gnl|CDD|173359 PTZ00064, PTZ00064, histone acetyltransferase; Provisional.
Length = 552
Score = 275 bits (705), Expect = 1e-83
Identities = 126/271 (46%), Positives = 162/271 (59%), Gaps = 28/271 (10%)
Query: 256 QAHEEATKATPLLPLAVPEQINPAAIEFGQYEVKTWYSSPFPQEYARLPKLFLCEFCLKY 315
HEE T+ + + G++ + TWY SP P EY + L CE+CL +
Sbjct: 241 LDHEETTRLRTI-----------GRVRIGKFILDTWYFSPLPDEYQNVDTLHFCEYCLDF 289
Query: 316 TKSKAVLERHRDKCAWRHPPATEIYRKENLSVFEVDGNQNKFYCQNLCLLAKLFLDHKTL 375
+ L RH +C RHPP EIYRK+N+SVFE+DG + Y +NLC LAKLFLDHKTL
Sbjct: 290 FCFEDELIRHLSRCQLRHPPGNEIYRKDNISVFEIDGALTRGYAENLCYLAKLFLDHKTL 349
Query: 376 YYDVEPFLFYVLTQNDDKGCHLVGYFSKEKHCQQKYNVSCIMTLPQYQRKGYGRFLIDFS 435
YDVEPFLFY++T+ D++GCH+VGYFSKEK YN++CI+TLP YQRKGYG+ L+D S
Sbjct: 350 QYDVEPFLFYIVTEVDEEGCHIVGYFSKEKVSLLHYNLACILTLPCYQRKGYGKLLVDLS 409
Query: 436 YLLSKKEGQRGTPEKPLSDLGRVSYHAYWKSVLLEYLDTIRNQKLIC------------- 482
Y LS KEG+ G PE+PLSDLGR Y+ +W + EYL Q IC
Sbjct: 410 YKLSLKEGKWGHPERPLSDLGRAIYNNWWAHRISEYLLEYFKQNKICERGGSKQPLQVSN 469
Query: 483 ----IDQMCADTGLYHHDVAETLELLGMLRT 509
ID + TG+ DV LE G++R
Sbjct: 470 YWKFIDNVVRSTGIRREDVIRILEENGIMRN 500
Score = 33.5 bits (76), Expect = 0.52
Identities = 13/45 (28%), Positives = 19/45 (42%), Gaps = 11/45 (24%)
Query: 34 QAHEEATKATPLLPLAVPEQINPAAIEFGQYEVKTWYSSPFPQEY 78
HEE T+ + + G++ + TWY SP P EY
Sbjct: 241 LDHEETTRLRTI-----------GRVRIGKFILDTWYFSPLPDEY 274
>gnl|CDD|178777 PLN03239, PLN03239, histone acetyltransferase; Provisional.
Length = 351
Score = 261 bits (667), Expect = 2e-80
Identities = 139/333 (41%), Positives = 194/333 (58%), Gaps = 30/333 (9%)
Query: 229 VSEDVVSKLLATDLAPGVTRKDIDLYKQAHEEATKATPLLPLAVPEQINPAAIEFGQYEV 288
V EDVV+ + A +L D L + HEE TK N A +E G Y++
Sbjct: 38 VGEDVVATIAAPELDEHEGLDDAAL--KEHEEVTKVK-----------NVAFLELGPYQM 84
Query: 289 KTWYSSPFPQEYAR----LPKLFLCEFCLKYTKSKAVLERHRDKC---AWRHPPATEIYR 341
TWY SP P+E + + L++CEF + K+ L R + K RHPP EIYR
Sbjct: 85 DTWYFSPLPKELFKAGGFIDVLYVCEFSFGFFARKSELLRFQAKELPKERRHPPGNEIYR 144
Query: 342 KENLSVFEVDGNQNKFYCQNLCLLAKLFLDHKTLYYDVEPFLFYVLTQNDDKGCHLVGYF 401
+L++FEVDG + + YCQNLC +AKLFLDHKTLY+DV+PFLFYVL + D++G H VGY+
Sbjct: 145 CGDLAMFEVDGFEERIYCQNLCYIAKLFLDHKTLYFDVDPFLFYVLCEVDERGFHPVGYY 204
Query: 402 SKEKHCQQKYNVSCIMTLPQYQRKGYGRFLIDFSYLLSKKEGQRGTPEKPLSDLGRVSYH 461
SKEK+ YN++CI+T P +QRKGYGRFLI FSY LSKKE + G+PEKP+SDLG+ +Y
Sbjct: 205 SKEKYSDVGYNLACILTFPAHQRKGYGRFLIAFSYELSKKEEKVGSPEKPMSDLGQQAYI 264
Query: 462 AYWKSVLLEY-LDTIRNQKLICIDQMCADTGLYHHDVAETLELLGMLRTKHGDSSEPCIV 520
YW S ++++ L+ N + I + T + D+ L LG+L+ +G +
Sbjct: 265 PYWGSTIVDFLLNHSGNDSSLSIMDIAKKTSIMAEDIVFALNQLGILKFING-------I 317
Query: 521 INWAIVDAHMKRLEQSK--TRIKIDPECLRWTP 551
A ++ L + ++DP L WTP
Sbjct: 318 YFIAAEKGLLEELAEKHPVKEPRVDPSKLHWTP 350
Score = 41.6 bits (97), Expect = 0.001
Identities = 25/76 (32%), Positives = 33/76 (43%), Gaps = 12/76 (15%)
Query: 36 HEEATKATPLLPLAVPEQINPAAIEFGQYEVKTWYSSPFPQEYARNTQVVSKLLATDLAP 95
HEE TK N A +E G Y++ TWY SP P+E + + L + +
Sbjct: 65 HEEVTKVK-----------NVAFLELGPYQMDTWYFSPLPKELFKAGGFIDVLYVCEFSF 113
Query: 96 GV-TRKDIDLYKQAHE 110
G RK L QA E
Sbjct: 114 GFFARKSELLRFQAKE 129
>gnl|CDD|214584 smart00249, PHD, PHD zinc finger. The plant homeodomain (PHD)
finger is a C4HC3 zinc-finger-like motif found in
nuclear proteins thought to be involved in epigenetics
and chromatin-mediated transcriptional regulation. The
PHD finger binds two zinc ions using the so-called
'cross-brace' motif and is thus structurally related to
the RING finger and the FYVE finger. It is not yet known
if PHD fingers have a common molecular function. Several
reports suggest that it can function as a
protein-protein interacton domain and it was recently
demonstrated that the PHD finger of p300 can cooperate
with the adjacent BROMO domain in nucleosome binding in
vitro. Other reports suggesting that the PHD finger is a
ubiquitin ligase have been refuted as these domains were
RING fingers misidentified as PHD fingers.
Length = 47
Score = 40.3 bits (94), Expect = 9e-05
Identities = 13/34 (38%), Positives = 17/34 (50%), Gaps = 1/34 (2%)
Query: 141 LIKCCACNVYYHIICLQPPLERRLK-VPWKCTSC 173
L++C C+ +YH CL PPL W C C
Sbjct: 14 LLQCDGCDRWYHQTCLGPPLLEEEPDGKWYCPKC 47
>gnl|CDD|173926 cd04301, NAT_SF, N-Acyltransferase superfamily: Various enzymes
that characteristically catalyze the transfer of an acyl
group to a substrate. NAT (N-Acyltransferase) is a
large superfamily of enzymes that mostly catalyze the
transfer of an acyl group to a substrate and are
implicated in a variety of functions, ranging from
bacterial antibiotic resistance to circadian rhythms in
mammals. Members include GCN5-related
N-Acetyltransferases (GNAT) such as Aminoglycoside
N-acetyltransferases, Histone N-acetyltransferase (HAT)
enzymes, and Serotonin N-acetyltransferase, which
catalyze the transfer of an acetyl group to a substrate.
The kinetic mechanism of most GNATs involves the ordered
formation of a ternary complex: the reaction begins with
Acetyl Coenzyme A (AcCoA) binding, followed by binding
of substrate, then direct transfer of the acetyl group
from AcCoA to the substrate, followed by product and
subsequent CoA release. Other family members include
Arginine/ornithine N-succinyltransferase, Myristoyl-CoA:
protein N-myristoyltransferase, and
Acyl-homoserinelactone synthase which have a similar
catalytic mechanism but differ in types of acyl groups
transferred. Leucyl/phenylalanyl-tRNA-protein
transferase and FemXAB nonribosomal peptidyltransferases
which catalyze similar peptidyltransferase reactions are
also included.
Length = 65
Score = 38.8 bits (91), Expect = 5e-04
Identities = 12/53 (22%), Positives = 23/53 (43%), Gaps = 6/53 (11%)
Query: 384 FYVLTQNDDKGCHLVGY--FSKEKHCQQKYNVSCIMTLPQYQRKGYGRFLIDF 434
F V + + +VG+ S + + + LP+Y+ KG G L++
Sbjct: 1 FLVAEDDGE----IVGFASLSPDGSGGDTAYIGDLAVLPEYRGKGIGSALLEA 49
>gnl|CDD|201356 pfam00628, PHD, PHD-finger. PHD folds into an interleaved type of
Zn-finger chelating 2 Zn ions in a similar manner to
that of the RING and FYVE domains. Several PHD fingers
have been identified as binding modules of methylated
histone H3.
Length = 51
Score = 35.5 bits (82), Expect = 0.004
Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 2/38 (5%)
Query: 141 LIKCCACNVYYHIICLQPPLER--RLKVPWKCTSCESG 176
L+ C C+ ++H+ CL PPLE + W C C+
Sbjct: 14 LLLCDGCDRWFHLACLGPPLEPEEIPEGEWYCPECKPK 51
>gnl|CDD|222185 pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain. This
domain catalyzes N-acetyltransferase reactions.
Length = 79
Score = 35.2 bits (82), Expect = 0.011
Identities = 10/54 (18%), Positives = 19/54 (35%), Gaps = 9/54 (16%)
Query: 383 LFYVLTQNDDKGCHLVGY--FSKEKHCQQKYNVSCIMTLPQYQRKGYGRFLIDF 434
F V + +VG+ + + P+Y+ +GYG L+
Sbjct: 4 HFLVAYDDG----EIVGFLRLRPIGEGAY---IGGVAVDPEYRGQGYGSKLLRH 50
>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 814
Score = 38.3 bits (89), Expect = 0.019
Identities = 20/97 (20%), Positives = 38/97 (39%), Gaps = 3/97 (3%)
Query: 611 LSLDTDAASPVVEVTPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNT 670
L+L AS ++ + + +T A +TT E+ TEE D + ++
Sbjct: 13 LTLALLTASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAP 72
Query: 671 TASETTASETPC---TEELDVMTPSSSLNESGAGVPV 704
++SET + P T+ + + + PV
Sbjct: 73 SSSETAETSDPTSEATDTTTSEARTVTPAATETSKPV 109
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 37.9 bits (88), Expect = 0.027
Identities = 22/134 (16%), Positives = 50/134 (37%), Gaps = 11/134 (8%)
Query: 587 EKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVTPKKTRKESESKNTTASETTAS 646
+ +D+ E + K+G ++ + + AA P P +K + + +
Sbjct: 1315 DDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKP-----PAAAKKRGPATVQSGQKLLTE 1369
Query: 647 ETPCTEELDVMTPSS-----RKESESKNTTASETTASETPCTEELDVMTPSSSLNESGAG 701
E + + +P R +K + + A+ TE + ++ SSS +
Sbjct: 1370 MLKPAEAIGI-SPEKKVRKMRASPFNKKSGSVLGRAATNKETESSENVSGSSSSEKDEID 1428
Query: 702 VPVVKKRRKFRRKK 715
V + ++ RK+
Sbjct: 1429 VSAKPRPQRANRKQ 1442
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 35.1 bits (81), Expect = 0.12
Identities = 13/101 (12%), Positives = 30/101 (29%), Gaps = 5/101 (4%)
Query: 550 TPLVSHIVNPYK-TLTKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRK 608
+ ++ I++ + T K +P S + + E E E + T K +
Sbjct: 2 SLILCMILSFFSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKED 61
Query: 609 RKLSLDTDAASPVVEVTPKKTRKESESKNTTASETTASETP 649
+ ++ + KK K+ +
Sbjct: 62 ----KNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKK 98
Score = 29.3 bits (66), Expect = 9.5
Identities = 20/102 (19%), Positives = 35/102 (34%), Gaps = 5/102 (4%)
Query: 635 SKNTTASETTASETPCTEELDVMTPSSRKESESKNTTASETTASETPCTEELDVMTPSSS 694
S T S+ + L + SE +N A+ +T + + + S
Sbjct: 13 SGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEK 72
Query: 695 LNESGAGVPVVKKRRKFRRKKNNTGWDTPKKKKKSTNITPNP 736
+ +K+ + G+ TPKK KK+ P P
Sbjct: 73 KKKKKK-----EKKEPKSEGETKLGFKTPKKSKKTKKKPPKP 109
>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
Provisional.
Length = 481
Score = 34.8 bits (79), Expect = 0.17
Identities = 38/197 (19%), Positives = 68/197 (34%), Gaps = 29/197 (14%)
Query: 561 KTLTKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKRK---------- 610
K T + + P++ + + T TT AET+E VV + K
Sbjct: 156 KETTTQQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHAVKSGDTIWALSV 215
Query: 611 ---------LSLDTDAASPVV---EVTPKKTRKESESKNTTASETTASETPCTEELDVMT 658
+S + ++S + ++ K+T + K +E A+E + T
Sbjct: 216 KYGVSVQDIMSWNNLSSSSIYVGQKLAIKQTANTATPKAEVKTEAPAAEKQAAPVVKENT 275
Query: 659 PSSRKESESKNTTASETTASETPCTEELDVMTPSSSLNESGAGVPVVKKRRKFRRKKNNT 718
++ +E K TT + TA + P PS++ N + NNT
Sbjct: 276 NTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNT-------NTNTNTNNT 328
Query: 719 GWDTPKKKKKSTNITPN 735
TP K + +
Sbjct: 329 NTSTPSKNTNTNTNSNT 345
Score = 30.2 bits (67), Expect = 4.4
Identities = 25/118 (21%), Positives = 50/118 (42%), Gaps = 15/118 (12%)
Query: 561 KTLTKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAASP 620
KT + K ++ + T T TTEK+ E T+++T K T+AA P
Sbjct: 257 KTEAPAAEKQAAPVVKENTNTNTATTEKK-ETTTQQQTAPKAP-----------TEAAKP 304
Query: 621 VVEVTPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTASETTAS 678
+ ++ + T + T + TP + T ++ + + NT A++ +++
Sbjct: 305 APAPSTNTNANKTNTNTNTNTNNTNTSTPSK---NTNTNTNSNTNTNSNTNANQGSSN 359
>gnl|CDD|237470 PRK13688, PRK13688, hypothetical protein; Provisional.
Length = 156
Score = 33.4 bits (77), Expect = 0.17
Identities = 10/16 (62%), Positives = 13/16 (81%)
Query: 419 LPQYQRKGYGRFLIDF 434
LP+YQ +GYG L+DF
Sbjct: 88 LPKYQNRGYGEMLVDF 103
>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
Length = 592
Score = 34.7 bits (79), Expect = 0.19
Identities = 26/96 (27%), Positives = 40/96 (41%), Gaps = 4/96 (4%)
Query: 551 PLVSHIVNPYKTLTKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKRK 610
P + IV+ +T + + S+ N E AT T E E E K+T+ + K
Sbjct: 44 PGIEVIVSADETAPADEASKSAEANTTKEAPATATPENTTEPTVEP----KQTETKEQTK 99
Query: 611 LSLDTDAASPVVEVTPKKTRKESESKNTTASETTAS 646
+ A+ VE P + S N T+S T A+
Sbjct: 100 TPEEKQPAAKQVEKAPAEPATVSNPDNATSSSTPAT 135
Score = 29.7 bits (66), Expect = 7.0
Identities = 15/61 (24%), Positives = 27/61 (44%), Gaps = 2/61 (3%)
Query: 618 ASPVVEVTPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTASETTA 677
A+P P K++E+K T +T + P ++++ S N T+S T A
Sbjct: 77 ATPENTTEPTVEPKQTETKEQT--KTPEEKQPAAKQVEKAPAEPATVSNPDNATSSSTPA 134
Query: 678 S 678
+
Sbjct: 135 T 135
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 33.9 bits (78), Expect = 0.36
Identities = 32/155 (20%), Positives = 49/155 (31%), Gaps = 7/155 (4%)
Query: 587 EKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVTPKKTRKESESKNTTASETTAS 646
E E+E++ EE K GR RK + + K+ + E + K + E
Sbjct: 402 EGEEESDEEENEEPSKKNVGR-RKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEEL- 459
Query: 647 ETPCTEELDVMTPSSRKESESKNTTASETTASETPCTEELDVMTPSSSLNESGAG--VPV 704
E +++ + K SE E E T S +
Sbjct: 460 EDEEEAKVEKVANKLLKRSEKAQKEEEEEELDE---ENPWLKTTSSVGKSAKKQDSKKKS 516
Query: 705 VKKRRKFRRKKNNTGWDTPKKKKKSTNITPNPVLK 739
K K K + KKKKK +I + L
Sbjct: 517 SSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLI 551
Score = 32.0 bits (73), Expect = 1.4
Identities = 34/172 (19%), Positives = 61/172 (35%), Gaps = 19/172 (11%)
Query: 565 KESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKRKLS-LDTDAASPVVE 623
+E ++ S NV E EKE E++ ++ + K ++ D VE
Sbjct: 409 EEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVE 468
Query: 624 VTPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTASETTASETPCT 683
K K SE E E + T SS +S K + ++++
Sbjct: 469 KVANKLLKRSEKAQKEEEEEELDEENPWLK----TTSSVGKSAKKQDSKKKSSSKLD--- 521
Query: 684 EELDVMTPSSSLNESGAGVPVVKKRRKFRRKKNNTGWDTPKKKKKSTNITPN 735
++ S A V V KK++K +K+ D ++ S + +
Sbjct: 522 --------KAANKISKAAVKVKKKKKK---EKSIDLDDDLIDEEDSIKLDVD 562
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 33.6 bits (76), Expect = 0.37
Identities = 24/142 (16%), Positives = 52/142 (36%), Gaps = 11/142 (7%)
Query: 580 TTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVT---------PKKTR 630
T T + K ++ ++ + V K R +++ A S + +K
Sbjct: 21 ITVTTQSVKAEKIQSTKVDKVPTLKAERLAMINITAGANSATTQAANTRQERTPKLEKAP 80
Query: 631 KESESKNTTASETTASETPCTEE--LDVMTPSSRKESESKNTTASETTASETPCTEELDV 688
+E K + + S+ E+ L++ + K+ +S+ TT S T ++ +
Sbjct: 81 NTNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTTPKTKVTTPPSTNT 140
Query: 689 MTPSSSLNESGAGVPVVKKRRK 710
P S P +K+ +
Sbjct: 141 PQPMQSTKSDTPQSPTIKQAQT 162
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This
family includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 33.1 bits (74), Expect = 0.61
Identities = 53/188 (28%), Positives = 83/188 (44%), Gaps = 17/188 (9%)
Query: 558 NPYKTLTKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDA 617
N K+ ++ ++ P+S V E T E +A +E KK K+ +++ S + A
Sbjct: 219 NKKKSESEATAAPASVEQV-VEQPKVVTEEPHQQAAPQE----KKNKKNKRKSESENVPA 273
Query: 618 AS-----PVVEVTPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTA 672
AS PVVE TP + E++ KN + + SE E + P S+K + N
Sbjct: 274 ASETPVEPVVETTPPAS--ENQKKNKKDKKKSESEKVVEEPVQAEAPKSKKPTADDNMDF 331
Query: 673 SE-TTASETPCTEELDVMTPSSSLNESGAGV--PVVKKRRKFRRKKNNTGWDTPKKKKKS 729
+ TA E P E + TP++ + E V VV+K +N KKK +S
Sbjct: 332 LDFVTAKEEPKDEPAE--TPAAPVEEVVENVVENVVEKSTTPPATENKKKNKKDKKKSES 389
Query: 730 TNITPNPV 737
+T PV
Sbjct: 390 EKVTEQPV 397
>gnl|CDD|223532 COG0456, RimI, Acetyltransferases [General function prediction
only].
Length = 177
Score = 31.9 bits (72), Expect = 0.66
Identities = 13/78 (16%), Positives = 26/78 (33%), Gaps = 8/78 (10%)
Query: 376 YYDVEPFLFYVLTQN---DDKGCHLVGYFSKEKH-----CQQKYNVSCIMTLPQYQRKGY 427
P L V +VG+ + ++ + P+Y+ +G
Sbjct: 49 DLTQAPELLLVAETGGLDGLLDGKVVGFLLVRVVDGRPSADHEGHIYNLAVDPEYRGRGI 108
Query: 428 GRFLIDFSYLLSKKEGQR 445
GR L+D + ++ G
Sbjct: 109 GRALLDEALERLRERGLA 126
>gnl|CDD|240069 cd04718, BAH_plant_2, BAH, or Bromo Adjacent Homology domain,
plant-specific sub-family with unknown function. BAH
domains are found in a variety of proteins playing roles
in transcriptional silencing and the remodeling of
chromatin. It is assumed that in most or all of these
instances the BAH domain mediates protein-protein
interactions.
Length = 148
Score = 31.4 bits (71), Expect = 0.82
Identities = 14/50 (28%), Positives = 22/50 (44%), Gaps = 3/50 (6%)
Query: 151 YHIICLQPPLERRLKVPWKCTSCE---SGTTEPSPAKTSPGRARASLTTS 197
+H+ CL+PPL+ + W C CE SG + + A L +
Sbjct: 2 FHLCCLRPPLKEVPEGDWICPFCEVEKSGQSAMPQLPPTSRSACEKLLSG 51
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 32.6 bits (74), Expect = 0.87
Identities = 31/164 (18%), Positives = 61/164 (37%), Gaps = 30/164 (18%)
Query: 579 ETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVTPKKTRKESESKNT 638
+ + K+ ++T E K K RK K + D + KK R E +S +
Sbjct: 724 AEKDSLSAPKKQTSKTASEKSSSKGK--RKHKNDEEADK------IESKKQRLEEKSSSC 775
Query: 639 TASETTASETPCTEELDVMTPSSRKESESKNTTASETTASETPCTEELDVMTPSSSLNES 698
+ S +++ SRK S +K EE+ + +PSS L+ S
Sbjct: 776 SPSSSSSHHHSS------SNKESRKSSRNKE--------------EEM-LPSPSSPLSSS 814
Query: 699 GAGVPVVKKRRKFRRKKNNTGWDTPKKKKKSTNITPNPVLKQQK 742
++R RR+++ + P + + + + + +
Sbjct: 815 SPKPEHPSRKRP-RRQEDTSSSSGPFSASSTKSSSKSSSTSKHR 857
>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
function prediction only].
Length = 872
Score = 32.5 bits (74), Expect = 0.99
Identities = 22/105 (20%), Positives = 35/105 (33%), Gaps = 12/105 (11%)
Query: 578 AETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVTPKKTRKESESKN 637
A T T T E KT+ +T V KT ++++
Sbjct: 759 ASGTGTSITTSG-TFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKT--VTQTQT 815
Query: 638 TTASETTASETPCTEELDVMTPSSRKESESKNTTASETTASETPC 682
TT+S + T +P+ S + T+ S+TT C
Sbjct: 816 TTSSPSPTQTT---------SPTQTSTSTTTTTSPSQTTTGGGIC 851
>gnl|CDD|216007 pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family. This
family contains proteins with N-acetyltransferase
functions such as Elp3-related proteins.
Length = 80
Score = 29.5 bits (67), Expect = 1.2
Identities = 10/41 (24%), Positives = 17/41 (41%), Gaps = 2/41 (4%)
Query: 396 HLVGY--FSKEKHCQQKYNVSCIMTLPQYQRKGYGRFLIDF 434
LVG+ S + + P+Y+ KG G L++
Sbjct: 6 ELVGFASLSIIDEEGNVAEIEGLAVDPEYRGKGIGTALLEA 46
>gnl|CDD|221891 pfam13007, LZ_Tnp_IS66, Transposase C of IS166 homeodomain. This
is a leucine-zipper-like or homeodomain-like region of
transposase TnpC of insertion element IS66.
Length = 77
Score = 29.2 bits (66), Expect = 1.3
Identities = 10/36 (27%), Positives = 12/36 (33%), Gaps = 1/36 (2%)
Query: 575 NVDAETTATETTEKEDEAETEEETVVKKTKRGRKRK 610
+ E A E E EA E +R RK
Sbjct: 32 LEELEADAAAL-EAEAEAAAAPEQPAAPPRRRPGRK 66
>gnl|CDD|176116 cd08425, PBP2_CynR, The C-terminal substrate-binding domain of the
LysR-type transcriptional regulator CynR, contains the
type 2 periplasmic binding fold. CynR is a LysR-like
transcriptional regulator of the cyn operon, which
encodes genes that allow cyanate to be used as a sole
source of nitrogen. The operon includes three genes in
the following order: cynT (cyanate permease), cynS
(cyanase), and cynX (a protein of unknown function).
CynR negatively regulates its own expression
independently of cyanate. CynR binds to DNA and induces
bending of DNA in the presence or absence of cyanate,
but the amount of bending is decreased by cyanate. The
CynR of LysR-type transcriptional regulator family is
composed of two functional domains joined by a linker
helix involved in oligomerization: an N-terminal HTH
(helix-turn-helix) domain, which is responsible for the
DNA-binding specificity, and a C-terminal
substrate-binding domain, which is structurally
homologous to the type 2 periplasmic binding proteins
(PBP2). The PBP2 are responsible for the uptake of a
variety of substrates such as phosphate, sulfate,
polysaccharides, lysine/arginine/ornithine, and
histidine. The PBP2 bind their ligand in the cleft
between these domains in a manner resembling a Venus
flytrap. After binding their specific ligand with high
affinity, they can interact with a cognate membrane
transport complex comprised of two integral membrane
domains and two cytoplasmically located ATPase domains.
This interaction triggers the ligand translocation
across the cytoplasmic membrane energized by ATP
hydrolysis.
Length = 197
Score = 30.8 bits (70), Expect = 1.6
Identities = 26/89 (29%), Positives = 33/89 (37%), Gaps = 31/89 (34%)
Query: 88 LLATDLAPGVTRKDIDLYKQAHE-------EATKATPLL------PLAVPEQINPAAIEF 134
LL+ D A TR+ ID Y Q EA + +L LA I P AI
Sbjct: 104 LLSPDFA---TRQHIDRYFQKQGIKPRIAIEANSISAVLEVVRRGRLAT---ILPDAIAR 157
Query: 135 GQYEVCLIKCCACNVYYHIICLQPPLERR 163
Q +C + L+PPL R
Sbjct: 158 EQPGLCAVA------------LEPPLPGR 174
Score = 29.6 bits (67), Expect = 5.2
Identities = 29/110 (26%), Positives = 40/110 (36%), Gaps = 27/110 (24%)
Query: 211 AAKGTPQKRGKRA-SVDMMVSEDVVSKLLATDLAPGVTRKDIDLYKQAHE-------EAT 262
P + + A ++D + +E + LL+ D A TR+ ID Y Q EA
Sbjct: 79 VGATHPLAQRRTALTLDDLAAEPLA--LLSPDFA---TRQHIDRYFQKQGIKPRIAIEAN 133
Query: 263 KATPLL------PLAVPEQINPAAIEFGQ---YEVKTWYSSPFPQEYARL 303
+ +L LA I P AI Q V P P A L
Sbjct: 134 SISAVLEVVRRGRLAT---ILPDAIAREQPGLCAVAL--EPPLPGRTAAL 178
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 31.3 bits (71), Expect = 2.0
Identities = 24/174 (13%), Positives = 54/174 (31%), Gaps = 37/174 (21%)
Query: 575 NVDAETTATETTEKEDEAETEEETVVKKTKR-----GRKRKLSLDTDAASPVVEVTPKKT 629
N++++ + E +++ + + + ++ D A P +
Sbjct: 128 NLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTS 187
Query: 630 RKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTASETTASETPCTEELDVM 689
K+ S T + S+ S +T ++++ + + +
Sbjct: 188 NKQPNSPKPTQPNQSNSQ-----------------PASDDTANQKSSSKD---NQSMSDS 227
Query: 690 TPSSSLNESGAGVPVVKKRRKFRRKKNNTGWDTPKKK-KKSTNITPNPVLKQQK 742
S L++ KK + + KK K T+ T NP L Q
Sbjct: 228 ALDSILDQYSED-----------AKKTQKDYASQSKKDKTETSNTKNPQLPTQD 270
Score = 30.1 bits (68), Expect = 5.8
Identities = 9/68 (13%), Positives = 26/68 (38%)
Query: 631 KESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTASETTASETPCTEELDVMT 690
+ K++TA T+ ++ SS+ ++ N S ++ + +
Sbjct: 27 ADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTS 86
Query: 691 PSSSLNES 698
S+++ +
Sbjct: 87 DSNNIIDF 94
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 31.3 bits (71), Expect = 2.2
Identities = 20/108 (18%), Positives = 32/108 (29%), Gaps = 5/108 (4%)
Query: 554 SHIVNPYKTLTKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKR----GRKR 609
+ + +E K NV + + T D + E KK K+ +KR
Sbjct: 230 GSTDDEAEKKRQEVKKKLKINNVSLDDD-STETPASDYYDVSEMVKFKKPKKKKKKKKKR 288
Query: 610 KLSLDTDAASPVVEVTPKKTRKESESKNTTASETTASETPCTEELDVM 657
+ LD D P E + + S EE +
Sbjct: 289 RKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRKEEQEDD 336
>gnl|CDD|222340 pfam13718, GNAT_acetyltr_2, GNAT acetyltransferase 2. This domain
has N-acetyltransferase activity. It has a GCN5-related
N-acetyltransferase (GNAT) fold.
Length = 169
Score = 30.1 bits (69), Expect = 2.8
Identities = 22/76 (28%), Positives = 29/76 (38%), Gaps = 21/76 (27%)
Query: 413 VSCIMTLPQYQRKGYGRFLIDFSYLLSKKEGQRGTPEKPLSD-LGRVSY------HAYWK 465
+ I P YQR GYG L++ EG D LG VS+ +W+
Sbjct: 91 IVRIAVHPDYQRMGYGSRLLELLEQYYIPEGV---------DYLG-VSFGATPELLKFWQ 140
Query: 466 S---VLLEYLDTIRNQ 478
V + YL RN
Sbjct: 141 KNGFVPV-YLGQTRNA 155
>gnl|CDD|236539 PRK09490, metH, B12-dependent methionine synthase; Provisional.
Length = 1229
Score = 30.9 bits (71), Expect = 3.1
Identities = 15/61 (24%), Positives = 23/61 (37%), Gaps = 4/61 (6%)
Query: 233 VVSKLLATDLAPGV---TRKDIDLYKQAHEEATKATPLLPLAVPEQINPAAIEFGQYEVK 289
VVS LL+ + TR + + ++ H PLL L N I++ Y
Sbjct: 871 VVSSLLSDEQRDAYVAETRAEYEKVREQHARKKPRKPLLTLEAARA-NRFKIDWEAYTPP 929
Query: 290 T 290
Sbjct: 930 K 930
Score = 30.9 bits (71), Expect = 3.1
Identities = 15/58 (25%), Positives = 23/58 (39%), Gaps = 4/58 (6%)
Query: 84 VVSKLLATDLAPGV---TRKDIDLYKQAHEEATKATPLLPLAVPEQINPAAIEFGQYE 138
VVS LL+ + TR + + ++ H PLL L N I++ Y
Sbjct: 871 VVSSLLSDEQRDAYVAETRAEYEKVREQHARKKPRKPLLTLEAARA-NRFKIDWEAYT 927
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 29.2 bits (66), Expect = 3.3
Identities = 19/51 (37%), Positives = 29/51 (56%), Gaps = 1/51 (1%)
Query: 587 EKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVTPKKTRKESESKN 637
EKE EAE +E+ V TK+GR+ K + ++ + KK +K+ E KN
Sbjct: 40 EKEKEAEPDEDGWVTVTKKGRRPKTA-RKESVAAKAAEKEKKKKKKKELKN 89
>gnl|CDD|112562 pfam03753, HHV6-IE, Human herpesvirus 6 immediate early protein.
The proteins in this family are poorly characterized,
but an investigation has indicated that the immediate
early protein is required the down-regulation of MHC
class I expression in dendritic cells. Human herpesvirus
6 immediate early protein is also referred to as U90.
Length = 993
Score = 30.8 bits (69), Expect = 3.3
Identities = 28/158 (17%), Positives = 53/158 (33%), Gaps = 6/158 (3%)
Query: 579 ETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVTPKKTRKESESKNT 638
E+ + TT K E T E + + + + SP V K+T E
Sbjct: 377 ESHSKNTTGKNPEVCTREYKLDFFEEIDTQNLAKMYPSTPSPDVPGKSKETETFVEVSRQ 436
Query: 639 TASETTASETPCTEELDVMTPSSRKESESKNTTASETTASETPCTEELDVMTPSSSLNES 698
T +E T+ CT + + +S + ++A E + L S + ++
Sbjct: 437 TGTEQTSPNCVCTASVTDLGGPDNIKSITGLSSAKEAL------MKSLLDTQADSVVTQT 490
Query: 699 GAGVPVVKKRRKFRRKKNNTGWDTPKKKKKSTNITPNP 736
+ + K+N + + K T+ P
Sbjct: 491 VSKNNEAFNMSLYNLKRNEETYQDKNSRDKKTDNQAGP 528
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 30.7 bits (69), Expect = 4.0
Identities = 34/131 (25%), Positives = 55/131 (41%), Gaps = 16/131 (12%)
Query: 611 LSLDTDAASPVVEV---TPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESES 667
+S D +S +E P+ T E+ + A E T S T + +TP++ K + S
Sbjct: 75 VSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENTPSPPRRTAK---ITPTTPKNNYS 131
Query: 668 KNTTASETTASETPCTEELDVMTPSSSLNE--SGAGVPVVKKRRKFRRKKNNTGWDTPKK 725
+E +TP TPS +LN S +G VK R + + T +
Sbjct: 132 PTAAGTERVKEDTPA-------TPSRALNHYISTSGRQRVKSYTPKPRGEVKSSSPT-QT 183
Query: 726 KKKSTNITPNP 736
++K TP+P
Sbjct: 184 REKVRKYTPSP 194
>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210). This is
a family of proteins conserved in yeasts. The function
is not known. The Schizosaccharomyces pombe member is
SPBC18E5.07 and the Saccharomyces cerevisiae member is
AIM21.
Length = 671
Score = 30.7 bits (69), Expect = 4.1
Identities = 34/174 (19%), Positives = 50/174 (28%), Gaps = 21/174 (12%)
Query: 550 TPLVSHIVNPYKTLTKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKR 609
K + S P + E E E + E V T+ +R
Sbjct: 345 HESDPKSRENSKPASIYGSVPDLIRHTPLEDV--EEYEPLFPEDESEIAVKPPTEESSRR 402
Query: 610 KLSLDTDAAS-PVVEVTPKKTRKESESKNTTASETTASETPCTE---------------E 653
S V E +P + + + ASETP E E
Sbjct: 403 PEEEKHRFPSEDVWEDSPSSLQDTATVSTPSNPPPRASETPEQETSRSSSEVSLDPHQSE 462
Query: 654 LDVMTPSSRKESESKNTTASETTASETPCTEELDVMTPSSSLNESGAGVPVVKK 707
L +R E SK S + P E +++T + E + P V K
Sbjct: 463 LKSEKKKARPE-VSKQRFPSRDVWEDAP--ESQELVTTEETPEEVKSSSPGVTK 513
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 30.3 bits (69), Expect = 4.4
Identities = 18/103 (17%), Positives = 33/103 (32%), Gaps = 3/103 (2%)
Query: 597 ETVVKKTKRGRKRKLSLDT--DAASPVVEVTPKKTRKESESKNTTASETTASETPCTEEL 654
E K GR KL+L++ +AS + P + T + P
Sbjct: 495 EQAFAKV-LGRSIKLNLESQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPP 553
Query: 655 DVMTPSSRKESESKNTTASETTASETPCTEELDVMTPSSSLNE 697
+++ S + ++++ E T SS E
Sbjct: 554 PPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTKDSSPEE 596
>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein.
Members of this family are the S-layer-forming
halobacterial major cell surface glycoprotein. The
highest scores below model cutoffs are fragmentary
paralogs to actual members of the family. Modifications
include at N-linked and O-linked glycosylation, a
C-terminal diphytanylglyceryl modification, and probable
cleavage of the PGF-CTERM tail.
Length = 782
Score = 30.2 bits (68), Expect = 4.5
Identities = 17/62 (27%), Positives = 24/62 (38%), Gaps = 9/62 (14%)
Query: 621 VVEVTPK-KTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTASETTASE 679
VVE T + T + TT TT E T ++ + ++ T TT S
Sbjct: 707 VVEETERPDTTTSEDPTTTTTPTTTGPEET--------TETAEPTTTTEEPTEETTTGSS 758
Query: 680 TP 681
TP
Sbjct: 759 TP 760
>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
Length = 234
Score = 29.9 bits (66), Expect = 4.9
Identities = 20/68 (29%), Positives = 31/68 (45%), Gaps = 1/68 (1%)
Query: 638 TTASETTASETPCTEELDVMTPSSRKESESKNTTASETTASETPCTEELDVMTPSSSLNE 697
T++ +TAS T V TPS S N + + TT S P T + T ++++
Sbjct: 25 TSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTS-APITTTAILSTNTTTVTS 83
Query: 698 SGAGVPVV 705
+G V V
Sbjct: 84 TGTTVTPV 91
Score = 29.1 bits (64), Expect = 8.8
Identities = 36/156 (23%), Positives = 53/156 (33%), Gaps = 14/156 (8%)
Query: 564 TKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAA---SP 620
T S +S GNV T T + T + T + T L T+ S
Sbjct: 25 TSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTST 84
Query: 621 VVEVTPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTASETTASET 680
VTP T + + N T T + T T + S TT S +T S T
Sbjct: 85 GTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVT-----SNVTTRSSSTTSAT 139
Query: 681 PCTEELDVMTP------SSSLNESGAGVPVVKKRRK 710
+ P +S+ ++ A +P V R+
Sbjct: 140 TRITNATTLAPTLSSKGTSNATKTTAELPTVPDERQ 175
>gnl|CDD|234780 PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha;
Validated.
Length = 339
Score = 30.1 bits (69), Expect = 5.1
Identities = 8/44 (18%), Positives = 17/44 (38%), Gaps = 11/44 (25%)
Query: 190 ARASLTTSFRERKKQLKQQRLAAK-----------GTPQKRGKR 222
+ ++ + ERK++L+ L A+ G + G
Sbjct: 64 LKQAIEAALEERKEELEAAALNARLAAETIDVTLPGRRIELGSL 107
>gnl|CDD|212096 cd01022, GH57N_like, N-terminal catalytic domain of heat stable
retaining glycoside hydrolase family 57. Glycoside
hydrolase family 57(GH57) is a chiefly prokaryotic
family with the majority of thermostable enzymes coming
from extremophiles (many of these are archaeal
hyperthermophiles), which exhibit the enzyme
specificities of alpha-amylase (EC 3.2.1.1),
4-alpha-glucanotransferase (EC 2.4.1.25),
amylopullulanase (EC 3.2.1.1/41), and
alpha-galactosidase (EC 3.2.1.22). This family also
includes many hypothetical proteins with uncharacterized
activity and specificity. GH57s cleave alpha-glycosidic
bonds by employing a retaining mechanism, which involves
a glycosyl-enzyme intermediate, allowing
transglycosylation.
Length = 313
Score = 29.7 bits (67), Expect = 5.4
Identities = 16/80 (20%), Positives = 24/80 (30%), Gaps = 11/80 (13%)
Query: 217 QKRGKRASVDM--------MVSEDVVSKLLATDLAPGVTRKDIDLYKQAHEEATKAT--- 265
G V + +++ VV K + V +L E
Sbjct: 43 VDEGPDPKVALTISGVLLEQLADPVVQKGFTSRYNDNVLDALKELVDTGQVELLGCGYTH 102
Query: 266 PLLPLAVPEQINPAAIEFGQ 285
LPL P++ A IE G
Sbjct: 103 AYLPLLGPKEDVRAQIEAGL 122
>gnl|CDD|240327 PTZ00243, PTZ00243, ABC transporter; Provisional.
Length = 1560
Score = 30.1 bits (68), Expect = 5.6
Identities = 17/67 (25%), Positives = 23/67 (34%), Gaps = 5/67 (7%)
Query: 560 YKTLTKES--SKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKR-GRKRKLSLDTD 616
Y TL E +K S G+ DAE + A E V K+ +LD
Sbjct: 874 YATLAAELKENKDSKEGDADAEVAEVDAA--PGGAVDHEPPVAKQEGNAEGGDGAALDAA 931
Query: 617 AASPVVE 623
A +
Sbjct: 932 AGRLMTR 938
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 29.0 bits (65), Expect = 5.8
Identities = 13/59 (22%), Positives = 23/59 (38%)
Query: 578 AETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVTPKKTRKESESK 636
++ ET E+ + +E+ KT+ K+K + T K K S+ K
Sbjct: 94 SDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152
Score = 28.6 bits (64), Expect = 8.2
Identities = 15/80 (18%), Positives = 27/80 (33%), Gaps = 19/80 (23%)
Query: 567 SSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVTP 626
S + E +T+ TE+ED ET+ ++K +KR P
Sbjct: 90 ELSSSDDEEEETEEESTDETEQEDPPETK-----TESKEKKKR--------------EVP 130
Query: 627 KKTRKESESKNTTASETTAS 646
K ++ + K +
Sbjct: 131 KPKTEKEKPKTEPKKPKPSK 150
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 29.8 bits (67), Expect = 7.1
Identities = 15/80 (18%), Positives = 31/80 (38%), Gaps = 7/80 (8%)
Query: 559 PYKTLTKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAA 618
P + + + SK G + + E+ ++E ++ +K R + K+ D+ A
Sbjct: 411 PAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDSKAG 470
Query: 619 SPVVEVTPKKTRKESESKNT 638
KK +S+ N
Sbjct: 471 -------GKKESVKSQEDNN 483
>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
Provisional.
Length = 417
Score = 29.6 bits (66), Expect = 7.6
Identities = 18/65 (27%), Positives = 33/65 (50%), Gaps = 3/65 (4%)
Query: 611 LSLDTDAASPVVEVTPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNT 670
+ L + SP T + + ES+S +T +E +E P +E +P+S S S ++
Sbjct: 11 IILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSES---PSPNSNSSSSSSSS 67
Query: 671 TASET 675
++S T
Sbjct: 68 SSSIT 72
>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
component of pyruvate dehydrogenase complex.
Length = 539
Score = 29.4 bits (66), Expect = 8.1
Identities = 22/82 (26%), Positives = 36/82 (43%), Gaps = 6/82 (7%)
Query: 595 EEETVVKKTKRGRKRKLSLDTDAASPVVEVTPKKTRKESESKNTTASETTASET---PCT 651
EEE + K K K S A+P + +P ++E K ++ E AS+ P +
Sbjct: 189 EEEEDIGKFK---DYKPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSS 245
Query: 652 EELDVMTPSSRKESESKNTTAS 673
+ +P +RK +E N S
Sbjct: 246 GDRIFASPLARKLAEDNNVPLS 267
>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
Length = 421
Score = 29.3 bits (65), Expect = 8.8
Identities = 21/124 (16%), Positives = 41/124 (33%), Gaps = 9/124 (7%)
Query: 613 LDTDAASPVVEVTPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTA 672
+DT PV VTP +S + +++ + PSS + T A
Sbjct: 293 IDTHGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAV---------IPSSVTTQSATTTQA 343
Query: 673 SETTASETPCTEELDVMTPSSSLNESGAGVPVVKKRRKFRRKKNNTGWDTPKKKKKSTNI 732
S S + + +L + + ++++TG T +T++
Sbjct: 344 SAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITSTANGPTTSL 403
Query: 733 TPNP 736
P
Sbjct: 404 PAAP 407
>gnl|CDD|201184 pfam00368, HMG-CoA_red, Hydroxymethylglutaryl-coenzyme A reductase.
The HMG-CoA reductases catalyze the conversion of
HMG-CoA to mevalonate, which is the rate-limiting step
in the synthesis of isoprenoids like cholesterol.
Probably because of the critical role of this enzyme in
cholesterol homeostasis, mammalian HMG-CoA reductase is
heavily regulated at the transcriptional, translational,
and post-translational levels.
Length = 370
Score = 29.0 bits (66), Expect = 9.8
Identities = 14/59 (23%), Positives = 25/59 (42%), Gaps = 6/59 (10%)
Query: 183 AKTSPGRARASLTTSFRERKKQLKQQRLAAKGTPQKRGKRASVDMMVSEDVVSKLLATD 241
+ PG +L+ + KK +A + RGK + ++ +VV K+L T
Sbjct: 180 EEEFPGMELLALSGNLCTDKKP------SAINWIEGRGKSVVAEAVIPAEVVEKVLKTT 232
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.129 0.381
Gapped
Lambda K H
0.267 0.0598 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 37,137,507
Number of extensions: 3543102
Number of successful extensions: 2750
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2673
Number of HSP's successfully gapped: 115
Length of query: 760
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 656
Effective length of database: 6,324,786
Effective search space: 4149059616
Effective search space used: 4149059616
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 63 (28.3 bits)