RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6756
(1055 letters)
>gnl|CDD|217735 pfam03798, TRAM_LAG1_CLN8, TLC domain.
Length = 198
Score = 172 bits (437), Expect = 6e-49
Identities = 56/159 (35%), Positives = 84/159 (52%), Gaps = 6/159 (3%)
Query: 834 KFCESSWRCIYYFFSFGFGLVCLWDKEWLWNMDTCWVNYPHQSVPSDVW---WYYMISLS 890
KF E ++Y S GL L + W WN W YP + + +YY+I L
Sbjct: 1 KFNERGVSLVHYLVSGILGLYVLLSEPWFWNPTWLWYPYPDRILGGTSALVKFYYLIQLG 60
Query: 891 FY-YSLAVSQFFDVKRKDFWQMFLHHICTICLLSFSWICNLTRIGTLVLLVHDCADIFLE 949
++ + L V F++ +RK +M LHHI T+ L+SFS+ N TR+G LVLL+H+ +D FL
Sbjct: 61 YFLHDLLVLLFYEKRRKW--EMLLHHIITLLLISFSYFYNFTRVGLLVLLLHELSDPFLH 118
Query: 950 AAKMAKYAKFDKTCEILFLAFTFLWLFTRNYIFPFWIIR 988
K Y K+C++LF+ F + R + PF I+
Sbjct: 119 LRKFLNYLGKKKSCDVLFVLFLVSFFLVRLVLGPFLILL 157
Score = 101 bits (253), Expect = 3e-24
Identities = 34/97 (35%), Positives = 54/97 (55%), Gaps = 1/97 (1%)
Query: 719 IGTLVLLVHDCADIFLEAAKMAKYAKFDKTCEILFLAFTFLWLFTRNYIFPFWIIRSTAL 778
+G LVLL+H+ +D FL K Y K+C++LF+ F + R + PF I+ S
Sbjct: 102 VGLLVLLLHELSDPFLHLRKFLNYLGKKKSCDVLFVLFLVSFFLVRLVLGPFLILLSILD 161
Query: 779 DAPKIANTM-FPAYYIFNGLLILLFILHLFWTRLIMK 814
+A F YYIF LL++L +L+++W LI++
Sbjct: 162 EARLDIPGGPFNLYYIFLALLLVLQLLNIYWFYLIIR 198
>gnl|CDD|214789 smart00724, TLC, TRAM, LAG1 and CLN8 homology domains. Protein
domain with at least 5 transmembrane alpha-helices.
Lag1p and Lac1p are essential for acyl-CoA-dependent
ceramide synthesis, TRAM is a subunit of the translocon
and the CLN8 gene is mutated in Northern epilepsy
syndrome. The family may possess multiple functions such
as lipid trafficking, metabolism, or sensing. Trh
homologues possess additional homeobox domains.
Length = 205
Score = 158 bits (403), Expect = 2e-44
Identities = 58/159 (36%), Positives = 89/159 (55%), Gaps = 7/159 (4%)
Query: 834 KFCESSWRCIYYFFSFGFGLVCLWDKEWLWNMDTCWVNYPHQSVPSDVWWYYMISLSFYY 893
KF ESS R + Y S GL L+ + WL + YP Q + +YY+ SL ++
Sbjct: 2 KFNESSNRLVSYLHSVIAGLYALYSEPWLSDPK---SLYPIQGMSPLAKFYYLFSLGYFI 58
Query: 894 SLAVSQ--FFDVKRKDFWQMFLHHICTICLLSFSWICNLTRIGTLVLLVHDCADIFLEAA 951
V+ F D+KRKDF +M +HHI T+ L+S S++ N TR+G L+LL+H+ +D FL
Sbjct: 59 HDLVALLLFQDLKRKDFKEMLVHHIATLLLISLSYVLNFTRLGLLLLLLHELSDPFLHLR 118
Query: 952 KMAKYAKFDKT--CEILFLAFTFLWLFTRNYIFPFWIIR 988
K+ YA K+ ++ F+ F ++ R +FPF I+
Sbjct: 119 KLLNYAGRKKSLLYDVNFVLFAVVFFVFRLILFPFLILT 157
Score = 88.9 bits (221), Expect = 6e-20
Identities = 30/105 (28%), Positives = 55/105 (52%), Gaps = 3/105 (2%)
Query: 719 IGTLVLLVHDCADIFLEAAKMAKYAKFDKT--CEILFLAFTFLWLFTRNYIFPFWIIRST 776
+G L+LL+H+ +D FL K+ YA K+ ++ F+ F ++ R +FPF I+ T
Sbjct: 100 LGLLLLLLHELSDPFLHLRKLLNYAGRKKSLLYDVNFVLFAVVFFVFRLILFPFLILTVT 159
Query: 777 A-LDAPKIANTMFPAYYIFNGLLILLFILHLFWTRLIMKIAVQYF 820
+ Y +F LL+ L +L+++W LI+++A +
Sbjct: 160 VHYAQAESGLFPPLLYLLFLLLLLCLQLLNIYWFFLILRMARKLL 204
>gnl|CDD|227391 COG5058, LAG1, Protein transporter of the TRAM (translocating
chain-associating membrane) superfamily, longevity
assurance factor [Intracellular trafficking and
secretion].
Length = 395
Score = 119 bits (299), Expect = 1e-28
Identities = 54/227 (23%), Positives = 98/227 (43%), Gaps = 17/227 (7%)
Query: 763 TRNYIFPF----WIIRSTALDAPKIANTMFPAYYIFNGLLILLFILHLFWTRLIMKIAVQ 818
TRNY+ F + I T I + F +Y+ + F++ + +++ V
Sbjct: 90 TRNYLHRFIFLSYQIPDTNQYGKGILDLCFVLFYMIFFTFLREFLMDVVIRPFGLELNV- 148
Query: 819 YFNAGEAQGKPSVLVKFCESSWRCIYYFFSFGFGLVCLWDKE-WLWNMDTCWVNYPHQSV 877
+ +FCE + YY S FGL + W +N + YP
Sbjct: 149 --------RSEKKIKRFCEQMYAIFYYGVSGPFGLYVMRSSPLWFFNTKALYETYPVFYN 200
Query: 878 PSDVWWYYMISLSFYY--SLAVSQFFDVKRKDFWQMFLHHICTICLLSFSWICNLTRIGT 935
P +Y+I +F+ + + + RKDF ++ HHI T+ L+ S++ + TR+G
Sbjct: 201 PFLFKAFYLIQAAFWAQQACILVLQLEKPRKDFKELVFHHIVTLLLIWLSYVFHFTRMGL 260
Query: 936 LVLLVHDCADIFLEAAKMAKYAKFDKTCEILFLAFTFLWLFTRNYIF 982
+ + D +D FL +K Y +F F F+W+++R+Y+
Sbjct: 261 AIYITMDVSDFFLSLSKTLNYLN-SVLATFIFGIFVFIWIYSRHYLN 306
Score = 57.5 bits (139), Expect = 2e-08
Identities = 27/117 (23%), Positives = 52/117 (44%), Gaps = 10/117 (8%)
Query: 719 IGTLVLLVHDCADIFLEAAKMAKYAKFDKTCEILFLAFTFLWLFTRNYIF---------P 769
+G + + D +D FL +K Y +F F F+W+++R+Y+
Sbjct: 258 MGLAIYITMDVSDFFLSLSKTLNYLN-SVLATFIFGIFVFIWIYSRHYLNLKILWSVLTA 316
Query: 770 FWIIRSTALDAPKIANTMFPAYYIFNGLLILLFILHLFWTRLIMKIAVQYFNAGEAQ 826
F I + LD + + I LL L +++++W LI+++A + GE +
Sbjct: 317 FRTIGNFVLDVATQQYKCWISLPIVFILLQALQLVNIYWLFLIVRVAYRVIWEGELK 373
>gnl|CDD|197585 smart00225, BTB, Broad-Complex, Tramtrack and Bric a brac. Domain
in Broad-Complex, Tramtrack and Bric a brac. Also known
as POZ (poxvirus and zinc finger) domain. Known to be a
protein-protein interaction motif found at the N-termini
of several C2H2-type transcription factors as well as
Shaw-type potassium channels. Known structure reveals a
tightly intertwined dimer formed via interactions
between N-terminal strand and helix structures. However
in a subset of BTB/POZ domains, these two secondary
structures appear to be missing. Be aware SMART predicts
BTB/POZ domains without the beta1- and alpha1-secondary
structures.
Length = 97
Score = 51.5 bits (124), Expect = 5e-08
Identities = 21/68 (30%), Positives = 34/68 (50%), Gaps = 2/68 (2%)
Query: 330 DISMICERGEILHAHRLVLASASPLIKRLLEEARFPLGSPVYIQFPDIKVFHMKTILHFL 389
D++++ G+ HAH+ VLA+ SP K L I D+ + +L+FL
Sbjct: 1 DVTLVVG-GKKFHAHKAVLAAHSPYFKALFSSDFKESDKSE-IYLDDVSPEDFRALLNFL 58
Query: 390 YTGQASVP 397
YTG+ +P
Sbjct: 59 YTGKLDLP 66
Score = 49.2 bits (118), Expect = 3e-07
Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 2/63 (3%)
Query: 420 DISMICERGEILHAHRLVLASASPLIKRLLEEARFPLGSPVYIQFPDIKVFHMKTILHFL 479
D++++ G+ HAH+ VLA+ SP K L I D+ + +L+FL
Sbjct: 1 DVTLVVG-GKKFHAHKAVLAAHSPYFKALFSSDFKESDKSE-IYLDDVSPEDFRALLNFL 58
Query: 480 YTG 482
YTG
Sbjct: 59 YTG 61
>gnl|CDD|216043 pfam00651, BTB, BTB/POZ domain. The BTB (for BR-C, ttk and bab) or
POZ (for Pox virus and Zinc finger) domain is present
near the N-terminus of a fraction of zinc finger
(pfam00096) proteins and in proteins that contain the
pfam01344 motif such as Kelch and a family of pox virus
proteins. The BTB/POZ domain mediates homomeric
dimerisation and in some instances heteromeric
dimerisation. The structure of the dimerised PLZF
BTB/POZ domain has been solved and consists of a tightly
intertwined homodimer. The central scaffolding of the
protein is made up of a cluster of alpha-helices flanked
by short beta-sheets at both the top and bottom of the
molecule. POZ domains from several zinc finger proteins
have been shown to mediate transcriptional repression
and to interact with components of histone deacetylase
co-repressor complexes including N-CoR and SMRT. The POZ
or BTB domain is also known as BR-C/Ttk or ZiN.
Length = 101
Score = 49.9 bits (120), Expect = 2e-07
Identities = 17/75 (22%), Positives = 34/75 (45%), Gaps = 6/75 (8%)
Query: 323 FDSENCTDISMICERGEILHAHRLVLASASPLIKRLLEEARFPLGSPVYIQFPDIKVFHM 382
++ D++++ + HAH+ VLA+ SP K L + ++ + F
Sbjct: 5 RENGELCDVTLVVG-DKEFHAHKAVLAACSPYFKALFTG---NKEVEITLEDVSPEDFEA 60
Query: 383 KTILHFLYTGQASVP 397
+L F+YTG+ +
Sbjct: 61 --LLEFIYTGKLEIT 73
Score = 49.2 bits (118), Expect = 3e-07
Identities = 17/74 (22%), Positives = 34/74 (45%), Gaps = 6/74 (8%)
Query: 413 FDSENCTDISMICERGEILHAHRLVLASASPLIKRLLEEARFPLGSPVYIQFPDIKVFHM 472
++ D++++ + HAH+ VLA+ SP K L + ++ + F
Sbjct: 5 RENGELCDVTLVVG-DKEFHAHKAVLAACSPYFKALFTG---NKEVEITLEDVSPEDFEA 60
Query: 473 KTILHFLYTGQAYM 486
+L F+YTG+ +
Sbjct: 61 --LLEFIYTGKLEI 72
>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain.
Length = 26
Score = 38.1 bits (89), Expect = 4e-04
Identities = 13/26 (50%), Positives = 17/26 (65%)
Query: 657 NLKTHLRVHTGEKPFACRLCVAMFKQ 682
NL+ H+R HTGEKP+ C +C F
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger.
Length = 23
Score = 33.6 bits (77), Expect = 0.016
Identities = 11/23 (47%), Positives = 13/23 (56%)
Query: 643 YRCTYCGKQFGMSWNLKTHLRVH 665
YRC CGK F L+ H+R H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type. The C2H2 zinc finger
is the classical zinc finger domain. The two conserved
cysteines and histidines co-ordinate a zinc ion. The
following pattern describes the zinc finger.
#-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
be any amino acid, and numbers in brackets indicate the
number of residues. The positions marked # are those
that are important for the stable fold of the zinc
finger. The final position can be either his or cys. The
C2H2 zinc finger is composed of two short beta strands
followed by an alpha helix. The amino terminal part of
the helix binds the major groove in DNA binding zinc
fingers. The accepted consensus binding sequence for Sp1
is usually defined by the asymmetric hexanucleotide core
GGGCGG but this sequence does not include, among others,
the GAG (=CTC) repeat that constitutes a high-affinity
site for Sp1 binding to the wt1 promoter.
Length = 22
Score = 33.5 bits (77), Expect = 0.018
Identities = 12/22 (54%), Positives = 13/22 (59%)
Query: 644 RCTYCGKQFGMSWNLKTHLRVH 665
+C CGK F NLK HLR H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger. This family contains
a number of divergent C2H2 type zinc fingers.
Length = 24
Score = 31.1 bits (70), Expect = 0.13
Identities = 11/23 (47%), Positives = 13/23 (56%)
Query: 643 YRCTYCGKQFGMSWNLKTHLRVH 665
++C CGK F LK HLR H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 29.1 bits (65), Expect = 0.64
Identities = 10/24 (41%), Positives = 11/24 (45%)
Query: 671 FACRLCVAMFKQKAHLLKHLCSVH 694
F C LC F K L +HL H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKHH 24
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 35.4 bits (81), Expect = 0.20
Identities = 29/133 (21%), Positives = 45/133 (33%), Gaps = 26/133 (19%)
Query: 128 PGFHNSPAQNPPF--IPFSPSYIDEISFRHYRSPHLLSSSNPPYLMDNPSKTPPHPGVLL 185
P N+P NPP P+S + SN P PS H
Sbjct: 437 PPNSNTPYSNPPNSNPPYSNLPYSNTPY-----------SNAPLSNAPPSSAKDHH---- 481
Query: 186 PNSVEEIVVKYR----PPSEEGSPIQRPASPRYGGIGN---NDAPWGPWSLCQTGGDSST 238
S ++R P + + Q A+ +G GN N P+ GG+++T
Sbjct: 482 --SAYHAAYQHRAANQPAANLPTANQPAANNFHGAAGNSVGNPFASRPFGSAPYGGNAAT 539
Query: 239 PDLSKGSCRRKCN 251
G +R+ +
Sbjct: 540 TADPNGIAKREDH 552
>gnl|CDD|177301 PHA00733, PHA00733, hypothetical protein.
Length = 128
Score = 33.3 bits (76), Expect = 0.21
Identities = 18/57 (31%), Positives = 24/57 (42%), Gaps = 2/57 (3%)
Query: 643 YRCTYCGKQFGMSWNLKTHLRVHTGEKPFACRLCVAMFKQKAHLLKHLCSVHRNVIS 699
Y C C F S +LK H+R K C +C F+ L H+C H +S
Sbjct: 74 YVCPLCLMPFSSSVSLKQHIRYTEHSK--VCPVCGKEFRNTDSTLDHVCKKHNICVS 128
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 35.1 bits (80), Expect = 0.25
Identities = 20/103 (19%), Positives = 33/103 (32%), Gaps = 4/103 (3%)
Query: 114 APENYVVMPHRKRRPGFHNSPAQNPPFIPFSPSYIDEISFRHYRSPHLLSSSNPPYLMDN 173
APE Y + +N P Q P + Y + + + P+ + P
Sbjct: 376 APEGYPQQSQYAQPAVQYNEPLQQP--VQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPY 433
Query: 174 PSKTPPHPGVLLPNSVEEIVVKYRPPS--EEGSPIQRPASPRY 214
+ P P EE + P S + Q+PA+
Sbjct: 434 YAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEP 476
>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
only].
Length = 467
Score = 33.9 bits (77), Expect = 0.42
Identities = 27/105 (25%), Positives = 41/105 (39%), Gaps = 2/105 (1%)
Query: 619 NSTNEDMISKSSCTTSNNVGSAREYRCTYCGKQFGMSWNLKTHLRVHTGEKPFACRL--C 676
+S N + S T + + R C C F +L H+R HTGEKP C C
Sbjct: 10 SSNNSVLSSTPKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGC 69
Query: 677 VAMFKQKAHLLKHLCSVHRNVISSVNDDGNSSHFNCCFCSMTIGT 721
F + L +HL + H N + S+ S++ +
Sbjct: 70 DKSFSRPLELSRHLRTHHNNPSDLNSKSLPLSNSKASSSSLSSSS 114
>gnl|CDD|220533 pfam10034, Dpy19, Q-cell neuroblast polarisation. Dyp-19, formerly
known as DUF2211, is a transmembrane domain family that
is required to orient the neuroblast cells, QR and QL
accurately on the anterior-posterior axis: QL and QR are
born in the same anterior-posterior position, but
polarise and migrate left-right asymmetrically, QL
migrating towards the posterior and QR migrating towards
the anterior. It is also required, with unc-40, to
express mab-5 correctly in the Q cell descendants. The
Dpy-19 protein derives from the C. elegans DUMPY mutant.
Length = 637
Score = 33.9 bits (78), Expect = 0.47
Identities = 32/144 (22%), Positives = 51/144 (35%), Gaps = 18/144 (12%)
Query: 885 YMISLSFYYSLAVSQFF--DVKRKDFWQMFLHHICTICLLSFSWICN---LTRIGTLVLL 939
M+ LS S V+ +++ + L + L S ++ L + +L
Sbjct: 254 SMLLLSLLLSSLVAILLIIALRKNVPCGLLLPRLLKWVLQSLKFLVLTLALKLLLKKILG 313
Query: 940 VHDCADIF-LEAAKMAK--YAKFDK---TC--EILFLAFTFLWLFTRNYIFPFWI----- 986
D A IF L +K Y FD TC E FL T+ + P +I
Sbjct: 314 EKDDAHIFDLLKSKFGLQSYRDFDTLLYTCAPEFDFLTKETFLRLTKTLLLPIYILVLVI 373
Query: 987 IRRKKSIEIWSYLNLELLHQKVGD 1010
I ++W L+ L Q+
Sbjct: 374 ILISALQDVWRNLSRNSLKQRKEK 397
>gnl|CDD|236581 PRK09598, PRK09598, lipid A phosphoethanolamine transferase;
Reviewed.
Length = 522
Score = 33.2 bits (76), Expect = 0.80
Identities = 18/84 (21%), Positives = 32/84 (38%), Gaps = 5/84 (5%)
Query: 743 AKFDKTCEILFLAFTFLWLFTRNYIFPFWIIRSTALDAPKIANTMFPAYYIFNGLL-ILL 801
KF K L + L Y FP + + + + NGLL +LL
Sbjct: 7 LKFLKPLSCLQAGLLYSLLNGVLYHFPLFAYVYKESNQVSFIAMLVVLLFCVNGLLFLLL 66
Query: 802 FILHLFWTRLIMKIAVQYFNAGEA 825
+L +R +M+++ F+ +
Sbjct: 67 GLL----SRRLMRLSAIVFSLLNS 86
>gnl|CDD|223501 COG0424, Maf, Nucleotide-binding protein implicated in inhibition
of septum formation [Cell division and chromosome
partitioning].
Length = 193
Score = 32.1 bits (74), Expect = 0.85
Identities = 11/21 (52%), Positives = 15/21 (71%)
Query: 345 RLVLASASPLIKRLLEEARFP 365
RL+LAS+SP + LLE+ P
Sbjct: 4 RLILASSSPRRRELLEQLGIP 24
Score = 32.1 bits (74), Expect = 0.85
Identities = 11/21 (52%), Positives = 15/21 (71%)
Query: 435 RLVLASASPLIKRLLEEARFP 455
RL+LAS+SP + LLE+ P
Sbjct: 4 RLILASSSPRRRELLEQLGIP 24
>gnl|CDD|112704 pfam03904, DUF334, Domain of unknown function (DUF334).
Staphylococcus aureus plasmid proteins with no
characterized function.
Length = 229
Score = 31.9 bits (72), Expect = 1.3
Identities = 13/60 (21%), Positives = 31/60 (51%), Gaps = 4/60 (6%)
Query: 57 MRSTSAEKIATRLQQEIENQALFYRKRKSRYMDYYNSNTIDDVEEQRKLQEKVQQQTAPE 116
M S + ++Q+ +EN+ L K++++ + Y I +++E + ++EK + E
Sbjct: 23 MNSKLTGTHSQKIQKSLENEELQELKKQNKLIIKY----IAEIKENQDIREKELKAIKSE 78
>gnl|CDD|178901 PRK00148, PRK00148, Maf-like protein; Reviewed.
Length = 194
Score = 31.3 bits (72), Expect = 1.5
Identities = 13/21 (61%), Positives = 14/21 (66%)
Query: 345 RLVLASASPLIKRLLEEARFP 365
RLVLASASP +LL A P
Sbjct: 2 RLVLASASPARLKLLRLAGIP 22
Score = 31.3 bits (72), Expect = 1.5
Identities = 13/21 (61%), Positives = 14/21 (66%)
Query: 435 RLVLASASPLIKRLLEEARFP 455
RLVLASASP +LL A P
Sbjct: 2 RLVLASASPARLKLLRLAGIP 22
>gnl|CDD|224136 COG1215, COG1215, Glycosyltransferases, probably involved in cell
wall biogenesis [Cell envelope biogenesis, outer
membrane].
Length = 439
Score = 32.2 bits (73), Expect = 1.6
Identities = 24/117 (20%), Positives = 35/117 (29%), Gaps = 7/117 (5%)
Query: 751 ILFLAFTFLWLFTRNYIFPFWIIRSTALDAPKIANTMFPAYYIFNGLLILLFILHLFWTR 810
+L L T L+L + I +I + L A +AN + A+ LLILL L
Sbjct: 314 LLPLLLTLLFLL--SIIPSLLLILLSGLLALLLANVILAAF-----LLILLIPPLLALFA 366
Query: 811 LIMKIAVQYFNAGEAQGKPSVLVKFCESSWRCIYYFFSFGFGLVCLWDKEWLWNMDT 867
L+ G V+ + L K W
Sbjct: 367 LLQLGVFLVTTKGLRLIAKLVVYLLVILIIFLPLVIWPLILILNLTLLKTVSWRTLN 423
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 32.1 bits (73), Expect = 1.9
Identities = 22/102 (21%), Positives = 31/102 (30%), Gaps = 5/102 (4%)
Query: 157 RSPHLLSSSNPPYLMDNPSKTPPHPGVLLPNSVEEIVVKYRPPSEEGSPIQRPASPRYGG 216
S +SS+ +PS +P PG S E S +S G
Sbjct: 279 SSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRG 338
Query: 217 IGNNDAPWGPWSLCQTGGDSSTPDLSKGSCRRK-CNSKQRPR 257
GP S ++ S P + S RK + P
Sbjct: 339 AA---VSPGP-SPSRSPSPSRPPPPADPSSPRKRPRPSRAPS 376
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 31.8 bits (73), Expect = 2.0
Identities = 14/64 (21%), Positives = 27/64 (42%), Gaps = 5/64 (7%)
Query: 54 EEVMRST-----SAEKIATRLQQEIENQALFYRKRKSRYMDYYNSNTIDDVEEQRKLQEK 108
EV+R E+ + +E + +K + + + EEQRKL+EK
Sbjct: 255 PEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEK 314
Query: 109 VQQQ 112
+++
Sbjct: 315 ERKK 318
>gnl|CDD|202278 pfam02545, Maf, Maf-like protein. Maf is a putative inhibitor of
septum formation in eukaryotes, bacteria, and archaea.
Length = 193
Score = 31.1 bits (71), Expect = 2.2
Identities = 11/22 (50%), Positives = 13/22 (59%)
Query: 345 RLVLASASPLIKRLLEEARFPL 366
L+LAS SP K LLE+ P
Sbjct: 2 PLILASTSPRRKELLEDLGIPF 23
Score = 31.1 bits (71), Expect = 2.2
Identities = 11/22 (50%), Positives = 13/22 (59%)
Query: 435 RLVLASASPLIKRLLEEARFPL 456
L+LAS SP K LLE+ P
Sbjct: 2 PLILASTSPRRKELLEDLGIPF 23
>gnl|CDD|219819 pfam08395, 7tm_7, 7tm Chemosensory receptor. This family includes
a number of gustatory and odorant receptors mainly from
insect species such as A. gambiae and D. melanogaster.
They are classified as G-protein-coupled receptors
(GPCRs), or seven-transmembrane receptors. They show
high sequence divergence, consistent with an ancient
origin for the family.
Length = 370
Score = 31.6 bits (72), Expect = 2.3
Identities = 26/251 (10%), Positives = 72/251 (28%), Gaps = 19/251 (7%)
Query: 751 ILFLAFTFLWLFTRNYIFPFWIIRSTALDAPKIANTMFPAYYIFNGLLILLFILHLFWTR 810
I+ L LF ++I + R +AL + N + ++++ + LF R
Sbjct: 38 IILFLLVLLVLFLISFIRILYFFRRSALLE-VVYNIQLVLGLLT---VLVILLSLLFQRR 93
Query: 811 LIMKIAVQYFNAGEAQGKPSVLVKFCESSWRCIYYFFSFGFGLVCLWDKEWLWNMDTCWV 870
+ ++ + + + S + LV + L +
Sbjct: 94 RLARLLNELL-------RLDRRLLRLGSRRNFRRFNRLLLLLLVLI---ILLLLVSLLGY 143
Query: 871 NYPHQSVPSDVWWYYMISLSFYYSLAVSQFFDVKR--KDFWQMFLHHICTICLLSFSWIC 928
Q + + +L + Q+F + ++ + +
Sbjct: 144 LLLSQYFLLVLLLLLYLLPYLVLNLLILQYFLLVLLIYRLLRLLNRQLKQLLRELRGLRE 203
Query: 929 NLTRIGTLVLLVHDCADIFLEAAKMAKYAKFDKTCEILFLAFTFLWLFTRNYIFPFWIIR 988
+ +L + ++ + + + + + + L L+ F + +W+
Sbjct: 204 SGQLRECCLLRLDKLLRLYRRLSDLVR--ELNDCFGLQ-LLLILLYNFLNITVLIYWLYS 260
Query: 989 RKKSIEIWSYL 999
L
Sbjct: 261 ILLLGSSGFTL 271
>gnl|CDD|223374 COG0297, GlgA, Glycogen synthase [Carbohydrate transport and
metabolism].
Length = 487
Score = 31.5 bits (72), Expect = 2.8
Identities = 27/112 (24%), Positives = 40/112 (35%), Gaps = 23/112 (20%)
Query: 334 ICERGEILHAH--RLVLASASPLIKRLLEEARFPLGSPVYIQFPDIK-VFHMKTILHFLY 390
I +I+HAH + L A +K+ Y I VF TI + Y
Sbjct: 127 ISWLPDIVHAHDWQTGLLPA--YLKQR------------YRSGYIIPTVF---TIHNLAY 169
Query: 391 TGQASVPYKLHHPSLADEVRQWFDSENCTDISMICERGEILHAHRLVLASAS 442
G + Y L L E F E IS + +G + +A + S +
Sbjct: 170 QGLFRLQY-LEELGLPFEAYASFGLEFYGQISFL--KGGLYYADAVTTVSPT 218
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 31.6 bits (71), Expect = 2.8
Identities = 29/108 (26%), Positives = 32/108 (29%), Gaps = 8/108 (7%)
Query: 110 QQQTAPENYVVMPHRKRRPGFHNSPAQNPPFIPFSPSYIDEISFRHYRSPHLL----SSS 165
Q Q+ P PH G SP PF I +P S S
Sbjct: 427 QSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAPPRASSGS 486
Query: 166 NPPYLMDNPSKTPPHPGVLLPNSVEEIVVKYRPPSEEGSPIQRPASPR 213
PP S PG LP I +K P E P P PR
Sbjct: 487 QPPGSALPSSGGCAGPGPPLP----PIQIKEEPLDEAEEPESPPPPPR 530
>gnl|CDD|238310 cd00555, Maf, Nucleotide binding protein Maf. Maf has been
implicated in inhibition of septum formation in
eukaryotes, bacteria and archaea, but homologs in
B.subtilis and S.cerevisiae are nonessential for cell
division. Maf has been predicted to be a nucleotide- or
nucleic acid-binding protein with structural similarity
to the hypoxanthine/xanthine NTP pyrophosphatase Ham1
from Methanococcus jannaschii, RNase H from Escherichia
coli, and some other nucleotide or RNA-binding proteins.
Length = 180
Score = 30.5 bits (70), Expect = 3.1
Identities = 11/20 (55%), Positives = 14/20 (70%)
Query: 346 LVLASASPLIKRLLEEARFP 365
L+LASASP + LLE+ P
Sbjct: 1 LILASASPRRRELLEQLGIP 20
Score = 30.5 bits (70), Expect = 3.1
Identities = 11/20 (55%), Positives = 14/20 (70%)
Query: 436 LVLASASPLIKRLLEEARFP 455
L+LASASP + LLE+ P
Sbjct: 1 LILASASPRRRELLEQLGIP 20
>gnl|CDD|214424 MTH00115, ND6, NADH dehydrogenase subunit 6; Provisional.
Length = 174
Score = 30.1 bits (69), Expect = 3.5
Identities = 20/92 (21%), Positives = 30/92 (32%), Gaps = 18/92 (19%)
Query: 834 KFCE--SSWRCIYYFFSFGFGLVCLWD-KEWLWNMDTCWVNYPHQSVPSDVWWYYMISLS 890
+ E W + Y F +V + LW M+ V V D +++ L
Sbjct: 77 PYPEAWGDWGVVGYVVGFVLVVVVVGVVVGGLWGMEGVGV------VTVDSGGLFVVRLD 130
Query: 891 FYYSLAVSQFFDVKRKDFWQMFLHHICTICLL 922
F VS F+ L +C LL
Sbjct: 131 F---SGVSLFYSSG----GGGLL--VCGWGLL 153
>gnl|CDD|220811 pfam10565, NMDAR2_C, N-methyl D-aspartate receptor 2B3 C-terminus.
This domain is found at the C-terminus of many
NMDA-receptor proteins, many of which also carry the
Ligated ion-channel family pfam00060 further upstream as
well as the ANF_receptor family pfam01094. This region
is predicted to be a large extra-cellular domain of the
NMDA receptor proteins, being highly hydrophilic, and is
thought to be integrally involved in the function of the
receptor. The region also carries a number of potential
N-glycosylation sites.
Length = 660
Score = 30.5 bits (69), Expect = 5.7
Identities = 27/124 (21%), Positives = 43/124 (34%), Gaps = 35/124 (28%)
Query: 118 YVVMPHRKRRPGFHNSPAQNPPFIPFSPSYIDEIS-----------------FRHYRSPH 160
Y + H++ + + NPP++P + D IS H H
Sbjct: 92 YDMSEHKR-----SLAYSDNPPYLPEDNMFSDYISEVERTFGNLHLKDSNNYQDHGLHHH 146
Query: 161 LLSSSNPPYL---------MDNPSKTPPHPGVLLPNSVEEIVVKYRPPSEEGSPIQRPAS 211
L+ +NP L P T P L SV+ + + +PP SP + +
Sbjct: 147 TLNEANPNSLGSASSLDGGYTAPIFTTQ-PRALWKKSVDTL--RQQPPPPSSSP-KSGLA 202
Query: 212 PRYG 215
YG
Sbjct: 203 DAYG 206
>gnl|CDD|235009 PRK02141, PRK02141, Maf-like protein; Reviewed.
Length = 207
Score = 29.7 bits (67), Expect = 6.5
Identities = 14/33 (42%), Positives = 17/33 (51%), Gaps = 4/33 (12%)
Query: 345 RLVLASASPLIKRLLEEARFPLGSPVYIQFPDI 377
RL+LAS+S + LLE R P PDI
Sbjct: 10 RLILASSSRYRRELLERLRLPFDVVS----PDI 38
Score = 29.7 bits (67), Expect = 6.5
Identities = 14/33 (42%), Positives = 17/33 (51%), Gaps = 4/33 (12%)
Query: 435 RLVLASASPLIKRLLEEARFPLGSPVYIQFPDI 467
RL+LAS+S + LLE R P PDI
Sbjct: 10 RLILASSSRYRRELLERLRLPFDVVS----PDI 38
>gnl|CDD|218464 pfam05142, DUF702, Domain of unknown function (DUF702). Members of
this family are found in various putative zinc finger
proteins.
Length = 155
Score = 29.3 bits (66), Expect = 6.8
Identities = 18/108 (16%), Positives = 40/108 (37%), Gaps = 17/108 (15%)
Query: 4 ITASRRRRRSSSTPVNLSLSNPEVHNESKSPENSHKPLR--KSSSLVESSSEEEVMRSTS 61
+ A++RR R + ++ + + +P + L SSL + + ++
Sbjct: 44 VPAAKRRERQQQLA---AATSNAASSAASAPSKRPRDLLSNAQSSLSSTRT---PSGTSP 97
Query: 62 AEKIATRLQQEIENQALFYRKRKSRYMDYYNSNTIDDVEEQRKLQEKV 109
+ E+ + A+F R S ++DD + + Q V
Sbjct: 98 QGTGTSSFPPEVSSPAVFRCVRVS---------SVDDGDAEYAYQTTV 136
>gnl|CDD|200429 TIGR04178, exo_archaeo, exosortase/archaeosortase family protein.
This model represents the most conserved region of the
multitransmembrane protein family of exosortases and
archaeosortases. The region includes nearly invariant
motifs at the ends of three predicted transmembrane
helices on the extracytoplasmic face: a Cys (often
Cys-Xaa-Gly), Asn-Xaa-Xaa-Arg, and His. This model is
much broader than the bacterial exosortase model
(TIGR02602), and has in intended scope similar to (or
broader than) pfam09721.
Length = 97
Score = 28.3 bits (64), Expect = 6.9
Identities = 10/56 (17%), Positives = 22/56 (39%), Gaps = 6/56 (10%)
Query: 886 MISLSFYYSLAVSQFFDVKRKDFWQMFLHHICTICLLSFSWICNLTRIGTLVLLVH 941
+ L L + F + + L + + + ++ N+ RI L+LL +
Sbjct: 12 SLILLLALGLLFAYLF---PRSLRRKLLLLLLGVPI---IYLANILRIVLLILLGY 61
>gnl|CDD|224089 COG1167, ARO8, Transcriptional regulators containing a DNA-binding
HTH domain and an aminotransferase domain (MocR family)
and their eukaryotic orthologs [Transcription / Amino
acid transport and metabolism].
Length = 459
Score = 30.0 bits (68), Expect = 7.1
Identities = 14/63 (22%), Positives = 18/63 (28%), Gaps = 2/63 (3%)
Query: 498 NPNIWLPPNITWSDLEPNDKIQYADHRHLFYPLPMALGMLLLRFFLEKYWFAPIGASVGI 557
P+ L P Y L Y G+ LR + Y A G S
Sbjct: 98 LPDPSLFPLEALRRALARVLRNYGASLALQYGPTA--GLPELREAIAAYLLARRGISCEP 155
Query: 558 KNI 560
+ I
Sbjct: 156 EQI 158
>gnl|CDD|131878 TIGR02831, spo_II_M, stage II sporulation protein M. A comparative
genome analysis of all sequenced genomes of shows a
number of proteins conserved strictly among the
endospore-forming subset of the Firmicutes. This
predicted integral membrane protein is designated stage
II sporulation protein M [Cellular processes,
Sporulation and germination].
Length = 200
Score = 29.2 bits (66), Expect = 8.0
Identities = 17/62 (27%), Positives = 26/62 (41%), Gaps = 15/62 (24%)
Query: 889 LSFYYSLAVSQFFDVKRKD--------FWQMFLHHICTICLLSFSWICNLTRIGTLVLLV 940
L FY +S FF F Q ++I + L+ WI L+ IG ++L+
Sbjct: 43 LYFY----LSNFFGQLSGGNSASYGDIFRQSLFNNIKYVGLM---WILGLSIIGLPIILI 95
Query: 941 HD 942
D
Sbjct: 96 LD 97
>gnl|CDD|191179 pfam05053, Menin, Menin. MEN1, the gene responsible for multiple
endocrine neoplasia type 1, is a tumour suppressor gene
that encodes a protein called Menin which may be an
atypical GTPase stimulated by nm23.
Length = 618
Score = 30.0 bits (67), Expect = 9.0
Identities = 26/134 (19%), Positives = 53/134 (39%), Gaps = 13/134 (9%)
Query: 9 RRRRSSSTPVNLSLSNPEVHNESKSP----ENSHKPLRKSSSLVESSSEEEVMRSTSAEK 64
RR S P NPE+ + + N++ RK ++ ++ S ++
Sbjct: 484 RRESKSQEPSGGESPNPELPANNNNSNSNNNNNNGADRKEAAATTGNATTTSNGSGTSVP 543
Query: 65 IATRLQ--QEIENQALFYRKRKSRYM------DYYNSNTID-DVEEQRKLQEKVQQQTAP 115
+ + Q E + + K + M + NS+ I + Q ++Q KVQ+ +A
Sbjct: 544 LPVSSEPPQHKEGPVITFYSEKMKGMKELLLAEKLNSSAIKLQLTAQSQVQMKVQKHSAT 603
Query: 116 ENYVVMPHRKRRPG 129
+Y + ++ R
Sbjct: 604 IDYTLSFIKRSRKD 617
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 29.3 bits (66), Expect = 9.2
Identities = 20/86 (23%), Positives = 35/86 (40%), Gaps = 16/86 (18%)
Query: 28 HNESKSPENSHKPLRKSSSLVESSSEEEVMRSTSAEKIATRLQQ-EIENQALFYRKRKSR 86
+K HK S+ S + R+ S E+ RLQ+ E++ +KR+
Sbjct: 143 QQAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKRE-- 200
Query: 87 YMDYYNSNTIDDVEEQRKLQEKVQQQ 112
EE+RK ++K Q++
Sbjct: 201 -------------EERRKQRKKQQEE 213
>gnl|CDD|213034 cd11726, ADDz_ATRX, ADDz domain of ATRX (alpha-thalassemia/mental
retardation, X-linked). ADDz_ATRX is a PHD-like zinc
finger domain of ATRX, which belongs to the SNF2 family
of chromatin remodeling proteins. ATRX is a large
chromatin-associated nuclear protein with two domains,
ADDz_ATRX at the N-terminus, followed by a C-terminal
ATPase/helicase domain. The ADDz_ATRX domain recognizes
a specific methylated histone, and this interaction is
required for heterochromatin localization of the ATRX
protein. Missense mutations in either of the two ATRX
domains lead to the X-linked alpha-thalassemia and
mental retardation syndrome; however the mutations in
the ADDz_ATRX domain produce a more severe disease
phenotype that may also relate to disturbing unknown
functions or interaction sites of this domain. The ADDz
domain is also present in chromatin-associated proteins
cytosine-5-methyltransferase 3 (Dnmt3); it is a PHD-like
zinc finger motif that contains two parts, a C2-C2 and a
PHD-like zinc finger. PHD zinc finger domains have been
identified in more than 40 proteins that are mainly
involved in chromatin mediated transcriptional control;
the classical PHD zinc finger has a C4-H-C3 motif that
spans about 50-80 amino acids. In ADDz, the conserved
histidine residue of the PHD finger is replaced by a
cysteine, and an additional zinc finger C2-C2 like motif
is located about twenty residues upstream of the C4-C-C3
motif.
Length = 127
Score = 28.4 bits (64), Expect = 9.4
Identities = 20/108 (18%), Positives = 29/108 (26%), Gaps = 39/108 (36%)
Query: 641 REYRCTYCGKQFGMS-WNLKTH--LRVHT--------GEKPFA----------------- 672
R CT CGKQ N+ H L+V F+
Sbjct: 7 RRVHCTACGKQINPQEKNVHRHPVLKVLICKSCHKFYNSGDFSKDEDGSEEYCRWCGEGG 66
Query: 673 ----CRLCVAMFKQKAHLLKHLCSVHRNVISSVNDDGNSSHFNCCFCS 716
C C F +K + ++L + I + C C
Sbjct: 67 ELICCDSCPNAFCKKC-IKRNLGREELSEIED------EDKWKCFVCD 107
>gnl|CDD|220704 pfam10343, DUF2419, Protein of unknown function (DUF2419). This is
a family of conserved proteins found from plants to
humans. The function is not known. A few members are
annotated as being cobyrinic acid a,c-diamide synthetase
but this could not be confirmed.
Length = 282
Score = 29.6 bits (67), Expect = 9.4
Identities = 19/90 (21%), Positives = 31/90 (34%), Gaps = 27/90 (30%)
Query: 317 DEVRQWFDSENCTDISMICERGEILHAHRLVLA----------------SASPLIKRLLE 360
+E+R F S +I ++ ER L VL SA+ L+ ++E
Sbjct: 75 EELRHIFRSATDEEIPLLEERLRCLREAGRVLLEKFDGSFVNCIKSANKSAAALVNLVVE 134
Query: 361 EARFPLGSPVYIQFPDIKVFHMKTILHFLY 390
FP F D + + + Y
Sbjct: 135 N--FP-------CFRDEATYKGRRV--RFY 153
Score = 29.6 bits (67), Expect = 9.4
Identities = 19/90 (21%), Positives = 31/90 (34%), Gaps = 27/90 (30%)
Query: 407 DEVRQWFDSENCTDISMICERGEILHAHRLVLA----------------SASPLIKRLLE 450
+E+R F S +I ++ ER L VL SA+ L+ ++E
Sbjct: 75 EELRHIFRSATDEEIPLLEERLRCLREAGRVLLEKFDGSFVNCIKSANKSAAALVNLVVE 134
Query: 451 EARFPLGSPVYIQFPDIKVFHMKTILHFLY 480
FP F D + + + Y
Sbjct: 135 N--FP-------CFRDEATYKGRRV--RFY 153
>gnl|CDD|238698 cd01407, SIR2-fam, SIR2 family of proteins includes silent
information regulator 2 (Sir2) enzymes which catalyze
NAD+-dependent protein/histone deacetylation, where the
acetyl group from the lysine epsilon-amino group is
transferred to the ADP-ribose moiety of NAD+, producing
nicotinamide and the novel metabolite
O-acetyl-ADP-ribose. Sir2 proteins, also known as
sirtuins, are found in all eukaryotes and many archaea
and prokaryotes and have been shown to regulate gene
silencing, DNA repair, metabolic enzymes, and life span.
The most-studied function, gene silencing, involves the
inactivation of chromosome domains containing key
regulatory genes by packaging them into a specialized
chromatin structure that is inaccessible to DNA-binding
proteins. The oligomerization state of Sir2 appears to
be organism-dependent, sometimes occurring as a monomer
and sometimes as a multimer.
Length = 218
Score = 29.1 bits (66), Expect = 9.7
Identities = 12/44 (27%), Positives = 17/44 (38%), Gaps = 4/44 (9%)
Query: 638 GSAREYRCTYCGKQFGMSWNLKTHLRVHTGEKPFACRLCVAMFK 681
GS RCT CGK++ R E+ C C + +
Sbjct: 105 GSLFRVRCTKCGKEYPRDELQADIDR----EEVPRCPKCGGLLR 144
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.136 0.438
Gapped
Lambda K H
0.267 0.0738 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 54,206,022
Number of extensions: 5347467
Number of successful extensions: 5459
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5424
Number of HSP's successfully gapped: 79
Length of query: 1055
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 948
Effective length of database: 6,191,724
Effective search space: 5869754352
Effective search space used: 5869754352
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 64 (28.4 bits)