RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy15988
(718 letters)
>gnl|CDD|219609 pfam07856, Orai-1, Mediator of CRAC channel activity. ORAI-1 is a
protein homologue of Drosophila Orai and human Orai1,
Orai2 and Orai3. ORAI-1 GFP reporters are co- expressed
with STIM-1 (ER CA(2+) sensors) in the gonad and
intestine. The protein has four predicted transmembrane
domains with a highly conserved region between TM2 ad
TM3. This conserved domain is thought to function in
channel regulation. ORAI1- related proteins are required
for the production of the calcium channel, CRAC, along
with STIM1-related proteins.
Length = 175
Score = 231 bits (591), Expect = 5e-72
Identities = 89/167 (53%), Positives = 117/167 (70%)
Query: 271 DSEGLVWRRLHMSRAKLKATATTSELLSGFAMVAMVELQINEPTNVPEWLFIMFAVCTTV 330
E L R L +SRA+LKA++ TS LL+GFAMVA+VELQ ++P PE L I FAV T +
Sbjct: 4 RLEDLEQRHLELSRAQLKASSRTSALLAGFAMVALVELQFSDPEVYPEGLLIAFAVVTAL 63
Query: 331 LVAVHIFALMISTYLLPNIEAISKLQSTKLVSESPHERMRGFVELAWAFSTVLGLFLFLV 390
LVAVH+ ALMIST +LPNIE ++ + + L ESPH R + EL W FST +G+ LFLV
Sbjct: 64 LVAVHLLALMISTCILPNIECVTFVHNYNLTPESPHRRFERYCELDWRFSTRMGIPLFLV 123
Query: 391 EVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVVHFYHNVVKYK 437
E+A+L WVKF+D AAI T I++PVLI+FV F HF+ ++ +K
Sbjct: 124 ELALLAWVKFYDSGPVAAIIVTAILVPVLIVFVAFTQHFWGELLSHK 170
Score = 208 bits (532), Expect = 2e-63
Identities = 89/215 (41%), Positives = 117/215 (54%), Gaps = 48/215 (22%)
Query: 480 DSEGLVWRRLHMSRAKLKATATTSELLSGFAMVAMVELQINEPTNVPEWLFIMFAVCTTV 539
E L R L +SRA+LKA++ TS LL+GFAMVA+VELQ ++P PE L I FAV T +
Sbjct: 4 RLEDLEQRHLELSRAQLKASSRTSALLAGFAMVALVELQFSDPEVYPEGLLIAFAVVTAL 63
Query: 540 LVAVHIFALMISTYLLPGNDVAMVELQINEPTNVPEWLFIMFAVCTTVLVAVHIFALMIS 599
LVAVH+ ALMIST +LP
Sbjct: 64 LVAVHLLALMISTCILP------------------------------------------- 80
Query: 600 TYLLPNIEAISKLQSTKLVSESPHERMRGFVELAWAFSTVLGLFLFLVEVAILCWVKFWD 659
NIE ++ + + L ESPH R + EL W FST +G+ LFLVE+A+L WVKF+D
Sbjct: 81 -----NIECVTFVHNYNLTPESPHRRFERYCELDWRFSTRMGIPLFLVELALLAWVKFYD 135
Query: 660 YSLRAAIAATIIVIPVLIIFVVFVVHFYHNVVKYK 694
AAI T I++PVLI+FV F HF+ ++ +K
Sbjct: 136 SGPVAAIIVTAILVPVLIVFVAFTQHFWGELLSHK 170
>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
family).
Length = 619
Score = 44.8 bits (106), Expect = 2e-04
Identities = 26/144 (18%), Positives = 45/144 (31%), Gaps = 18/144 (12%)
Query: 138 SRTQTASKRGALSHPRTPTTLLSKQGSHPLTPSSR--------------YPTTPTSGGHN 183
+ + S P TP+T + + TPS + + ++G +
Sbjct: 455 TVMEPPYGSTESSVPSTPSTRRNDRNITSNTPSLKRTPNLTKSSLSQEASLISKSTGNTH 514
Query: 184 PSYSSSGSEASIEPRTPTVSKSPLGRTFSAGSHMSLPSLSPLPCHQCQHSPPTWSPYILP 243
S+ + P+ P S+S G +G++ + S P SP L
Sbjct: 515 --KHSTPRRLTTLPKLPAASRSSKGNLIRSGANGNASSDLSSPGSINSKSPEH--SVPLV 570
Query: 244 MNTDHHPRLYKNHSRRDLGHHYEE 267
D H R R E+
Sbjct: 571 RVFDIHLRASTTKGRHSTPSTNEK 594
>gnl|CDD|224213 COG1294, AppB, Cytochrome bd-type quinol oxidase, subunit 2 [Energy
production and conversion].
Length = 346
Score = 35.7 bits (83), Expect = 0.079
Identities = 27/180 (15%), Positives = 64/180 (35%), Gaps = 33/180 (18%)
Query: 527 EWLFIMFAVCTTVLVAVHIFALMISTYLL-----PGNDVAMVELQINEPTNVPEWLFIMF 581
+ L FA+ + + + ++ L + +LL + A +I + F++F
Sbjct: 161 DQLLNPFALLCGLGLVL-MYVLHGAAWLLLKTEGALQERARKLARIAALLTLV--GFLLF 217
Query: 582 AVCTTVLVAVHIFALMIST---YLLPNIEAISKLQSTKLVSESPHERMRGFVELAWAFST 638
V T + + S +L + + L + L +G + AF
Sbjct: 218 GVWVTPGLDGFAASRWFSLPWLFLFSLLPVVGALLAVLLKR-------KGRGKEGLAFVL 270
Query: 639 VLGLFLFLVE-VAILCW---------VKFWD-----YSLRAAIAATIIVIPVLIIFVVFV 683
LFL + + I + + WD +L + ++ +P+++ + ++
Sbjct: 271 TSLLFLLAILGLGISLFPYVLPSSVSLTIWDAASSPLTLSVMLVVALVFLPIVLAYTIWS 330
Score = 31.4 bits (72), Expect = 1.5
Identities = 20/111 (18%), Positives = 38/111 (34%), Gaps = 25/111 (22%)
Query: 318 EWLFIMFAVCTTVLVAVHIFALMISTYLLPNIEAISKLQSTKLVSESP-HERMRGFVELA 376
+ L FA+ + + + ++ L + +LL L +E ER R
Sbjct: 161 DQLLNPFALLCGLGLVL-MYVLHGAAWLL-------------LKTEGALQERAR------ 200
Query: 377 WAFSTVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVV 427
+ + L + + WV L A+ +P L +F + V
Sbjct: 201 -KLARIAALLTLVGFLLFGVWVTPG---LDGFAASRWFSLPWLFLFSLLPV 247
Score = 31.4 bits (72), Expect = 1.5
Identities = 20/111 (18%), Positives = 38/111 (34%), Gaps = 25/111 (22%)
Query: 575 EWLFIMFAVCTTVLVAVHIFALMISTYLLPNIEAISKLQSTKLVSESP-HERMRGFVELA 633
+ L FA+ + + + ++ L + +LL L +E ER R
Sbjct: 161 DQLLNPFALLCGLGLVL-MYVLHGAAWLL-------------LKTEGALQERAR------ 200
Query: 634 WAFSTVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVV 684
+ + L + + WV L A+ +P L +F + V
Sbjct: 201 -KLARIAALLTLVGFLLFGVWVTPG---LDGFAASRWFSLPWLFLFSLLPV 247
>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
Rho/Rac/Cdc42-like GTPases [Signal transduction
mechanisms].
Length = 1175
Score = 36.0 bits (83), Expect = 0.083
Identities = 47/237 (19%), Positives = 77/237 (32%), Gaps = 32/237 (13%)
Query: 25 DNIKKEQLDTLTKLELEHNLNPI--------RLRTSSHSSVKRRLLTRTSSQASLTSRTG 76
D I E ++ K L H + + R S+ + TSS +SL S G
Sbjct: 53 DIINSESKESFGKYALGHQIFSSFSSSPKLFQRRNSAGPITHSP--SATSSTSSLNSNDG 110
Query: 77 RSDLRPRS-----GLERPILRVNTSPYPATKGTPHFTKKSFTSADLSRTVTHPYSRTQ-- 129
P S R ++ P G+P +K+ S THP
Sbjct: 111 DQ-FSPASDSLSFNPSSTQSRKDSGP---GDGSPVQKRKNPLLPSSSTHGTHPPIVFTDN 166
Query: 130 ---SVQSLRDVSRTQTASKRGALSHPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHNPSY 186
+ SR + S +P S S YP+ + H+ +
Sbjct: 167 NGSHAGAPNARSRKEIPSLGSQSMQLPSPHFRQKFSSSDTSNGFS-YPSIRKNSRHSSNS 225
Query: 187 SSSGSEASIEPRTPTVSKSPLGRTFSAGSHMSLPSLSPLPCHQCQHSPPTWSPYILP 243
S +S T + K + S+G+ + +++P + S + PYI P
Sbjct: 226 MPSFPHSS----TAVLLKR---HSGSSGASLISSNITPSSSNSEAMSTSSKRPYIYP 275
>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
Length = 486
Score = 35.7 bits (82), Expect = 0.094
Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 5/87 (5%)
Query: 141 QTASKRGALSHPRTPTTLLSKQGSHPLTPS--SRYPTTPTSGGHNPSYSSSGSEASIEPR 198
AS S ++ S P TP+ + T+P G + S +++G+E++ +
Sbjct: 23 TYASGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQAS 82
Query: 199 TPTVSKSPLGRTFSAGSHMSLPSLSPL 225
P ++ + T S +S+P +
Sbjct: 83 QPHSHETTITCT---KSLISVPYYKSV 106
>gnl|CDD|219805 pfam08347, CTNNB1_binding, N-terminal CTNNB1 binding. This region
tends to appear at the N-terminus of proteins also
containing DNA-binding HMG (high mobility group) boxes
(pfam00505) and appears to bind the armadillo repeat of
CTNNB1 (beta-catenin), forming a stable complex.
Signaling by Wnt through TCF/LCF is involved in
developmental patterning, induction of neural tissues,
cell fate decisions and stem cell differentiation.
Isoforms of HMG T-cell factors lacking the N-terminal
CTNNB1-binding domain cannot fulfill their role as
transcriptional activators in T-cell differentiation.
Length = 200
Score = 33.7 bits (77), Expect = 0.18
Identities = 23/120 (19%), Positives = 37/120 (30%), Gaps = 13/120 (10%)
Query: 128 TQSVQSLR--DVSRTQTASKRGAL--SHPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHN 183
++S +S +SR + + + K Y P N
Sbjct: 52 SESSESSSSEAISRRDHPPRYESYRDKSREKLEDVSHKHQDMGYYKGPPYSGYPFLMLPN 111
Query: 184 PSYSSSGSEASIEPRTPTVSKSPLGRTFSAGSHMSLPSLSPLPCHQCQHSPPTWSPYILP 243
P Y +GS + + P + V H L+PL + +H P P LP
Sbjct: 112 P-YLPNGSLSPLPPSSNKV----PVVQPPHHVH----PLTPLITYSNEHFSPGTPPPHLP 162
>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein. This is a viral
attachment glycoprotein from region G of metaviruses. It
is high in serine and threonine suggesting it is highly
glycosylated.
Length = 183
Score = 33.5 bits (76), Expect = 0.21
Identities = 24/99 (24%), Positives = 37/99 (37%), Gaps = 2/99 (2%)
Query: 91 LRVNTSPYPATKGTPHFTKKSFTSADLSRTVTHPYSRTQSVQSLRDVSRTQTASKRGALS 150
L + P T + T SRT T + ++ R SRTQT +
Sbjct: 79 LETEPTSTPDTTNRQQTVDRHTTPPSSSRTQTTQAVHEK--KNTRTTSRTQTPPTTSTAA 136
Query: 151 HPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHNPSYSSS 189
T TT S G P T S + ++ T+ H + ++
Sbjct: 137 VQTTTTTNTSSTGKEPTTTSVQPRSSATTQSHEETSQAN 175
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 34.4 bits (78), Expect = 0.22
Identities = 38/206 (18%), Positives = 64/206 (31%), Gaps = 32/206 (15%)
Query: 49 LRTSSHSSVKRRLLTRTSSQASLTSRTGRS------DLRPRSGLERPILRVNTSPYPA-T 101
+ T+ T +++ + + TG + D P S + P
Sbjct: 449 VPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSA---TPNATSPTPA 505
Query: 102 KGTPHFTKKSFTSADLSRTVTHPYSRTQSVQSLRDVSRTQTASKRGALSHPRTPTTLLSK 161
TP+ T + + T P V + T T S A S T + ++
Sbjct: 506 VTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNN 565
Query: 162 QGSHPLT-PSSRYPTTPTSGGHNPSYSSSGSEASIEPRT--------------------P 200
+ +T S + T+G H SS S+ P + P
Sbjct: 566 TNTPVVTSAPSVLTSAVTTGQHGTG-SSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHP 624
Query: 201 TVSKSPLGRTFSAGSHMSLPSLSPLP 226
T ++ T S S + +LSP P
Sbjct: 625 TGGENITEETPSVPSTTHVSTLSPGP 650
Score = 31.7 bits (71), Expect = 1.5
Identities = 39/231 (16%), Positives = 72/231 (31%), Gaps = 27/231 (11%)
Query: 34 TLTKLELEHNLNPIRLRTSSHSSVKRRLLTRTSSQASLTSRTGRSDLRPRSGLERPILRV 93
+ + +P S ++ ++T S + TG+ ++P +
Sbjct: 543 PTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPS 602
Query: 94 NTSPYPATKGTPHFTKKSFTSADLSRTVTHPYSRTQSVQSLRDVSRTQTASKRGALSHPR 153
++ P + T + +T T SV S VS G S
Sbjct: 603 SSHSTPRSNSTSTTPLLTSAHPTGGENITE---ETPSVPSTTHVSTLSPGPGPGTTSQVS 659
Query: 154 TPTTLLSKQGSHPLTPSSRYP--TTPTSGGHNPSYSSSGSEASIEPRTPTVSKSPLGRTF 211
P + +SRYP T G NP+ +S + + + PTV+ +
Sbjct: 660 GPGN----------SSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANS 709
Query: 212 --------SAGSHMSLPS----LSPLPCHQCQHSPPTWSPYILPMNTDHHP 250
+ + S + P + + PP+ S + P T P
Sbjct: 710 TTKETSGSTLMASTSPHTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFTSP 760
>gnl|CDD|219833 pfam08418, Pol_alpha_B_N, DNA polymerase alpha subunit B
N-terminal. This is the eukaryotic DNA polymerase alpha
subunit B N-terminal domain which is involved in complex
formation. Also see pfam04058.
Length = 239
Score = 33.5 bits (77), Expect = 0.26
Identities = 21/96 (21%), Positives = 31/96 (32%), Gaps = 6/96 (6%)
Query: 117 LSRTVTHPYSRTQSVQSLRDVSRTQTASKRGALSHPRTPTTLLSKQGSHPLTPSSRYPTT 176
L + V P + ++ + L +V ++ L TP+ TP
Sbjct: 62 LEKRVRTP-ASIKTSKRLIEVPE----AEESLLDSYTTPSDK-GGMLRILSTPELPKRKR 115
Query: 177 PTSGGHNPSYSSSGSEASIEPRTPTVSKSPLGRTFS 212
S S S S AS P + SP FS
Sbjct: 116 SFSASSLESPSLFFSPASFSPSAAPSTPSPNSAKFS 151
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 33.9 bits (77), Expect = 0.34
Identities = 31/151 (20%), Positives = 41/151 (27%), Gaps = 6/151 (3%)
Query: 89 PILRVNTSPYPATKGTPHFTKKSFTSADLSRTVTHPYSRTQSVQSLRDVSRTQTASKRGA 148
P ++V A P S P + Q L +S +R
Sbjct: 179 PSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLP 238
Query: 149 LSHPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHNPSYSSSGSEASIEPRTPTVSKSPLG 208
HP S+Q P PSSR+P + G P + + P G
Sbjct: 239 SPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFG 298
Query: 209 RTFSAGSHMSLPSLSPLPCHQCQHSPPTWSP 239
S PLP HS S
Sbjct: 299 LAQSQV------PPLPLPSQAQPHSHTPPSQ 323
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 33.9 bits (77), Expect = 0.35
Identities = 27/147 (18%), Positives = 65/147 (44%), Gaps = 6/147 (4%)
Query: 49 LRTSSHSSVKRRLLTRTSSQASLTSRTGRSD-LRPRSGLERPILRVNTSPYPATKGT--P 105
+R+ H S++ + A R+G D +R SG++ + +P P +GT
Sbjct: 604 MRSGVHDSLRGGVHDSIRGGAHDGVRSGAHDGVRKSSGVKVEGMGNARAPPP-NQGTHDG 662
Query: 106 HFTKKSFTSADLSRTVTHPYS--RTQSVQSLRDVSRTQTASKRGALSHPRTPTTLLSKQG 163
+F++ +S +S + Y + ++ Q D + K +S + ++ +G
Sbjct: 663 NFSRGKDSSDHVSSYKPYAYDSRKAKNYQDTFDRNDMHGYMKNSPISTIVPSSHMMHTKG 722
Query: 164 SHPLTPSSRYPTTPTSGGHNPSYSSSG 190
+ + + +Y GG++ +++++G
Sbjct: 723 NTGMNVNCKYLGGAAYGGYSNNHAATG 749
>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
and diverse group of secondary transporters that
includes uniporters, symporters, and antiporters. MFS
proteins facilitate the transport across cytoplasmic or
internal membranes of a variety of substrates including
ions, sugar phosphates, drugs, neurotransmitters,
nucleosides, amino acids, and peptides. They do so using
the electrochemical potential of the transported
substrates. Uniporters transport a single substrate,
while symporters and antiporters transport two
substrates in the same or in opposite directions,
respectively, across membranes. MFS proteins are
typically 400 to 600 amino acids in length, and the
majority contain 12 transmembrane alpha helices (TMs)
connected by hydrophilic loops. The N- and C-terminal
halves of these proteins display weak similarity and may
be the result of a gene duplication/fusion event. Based
on kinetic studies and the structures of a few bacterial
superfamily members, GlpT (glycerol-3-phosphate
transporter), LacY (lactose permease), and EmrD
(multidrug transporter), MFS proteins are thought to
function through a single substrate binding site,
alternating-access mechanism involving a rocker-switch
type of movement. Bacterial members function primarily
for nutrient uptake, and as drug-efflux pumps to confer
antibiotic resistance. Some MFS proteins have medical
significance in humans such as the glucose transporter
Glut4, which is impaired in type II diabetes, and
glucose-6-phosphate transporter (G6PT), which causes
glycogen storage disease when mutated.
Length = 352
Score = 32.7 bits (75), Expect = 0.62
Identities = 15/108 (13%), Positives = 40/108 (37%), Gaps = 1/108 (0%)
Query: 321 FIMFAVCTTVLVAVHIFALMISTYLLPNI-EAISKLQSTKLVSESPHERMRGFVELAWAF 379
++FA+ + +L L++ L + + L++E + RG ++
Sbjct: 71 LLLFALGSLLLAFASSLWLLLVGRFLLGLGGGALYPAAAALIAEWFPPKERGRALGLFSA 130
Query: 380 STVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVV 427
LG L + +L W + +++ +L+ + ++
Sbjct: 131 GFGLGALLGPLLGGLLAESLGWRWLFLILAILGLLLALLLLFLLRLLL 178
Score = 32.7 bits (75), Expect = 0.62
Identities = 15/108 (13%), Positives = 40/108 (37%), Gaps = 1/108 (0%)
Query: 578 FIMFAVCTTVLVAVHIFALMISTYLLPNI-EAISKLQSTKLVSESPHERMRGFVELAWAF 636
++FA+ + +L L++ L + + L++E + RG ++
Sbjct: 71 LLLFALGSLLLAFASSLWLLLVGRFLLGLGGGALYPAAAALIAEWFPPKERGRALGLFSA 130
Query: 637 STVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVV 684
LG L + +L W + +++ +L+ + ++
Sbjct: 131 GFGLGALLGPLLGGLLAESLGWRWLFLILAILGLLLALLLLFLLRLLL 178
>gnl|CDD|216908 pfam02159, Oest_recep, Oestrogen receptor.
Length = 139
Score = 31.4 bits (71), Expect = 0.71
Identities = 21/75 (28%), Positives = 26/75 (34%), Gaps = 8/75 (10%)
Query: 169 PSSRYPTTPTSGGHNPSYSSSGSEASIEPRTPTVSKSPLGRTFSAGSHMSLPSLSPLPCH 228
P Y + P YSS+ P V P S G SL S+ P P
Sbjct: 15 PEGAYDFGAPTETSAPVYSSAS--LGYYPAPEDVHGPP-----SLGGLQSLGSVPPSPLM 67
Query: 229 QCQHSPPTWSPYILP 243
S P SP++ P
Sbjct: 68 -FLPSSPQLSPFLHP 81
>gnl|CDD|223333 COG0255, RpmC, Ribosomal protein L29 [Translation, ribosomal
structure and biogenesis].
Length = 69
Score = 29.6 bits (67), Expect = 0.74
Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 9/57 (15%)
Query: 11 SDLRHKCDERLQT-IDNIKKEQLDTLTKLELEHNLNPIRLRTSSHSSVKR---RLLT 63
+LR K E L+ + +KKE + +L NP R+R V+R R+LT
Sbjct: 6 KELREKSVEELEEELRELKKELFNLRFQLATGQLENPHRIRE-----VRRDIARILT 57
>gnl|CDD|216998 pfam02366, PMT, Dolichyl-phosphate-mannose-protein
mannosyltransferase. This is a family of
Dolichyl-phosphate-mannose-protein mannosyltransferase
proteins EC:2.4.1.109. These proteins are responsible
for O-linked glycosylation of proteins, they catalyze
the reaction:- Dolichyl phosphate D-mannose + protein
<=> dolichyl phosphate + O-D-mannosyl-protein. Also in
this family is the Drosophila rotated abdomen protein
which is a putative mannosyltransferase. This family
appears to be distantly related to pfam02516 (A Bateman
pers. obs.).
Length = 244
Score = 31.9 bits (73), Expect = 0.82
Identities = 20/73 (27%), Positives = 27/73 (36%), Gaps = 12/73 (16%)
Query: 371 GFVELAWAFSTVL-GLFLFLVEVAILCWV---KFWDYSLR--------AAIAATIIVIPV 418
+ L A ST GLF L + W+ D SL A +I+IP
Sbjct: 170 TGIALGLAVSTKGVGLFTVLPVGLLTIWLLWQLLGDLSLLYKSIWKHLFARLLFLILIPW 229
Query: 419 LIIFVVFVVHFYH 431
+ F VHF+
Sbjct: 230 ALYLAQFYVHFWL 242
Score = 31.9 bits (73), Expect = 0.82
Identities = 20/73 (27%), Positives = 27/73 (36%), Gaps = 12/73 (16%)
Query: 628 GFVELAWAFSTVL-GLFLFLVEVAILCWV---KFWDYSLR--------AAIAATIIVIPV 675
+ L A ST GLF L + W+ D SL A +I+IP
Sbjct: 170 TGIALGLAVSTKGVGLFTVLPVGLLTIWLLWQLLGDLSLLYKSIWKHLFARLLFLILIPW 229
Query: 676 LIIFVVFVVHFYH 688
+ F VHF+
Sbjct: 230 ALYLAQFYVHFWL 242
>gnl|CDD|144451 pfam00859, CTF_NFI, CTF/NF-I family transcription modulation
region.
Length = 295
Score = 32.0 bits (72), Expect = 1.0
Identities = 39/173 (22%), Positives = 54/173 (31%), Gaps = 29/173 (16%)
Query: 63 TRTSSQASLTSRTGRSDLRPRSGL----ERPILRVNTSPY---PATKGTPHFTKKSFTSA 115
T SL S D PR RP++ ++ T HF
Sbjct: 108 TMKKPDKSLFSSPSPQDSSPRLSAFTQHHRPVITGHSGISASPHPTPSPLHFPTSPILPQ 167
Query: 116 DLSRTVTHPYSR-----------TQSVQSLRDVSRTQTA----SKRGALSHPRTPTTLLS 160
S H R + VQ + D S Q S +G + + PT +L+
Sbjct: 168 QPSSYFPHTAIRYPPHLHPQDPLKEFVQLVCDPSSQQAGQPNGSGQGKVPNHFLPTPMLA 227
Query: 161 KQGSHPLT-------PSSRYPTTPTSGGHNPSYSSSGSEASIEPRTPTVSKSP 206
P+ P ++ PTT T GG S + S S P V P
Sbjct: 228 PPPPPPMARPVPLPMPDTKPPTTSTEGGATSPTSPTYSTPSTSPANRFVGLGP 280
>gnl|CDD|219890 pfam08537, NBP1, Fungal Nap binding protein NBP1. NBP1 is a
nuclear protein which has been shown in Saccharomyces
cerevisiae to be essential for the G2/M transition of
the cell cycle.
Length = 313
Score = 31.8 bits (72), Expect = 1.1
Identities = 20/60 (33%), Positives = 33/60 (55%), Gaps = 6/60 (10%)
Query: 3 TAGPVKSSS-DLRHKCDERLQTIDNI-----KKEQLDTLTKLELEHNLNPIRLRTSSHSS 56
T+ P++ S+ R + + LQ DN K + + LT+ E E +L+PIR+ S +SS
Sbjct: 254 TSSPIRKSNRQGRERPLQPLQKGDNFYAKYPKIPKTERLTEKESEESLSPIRIDYSKYSS 313
>gnl|CDD|222193 pfam13520, AA_permease_2, Amino acid permease.
Length = 425
Score = 31.9 bits (73), Expect = 1.3
Identities = 20/172 (11%), Positives = 55/172 (31%), Gaps = 10/172 (5%)
Query: 524 NVPEWLFIMFAVCTTVLVAVHIFALMIST-YLLPGNDVAMVELQINEPTNVPEWLFIMFA 582
+VP+ LFI + + + V+I L + + + W I+
Sbjct: 218 DVPKALFIGLLIVGVLYLLVNILFLGVVPDDEIAKLSNLPSVAALLFEAVGGPWGAIIVV 277
Query: 583 VCTTVLVAVHIFALMISTYLLPNIEAISKLQSTKLVSESPHERMRGFVELAWAFSTVLGL 642
+ + + + ++++ + A + + + L S +L L
Sbjct: 278 ILLALSLLGAVNTAIVASSRVLEALARDGVLPKFFAKVNKFGSPVRALILTAILSLILLL 337
Query: 643 FLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVVHFYHNVVKYK 694
L A Y+ +++A ++ L++ + ++ +
Sbjct: 338 LFLLSGAA---------YNALLSLSAVGYLLVYLLLIIGLLILRKKRPDLPR 380
>gnl|CDD|221787 pfam12815, CTD, Spt5 C-terminal nonapeptide repeat binding Spt4.
The C-terminal domain of the transcription elongation
factor protein Spt5 is necessary for binding to Spt4 to
form the functional complex that regulates early
transcription elongation by RNA polymerase II. The
complex may be involved in pre-mRNA processing through
its association with mRNA capping enzymes. This CTD
domain carries a regular nonapeptide repeat that can be
present in up to 18 copies, as in S. pombe. The repeat
has a characteristic TPA motif.
Length = 112
Score = 30.1 bits (68), Expect = 1.3
Identities = 20/86 (23%), Positives = 25/86 (29%), Gaps = 9/86 (10%)
Query: 153 RTPT-TLLSKQGSHPLTPSSRYPTTPTSGGHNPSYSSSGSEASIEPRTPTVSKSPLGRTF 211
RTP + R P +G P++ RTP GRT
Sbjct: 14 RTPAWGQGGSRTPAYGGAGGRTPAWGGAGNKTPAFGDGS-------RTPAYGGESNGRTP 66
Query: 212 SAGSHMSLPSLS-PLPCHQCQHSPPT 236
+ G S S P S PT
Sbjct: 67 AYGGSGSYLGASTPGAAPTPAKSAPT 92
>gnl|CDD|146273 pfam03546, Treacle, Treacher Collins syndrome protein Treacle.
Length = 519
Score = 31.8 bits (71), Expect = 1.4
Identities = 20/87 (22%), Positives = 29/87 (33%), Gaps = 11/87 (12%)
Query: 154 TPTTLLSKQGSHPLTPSSRYPTTPTSGGHNPSYSSSGSEASIE------PRTPTVSKSPL 207
P ++G+ P+TP P +G S S++ E T PL
Sbjct: 60 APAKESPQKGAPPVTPGKAGPAAAQAGEEEAKSSEEESDSEGETPTAATLTTSPAQAKPL 119
Query: 208 G-----RTFSAGSHMSLPSLSPLPCHQ 229
G R S + + LPC Q
Sbjct: 120 GKNSQVRPASTVTPGPSGKGANLPCPQ 146
>gnl|CDD|225579 COG3037, SgaT, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 481
Score = 31.5 bits (72), Expect = 1.6
Identities = 33/187 (17%), Positives = 63/187 (33%), Gaps = 48/187 (25%)
Query: 521 EPTNVPEWLFIMF--AVCTTVLVAVHIFALMISTYLLPGNDVAMVELQINEPTNVPEWLF 578
E +P+WL I V T + + F ++ + G D + N +
Sbjct: 219 EDLKLPKWLSIFRDSIVAITATMMILFFGAILLSLGAAGIDNVQLMAS-GGQAN-----W 272
Query: 579 IMFAVCTTVLVAVHIFALM----------------ISTYLLPNIEAISKLQSTKLVSESP 622
++ + + A +F ++ IS L+PN A+ L + +P
Sbjct: 273 FVYIIQQGLTFAAGVFIILQGVRMFVAELVPAFKGISEKLVPN--AVPALDCAAVYPFAP 330
Query: 623 HERMRGFVELAWAFSTVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVF 682
+ + GF+ + L + + IL + I++IP F
Sbjct: 331 NAVLIGFL-----SGFIGQL----IGMGIL-------GGFSSPNTGGILIIPG------F 368
Query: 683 VVHFYHN 689
V HF+
Sbjct: 369 VPHFFSG 375
>gnl|CDD|147982 pfam06112, Herpes_capsid, Gammaherpesvirus capsid protein. This
family consists of several Gammaherpesvirus capsid
proteins. The exact function of this family is unknown.
Length = 148
Score = 30.2 bits (68), Expect = 1.7
Identities = 21/87 (24%), Positives = 33/87 (37%), Gaps = 7/87 (8%)
Query: 122 THPYSRTQSVQSLRDVSRTQTASKRGALS-HPRTPTTLLSKQGSHPLTPSSRYPTTPTSG 180
H R + +Q+LR ++S ALS + + + S + +SG
Sbjct: 65 MHGIRRKKHLQALRGAGPQTSSSIGSALSASSSSASGV----PGGANQLSGSSGSALSSG 120
Query: 181 GHN--PSYSSSGSEASIEPRTPTVSKS 205
+ S S SGS A P+ SK
Sbjct: 121 PGSLSSSSSLSGSGAGAGDTAPSSSKK 147
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 31.4 bits (71), Expect = 1.8
Identities = 16/75 (21%), Positives = 21/75 (28%), Gaps = 12/75 (16%)
Query: 166 PLTPSSRYPTTPTSGGHNPSYSSSGSEASIEPRTPTVSKSPLGRTFSAGSHMSLPSLSPL 225
P + + P G P S ++ P S P S
Sbjct: 163 QQNPPPQAQSAPQVSGLYPEESPYQPQSY----PPNEPLPS--------SMAMQPPYSGA 210
Query: 226 PCHQCQHSPPTWSPY 240
P Q + PP SPY
Sbjct: 211 PPSQQFYGPPQPSPY 225
>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
Length = 590
Score = 31.2 bits (71), Expect = 2.0
Identities = 28/83 (33%), Positives = 34/83 (40%), Gaps = 8/83 (9%)
Query: 169 PSSRYPTTPTSGGHNPSYSSSGSEASIEPRTPTVSKSPLGRTFSAGSHMSLPSLSPLPCH 228
SS PT S H S SSS S++S T S P R SA S + PS+ PLP
Sbjct: 1 SSSSNPT-SRSQSHASSSSSSSSQSSPPSST---SPRPRRRKPSASSLLHTPSILPLP-- 54
Query: 229 QCQHSPPTWSPYILPMNTDHHPR 251
+ S P+ LP
Sbjct: 55 --KLSSPSPPSVTLPPAATTQTP 75
>gnl|CDD|219916 pfam08580, KAR9, Yeast cortical protein KAR9. The KAR9 protein in
Saccharomyces cerevisiae is a cytoskeletal protein
required for karyogamy, correct positioning of the
mitotic spindle and for orientation of cytoplasmic
microtubules. KAR9 localises at the shmoo tip in mating
cells and at the tip of the growing bud in anaphase.
Length = 626
Score = 31.4 bits (71), Expect = 2.1
Identities = 24/125 (19%), Positives = 34/125 (27%), Gaps = 9/125 (7%)
Query: 65 TSSQASLTSRTGRSDLRPRSGLERPILRVNTSPYPATKGTPHFTKKSFTSADLSRTVTHP 124
SS L+ R L RP R+++ + +SR
Sbjct: 502 LSSIPPLSPRQSIITLPT---PSRPASRISSLSLRLGSYSGSIVSPPPYPTLVSRKGAAG 558
Query: 125 YSRTQSVQSLRDVSRTQTASKRGALSHPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHNP 184
S +SV + R L R P L K S + S +PT P
Sbjct: 559 LSFNRSVSDIE-----GERIGRYNLLPTRIPA-LPFKAESTTSSRRSSSLPSPTGVIGFP 612
Query: 185 SYSSS 189
Sbjct: 613 GSVPR 617
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 31.5 bits (71), Expect = 2.1
Identities = 34/143 (23%), Positives = 57/143 (39%), Gaps = 16/143 (11%)
Query: 69 ASLTSRTGRSDLRPR-SGLERPILRVNTSP------YPATKGTPHFTKKSFTSADLSRTV 121
A +T T +++ P +G ER +P Y +T G KS+T R
Sbjct: 119 AKITPTTPKNNYSPTAAGTERVKEDTPATPSRALNHYISTSGRQRV--KSYTPK--PRGE 174
Query: 122 THPYSRTQSVQSLRDVSRTQTASKRGALSHPRTPTTLLSKQGSHPLTPSSRYPTTPTSGG 181
S TQ+ R+ R T S G + + P+T ++ SH +TP + + +
Sbjct: 175 VKSSSPTQT----REKVRKYTPSPLGRMVNSYAPSTFMTMPRSHGITPRTTVKDSEITAT 230
Query: 182 HNPSYSSSGSEASIEPRTPTVSK 204
++ S+ + TPT K
Sbjct: 231 -YKMLETNPSKRTAGKTTPTPLK 252
>gnl|CDD|218244 pfam04754, Transposase_31, Putative transposase, YhgA-like. This
family of putative transposases includes the YhgA
sequence from Escherichia coli and several prokaryotic
homologues.
Length = 203
Score = 30.6 bits (70), Expect = 2.3
Identities = 10/42 (23%), Positives = 19/42 (45%), Gaps = 9/42 (21%)
Query: 414 IVIPVLIIFVVFVVHFYHNVVKYKCKTSISDMKELEDIKKKL 455
+VIP++ FYH + ++ D+ E ++ K L
Sbjct: 110 LVIPIV---------FYHGKESWPAPRNLLDLFEDPELAKYL 142
Score = 30.6 bits (70), Expect = 2.3
Identities = 10/42 (23%), Positives = 19/42 (45%), Gaps = 9/42 (21%)
Query: 671 IVIPVLIIFVVFVVHFYHNVVKYKCKTSISDMKELEDIKKKL 712
+VIP++ FYH + ++ D+ E ++ K L
Sbjct: 110 LVIPIV---------FYHGKESWPAPRNLLDLFEDPELAKYL 142
>gnl|CDD|217379 pfam03125, Sre, C. elegans Sre G protein-coupled chemoreceptor.
Caenorhabditis elegans Sre proteins are candidate
chemosensory receptors. There are four main recognised
groups of such receptors: Odr-10, Sra, Sro, and Srg. Sre
(this family), Sra pfam02117 and Srb pfam02175 comprise
the Sra group. All of the above receptors are thought to
be G protein-coupled seven transmembrane domain
proteins. The existence of several different
chemosensory receptors underlies the fact that in spite
of having only 20-30 chemosensory neurones, C. elegans
detects hundreds of different chemicals, with the
ability to discern individual chemicals among
combinations.
Length = 363
Score = 30.8 bits (70), Expect = 2.4
Identities = 17/67 (25%), Positives = 29/67 (43%), Gaps = 3/67 (4%)
Query: 371 GFVELAWAFSTVLGLFLFLVEVAILCW-VKFWDYSLRAAIAATIIVI--PVLIIFVVFVV 427
GF++ + FS V GL VE +K ++ R I +I+I + I +F +
Sbjct: 126 GFLQWHYMFSVVFGLLAIAVERCFASVFIKDYEKKSRIWIPVILIIISQFLAISSALFFL 185
Query: 428 HFYHNVV 434
V+
Sbjct: 186 FNKLPVL 192
Score = 30.8 bits (70), Expect = 2.4
Identities = 17/67 (25%), Positives = 29/67 (43%), Gaps = 3/67 (4%)
Query: 628 GFVELAWAFSTVLGLFLFLVEVAILCW-VKFWDYSLRAAIAATIIVI--PVLIIFVVFVV 684
GF++ + FS V GL VE +K ++ R I +I+I + I +F +
Sbjct: 126 GFLQWHYMFSVVFGLLAIAVERCFASVFIKDYEKKSRIWIPVILIIISQFLAISSALFFL 185
Query: 685 HFYHNVV 691
V+
Sbjct: 186 FNKLPVL 192
>gnl|CDD|218397 pfam05044, Prox1, Homeobox prospero-like protein (PROX1). The
homeobox gene Prox1 is expressed in a subpopulation of
endothelial cells that, after budding from veins, gives
rise to the mammalian lymphatic system. Prox1 has been
found to be an early specific marker for the developing
liver and pancreas in the mammalian foregut endoderm.
This family contains an atypical homeobox domain.
Length = 908
Score = 31.2 bits (70), Expect = 2.5
Identities = 47/220 (21%), Positives = 65/220 (29%), Gaps = 55/220 (25%)
Query: 88 RPILRVNTSPYPATKGTPHFTKKSFTSAD-------LSRTVTHPYSRTQSVQSLRDVSRT 140
+P A +P ++ D LS VT P + V R RT
Sbjct: 539 KPPDSFANVAAAALYNSPFCMPQTPQPQDAPEQTEALSLVVT-PKKKRHKVTDTRITPRT 597
Query: 141 ---------QTASKRGALSHPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHNPSYSSSGS 191
G P P++L + G HP +P T NPS S
Sbjct: 598 VSRILALRDAVGPAAGTHHQPLHPSSLSASMGFHPPPFRHPFPLPLTVAIPNPSLHQSEV 657
Query: 192 EASIEPRTPTVSKSPLGRTFSAGSHMSLPSLSPLPCHQCQHSPPTWSPYILPMN-TDHHP 250
++P H+ PS SP + + SP LP T HP
Sbjct: 658 FMGYPFQSP---------------HLGAPSGSPPG--KDRDSPD------LPRPTTSLHP 694
Query: 251 RLYKNHSRRDLGHHYEELPEDSEGLVWRRLHMSRAKLKAT 290
+L H HH P+ L +S K +
Sbjct: 695 KLLSAH------HHPGSSPDG--------LSLSLIKSECG 720
>gnl|CDD|165093 PHA02724, PHA02724, hydrophobic IMV membrane protein; Provisional.
Length = 53
Score = 27.6 bits (61), Expect = 2.8
Identities = 12/46 (26%), Positives = 25/46 (54%)
Query: 389 LVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVVHFYHNVV 434
L+ + I C V +++++ + II I+F+ F+ HF H+ +
Sbjct: 8 LLLLVITCCVLLFNFTISSKTKIDIIFAVQTIVFIWFIFHFVHSAI 53
Score = 27.6 bits (61), Expect = 2.8
Identities = 12/46 (26%), Positives = 25/46 (54%)
Query: 646 LVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVVHFYHNVV 691
L+ + I C V +++++ + II I+F+ F+ HF H+ +
Sbjct: 8 LLLLVITCCVLLFNFTISSKTKIDIIFAVQTIVFIWFIFHFVHSAI 53
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 30.8 bits (70), Expect = 3.4
Identities = 12/29 (41%), Positives = 16/29 (55%)
Query: 544 HIFALMISTYLLPGNDVAMVELQINEPTN 572
HI L++ Y L N +A+VE EP N
Sbjct: 62 HIMKLLLEKYKLNNNKIAIVEHSCGEPQN 90
>gnl|CDD|212032 cd10322, SLC5sbd, Solute carrier 5 family, sodium/glucose
transporters and related proteins; solute-binding
domain. This family represents the solute-binding
domain of SLC5 proteins (also called the sodium/glucose
cotransporter family or solute sodium symporter family)
that co-transport Na+ with sugars, amino acids,
inorganic ions or vitamins. Family members include: the
human glucose (SGLT1, 2, 4, 5), chiro-inositol (SGLT5),
myo-inositol (SMIT), choline (CHT), iodide (NIS),
multivitamin (SMVT), and monocarboxylate (SMCT)
cotransporters, as well as Vibrio parahaemolyticus
glucose/galactose (vSGLT), and Escherichia coli proline
(PutP) and pantothenate (PutF) cotransporters. Vibrio
parahaemolyticus Na(+)/galactose cotransporter (vSGLT)
has 13 transmembrane helices (TMs): TM-1, an inverted
topology repeat: TMs1-5 and TMs6-10, and TMs 11-12 (TMs
numbered to conform to the solute carrier 6 family
Aquifex aeolicus LeuT). One member of this family, human
SGLT3, has been characterized as a glucose sensor and
not a transporter. Members of this family are important
in human physiology and disease.
Length = 455
Score = 30.6 bits (70), Expect = 3.6
Identities = 15/62 (24%), Positives = 20/62 (32%), Gaps = 7/62 (11%)
Query: 274 GLVWRRLHMSRAKLKATATTSELLSGFAMVAMVELQINEPTNVPEWLFIMFAVCTTVLVA 333
GL W+R KA A S L+ G + + L + V V V
Sbjct: 401 GLFWKRAT------KAGAIAS-LIVGLIVTLVWLLLGLASPFGIDPAIPGLLVSLIVFVV 453
Query: 334 VH 335
V
Sbjct: 454 VS 455
Score = 30.6 bits (70), Expect = 3.6
Identities = 15/62 (24%), Positives = 20/62 (32%), Gaps = 7/62 (11%)
Query: 483 GLVWRRLHMSRAKLKATATTSELLSGFAMVAMVELQINEPTNVPEWLFIMFAVCTTVLVA 542
GL W+R KA A S L+ G + + L + V V V
Sbjct: 401 GLFWKRAT------KAGAIAS-LIVGLIVTLVWLLLGLASPFGIDPAIPGLLVSLIVFVV 453
Query: 543 VH 544
V
Sbjct: 454 VS 455
>gnl|CDD|219051 pfam06479, Ribonuc_2-5A, Ribonuclease 2-5A. This domain is a
endoribonuclease. Specifically it cleaves an intron from
Hac1 mRNA in humans, which causes it to be much more
efficiently translated.
Length = 127
Score = 29.1 bits (66), Expect = 3.9
Identities = 14/28 (50%), Positives = 15/28 (53%), Gaps = 7/28 (25%)
Query: 251 RLYKNHSRRDL-------GHHYEELPED 271
R YK S RDL HHY ELPE+
Sbjct: 56 RKYKGDSVRDLLRAIRNKKHHYRELPEE 83
>gnl|CDD|227367 COG5034, TNG2, Chromatin remodeling protein, contains PhD zinc
finger [Chromatin structure and dynamics].
Length = 271
Score = 29.9 bits (67), Expect = 4.1
Identities = 29/190 (15%), Positives = 51/190 (26%), Gaps = 27/190 (14%)
Query: 11 SDLRHKCDE---RLQTIDNIKK--EQLDTLTKLELEHNLNPIRLRTSSHSSVKRRLLTRT 65
S++ K + L+ + +I K LD+ T E+E L ++ LL
Sbjct: 29 SEIDAKVCDIIKNLRQMISILKKIIDLDSQTYEEVEDGLL---------KEIRELLLKAI 79
Query: 66 SSQASLTSRTGRSDLRPRSGLERPILRVNTSPYPATKGTPHFTKKSFTSADLSRTVTHPY 125
Q R L R ++ A + +
Sbjct: 80 YIQKE----KSDLADRAEKLLRRHRKLLDDRI--AKRPHEKVAAR--IENCHDAVSRLER 131
Query: 126 SRTQSVQSLRDVSRTQTASKRGALSHPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHNPS 185
+ S R+ + A S T L ++ H L S ++
Sbjct: 132 NSYSSA-----ARRSSGEHRSAASSQGSRHTKLKKRKNIHNLKRRSPELSSKREVSFTLE 186
Query: 186 YSSSGSEASI 195
S A+
Sbjct: 187 SPSVPDTATR 196
>gnl|CDD|165564 PHA03309, PHA03309, transcriptional regulator ICP4; Provisional.
Length = 2033
Score = 30.6 bits (68), Expect = 4.4
Identities = 40/163 (24%), Positives = 65/163 (39%), Gaps = 10/163 (6%)
Query: 46 PIRLRTSSHSSVKRRLLTRTSSQASLTSRTGRSDLRPRSGLERPIL-RVNTSPYPATKGT 104
P+R R+ + + R +++ S +S +S + S P S R ++ SP P +
Sbjct: 1795 PVRRRSGAANCGGRWMISAGRSSSSSSSSSSSSSSSPSSRPSRSATPSLSPSPSPPRRA- 1853
Query: 105 PHFTKKSFTSADLSRTVTHPYSRTQSVQSLRDVSRTQTASKRGALSHPRTPTTLLSKQGS 164
P +S + R +P+ +S D S TA L+ P +G
Sbjct: 1854 PVDRSRSGRRRERDRPSANPFRWAPRQRSRADHSPDGTAPGDAPLNLEDGP-----GRGR 1908
Query: 165 HPLTPSSRYPTTPTSGGHNPSYSSSGSEASIEPRTPTVSKSPL 207
TPSS T P+ G S + +E S P ++ SPL
Sbjct: 1909 PIWTPSSA-TTLPSRSGPEDSVDETETEDSAPP--ARLAPSPL 1948
>gnl|CDD|224945 COG2034, COG2034, Predicted membrane protein [Function unknown].
Length = 85
Score = 27.6 bits (62), Expect = 6.0
Identities = 7/55 (12%), Positives = 20/55 (36%)
Query: 368 RMRGFVELAWAFSTVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIF 422
+ +A +G L + + + + F + +++ P+ I+F
Sbjct: 2 IRSEDLIIAGLILIFIGFLLIFLGIVLPAFSPFAESGSSVEYGGVVMIGPIPIVF 56
Score = 27.6 bits (62), Expect = 6.0
Identities = 7/55 (12%), Positives = 20/55 (36%)
Query: 625 RMRGFVELAWAFSTVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIF 679
+ +A +G L + + + + F + +++ P+ I+F
Sbjct: 2 IRSEDLIIAGLILIFIGFLLIFLGIVLPAFSPFAESGSSVEYGGVVMIGPIPIVF 56
>gnl|CDD|227005 COG4658, RnfD, Predicted NADH:ubiquinone oxidoreductase, subunit
RnfD [Energy production and conversion].
Length = 338
Score = 29.7 bits (67), Expect = 6.1
Identities = 43/186 (23%), Positives = 74/186 (39%), Gaps = 13/186 (6%)
Query: 494 AKLKATATTSELLSGFAMVAMVELQINEPTNVPEWLFIMFAVCTTVLVAVHIFALMISTY 553
KL+ + T+ L G A++ + L + P P W+ ++ V++ H++ +
Sbjct: 61 LKLRKKSVTAFLPDGSALLTGLLLAVVIPPLAPWWMTVIGTFF-AVVIGKHLYGGLGQNP 119
Query: 554 LLPGNDVAMVELQINEPTNVPEWLFIMFAVCTTVLVAVHIFALMISTYLLPNIEAISKLQ 613
P V V L I+ P + WL + A+ IF+ + L I+AI+ Q
Sbjct: 120 FNPAM-VGYVVLLISFPVQMTTWLPPISALQEPADSYSLIFSSL--VQLRTGIDAIT--Q 174
Query: 614 STKLVSESPHERMRGFVELAWAFSTVLGLF-------LFLVEVAILCWVKFWDYSLRAAI 666
+T L R F+ A FS L FL+ L WVK + + +
Sbjct: 175 ATPLDDFKLSLREGDFILQAPIFSGKLAGAGWEWVNLAFLLGGLFLLWVKAIRWHIPVSF 234
Query: 667 AATIIV 672
T+ +
Sbjct: 235 LVTLFL 240
>gnl|CDD|223701 COG0628, yhhT, Predicted permease, member of the PurR regulon
[General function prediction only].
Length = 355
Score = 29.5 bits (67), Expect = 6.6
Identities = 28/158 (17%), Positives = 64/158 (40%), Gaps = 19/158 (12%)
Query: 285 AKLKATATTSELLSGFAMVAMVELQINEPTNVPEWLFIMFAVCTTVLVAVHIFALMISTY 344
AKL + ++ A++ ++ ++ L + + +++V++ + L++ +
Sbjct: 112 AKLLESLPLLGERLPVSVDALLSALLSILQSILGSLLSLLSSLLSLIVSL-LLVLVLLFF 170
Query: 345 LLPNIEAISKLQSTKLVSESPHERMRGFVELAWA--FSTVLGLFLFLVEVAILCWVKFW- 401
LL + E + + + KL+ +R R + A + G L + V IL +
Sbjct: 171 LLLDGERLRR-KLIKLLPRKLRKRARRILSEVNATLSGYLRGQVLVALIVGILTGIGLLI 229
Query: 402 ---DYSLRAAIAA-----------TIIVIPVLIIFVVF 425
Y+L + A I +IP +II ++
Sbjct: 230 LGVPYALLLGLLAGLLSLIPYIGPVIGLIPAVIIALLQ 267
>gnl|CDD|223654 COG0581, PstA, ABC-type phosphate transport system, permease
component [Inorganic ion transport and metabolism].
Length = 292
Score = 29.1 bits (66), Expect = 7.5
Identities = 33/162 (20%), Positives = 62/162 (38%), Gaps = 33/162 (20%)
Query: 531 IMFAVCTTVLVAVHIFALMISTYLLPGNDVAMVELQ--INEPTNVPEW----------LF 578
V VA+ ++ T L+ G + L+ P+ E L+
Sbjct: 19 ATGLVFLAAAVALLPLLWILWTILIRGIPALSLNLEFFTGSPSPPGEGGGIGPAIVGTLY 78
Query: 579 IMFAVCTTVLVAVHIFALMISTYLLPNIEAISKLQSTKLVSESPHERMRGFVELAWAF-S 637
++ +L+ V + + YL E K + TK++ R +++ + S
Sbjct: 79 LILL---AILIGVPL-GIGAGIYL---AEYAKKSRLTKVI--------RFAIDILASVPS 123
Query: 638 TVLGLFLFLVEVAILCWVKFWDYSLRA-AIAATIIVIPVLII 678
V GLF V L + +S A A+A ++++PV+I
Sbjct: 124 IVYGLFGLGFFVVTL----GFGFSALAGALALALLMLPVVIR 161
>gnl|CDD|119394 cd06261, TM_PBP2, Transmembrane subunit (TM) found in Periplasmic
Binding Protein (PBP)-dependent ATP-Binding Cassette
(ABC) transporters which generally bind type 2 PBPs.
These types of transporters consist of a PBP, two TMs,
and two cytoplasmic ABC ATPase subunits, and are mainly
involved in importing solutes from the environment. The
solute is captured by the PBP which delivers it to a
gated translocation pathway formed by the two TMs. The
two ABCs bind and hydrolyze ATP and drive the transport
reaction. For these transporters the ABCs and TMs are on
independent polypeptide chains. These systems transport
a diverse range of substrates. Most are specific for a
single substrate or a group of related substrates;
however some transporters are more promiscuous,
transporting structurally diverse substrates such as the
histidine/lysine and arginine transporter in
Enterobacteriaceae. In the latter case, this is achieved
through binding different PBPs with different
specificities to the TMs. For other promiscuous
transporters such as the multiple-sugar transporter Msm
of Streptococcus mutans, the PBP has a wide substrate
specificity. These transporters include the
maltose-maltodextrin, phosphate and sulfate
transporters, among others.
Length = 190
Score = 28.8 bits (65), Expect = 7.5
Identities = 16/56 (28%), Positives = 26/56 (46%), Gaps = 1/56 (1%)
Query: 369 MRGFVELAWAF-STVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFV 423
+R ++L + S VLGL L L+ +L W L A I A +++ P +
Sbjct: 37 LRRIIDLLLSLPSLVLGLLLVLLFGVLLGWGILPGLGLPALILALLLIAPFARLIR 92
Score = 28.8 bits (65), Expect = 7.5
Identities = 16/56 (28%), Positives = 26/56 (46%), Gaps = 1/56 (1%)
Query: 626 MRGFVELAWAF-STVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFV 680
+R ++L + S VLGL L L+ +L W L A I A +++ P +
Sbjct: 37 LRRIIDLLLSLPSLVLGLLLVLLFGVLLGWGILPGLGLPALILALLLIAPFARLIR 92
>gnl|CDD|219702 pfam08022, FAD_binding_8, FAD-binding domain.
Length = 103
Score = 27.7 bits (62), Expect = 7.8
Identities = 8/30 (26%), Positives = 13/30 (43%), Gaps = 2/30 (6%)
Query: 201 TVSKSPLGRTFSAGSHM--SLPSLSPLPCH 228
VSK + G ++ + P +S L H
Sbjct: 20 IVSKPKKPFKYKPGQYIFINFPPISFLQSH 49
>gnl|CDD|224205 COG1286, CvpA, Uncharacterized membrane protein, required for
colicin V production [General function prediction only].
Length = 182
Score = 28.8 bits (65), Expect = 7.9
Identities = 19/65 (29%), Positives = 27/65 (41%), Gaps = 5/65 (7%)
Query: 369 MRGFV-ELAWAFSTVLGLF---LFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVV 424
RGF+ E+ S +L F LF +A L ++ Y A A I +L+I
Sbjct: 21 RRGFIREVLSLLSWILAAFVASLFYKPLAPL-LREYIPYPNIAIGIAIAIFFVILLIVGA 79
Query: 425 FVVHF 429
FV
Sbjct: 80 FVNSL 84
Score = 28.8 bits (65), Expect = 7.9
Identities = 19/65 (29%), Positives = 27/65 (41%), Gaps = 5/65 (7%)
Query: 626 MRGFV-ELAWAFSTVLGLF---LFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVV 681
RGF+ E+ S +L F LF +A L ++ Y A A I +L+I
Sbjct: 21 RRGFIREVLSLLSWILAAFVASLFYKPLAPL-LREYIPYPNIAIGIAIAIFFVILLIVGA 79
Query: 682 FVVHF 686
FV
Sbjct: 80 FVNSL 84
>gnl|CDD|199217 cd10422, RNase_Ire1, RNase domain (also known as the kinase
extension nuclease domain) of Ire1. The model
represents the C-terminal endoribonuclease domain of the
multi-functional protein Ire1; Ire1 in addition contains
a type I transmembrane serine/threonine protein kinase
(STK) domain, and a Luminal dimerization domain. Ire1 is
essential for the endoplasmic reticulum (ER) unfolded
protein response (UPR), which acts as an ER stress
sensor and is the oldest and most conserved component of
the UPR in eukaryotes. During ER stress, IRE1 dimerizes
through its N-terminal luminal domain and forms
oligomers, promoting trans-autophosphorylation by its
cytosolic kinase domain. This leads to a conformational
change that stimulates its endoribonuclease (RNase)
activity and results in the cleavage of its mRNA
substrate, Hac1 in yeast and Xbp1 in metazoans, thus
promoting a splicing event that enables translation into
a transcription factor which activates the UPR. This
RNase domain is homologous to the RNase domain of RNase
L, and possesses a novel fold for a nuclease and appears
to be rigid irrespective of the activation state of
IRE1. Structural analysis and mutational studies have
revealed that an early stage 'phosphoryl-transfer'
competent conformation of IRE1 favors face-to-face
dimerization of the kinase domains which precedes and is
distinct from the RNase 'active' back-to-back
conformation. Furthermore, in yeast IRE1, the flavonol
quercetin activates the RNase and potentiates activation
of the protein kinase by ADP, hinting at the possible
existence of endogenous cytoplasmic ligands that may
function along with stress signals from ER lumen in
order to modulate IRE1 activity, thus identifying IRE1
as a target for development of ATP-competitive
inhibitors to modulate the UPR with specific relevance
for multiple myeloma.
Length = 129
Score = 27.9 bits (63), Expect = 8.0
Identities = 14/28 (50%), Positives = 14/28 (50%), Gaps = 7/28 (25%)
Query: 251 RLYKNHSRRDL-------GHHYEELPED 271
R YK S RDL HHY ELP D
Sbjct: 57 RKYKGSSVRDLLRALRNKKHHYRELPPD 84
>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 433 to 538 amino acids in length. This domain is
found associated with pfam00616, pfam00168. This domain
has two conserved sequence motifs: DLQ and PLSFQNP.
Length = 489
Score = 29.3 bits (65), Expect = 8.1
Identities = 19/72 (26%), Positives = 21/72 (29%), Gaps = 8/72 (11%)
Query: 147 GALSHPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHNPSYSSSGSEASIEPRT------- 199
S S P S P SG SSS S E R
Sbjct: 267 PPGGGSHRGRIPPSLLSSLPSEGSMLSSEWPQSGARPRQQSSSSKGDSPELRPAAGHLQQ 326
Query: 200 PT-VSKSPLGRT 210
P+ V+ S L RT
Sbjct: 327 PSPVNMSALERT 338
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 29.5 bits (66), Expect = 8.2
Identities = 30/196 (15%), Positives = 50/196 (25%), Gaps = 19/196 (9%)
Query: 39 ELEHNLNPIRLRTSSHSSVKRRLLTRTSSQASLTSRT--GRSDLRPRSGLERPILRVNTS 96
E H L+ S SS TS++ + + G L S + P L +
Sbjct: 204 EENHTLSVTDKEKSEASSKGL-----TSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEK 258
Query: 97 PYPATKGTPHFTKK------SFTSADLSRTVTHPYSRTQSVQSLRDVSRTQTASKRGALS 150
H AD + S + S
Sbjct: 259 GKDHHHSHNHQHHSIGINNHHSKHADSKLQTIEVIENHSNKSRPSSSSTNGSKETTSNSS 318
Query: 151 HPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHNPSYSSSGSEASIEPRTPTVSKSPLGRT 210
+ S + ++ ++P S S S + E + SKS
Sbjct: 319 SAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVSDNESKQKRASKSS---- 374
Query: 211 FSAGSHMSLPSLSPLP 226
+G+ S S +
Sbjct: 375 --SGARDSKKDASGMS 388
>gnl|CDD|110602 pfam01611, Filo_glycop, Filovirus glycoprotein. This family
includes an extracellular region from the envelope
glycoprotein of Ebola and Marburg viruses. This region
is also produced as a separate transcript that gives
rise to a non-structural, secreted glycoprotein, which
is produced in large amounts and has an unknown
function. Processing of this protein may be involved in
viral pathogenicity.
Length = 364
Score = 29.0 bits (65), Expect = 8.2
Identities = 15/79 (18%), Positives = 26/79 (32%), Gaps = 2/79 (2%)
Query: 105 PHFTKKSFTSADLSRTVTHPYSRTQSVQSLRDVSRTQTASKRGALSHPRTPTTLLSKQGS 164
TKK+ T S T S + + + T+ R +L TT +S +
Sbjct: 276 FWETKKNVTKQGQSSTCLSTPSLSPRTTNHSRQAVTELDKNRTSLQPSTNNTTTIST--N 333
Query: 165 HPLTPSSRYPTTPTSGGHN 183
+ + + P N
Sbjct: 334 NTSKHNFSTQSIPLQNFTN 352
>gnl|CDD|221931 pfam13136, DUF3984, Protein of unknown function (DUF3984). This
family of proteins is functionally uncharacterized. This
family of proteins is found in eukaryotes. Proteins in
this family are typically between 393 and 442 amino
acids in length.
Length = 301
Score = 28.9 bits (65), Expect = 8.6
Identities = 23/113 (20%), Positives = 29/113 (25%), Gaps = 16/113 (14%)
Query: 168 TPSSRYPTTPTSGGHNPSYSSSGSEASIEP---RTPTVSKSPLGR--TFSAGSHMSLPSL 222
Y P SY SS S + + + S+S L + S S MS L
Sbjct: 27 DEERDYSYAPHP--PTTSYLSSKSVPTTPGILSHSRSPSRSRLHKRKKSSRRSPMSDTLL 84
Query: 223 SPLPCHQCQHSPPTWSPYILPMNTDHHPRLYKNHSRRDLGHHYEELPEDSEGL 275
S + H S R DSE L
Sbjct: 85 ---------KSKSSAHLLHHQSTRSHRRSKSGTTSPRKPSSSAHRRRNDSEWL 128
>gnl|CDD|233213 TIGR00966, 3a0501s07, protein-export membrane protein SecF. This
bacterial protein is always found with the homologous
protein-export membrane protein SecD. In numerous
lineages, this protein occurs as a SecDF fusion protein
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 246
Score = 28.8 bits (65), Expect = 8.8
Identities = 18/53 (33%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 374 ELAWAFSTVLGLFLFLVEVAILCWVKF-WDYSLRAAIAATIIVIPVLIIFVVF 425
ELA VL L LV + I V+F W ++L A +A VI + ++ +F
Sbjct: 95 ELAR--KAVLALLAALVLILIYITVRFEWRFALGAIVALVHDVIITVGVYSLF 145
Score = 28.8 bits (65), Expect = 8.8
Identities = 18/53 (33%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 631 ELAWAFSTVLGLFLFLVEVAILCWVKF-WDYSLRAAIAATIIVIPVLIIFVVF 682
ELA VL L LV + I V+F W ++L A +A VI + ++ +F
Sbjct: 95 ELAR--KAVLALLAALVLILIYITVRFEWRFALGAIVALVHDVIITVGVYSLF 145
>gnl|CDD|226007 COG3476, COG3476, Tryptophan-rich sensory protein (mitochondrial
benzodiazepine receptor homolog) [Signal transduction
mechanisms].
Length = 161
Score = 28.1 bits (63), Expect = 9.4
Identities = 13/61 (21%), Positives = 30/61 (49%), Gaps = 2/61 (3%)
Query: 372 FVELAWAFSTVLGLFLFLV-EVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVVHFY 430
+ + ++ L +FL+L+ + W + + LR+ AA II++ + I +V + +
Sbjct: 68 WEKGPGQGTSWLLMFLYLLQLILNFAWSPIF-FGLRSLGAALIIILLLWIAVLVLTIIVW 126
Query: 431 H 431
Sbjct: 127 P 127
Score = 28.1 bits (63), Expect = 9.4
Identities = 13/61 (21%), Positives = 30/61 (49%), Gaps = 2/61 (3%)
Query: 629 FVELAWAFSTVLGLFLFLV-EVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVVHFY 687
+ + ++ L +FL+L+ + W + + LR+ AA II++ + I +V + +
Sbjct: 68 WEKGPGQGTSWLLMFLYLLQLILNFAWSPIF-FGLRSLGAALIIILLLWIAVLVLTIIVW 126
Query: 688 H 688
Sbjct: 127 P 127
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.133 0.398
Gapped
Lambda K H
0.267 0.0778 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 36,040,269
Number of extensions: 3543308
Number of successful extensions: 5344
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5280
Number of HSP's successfully gapped: 215
Length of query: 718
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 614
Effective length of database: 6,324,786
Effective search space: 3883418604
Effective search space used: 3883418604
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 63 (28.0 bits)