RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy11431
(590 letters)
>gnl|CDD|217309 pfam02990, EMP70, Endomembrane protein 70.
Length = 518
Score = 674 bits (1742), Expect = 0.0
Identities = 268/558 (48%), Positives = 373/558 (66%), Gaps = 51/558 (9%)
Query: 1 FDFCTVEREEKESPAENLGQVVFGERIRPSPY--KMNEDQTCTELCTKTYTPGNSDSEMK 58
FC E+ + +E+LG+V+FG+RI SPY K E++ C LC +
Sbjct: 8 LPFCR--PEKIKHKSESLGEVLFGDRIYNSPYKLKFLEEKECEVLCAVK------LTSED 59
Query: 59 LALLKRGMDLNYYHHWIVDNMPVTWC-YLTNDGQKYCNTGFPMGCYKHAYQDFCSSMIGE 117
+ ++ ++ YY W++DN+PV ++ +GFP+G
Sbjct: 60 VKFFRKAIEEGYYVQWLIDNLPVAGFVGKVDNKGVGFESGFPLGFQTE------------ 107
Query: 118 GSPNDMYYVFNHVDIRITYHSGEDEEWGSSFQGNGGRIIAVKITPRSISHIEEPGKPPVC 177
+ YY+FNH+D I YH ++++ RI+ +++TPRS+
Sbjct: 108 ----EKYYLFNHLDFVIEYHDRDNDD---------YRIVGIEVTPRSVKP---------- 144
Query: 178 SSAQGELQPMAVPKGKLDKPLQIKYTYSIMFQKNNTVKWSSRWDYILESTSHTNIHWFSI 237
S P + +GK ++ +TYS+ +++++ VKW+SRWD L++ IHWFSI
Sbjct: 145 SGCSTTSSPQELDEGKE---NELTFTYSVKWKESD-VKWASRWDKYLDAMHDLQIHWFSI 200
Query: 238 LNSLMIVLFLSGMVAMIILRTLHKDIARYNQIDSGEDVQEEFGWKLVHGDVFRPPRKGML 297
+NSL+IVLFLSG+V+MI++RTL +DIARYN++D ED QEE GWKLVHGDVFRPPR ML
Sbjct: 201 INSLVIVLFLSGIVSMILMRTLRRDIARYNELDEDEDAQEESGWKLVHGDVFRPPRNPML 260
Query: 298 LSVLLGSGIQILMMTMVTLAFACLGFLSPANRGALMTCAMVMYVCLGTPAGYISARIYKS 357
LS L+GSG+Q+L+M + T+ FACLGFLSP+NRG+L+T A+V+Y G AGY+SAR+YK+
Sbjct: 261 LSALVGSGVQLLLMVIGTIVFACLGFLSPSNRGSLLTAAIVLYALTGFVAGYVSARLYKT 320
Query: 358 FGGEKWKSNIILTSTLVPGVVFCLFFVMNLIFWAKDSSAAVPFSTLLALVSLWVWVSMPL 417
F G+KWK N+ILT+ L PG+VF +FFV+N + WA SS A+PF T++AL+ LW VS+PL
Sbjct: 321 FKGKKWKRNLILTAFLFPGIVFVIFFVLNFVLWAYGSSGAIPFGTIVALLLLWFLVSVPL 380
Query: 418 TFIGSFFGFRKRAI-EHPVRTNQIPRQIPEQSVYTQAIPGVIMGGVLPFGCIFIQLYFIL 476
TF+G GFR RA +HPVRTNQIPRQIPEQ Y +PG++MGG+LPFG IFI+L+FI
Sbjct: 381 TFLGGIVGFRNRAGEQHPVRTNQIPRQIPEQPWYLSPLPGILMGGILPFGAIFIELFFIF 440
Query: 477 SSIWSSQTYYMFGFLFLVFLILIITCSETTILLCYFHLCAEDYHWWWRSFITSGFTSIYL 536
+S+W + YYMFGFLFLVF+IL++ CSE TI+L YF LCAEDY WWWRSF+TSG T++Y+
Sbjct: 441 TSLWLHKIYYMFGFLFLVFIILVVVCSEVTIVLTYFQLCAEDYRWWWRSFLTSGSTAVYV 500
Query: 537 FIYCIHYFETRLNISDAS 554
F+Y I+YF T+L IS
Sbjct: 501 FLYSIYYFFTKLKISGFV 518
>gnl|CDD|226951 COG4585, COG4585, Signal transduction histidine kinase [Signal
transduction mechanisms].
Length = 365
Score = 34.4 bits (79), Expect = 0.17
Identities = 16/115 (13%), Positives = 34/115 (29%), Gaps = 10/115 (8%)
Query: 468 IFIQLYFILSSIWS-SQTYYMFGFLFLVFLILIITCSETTILLCYFHLCAEDYHWWWRSF 526
I L +L + S + L ++FL L+I F L + +
Sbjct: 13 FLILLLLLLLILLSLGLILTLISGLLILFLALVIA---------LFDLRLQVLGFLLLIL 63
Query: 527 ITSGFTSIYLFIYCIHYFETRLNISDASSVFLYFGYTLIMVFLFFIMTGMCEMLV 581
I I ++ + L + + LF ++ +++
Sbjct: 64 ILVLLLLILAWLGLPGLSLGFFILIFLLIYLLGVRGFALALILFLLLLLAIALIL 118
>gnl|CDD|220672 pfam10277, Frag1, Frag1/DRAM/Sfk1 family. This family includes
Frag1, DRAM and Sfk1 proteins. Frag1 (FGF receptor
activating protein 1) is a protein that is conserved
from fungi to humans. There are four potential
iso-prenylation sites throughout the peptide, viz CILW,
CIIW and CIGL. Frag1 is a membrane-spanning protein that
is ubiquitously expressed in adult tissues suggesting an
important cellular function. Dram is a family of
proteins conserved from nematodes to humans with six
hydrophobic transmembrane regions and an Endoplasmic
Reticulum signal peptide. It is a lysosomal protein that
induces macro-autophagy as an effector of p53-mediated
death, where p53 is the tumour-suppressor gene that is
frequently mutated in cancer. Expression of Dram is
stress-induced. This region is also part of a family of
small plasma membrane proteins, referred to as Sfk1,
that may act together with or upstream of Stt4p to
generate normal levels of the essential phospholipid
PI4P, thus allowing proper localisation of Stt4p to the
actin cytoskeleton.
Length = 216
Score = 33.8 bits (78), Expect = 0.19
Identities = 12/106 (11%), Positives = 28/106 (26%), Gaps = 9/106 (8%)
Query: 475 ILSSIWSSQTYY------MFGFLFLVFLILIITCSETTILLCYFHLCAEDYHWWWRSFIT 528
+ + S + F L +++ + + L L + + ++
Sbjct: 106 GGVANFQSTEDHDVHDIGAILFFVGGLLYMLLQTALSYRLGPTPVLRKS---FRLKLILS 162
Query: 529 SGFTSIYLFIYCIHYFETRLNISDASSVFLYFGYTLIMVFLFFIMT 574
+ A S+ F + L F+ F T
Sbjct: 163 IIAFISAIPFGVFFIQHNVHCGPGAYSLSAIFEWILAFSFILFFGT 208
>gnl|CDD|225451 COG2898, COG2898, Uncharacterized conserved protein [Function
unknown].
Length = 538
Score = 33.5 bits (77), Expect = 0.38
Identities = 35/222 (15%), Positives = 68/222 (30%), Gaps = 52/222 (23%)
Query: 290 RPPRKGMLLSVLLGSGIQILMMTMVTLAFACLGFLSPANRGALMTCAMVMYVCLGTPAGY 349
R +L ++ SG+ +L L L L+ A + L
Sbjct: 10 RLLVPQILAILVFLSGVILLFSGATPSLDERLRLLPRIVPLPLIE-ASHLTASLLGALLL 68
Query: 350 ISARIYKSFGGEKWKSNIILTSTLVPGVVFCLFFVMNLIFWAKDSSAAVPFSTLLALVSL 409
+ AR + W + +I+ L+ G+V L + I + +L L+++
Sbjct: 69 LLARGLRKRLKRAWIAALIV---LILGLVLSLAKGLAWI-----------EAAVLFLIAV 114
Query: 410 WVWVSMPLTFIGSFFGFRKRAIEHPVRTNQIPRQIPEQSVYTQAI-PGVIMG-GVLPFGC 467
+ R + + Q+I PG + V
Sbjct: 115 LLLAGR----------------------RAFYR---KSLLLVQSISPGWLAAFIVALAAA 149
Query: 468 IFIQLYFILSSIWSSQTYYMFGF----------LFLVFLILI 499
++ + +S+ ++ F F L VF+ LI
Sbjct: 150 NWLGGFAYADVEYSNDLWWEFPFTGSAPRFLRALLGVFIALI 191
>gnl|CDD|220299 pfam09586, YfhO, Bacterial membrane protein YfhO. This protein is
a conserved membrane protein. The yfhO gene is
transcribed in Difco sporulation medium and the
transcription is affected by the YvrGHb two-component
system. Some members of this family have been annotated
as glycosyl transferases of the PMT family.
Length = 835
Score = 32.6 bits (75), Expect = 0.63
Identities = 21/114 (18%), Positives = 44/114 (38%), Gaps = 14/114 (12%)
Query: 461 GVLPFGCIFIQLYFILSSIWSSQTYYMFGFLFLVFLILIITCSETTILLCY-FHLCAEDY 519
G+LP + + LYF I + +F L L ++ L+ + F
Sbjct: 284 GLLPL--LLLILYFFNKLI-KLKEKLLFALLLLFLILSF--YLPPLDLVWHGFQAPVWFP 338
Query: 520 HWWWRSFITSGFTSIYLFIYCIHYFETRLNISDASSVFLYFGYTLIMVFLFFIM 573
+ + SF+ S F + L + + + + S+ L L+++F+ +
Sbjct: 339 YRF--SFLFS-FLILLLAAEALLHLKQLKKLKILISILL-----LVLLFILVFL 384
>gnl|CDD|220185 pfam09334, tRNA-synt_1g, tRNA synthetases class I (M). This family
includes methionyl tRNA synthetases.
Length = 388
Score = 32.2 bits (74), Expect = 0.76
Identities = 14/28 (50%), Positives = 15/28 (53%)
Query: 338 VMYVCLGTPAGYISARIYKSFGGEKWKS 365
V+YV GYISA Y S EKWK
Sbjct: 244 VIYVWFDALIGYISATKYLSGNPEKWKK 271
>gnl|CDD|219005 pfam06387, Calcyon, D1 dopamine receptor-interacting protein
(calcyon). This family consists of several D1 dopamine
receptor-interacting (calcyon) proteins. D1/D5 dopamine
receptors in the basal ganglia, hippocampus, and
cerebral cortex modulate motor, reward, and cognitive
behaviour. D1-like dopamine receptors likely modulate
neocortical and hippocampal neuronal excitability and
synaptic function via Ca(2+) as well as cAMP-dependent
signaling. Defective calcyon proteins have been
implicated in both attention-deficit/hyperactivity
disorder (ADHD) and schizophrenia.
Length = 186
Score = 30.7 bits (69), Expect = 1.4
Identities = 18/70 (25%), Positives = 28/70 (40%), Gaps = 8/70 (11%)
Query: 145 GSSFQGNGGRIIAVK--------ITPRSISHIEEPGKPPVCSSAQGELQPMAVPKGKLDK 196
G +F GG+ + ITP ++ ++ V Q E QP + KGK
Sbjct: 5 GCNFSEKGGKEPGDEDGFDTVPLITPLDVNQLQPSFPDKVVVKTQTEYQPTSADKGKKRA 64
Query: 197 PLQIKYTYSI 206
PL ++T
Sbjct: 65 PLIAEFTVCS 74
>gnl|CDD|223911 COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms].
Length = 1009
Score = 31.0 bits (71), Expect = 2.0
Identities = 19/100 (19%), Positives = 41/100 (41%), Gaps = 7/100 (7%)
Query: 235 FSILNSLMIVLFLSGMVAMIILRTLHKDIARYNQIDSGEDVQEEFGWKLVHGDVFRPPRK 294
++L SL++ L L+ + +L+ + + + D E+ + + V R
Sbjct: 465 VAMLLSLLVALTLTPALCARLLKPVPPPKGFFGWFNRFFDRLEDRYQRGLRW-VLRRRAL 523
Query: 295 GMLLSVLLGSGIQILMMTMVTLAFACLGFLSPANRGALMT 334
+LL ++L G +L + + FL ++G M
Sbjct: 524 VLLLYLVLLGGSVLLFLKLPK------SFLPEEDQGEFMV 557
>gnl|CDD|224214 COG1295, Rbn, Ribonuclease BN family enzyme [Replication,
recombination, and repair].
Length = 303
Score = 30.7 bits (70), Expect = 2.2
Identities = 36/201 (17%), Positives = 59/201 (29%), Gaps = 46/201 (22%)
Query: 381 LFFVMNLIFWAKDSS---AAVPFSTLLALVSLWVWVSMPLTF---IGSFFGFRKRAIEHP 434
L F D S AA+ + TLL+L L L + FF +E
Sbjct: 19 LLRRALERFNEDDGSLLAAALAYYTLLSLFPL-------LLVVFSLLGFFLAHPILLE-- 69
Query: 435 VRTNQIPRQIPEQSV-----YTQAIPGVIMGGVLPFGCIFI---------QLYFILSSIW 480
I +PEQ+ Y + G +L G + L L+ IW
Sbjct: 70 DLLFIIANLLPEQAADVIEDYLKNFLSQSRGSLLSLGLVVALWTASNGMSALRDALNKIW 129
Query: 481 SSQT------------YYMFG-FLFLVFLILIITCSETT----ILLCYFHLCAEDYHWWW 523
+ ++ G L L+F +L+ +L
Sbjct: 130 RVKPRRSFIRRRLLSLFWTLGTLLPLLFALLLSVVGPIALQALLLRFGLPFFDVLVLILL 189
Query: 524 RSFITSGFTSIYLFIYCIHYF 544
R + + L + ++ F
Sbjct: 190 RLRLLVSLLLLTLGFFLLYRF 210
>gnl|CDD|219819 pfam08395, 7tm_7, 7tm Chemosensory receptor. This family includes
a number of gustatory and odorant receptors mainly from
insect species such as A. gambiae and D. melanogaster.
They are classified as G-protein-coupled receptors
(GPCRs), or seven-transmembrane receptors. They show
high sequence divergence, consistent with an ancient
origin for the family.
Length = 370
Score = 30.5 bits (69), Expect = 2.4
Identities = 15/76 (19%), Positives = 29/76 (38%), Gaps = 3/76 (3%)
Query: 469 FIQLYFILSSIWSSQTYYMFGFLFLVFLILIITCSETTILLCYFHLCAEDYHWWWRSFIT 528
++L L + S + + F L L+ L+LII ++ +L Y +
Sbjct: 102 LLRLDRRLLRLGSRRNFRRFNRLLLLLLVLIILLLLVSL---LGYLLLSQYFLLVLLLLL 158
Query: 529 SGFTSIYLFIYCIHYF 544
+ L + + YF
Sbjct: 159 YLLPYLVLNLLILQYF 174
>gnl|CDD|217431 pfam03219, TLC, TLC ATP/ADP transporter.
Length = 491
Score = 30.7 bits (70), Expect = 2.7
Identities = 17/47 (36%), Positives = 25/47 (53%), Gaps = 1/47 (2%)
Query: 453 AIPGVIMGGVLPFGCIFIQLYFILSSIWSSQT-YYMFGFLFLVFLIL 498
IP + + GVLP +F +Y LS+ S + +Y+ FLVF L
Sbjct: 59 VIPFLKVWGVLPSAILFTIIYAKLSNRLSREKVFYIIISFFLVFFAL 105
>gnl|CDD|218842 pfam05987, DUF898, Bacterial protein of unknown function (DUF898).
This family consists of several bacterial proteins of
unknown function. Some of the family members are
described as putative membrane proteins.
Length = 339
Score = 30.2 bits (69), Expect = 2.9
Identities = 18/82 (21%), Positives = 36/82 (43%), Gaps = 2/82 (2%)
Query: 276 QEEFGWKLVHGDVFRPPRKGMLLSVLLGSGIQILMMTMVT--LAFACLGFLSPANRGALM 333
F ++ GD ++ +L+++L + + +L+ A LG A GA++
Sbjct: 170 TLRFSFEATAGDFYKAYLLALLVALLFLAVLGLLLALAAGALAGAAPLGGFGLAAVGAII 229
Query: 334 TCAMVMYVCLGTPAGYISARIY 355
+V++V A Y AR+
Sbjct: 230 LGYLVLFVGYLVAAAYYRARLR 251
>gnl|CDD|217608 pfam03547, Mem_trans, Membrane transport protein. This family
includes auxin efflux carrier proteins and other
transporter proteins from all domains of life.
Length = 321
Score = 30.0 bits (68), Expect = 3.5
Identities = 33/207 (15%), Positives = 61/207 (29%), Gaps = 38/207 (18%)
Query: 239 NSLMIVLFLSGMVAMIILRTLHKDIARYNQIDSGEDVQEEFGW-----------KLVHGD 287
L + + +II + I D EE G ++
Sbjct: 121 EGLSYAIISVVLGVIIIWTLGYFLIESRGA---KRDKSEESGDTSGSMTLLILIVVLLKL 177
Query: 288 VFRPPRKGMLLSVLLG---------------------SGIQILMMTMVTLAFACLGFLSP 326
+ PP LL ++LG I M LG L
Sbjct: 178 ILNPPTYASLLGLILGLVGFLLPLIFPEFIQDSISILGDAAIPMALFSLGLTLALGKLKS 237
Query: 327 ANRGALMTCAMVMYVCLGTPAGYISARIYKSFGGEKWKSNIILTSTLVPGVVFCLFFVMN 386
+ A T +++ + L + G + + +P +
Sbjct: 238 SLGAATATIHLILRLILMPLVMLGIVLL---LGLRGLTLLVAILEAALPPAIVLGVIAQL 294
Query: 387 LIFWAKDSSAAVPFSTLLALVSLWVWV 413
+++S V ++TLLAL++L +W+
Sbjct: 295 YNVDEEEASTVVFWTTLLALLTLPLWI 321
>gnl|CDD|217611 pfam03552, Cellulose_synt, Cellulose synthase. Cellulose, an
aggregate of unbranched polymers of beta-1,4-linked
glucose residues, is the major component of wood and
thus paper, and is synthesised by plants, most algae,
some bacteria and fungi, and even some animals. The
genes that synthesise cellulose in higher plants differ
greatly from the well-characterized genes found in
Acetobacter and Agrobacterium sp. More correctly
designated as 'cellulose synthase catalytic subunits',
plant cellulose synthase (CesA) proteins are integral
membrane proteins, approximately 1,000 amino acids in
length. There are a number of highly conserved residues,
including several motifs shown to be necessary for
processive glycosyltransferase activity.
Length = 716
Score = 30.1 bits (68), Expect = 3.6
Identities = 17/59 (28%), Positives = 25/59 (42%), Gaps = 7/59 (11%)
Query: 531 FTSIYLFIYC----IHYFETRLNISDASSVFLYFGYTLIMVFLFFIMTGMCEMLVSLVG 585
FTSI L YC I + + S+ + + +FL I TG+ E+ S V
Sbjct: 500 FTSIPLIAYCFLPAICLLTGKFIVPTLSNYASIY---FLSLFLSIIATGILELRWSGVS 555
>gnl|CDD|225764 COG3223, COG3223, Predicted membrane protein [Function unknown].
Length = 138
Score = 28.8 bits (65), Expect = 3.6
Identities = 24/86 (27%), Positives = 33/86 (38%), Gaps = 26/86 (30%)
Query: 490 FLFLVFLILIITCSETTILLCYFHLCAEDYHWWWRSFITSGFTSIYLFIYCIHYFETRLN 549
FL+ F+ LI+ YF +H+ R FI G T+I RL
Sbjct: 66 FLYFEFIALIVK---------YFK---SGFHFPLRYFIYIGITAI-----------IRLI 102
Query: 550 ISD---ASSVFLYFGYTLIMVFLFFI 572
I D LY G L++V F+
Sbjct: 103 IVDHESPIDTLLYSGAILLLVIALFL 128
>gnl|CDD|225105 COG2194, COG2194, Predicted membrane-associated, metal-dependent
hydrolase [General function prediction only].
Length = 555
Score = 30.0 bits (68), Expect = 3.8
Identities = 25/120 (20%), Positives = 44/120 (36%), Gaps = 32/120 (26%)
Query: 468 IFIQLYFILSSIWSSQTYYMFGFLFLVFLILIITCSETTILLCYFHLCAEDYHWWWR--- 524
F+Q++ I S S + F FL + ++ +LL +
Sbjct: 31 FFLQVFLINSLDLS----FSFSFLLALVFAFLL------LLLLLSFP------RLLKPLA 74
Query: 525 --SFITSGFTSIYLFIYCIHY--------FETRLNISDASSVF-LYFGYTLIMVFLFFIM 573
+ S S + + Y I FET N +++S + LYF L++V L +
Sbjct: 75 GVLSLVSAAASYFAYFYGIIIDKNMLLNVFET--NTAESSELLTLYFLLWLVLVGLLPAL 132
>gnl|CDD|237523 PRK13825, PRK13825, conjugal transfer protein TraB; Provisional.
Length = 388
Score = 30.0 bits (68), Expect = 3.9
Identities = 35/155 (22%), Positives = 50/155 (32%), Gaps = 38/155 (24%)
Query: 290 RPPRKGMLLSVLLGSGI------QILMMTMVTLAFACLGFLSPAN-RGALMTCAMVMYVC 342
R L +L G+ I +++ + LAF L SP+ AL++ +
Sbjct: 2 RRDVLQALALILAGAAIGFLGWSGHVLLLPLALAFPVLWANSPSRLAAALVSAGYFLAAS 61
Query: 343 LGTPAGYISARIYKSFGGEKWKSNIILTSTLVPGVVF-----CLFFVMNLIFWAKDSSAA 397
G P G A + S L PG+ F +++ W A
Sbjct: 62 RGLPQG--VAAFF--------------GSDLWPGLALWLAASLSFVLVHAALWTAPRGRA 105
Query: 398 VPFSTLLALVSLW--------VWVSMPLTFIGSFF 424
LLA V L W PLT G F
Sbjct: 106 RALRYLLAAV-LMAVPPFGITGWAH-PLTAAGVLF 138
>gnl|CDD|116021 pfam07399, DUF1504, Protein of unknown function (DUF1504). This
family consists of several hypothetical bacterial
proteins of around 440 residues in length. The function
of this family is unknown.
Length = 438
Score = 29.8 bits (67), Expect = 4.7
Identities = 15/38 (39%), Positives = 18/38 (47%)
Query: 459 MGGVLPFGCIFIQLYFILSSIWSSQTYYMFGFLFLVFL 496
MG +P IFI L + I S GFLFL +L
Sbjct: 259 MGERVPSWIIFIHLVLVGCVILSRAPPLFMGFLFLFYL 296
>gnl|CDD|233122 TIGR00769, AAA, ADP/ATP carrier protein family. These proteins are
members of the ATP:ADP Antiporter (AAA) Family (TC
2.A.12), which consists of nucleotide transporters that
have 12 GES predicted transmembrane regions. One protein
from Rickettsia prowazekii functions to take up ATP from
the eukaryotic cell cytoplasm into the bacterium in
exchange for ADP. Five AAA family paralogues are encoded
within the genome of R. prowazekii. This organism
transports UMP and GMP but not CMP, and it seems likely
that one or more of the AAA family paralogues are
responsible. The genome of Chlamydia trachomatis encodes
two AAA family members, Npt1 and Npt2, which catalyse
ATP/ADP exchange and GTP, CTP, ATP and UTP uptake
probably employing a proton symport mechanism. Two
homologous adenylate translocators of Arabidopsis
thaliana are postulated to be localized to the
intracellular plastid membrane where they function as
ATP importers [Transport and binding proteins,
Nucleosides, purines and pyrimidines].
Length = 472
Score = 29.7 bits (67), Expect = 4.7
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 1/48 (2%)
Query: 452 QAIPGVIMGGVLPFGCIFIQLYFILSSIWSSQT-YYMFGFLFLVFLIL 498
+ IP + V+P IF+ +Y LS+I S + +Y FL F L
Sbjct: 43 EIIPFLKTWVVVPMAVIFMLIYTKLSNILSKEALFYTVISPFLGFFAL 90
>gnl|CDD|182422 PRK10381, PRK10381, LPS O-antigen length regulator; Provisional.
Length = 377
Score = 29.6 bits (67), Expect = 4.8
Identities = 12/28 (42%), Positives = 16/28 (57%), Gaps = 3/28 (10%)
Query: 297 LLSVLLGSGIQILMMTMVTLAFACLGFL 324
L+SVL + I+ +T AFAC G L
Sbjct: 32 LISVLWKAKKTIIA---ITFAFACAGLL 56
>gnl|CDD|227621 COG5305, COG5305, Predicted membrane protein [Function unknown].
Length = 552
Score = 29.7 bits (67), Expect = 5.2
Identities = 13/83 (15%), Positives = 26/83 (31%), Gaps = 3/83 (3%)
Query: 468 IFIQLYFILSSIWSSQTYYMFGFLFLVFLILIITCSETTILLCYFHLCAEDYHWWWRSFI 527
I L ILS + S + ++L+ LL + + + ++
Sbjct: 356 IAPALPLILSDLVSVFITRNPYLIPSYLILLLGVA--YLNLLGTWLTIGDLSQRRYGQYL 413
Query: 528 TSGFTSIYLFIYCIHYFETRLNI 550
+ I Y + Y L +
Sbjct: 414 SL-LLVICSLAYSLVYLNADLLV 435
>gnl|CDD|205005 pfam12668, DUF3791, Protein of unknown function (DUF3791). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 71 and 125 amino acids
in length.
Length = 62
Score = 26.7 bits (60), Expect = 6.1
Identities = 6/27 (22%), Positives = 11/27 (40%), Gaps = 1/27 (3%)
Query: 534 IYLFIYCIHYFETRLNISDASSVFLYF 560
+ I+CI + +L IS +
Sbjct: 1 LEFVIFCIEAYARKLGIS-GKEAYNLL 26
>gnl|CDD|220973 pfam11084, DUF2621, Protein of unknown function (DUF2621). This
family is conserved in the Bacillaceae family. Several
members are named as YneK. The function is not known.
Length = 141
Score = 28.1 bits (63), Expect = 6.4
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 404 LALVSLWVWVSMPLTFIGSFFGFRK 428
L + LWV + + L IG FF FRK
Sbjct: 7 LWFILLWVVILISLMAIGGFFMFRK 31
>gnl|CDD|226385 COG3867, COG3867, Arabinogalactan endo-1,4-beta-galactosidase
[Carbohydrate transport and metabolism].
Length = 403
Score = 29.1 bits (65), Expect = 7.0
Identities = 29/130 (22%), Positives = 44/130 (33%), Gaps = 28/130 (21%)
Query: 26 RIRPSPYKMNEDQTCTELCTKTYTPGNSDSEMKLALLKR----GM----DLNYYHHWI-- 75
R+ PY N + Y GN+D + + + KR GM D +Y W
Sbjct: 83 RVWNDPYDSNGNG---------YGGGNNDLKKAIEIAKRAKNLGMKVLLDFHYSDFWADP 133
Query: 76 -VDNMPVTWCYLTNDGQKYCNTGFPMGCYKHAYQDFCSSMIGEGSPNDMYYVFNHVDIRI 134
P W L + K Y + + ++M EG DM V N +
Sbjct: 134 AKQKKPKAWENLNFEQLK-------KAVYSYT-KYVLTTMKKEGILPDMVQVGNETNGGF 185
Query: 135 TYHSGEDEEW 144
+ GE +
Sbjct: 186 LWPDGEGRNF 195
>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
and diverse group of secondary transporters that
includes uniporters, symporters, and antiporters. MFS
proteins facilitate the transport across cytoplasmic or
internal membranes of a variety of substrates including
ions, sugar phosphates, drugs, neurotransmitters,
nucleosides, amino acids, and peptides. They do so using
the electrochemical potential of the transported
substrates. Uniporters transport a single substrate,
while symporters and antiporters transport two
substrates in the same or in opposite directions,
respectively, across membranes. MFS proteins are
typically 400 to 600 amino acids in length, and the
majority contain 12 transmembrane alpha helices (TMs)
connected by hydrophilic loops. The N- and C-terminal
halves of these proteins display weak similarity and may
be the result of a gene duplication/fusion event. Based
on kinetic studies and the structures of a few bacterial
superfamily members, GlpT (glycerol-3-phosphate
transporter), LacY (lactose permease), and EmrD
(multidrug transporter), MFS proteins are thought to
function through a single substrate binding site,
alternating-access mechanism involving a rocker-switch
type of movement. Bacterial members function primarily
for nutrient uptake, and as drug-efflux pumps to confer
antibiotic resistance. Some MFS proteins have medical
significance in humans such as the glucose transporter
Glut4, which is impaired in type II diabetes, and
glucose-6-phosphate transporter (G6PT), which causes
glycogen storage disease when mutated.
Length = 352
Score = 28.8 bits (65), Expect = 7.3
Identities = 38/278 (13%), Positives = 87/278 (31%), Gaps = 49/278 (17%)
Query: 296 MLLSVLLGSGIQILMMTMVTLAFACLGFLSPANRGALMTCAMVMYVCLGTPAGYISARIY 355
+L SG+ +++ A LS + G +++ + Y AGY+S R
Sbjct: 3 LLFLGFFLSGLDRGLLSPALPLLAEDLGLSASQAGLIVSAFSLGYALGSLLAGYLSDRF- 61
Query: 356 KSFGGEKWKSNIILTSTLVPGVVFCLFFVMNLIFWAKDSSAAVPFSTLLALVSLWVWVSM 415
G + ++L L+ + L + ++ + LL L ++ +
Sbjct: 62 ----GRRR---VLLLGLLLFALGSLLLAFASSLWL------LLVGRFLLGLGGGALYPAA 108
Query: 416 PLTFIGSFFGFRKRAIEHPVRTNQIPRQIPEQSVYTQAIPGVIMGGVLPFGCIFIQLYFI 475
I +F ++R + G G + +
Sbjct: 109 A-ALIAEWFPPKERG-------------------RALGLFSAGFGLGALLGPLLG--GLL 146
Query: 476 LSSIWSSQTYYMFGFLFLVFLILIITCSETTILLCYFHLCAEDYHWWWRSFITSGFTSIY 535
S+ + + L L+ +L++ +LL ++ G+ +
Sbjct: 147 AESLGWRWLFLILAILGLLLALLLLFLLRLLLLLALAFFL-----------LSFGYYGLL 195
Query: 536 LFIYCIHYFETRLNISDASSVFLYFGYTLIMVFLFFIM 573
++ Y + L +S A + L + L + +
Sbjct: 196 TYL--PLYLQEVLGLSAAEAGLLLSLFGLGGILGALLG 231
>gnl|CDD|165431 PHA03160, PHA03160, hypothetical protein; Provisional.
Length = 499
Score = 29.3 bits (65), Expect = 7.6
Identities = 10/56 (17%), Positives = 21/56 (37%)
Query: 142 EEWGSSFQGNGGRIIAVKITPRSISHIEEPGKPPVCSSAQGELQPMAVPKGKLDKP 197
+ + + + G + P + PP+ S Q +QP+ V + + P
Sbjct: 382 HHFSNPYSFDPGHAPFFRYAPYGAPKNDHHLLPPLACSQQLPMQPLHVQQAPMQAP 437
>gnl|CDD|216411 pfam01284, MARVEL, Membrane-associating domain. MARVEL
domain-containing proteins are often found in
lipid-associating proteins - such as Occludin and MAL
family proteins. It may be part of the machinery of
membrane apposition events, such as transport vesicle
biogenesis.
Length = 132
Score = 28.0 bits (63), Expect = 7.7
Identities = 8/80 (10%), Positives = 22/80 (27%), Gaps = 7/80 (8%)
Query: 493 LVFLILIITCSETTILLCYFHLCAEDYHWWWRSFITSGFTSIYLFIYCIHYFETRLNISD 552
LVF I+++ + I + F + F+ + + + Y
Sbjct: 14 LVFAIIVLGLIASLIAGSNPSAVN------FAVF-VAVFSFLIALYFLLLYLFGYSYFPS 66
Query: 553 ASSVFLYFGYTLIMVFLFFI 572
+ + + +
Sbjct: 67 IAWPLVDLILDALAALFWLA 86
>gnl|CDD|223469 COG0392, COG0392, Predicted integral membrane protein [Function
unknown].
Length = 322
Score = 28.9 bits (65), Expect = 7.7
Identities = 20/136 (14%), Positives = 53/136 (38%), Gaps = 8/136 (5%)
Query: 365 SNIILTSTLVPGVVFCLFFVMNLIFWAKDSSAAVPFSTLLALVSLWVWVSMPLTFIGSFF 424
S I+ L G+ L F++ L+F + + TLL + L + L
Sbjct: 129 SRILAFLLLTFGLGLLLLFLLILLFVPLKLALFLLLLTLLLRILLILL----LLAAAILL 184
Query: 425 GFRKRAIEHPVRTNQIPRQIPEQSVYTQAIPGVIMGGVLPFGCIFIQLYFILSSIWSSQT 484
+ ++ ++P + +++ ++ + + LY +L ++ + +
Sbjct: 185 ILTAFLRRREFKIGRLRLRLP---NRRGLLAQLLLS-LVEWLADALILYLLLLALGVNVS 240
Query: 485 YYMFGFLFLVFLILII 500
+++ FLV ++ +
Sbjct: 241 FFLVLLAFLVAVLAGV 256
>gnl|CDD|221487 pfam12250, AftA_N, Arabinofuranosyltransferase N terminal. This
domain family is found in bacteria, and is typically
between 430 and 441 amino acids in length. This family
is the N terminal region of AftA. The enzyme catalyzes
the addition of the first key arabinofuranosyl residue
from the sugar donor
beta-D-arabinofuranosyl-1-monophosphoryldecaprenol to
the galactan domain of the cell wall, thus priming the
galactan for further elaboration by the
arabinofuranosyltransferases. The N terminal region has
been predicted to span 11 transmembrane regions.
Length = 431
Score = 29.0 bits (65), Expect = 8.8
Identities = 15/51 (29%), Positives = 24/51 (47%), Gaps = 1/51 (1%)
Query: 378 VFCLFFVMNLIFWAKDSSAAVPFSTLLALVSLWVWVSMPLTFIGS-FFGFR 427
CL + L+ A+ S+ A + + V LW +SM +T G+ FR
Sbjct: 332 ALCLLGTIWLVMRARTSTRAGALAVGVVAVYLWSLLSMLVTLAGTTLLSFR 382
>gnl|CDD|236830 PRK11049, PRK11049, D-alanine/D-serine/glycine permease;
Provisional.
Length = 469
Score = 28.5 bits (64), Expect = 9.9
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 560 FGYTLIMVFLFFIMTGMCEMLVS 582
F Y +I LFF+M M E+L+S
Sbjct: 57 FVYMIIGFMLFFVMRAMGELLLS 79
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.327 0.140 0.453
Gapped
Lambda K H
0.267 0.0721 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 30,261,355
Number of extensions: 2996970
Number of successful extensions: 5570
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5378
Number of HSP's successfully gapped: 305
Length of query: 590
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 488
Effective length of database: 6,413,494
Effective search space: 3129785072
Effective search space used: 3129785072
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 62 (27.7 bits)