RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy929
(483 letters)
>gnl|CDD|215646 pfam00001, 7tm_1, 7 transmembrane receptor (rhodopsin family).
This family contains, amongst other G-protein-coupled
receptors (GCPRs), members of the opsin family, which
have been considered to be typical members of the
rhodopsin superfamily. They share several motifs, mainly
the seven transmembrane helices, GCPRs of the rhodopsin
superfamily. All opsins bind a chromophore, such as
11-cis-retinal. The function of most opsins other than
the photoisomerases is split into two steps: light
absorption and G-protein activation. Photoisomerases, on
the other hand, are not coupled to G-proteins - they are
thought to generate and supply the chromophore that is
used by visual opsins.
Length = 251
Score = 173 bits (441), Expect = 4e-51
Identities = 64/182 (35%), Positives = 91/182 (50%), Gaps = 5/182 (2%)
Query: 93 IAVCIVRKLRRPCNYLLVSLAVSDLCVAILVMPMALLYEVSGKWNFGFMMCNLWVSFDVL 152
+ + +KLR P N L++LAV+DL + + P AL Y V G W FG +C L V+
Sbjct: 1 LVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGGDWPFGDALCKLVGFLFVV 60
Query: 153 SCTASILNLCMISVDRYYAITKPLEYGVKRTPKRMILCVVFVWLGAACISLPPLIVLGND 212
+ ASIL L IS+DRY AI PL Y RTP+R + ++ VW+ A +SLPPL+
Sbjct: 61 NGYASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILVVWVLALLLSLPPLLFSWLR 120
Query: 213 YREDNGVSFCSVC-----QNYGYQIYATLGSFYIPLTVMITVYYKIFCAARKIVAEEKRA 267
E+ V+ C + Y + +TL F +PL V++ Y I RK
Sbjct: 121 TVEEGNVTTCLIDFPEESTKRSYTLLSTLLGFVLPLLVILVCYTLILRTLRKRARSGASQ 180
Query: 268 QS 269
Sbjct: 181 AR 182
>gnl|CDD|222976 PHA03087, PHA03087, G protein-coupled chemokine receptor-like
protein; Provisional.
Length = 335
Score = 74.0 bits (182), Expect = 2e-14
Identities = 51/197 (25%), Positives = 93/197 (47%), Gaps = 11/197 (5%)
Query: 71 LIIGLILLIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDLCVAILVMPMALLY 130
I+ ++ I +VGNI+V I V K++ P + L++LAVSDL + ++ +P + Y
Sbjct: 41 TILIVVYSTIFFFGLVGNIIV-IYVLTKTKIKTPMDIYLLNLAVSDL-LFVMTLPFQIYY 98
Query: 131 EVSGKWNFGFMMCNLWVSFDVLSCTASILNLCMISVDRYYAITKPLEYGVKRTPKRMILC 190
+ +W+FG C + + S+ + ++SVDRY AI P++ T K +
Sbjct: 99 YILFQWSFGEFACKIVSGLYYIGFYNSMNFITVMSVDRYIAIVHPVKSNKINTVKYGYIV 158
Query: 191 VVFVWLGAACISLPPLIVLGNDYREDNGVSFCSVCQNYGYQIYATLGSF-------YIPL 243
+ +W+ + + P L V ++D+ C + N + +F IPL
Sbjct: 159 SLVIWIISIIETTPILFV--YTTKKDHETLICCMFYNNKTMNWKLFINFEINIIGMLIPL 216
Query: 244 TVMITVYYKIFCAARKI 260
T+++ Y KI + I
Sbjct: 217 TILLYCYSKILITLKGI 233
>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
Provisional.
Length = 417
Score = 50.4 bits (120), Expect = 1e-06
Identities = 45/185 (24%), Positives = 91/185 (49%), Gaps = 4/185 (2%)
Query: 20 PQTHSPEVCMDSFSPHNISRFSLTNDTSPASFHNYTRLGLLSHSPYSPLQALIIGLILLI 79
P + SP +S S + S S+T D + Y + + + Y P + I + +I
Sbjct: 49 PTSESPSPNSNSSSSSSSSSSSITYDYEYENNITYELINIKNKCMY-PSISEYIKIFYII 107
Query: 80 IIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDLCVAILVMPMALLYEVSGKWNFG 139
I I+ + GN + I + +K++ + + +LA+SDL + ++ P ++Y +W FG
Sbjct: 108 IFILGLFGNAAI-IMILFCKKIKTITDIYIFNLAISDL-IFVIDFPF-IIYNEFDQWIFG 164
Query: 140 FMMCNLWVSFDVLSCTASILNLCMISVDRYYAITKPLEYGVKRTPKRMILCVVFVWLGAA 199
MC + + + +++ + ++S+DRY+AI P+ + RT I+ + W+ +
Sbjct: 165 DFMCKVISASYYIGFFSNMFLITLMSIDRYFAILYPISFQKYRTFNIGIILCIISWILSL 224
Query: 200 CISLP 204
I+ P
Sbjct: 225 IITSP 229
>gnl|CDD|165177 PHA02834, PHA02834, chemokine receptor-like protein; Provisional.
Length = 323
Score = 49.5 bits (118), Expect = 2e-06
Identities = 46/187 (24%), Positives = 88/187 (47%), Gaps = 19/187 (10%)
Query: 78 LIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDLCVAILVMPMALLYEVSGKWN 137
+++ I ++GN+LV IAV IV++ + L ++A+SDL + + P +++ +W
Sbjct: 36 ILLFIFGLIGNVLV-IAVLIVKRFMFVVDVYLFNIAMSDLML-VFSFPF-IIHNDLNEWI 92
Query: 138 FGFMMCNLWVSFDVLSCTASILNLCMISVDRYYAITKPLEYGVKRTPKRMILCVVFVWLG 197
FG MC L + + +++ + +IS+DRY + + K ++L V W+
Sbjct: 93 FGEFMCKLVLGVYFVGFFSNMFFVTLISIDRYILVVNATKIKNKSISLSVLLSVA-AWVC 151
Query: 198 AACISLPPLIVLGNDYREDNGVSFCSVCQNYGYQIYATLGSFY----------IPLTVMI 247
+ +S+P +++ Y DN C Y + +F+ IPL ++I
Sbjct: 152 SVILSMPAMVL----YYVDN-TDNLKQCIFNDYHENFSWSAFFNFEINIFGIVIPLIILI 206
Query: 248 TVYYKIF 254
Y KI
Sbjct: 207 YCYSKIL 213
>gnl|CDD|192535 pfam10320, 7TM_GPCR_Srsx, Serpentine type 7TM GPCR chemoreceptor
Srsx. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srsx is a solo family
amongst the superfamilies of chemoreceptors.
Chemoperception is one of the central senses of soil
nematodes like C. elegans which are otherwise 'blind'
and 'deaf'.
Length = 257
Score = 45.7 bits (109), Expect = 2e-05
Identities = 32/152 (21%), Positives = 60/152 (39%), Gaps = 23/152 (15%)
Query: 81 IIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDL--CVAILVMPMALLYEVSGKWNF 138
++ I GN+++ I +KLR C+YL+ ++DL ++ + L N
Sbjct: 1 SVIGIFGNVIMIILTFKKKKLRSKCSYLICVQCLADLLCLSGEIIFVVLLFTGTQLTRNE 60
Query: 139 GFMMCNLWVSFDVLSCTASILNLCMISVDRYYAITKPLEYGVKRTPKRMILCVVFVWLGA 198
F++ + + TA + MI +DR A+ P+ Y + L + +
Sbjct: 61 CFLI----IIPYIFGQTAQSPLMLMIGIDRLIAVKFPIFY--RLLSSSKYLFIQLI---- 110
Query: 199 ACISLPPLI------VLGNDYREDNGVSFCSV 224
P+I V G R+D + C+
Sbjct: 111 -----FPVIYSSFILVYGFLERDDETIIVCAP 137
>gnl|CDD|223017 PHA03235, PHA03235, DNA packaging protein UL33; Provisional.
Length = 409
Score = 38.6 bits (90), Expect = 0.005
Identities = 26/129 (20%), Positives = 50/129 (38%), Gaps = 13/129 (10%)
Query: 153 SCTASILNLCMISVDRYYAITKPLEYGVKRTPKRMILCVVFVWLGAACISLPPLI---VL 209
SCT + +I+ DRY I + + + W + S P + V+
Sbjct: 115 SCTVGFATVALIAADRYRVIHQR-TRARSSAYRSTYKILGLTWFASLICSGPAPVYTTVV 173
Query: 210 GNDYREDNGVSFCSVCQNYGY-QIYATLGSFYIPLT--------VMITVYYKIFCAARKI 260
+D + + + + Q+ L +F + LT VM+T +Y F K
Sbjct: 174 AHDDVDPEAPGYETCVIYFRADQVKTVLSTFKVLLTLVWGIAPVVMMTWFYTFFYRTLKR 233
Query: 261 VAEEKRAQS 269
+ +KR+++
Sbjct: 234 ASYKKRSRT 242
>gnl|CDD|217323 pfam03023, MVIN, MviN-like protein. Deletion of the mviN virulence
gene in Salmonella enterica serovar. Typhimurium greatly
reduces virulence in a mouse model of typhoid-like
disease. Open reading frames encoding homologues of MviN
have since been identified in a variety of bacteria,
including pathogens and non-pathogens and
plant-symbionts. In the nitrogen-fixing symbiont
Rhizobium tropici, mviN is required for motility. The
MviM protein is predicted to be membrane-associated.
Length = 452
Score = 36.1 bits (84), Expect = 0.040
Identities = 19/93 (20%), Positives = 40/93 (43%), Gaps = 9/93 (9%)
Query: 72 IIGLILLIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDLCVAILVMPMALLYE 131
+ L++ ++++VT++G + A ++R L + SL VS + + P LL
Sbjct: 59 VSTLLIGVLLLVTLLGILA---APWVIRLLAPGFDAETFSLTVS---LLRITFPYLLLV- 111
Query: 132 VSGKWNFGFMMCNLWVSFDVLSCTASILNLCMI 164
+ N F + + +LN+ +I
Sbjct: 112 --SLSAVFGAVLNARKKFFAPAFSPVLLNIGVI 142
>gnl|CDD|220695 pfam10328, 7TM_GPCR_Srx, Serpentine type 7TM GPCR chemoreceptor
Srx. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srx is part of the Srg
superfamily of chemoreceptors. Chemoperception is one of
the central senses of soil nematodes like C. elegans
which are otherwise 'blind' and 'deaf'.
Length = 275
Score = 34.9 bits (81), Expect = 0.061
Identities = 35/208 (16%), Positives = 73/208 (35%), Gaps = 24/208 (11%)
Query: 78 LIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSD----LCVAILVMPMALLYEVS 133
+I + I+ N L+ + L+ L + A+S+ V+PM LL
Sbjct: 1 FLISLFGIILNWLIFYIFLKLPSLKNSFGILCANKAISNAIICTIFLFYVVPMTLLDLSF 60
Query: 134 GK-----WNFGFMMCNLWVSFDVLSCTASILNLCMISVDRYYAITKPLEYGVKRTPKRMI 188
G ++ L+ S L +IS++R+ A+ P +Y + K
Sbjct: 61 LPELLNSHIGGLILLGLY--------EISPLTHLLISLNRFCAVFFPFKYKKIFSIKNTK 112
Query: 189 LCVVFVWLGAACISLPPLIVLGNDYR-EDNGVSF------CSVCQNYGYQIYATLGSFYI 241
+ + F+W+ A IS LG + +++ + L I
Sbjct: 113 IIITFIWIIAIIISTLFYFPLGCHFYYSPESLTWSFDEDPPCAEIGWYGDFLKNLVLVII 172
Query: 242 PLTVMITVYYKIFCAARKIVAEEKRAQS 269
+ + + K+ ++K + ++
Sbjct: 173 TNIINVITFIKLRKFSKKSSLSSEESKK 200
>gnl|CDD|177562 PHA03234, PHA03234, DNA packaging protein UL33; Provisional.
Length = 338
Score = 32.6 bits (74), Expect = 0.39
Identities = 45/205 (21%), Positives = 91/205 (44%), Gaps = 14/205 (6%)
Query: 68 LQALIIGLILLIIIIVTIVGNILVCIAVCIVRKLRRPC-NYLLVSLAVSDLCVAILVMPM 126
L++ I G++L +II + I I++C + + + ++ L++L SD + V +
Sbjct: 34 LESAINGIMLTLIIPMII---IVICTLIIYHKVAKHNATSFYLITLFASDFLHMLCVFFL 90
Query: 127 ALLYEVSGKWNFGFMMCNLWVSFDVLSCTASILNLCMISVDRYYAITKPLEYGVKRTPKR 186
L E +N F C L++ SC+ SI L +I+ RY + + + K
Sbjct: 91 TLNREALFNFNQAFCQCVLFIYH--ASCSYSICMLAIIATIRYKTLHRRKKNDKKNNHIG 148
Query: 187 MILCVVFVWLGAACISLPP-LIVLGNDYREDNGVSFCSVCQNYGYQIYATLGSFYIPL-- 243
+ ++F L +A ++P L V + + G + Y ++ + + +
Sbjct: 149 RNIGILF--LASAMCAIPAALFVKTEGKKGNYGKCNIHISSKKAYDLFIAIKIVFCFIWG 206
Query: 244 ---TVMITVYYKIFCAARKIVAEEK 265
T++ + +Y IFC A + E+K
Sbjct: 207 IFPTMIFSFFYVIFCKALHALTEKK 231
>gnl|CDD|223800 COG0728, MviN, Uncharacterized membrane protein, putative virulence
factor [General function prediction only].
Length = 518
Score = 32.6 bits (75), Expect = 0.43
Identities = 22/96 (22%), Positives = 43/96 (44%), Gaps = 10/96 (10%)
Query: 70 ALIIGLILLIIIIVTIVGNILVCIAVCIVRKLRRP-CNYLLVSLAVSDLCVAILVMPMAL 128
L+ GL+ L++++VT++G + +VR L P + LAV + ++ P L
Sbjct: 91 RLVTGLLTLVLLLVTLLGILFAPW---LVRLLLAPGFDETDKFLAVL---LTRILFPYLL 144
Query: 129 LYEVSGKWNFGFMMCNLWVSFDVLSCTASILNLCMI 164
+S + N F + + +LN+ +I
Sbjct: 145 FISLSA---LFGAILNSRNRFFIPAFAPVLLNVSVI 177
>gnl|CDD|226122 COG3594, NolL, Fucose 4-O-acetylase and related acetyltransferases
[Carbohydrate transport and metabolism].
Length = 343
Score = 32.3 bits (74), Expect = 0.49
Identities = 15/54 (27%), Positives = 22/54 (40%)
Query: 51 FHNYTRLGLLSHSPYSPLQALIIGLILLIIIIVTIVGNILVCIAVCIVRKLRRP 104
F + + S L L I LI+ +III + L + V +LRR
Sbjct: 282 FKVLRLQFGILNLSTSYLAFLPILLIVSLIIIAVLSRVFLKTLLYLPVIELRRK 335
>gnl|CDD|216273 pfam01061, ABC2_membrane, ABC-2 type transporter.
Length = 210
Score = 31.5 bits (72), Expect = 0.61
Identities = 23/102 (22%), Positives = 39/102 (38%), Gaps = 16/102 (15%)
Query: 63 SPYSPLQALIIGLILLIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDLCVAIL 122
SP A ++ IL+ + I + I + I Y +V L VS + +L
Sbjct: 88 SPLYSPSAYVLAKILVELPISLLQAIIFLLIV------------YFMVGLPVSRFFLFLL 135
Query: 123 VMPMALLYEVSGKWNFGFMMCNLWVSFDVLSCTASILNLCMI 164
V+ + L G + L SF+ S +L L ++
Sbjct: 136 VLLLTALAASG----LGLFIAALAPSFEDASQIGPLLLLPLL 173
>gnl|CDD|222432 pfam13882, Bravo_FIGEY, C-terminal domain of Fibronectin type III.
This is the very C-terminal region of neural adhesion
molecule L1 proteins that are also known as Bravo or
NrCAM. It lies upstream of the IG and Fn3 domains and
has the highly conserved motif FIGEY. The function is
not known.
Length = 118
Score = 30.4 bits (69), Expect = 0.70
Identities = 5/24 (20%), Positives = 14/24 (58%)
Query: 79 IIIIVTIVGNILVCIAVCIVRKLR 102
I ++ I +L+ + VC +++ +
Sbjct: 9 IGLMCAIALLLLILLIVCFIKRNK 32
>gnl|CDD|216260 pfam01040, UbiA, UbiA prenyltransferase family.
Length = 259
Score = 31.3 bits (71), Expect = 0.80
Identities = 34/156 (21%), Positives = 64/156 (41%), Gaps = 18/156 (11%)
Query: 66 SPLQALIIGLILLII--IIVTIVG-NILVCIAVCIV--------RKLRRPCNYLLVSLAV 114
SP +AL++GL LL++ ++ ++G +L+ + ++ +L+R L+ V
Sbjct: 68 SPREALLLGLALLLLGLLLALLLGPLLLLLGLLGLLLGLLYSPPLRLKR---RPLLGELV 124
Query: 115 SDLCVAILVMPMALLYEVSGKWNFGFMMCNLWVSFDVLSCTASILNLCMISVDRYYAI-T 173
L +L++ A V G ++ L V L+ ++ + DR I T
Sbjct: 125 VGLAFGLLILLGAYA--VGGDIPSPLLLLALPVFLLSLA-ILLTNDIRDVEGDRKAGIRT 181
Query: 174 KPLEYGVKRTPKRMILCVVFVWLGAACISLPPLIVL 209
P+ G KR L + L + L +L
Sbjct: 182 LPVRLGRKRALALYALLLAAALLLLLLLLLLLAPLL 217
>gnl|CDD|220495 pfam09971, DUF2206, Predicted membrane protein (DUF2206). This
domain, found in various hypothetical archaeal proteins,
has no known function.
Length = 367
Score = 31.6 bits (72), Expect = 0.91
Identities = 24/95 (25%), Positives = 39/95 (41%), Gaps = 10/95 (10%)
Query: 36 NISRFSLTNDTSPASFHNYTRLGLLSHSPYSPLQALIIGLILLIIIIVTIVGNILVCIAV 95
I T +P ++ L + S L L+ LI L I + ++G +
Sbjct: 85 KILSSIFTESLNP----TNSQGLELLTTSSSFLH-LLAKLINLFIQGLIVIGLL-----Y 134
Query: 96 CIVRKLRRPCNYLLVSLAVSDLCVAILVMPMALLY 130
+ + +RR L LA S L +AILV + L +
Sbjct: 135 VLYKLIRRRSKKSLEYLAFSFLFLAILVASIVLPF 169
>gnl|CDD|225154 COG2245, COG2245, Predicted membrane protein [Function unknown].
Length = 182
Score = 30.4 bits (69), Expect = 1.4
Identities = 27/139 (19%), Positives = 57/139 (41%), Gaps = 14/139 (10%)
Query: 73 IGLILLII--------IIVTIVGNILVCIAVCIVRKL---RRPCNYLLVSLAVSDLCVAI 121
IG IL +I I+++VG +LV +A+ ++ K R N L+++ + + +
Sbjct: 12 IGAILQLIGFFAGWPLWILSLVGLVLVLVALYMISKQVGEDRIFNNYLIAVVLGLVGFVV 71
Query: 122 LVMPMALLYEVSGKWNFGFMMCNLWVSFDVLSCTASILNLCMISVDRYYAITKPLEYGVK 181
++ A V+ + + + S A + L ++ + Y K LE +
Sbjct: 72 VIFATAGFSLVALLLGTFMLPAHGL--SALGSFLAGFVILWILYIISAYFQKKALEALAQ 129
Query: 182 RTPKRMI-LCVVFVWLGAA 199
T + + ++GA
Sbjct: 130 YTGVDLFRTAGLLYFIGAI 148
>gnl|CDD|221239 pfam11812, DUF3333, Domain of unknown function (DUF3333). This
family of proteins are functionally uncharacterized.
This family is only found in bacteria. This presumed
domain is typically between 116 to 159 amino acids in
length.
Length = 155
Score = 29.9 bits (68), Expect = 1.7
Identities = 9/19 (47%), Positives = 15/19 (78%)
Query: 70 ALIIGLILLIIIIVTIVGN 88
A++IGL L+I++ +IV N
Sbjct: 22 AILIGLAFLVILLGSIVSN 40
>gnl|CDD|198415 cd10433, YccA_like, YccA-like proteins. A prokaryotic member of
the BAX inhibitor (BI)-1 like family of small
transmembrane proteins, Escherichia coli YccA, has been
shown to interact with ATP-dependent protease FtsH,
which degrades abnormal membrane proteins as part of a
quality control mechanism to keep the integrity of
biological membranes.
Length = 205
Score = 30.2 bits (69), Expect = 1.7
Identities = 25/100 (25%), Positives = 41/100 (41%), Gaps = 26/100 (26%)
Query: 71 LIIGLILLIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDLCVAILVMPMALLY 130
L +G I ++++ V NI + I +SLA+S +L +LY
Sbjct: 125 LFVGAI---VLLLAAVANIFLQIPA--------------LSLAIS--AAFVLFSSGFILY 165
Query: 131 EVS-----GKWNFGFMMCNLWVSFDVLSCTASILNLCMIS 165
+ S G+ N+ +L+VS L S+LNL
Sbjct: 166 DTSRIIHGGETNYIMATVSLYVSIYNL--FVSLLNLLGAF 203
>gnl|CDD|226685 COG4232, COG4232, Thiol:disulfide interchange protein
[Posttranslational modification, protein turnover,
chaperones / Energy production and conversion].
Length = 569
Score = 30.8 bits (70), Expect = 1.8
Identities = 18/94 (19%), Positives = 31/94 (32%), Gaps = 15/94 (15%)
Query: 60 LSHSPYSPLQALIIGLILLIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDLCV 119
+ +S L A + GL+L V + +L + + R L+
Sbjct: 163 AATLKWSLLLAFLGGLLLNFTPCVLPMLPLLS--GIVLGSAKRASKARAFG-LSF----- 214
Query: 120 AILVMPMALLYEVSG------KWNFGFMMCNLWV 147
+ V MAL Y + G + + WV
Sbjct: 215 -VYVQGMALAYTLLGLVAAAAGLGWQAQLQQPWV 247
>gnl|CDD|204597 pfam11151, DUF2929, Protein of unknown function (DUF2929). This
family of proteins with unknown function appears to be
restricted to Firmicutes.
Length = 57
Score = 27.6 bits (62), Expect = 2.2
Identities = 9/29 (31%), Positives = 17/29 (58%)
Query: 57 LGLLSHSPYSPLQALIIGLILLIIIIVTI 85
+ L+ Y+ L A+II +I I++ + I
Sbjct: 21 VSSLTGGTYNFLTAIIIAVIFGILVFLLI 49
>gnl|CDD|100079 cd06158, S2P-M50_like_1, Uncharacterized homologs of Site-2
protease (S2P), zinc metalloproteases (MEROPS family
M50) which cleave transmembrane domains of substrate
proteins, regulating intramembrane proteolysis (RIP) of
diverse signal transduction mechanisms. Members of the
S2P/M50 family of RIP proteases use proteolytic activity
within the membrane to transfer information across
membranes to integrate gene expression with physiologic
stresses occurring in another cellular compartment. In
eukaryotic cells they regulate such processes as sterol
and lipid metabolism, and endoplasmic reticulum stress
responses. In prokaryotes they regulate such processes
as sporulation, cell division, stress response, and cell
differentiation. This group includes bacterial,
eukaryotic, and Archaeal S2P/M50s homologs with a
minimal core protein and no PDZ domains.
Length = 181
Score = 29.1 bits (66), Expect = 3.2
Identities = 28/99 (28%), Positives = 47/99 (47%), Gaps = 16/99 (16%)
Query: 72 IIGLILLIIIIVTIVGNILVCIA--VCIV-RKLRRPCN-YLLVSLA--VSDLCVAIL-VM 124
IG I+L +++ + G A V + R + P LLVSLA +S+L +A+L +
Sbjct: 47 PIGTIILPLLLPFLFG-----WAKPVPVNPRNFKNPRRGMLLVSLAGPLSNLLLALLFAL 101
Query: 125 PMALLYEVSGKWN-FGFMMCNLWVSFDVLSCTASILNLC 162
+ LL G F F+M V +++ ++ NL
Sbjct: 102 LLRLLPAFGGVVASFLFLMLAYGV---LINLVLAVFNLL 137
>gnl|CDD|215647 pfam00002, 7tm_2, 7 transmembrane receptor (Secretin family). This
family is known as Family B, the secretin-receptor
family or family 2 of the G-protein-coupled receptors
(GCPRs).They have been described in many animal species,
but not in plants, fungi or prokaryotes. Three distinct
sub-families are recognised. Subfamily B1 contains
classical hormone receptors, such as receptors for
secretin and glucagon, that are all involved in
cAMP-mediated signalling pathways. Subfamily B2 contains
receptors with long extracellular N-termini, such as the
leukocyte cell-surface antigen CD97 ;
calcium-independent receptors for latrotoxin, and
brain-specific angiogenesis inhibitors amongst others.
Subfamily B3 includes Methuselah and other Drosophila
proteins. Other than the typical seven-transmembrane
region, characteristic structural features include an
amino-terminal extracellular domain involved in ligand
binding, and an intracellular loop (IC3) required for
specific G-protein coupling.
Length = 239
Score = 29.1 bits (66), Expect = 3.9
Identities = 16/50 (32%), Positives = 25/50 (50%), Gaps = 7/50 (14%)
Query: 82 IVTIVGNI--LVCIAVCIV-----RKLRRPCNYLLVSLAVSDLCVAILVM 124
++ VG LV + + I RKLR NY+ ++L S + AIL +
Sbjct: 6 VIYTVGYSLSLVALLLAIAIFLLFRKLRCTRNYIHLNLFASFILRAILFL 55
>gnl|CDD|219227 pfam06912, DUF1275, Protein of unknown function (DUF1275). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown although a few members are
thought to be membrane proteins.
Length = 206
Score = 28.3 bits (64), Expect = 6.4
Identities = 15/74 (20%), Positives = 34/74 (45%), Gaps = 7/74 (9%)
Query: 53 NYTRLGL-LSHSPYSPLQALIIGLILLIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVS 111
N LG+ L+ ++ L L+ ++ ++G L + + ++R+ R L+
Sbjct: 33 NTVLLGVALAGGNWALA------LHKLLALLAFVLGAALAGLLIRLLRRRRLRRWLALLL 86
Query: 112 LAVSDLCVAILVMP 125
L + L +A ++P
Sbjct: 87 LLEAVLLLAAALLP 100
>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
and diverse group of secondary transporters that
includes uniporters, symporters, and antiporters. MFS
proteins facilitate the transport across cytoplasmic or
internal membranes of a variety of substrates including
ions, sugar phosphates, drugs, neurotransmitters,
nucleosides, amino acids, and peptides. They do so using
the electrochemical potential of the transported
substrates. Uniporters transport a single substrate,
while symporters and antiporters transport two
substrates in the same or in opposite directions,
respectively, across membranes. MFS proteins are
typically 400 to 600 amino acids in length, and the
majority contain 12 transmembrane alpha helices (TMs)
connected by hydrophilic loops. The N- and C-terminal
halves of these proteins display weak similarity and may
be the result of a gene duplication/fusion event. Based
on kinetic studies and the structures of a few bacterial
superfamily members, GlpT (glycerol-3-phosphate
transporter), LacY (lactose permease), and EmrD
(multidrug transporter), MFS proteins are thought to
function through a single substrate binding site,
alternating-access mechanism involving a rocker-switch
type of movement. Bacterial members function primarily
for nutrient uptake, and as drug-efflux pumps to confer
antibiotic resistance. Some MFS proteins have medical
significance in humans such as the glucose transporter
Glut4, which is impaired in type II diabetes, and
glucose-6-phosphate transporter (G6PT), which causes
glycogen storage disease when mutated.
Length = 352
Score = 28.8 bits (65), Expect = 6.4
Identities = 13/55 (23%), Positives = 28/55 (50%), Gaps = 5/55 (9%)
Query: 73 IGLILLIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDLCVAILVMPMA 127
GL+L + + I+G +L + + +L R LL+ L + + +L++ +A
Sbjct: 213 AGLLLSLFGLGGILGALLGGL---LSDRLGRRRLLLLIGLLL--AALGLLLLALA 262
>gnl|CDD|215733 pfam00122, E1-E2_ATPase, E1-E2 ATPase.
Length = 222
Score = 28.3 bits (64), Expect = 7.8
Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 14/77 (18%)
Query: 61 SHSPYSPLQALIIGLILLIIIIVTIVGNILVCIAVCIVRKLRR--PCNYLLVSLAVSDLC 118
+ S +PLQ L+ L +++ IV + + I V ++ R LL +LAV L
Sbjct: 133 AKSAKTPLQRLLDKLAKILVPIV-----LALAILVFLIWFFRGGDFLEALLRALAV--LV 185
Query: 119 VAI-----LVMPMALLY 130
A L +P+AL
Sbjct: 186 AACPEALPLAVPLALAV 202
>gnl|CDD|219559 pfam07760, DUF1616, Protein of unknown function (DUF1616). This is
a family of sequences from hypothetical archaeal
proteins. The region in question is approximately 330
amino acid residues long.
Length = 290
Score = 28.5 bits (64), Expect = 9.1
Identities = 14/59 (23%), Positives = 21/59 (35%)
Query: 37 ISRFSLTNDTSPASFHNYTRLGLLSHSPYSPLQALIIGLILLIIIIVTIVGNILVCIAV 95
+ R L ND L + + L+LLII I+ +VG + I
Sbjct: 106 LRRRRLPNDERFIVLFRALDSALKIGLYLPNSYSDNVLLVLLIISILALVGLLGYVILF 164
>gnl|CDD|217547 pfam03419, Peptidase_U4, Sporulation factor SpoIIGA.
Length = 291
Score = 28.3 bits (64), Expect = 9.3
Identities = 25/163 (15%), Positives = 52/163 (31%), Gaps = 36/163 (22%)
Query: 70 ALIIGLILLIIII------VTIVGNILV-CIAVCIVRKLRRPCNYL-------LVSLAVS 115
AL+ L +L++ I + +L + V I +R +L V+
Sbjct: 40 ALLGSLYVLLLFIPFLPIFFHPLFKLLFSVLMVLIAFGPKRFRFFLKTLLIFYFVTFLFG 99
Query: 116 DLCVAILVMPMALLYEVSGKWNFGFMMCNLWVSFDVLSCTASILNL---CMISVDRYYAI 172
A+ G ++ N + S + I Y
Sbjct: 100 GGLFALHYF-----------TGTGSLVQNGIFGDPI-----SWKFVLIGFPIL---YIFS 140
Query: 173 TKPLEYGVKRTPKRMILCVVFVWLGAACISLPPLIVLGNDYRE 215
+ EY +R + ++ V + + +++ L+ GN R+
Sbjct: 141 KRRWEYIKRRKIQYELIVEVEIVINGKKLNVKGLVDTGNQLRD 183
>gnl|CDD|224397 COG1480, COG1480, Predicted membrane-associated HD superfamily
hydrolase [General function prediction only].
Length = 700
Score = 28.5 bits (64), Expect = 9.3
Identities = 21/118 (17%), Positives = 41/118 (34%), Gaps = 20/118 (16%)
Query: 56 RLGLLSHS-PYSPLQALIIGLILLIIIIV---------------TIVGNILVCIAVCIVR 99
LGLLS S PL L+I +I LI++ ++ + + I +
Sbjct: 264 LLGLLSLSVNILPLLGLLILVIFLILLFALYERRTKSPLKLRNSLLLLYLSLAILTLSLL 323
Query: 100 KLRRPCNYLLVSLAV-SDLCVAILVMPMALLYEVSGKWNF---GFMMCNLWVSFDVLS 153
++ NY L V L +L++ + L + + + ++
Sbjct: 324 RIVGYFNYSASGLLVPPALGPMLLILLVFLRIAIFSSSMIAIALLYLFGGSYNSEIAL 381
>gnl|CDD|113196 pfam04415, DUF515, Protein of unknown function (DUF515). Family
of hypothetical Archaeal proteins.
Length = 416
Score = 28.4 bits (63), Expect = 9.7
Identities = 9/21 (42%), Positives = 15/21 (71%)
Query: 69 QALIIGLILLIIIIVTIVGNI 89
+ LIIG ++L +II+ +V I
Sbjct: 30 KRLIIGALVLSVIILLLVFAI 50
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.325 0.137 0.431
Gapped
Lambda K H
0.267 0.0635 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,212,103
Number of extensions: 2320532
Number of successful extensions: 3766
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3734
Number of HSP's successfully gapped: 73
Length of query: 483
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 382
Effective length of database: 6,457,848
Effective search space: 2466897936
Effective search space used: 2466897936
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 61 (27.5 bits)