RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3590
(258 letters)
>gnl|CDD|215648 pfam00003, 7tm_3, 7 transmembrane sweet-taste receptor of 3 GCPR.
This is a domain of seven transmembrane regions that
forms the C-terminus of some subclass 3
G-coupled-protein receptors. It is often associated with
a downstream cysteine-rich linker domain, NCD3G
pfam07562, which is the human sweet-taste receptor, and
the N-terminal domain, ANF_receptor pfam01094. The seven
TM regions assemble in such a way as to produce a
docking pocket into which such molecules as cyclamate
and lactisole have been found to bind and consequently
confer the taste of sweetness.
Length = 238
Score = 58.8 bits (143), Expect = 2e-10
Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 3/76 (3%)
Query: 1 MFGAFLAWETRHVSIPALNDSKYVGMSVYNVVLMCIMGAAISFVISDKQDASFVIISLFI 60
+ FLA++ R + N++K++ S+Y ++ I + + K +++ I
Sbjct: 163 LLCFFLAFKARKLP-ENFNEAKFITFSMYLFCIIWIAFIPLYLSTNGK--YRDAVLAFAI 219
Query: 61 LFCTTATLCLVFVPKI 76
L T L +F+PK
Sbjct: 220 LASATGLLLCIFIPKC 235
>gnl|CDD|222300 pfam13664, DUF4149, Domain of unknown function (DUF4149).
Length = 98
Score = 31.4 bits (72), Expect = 0.13
Identities = 10/56 (17%), Positives = 22/56 (39%), Gaps = 1/56 (1%)
Query: 27 SVYN-VVLMCIMGAAISFVISDKQDASFVIISLFILFCTTATLCLVFVPKIVELKR 81
VY + L + ++ ++ + + ++ L +L T P+I LK
Sbjct: 39 PVYFLLGLALAVLLLLTELLLGGKAERWQLLLLAVLLLLTLLNLFYLTPRITALKD 94
>gnl|CDD|218397 pfam05044, Prox1, Homeobox prospero-like protein (PROX1). The
homeobox gene Prox1 is expressed in a subpopulation of
endothelial cells that, after budding from veins, gives
rise to the mammalian lymphatic system. Prox1 has been
found to be an early specific marker for the developing
liver and pancreas in the mammalian foregut endoderm.
This family contains an atypical homeobox domain.
Length = 908
Score = 32.7 bits (74), Expect = 0.18
Identities = 16/46 (34%), Positives = 20/46 (43%), Gaps = 3/46 (6%)
Query: 146 ALPSSGPHGKTPPGQPPGKSLPSALKPPQPEALTVVKRELLVPNNH 191
P PH P G PPGK S P P T + +LL ++H
Sbjct: 660 GYPFQSPHLGAPSGSPPGKDRDS---PDLPRPTTSLHPKLLSAHHH 702
>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
modification].
Length = 2365
Score = 32.6 bits (74), Expect = 0.21
Identities = 27/108 (25%), Positives = 39/108 (36%), Gaps = 14/108 (12%)
Query: 146 ALPSSGPHGKTPPGQPPGKSLPSALKPPQPEALTVVKRELLVPNNHGIL---SEDPFYYW 202
+LP P PP PPG PS PP P + V KR + G + S +P Y
Sbjct: 3 SLPPGNP---PPPPPPPGFEPPSQPPPPPPPGVNVKKRSRKQLSIVGDILGHSGNPIYSL 59
Query: 203 KALCHALKHLKRLLQSQHY--------EMMLKQTNPVMVRPATHYQVD 242
+ +K + E + K +P P+ +VD
Sbjct: 60 RVSDKPVKLGNKAKTLHVLTLKAPIPPEHLRKIQSPCSDMPSVLTKVD 107
>gnl|CDD|222477 pfam13965, SID-1_RNA_chan, dsRNA-gated channel SID-1. This is a
family of proteins that are transmembrane dsRNA-gated
channels. They passively transport dsRNA into cells and
do not act as ATP-dependent pumps. They are required for
systemic RNA interference.
Length = 567
Score = 32.4 bits (74), Expect = 0.26
Identities = 24/110 (21%), Positives = 49/110 (44%), Gaps = 17/110 (15%)
Query: 41 ISFVISDKQDASFVIISLFILFCTTATLCLVFVPKIVELKRNPQGSIDKRIRATLRPMSK 100
I+F ++ + S+ + +L ++F + CL+F+ ++ + RN + + R + P
Sbjct: 103 ITFNVTILESISYFVPTLAMVF-FLLSFCLLFIVFVIHIFRNRR-DLRSRDIISFEPSPS 160
Query: 101 TRR-----------DSSELEEKLKDAKMSNMRHRKILAERENELQALLSR 139
+R DSSE E + A M ++ E+ + + LSR
Sbjct: 161 EQRAMDLQPDQSEEDSSERENDILMADQQIM----VIREKASLYVSDLSR 206
>gnl|CDD|187828 cd09697, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Large proteins, some contain Zn-finger domain; signature
gene for I-A subtype; also known as Csx8 family.
Length = 441
Score = 30.3 bits (68), Expect = 0.94
Identities = 14/41 (34%), Positives = 18/41 (43%), Gaps = 4/41 (9%)
Query: 102 RRDSSELEEKLKDAKMSNMRHRKILAERENEL----QALLS 138
R E++E LK AK K + N+L Q LLS
Sbjct: 352 RNGGKEMDENLKKAKACAKAVVKKFIKELNKLASYKQKLLS 392
>gnl|CDD|233970 TIGR02670, cas_csx8, CRISPR-associated protein Cas8a1/Csx8, subtype
I. In three genomes so far, a member of this protein
appears in the midst of a CRISPR-associated (cas) gene
operon, immediately upstream of a member of family
TIGR01875 (CRISPR-associated autoregulator, DevR
family). The genomes so far are Nocardia farcinica
IFM10152, Clostridium perfringens SM101, and Clostridium
tetani E88 [Mobile and extrachromosomal element
functions, Other].
Length = 441
Score = 30.3 bits (68), Expect = 0.94
Identities = 14/41 (34%), Positives = 18/41 (43%), Gaps = 4/41 (9%)
Query: 102 RRDSSELEEKLKDAKMSNMRHRKILAERENEL----QALLS 138
R E++E LK AK K + N+L Q LLS
Sbjct: 352 RNGGKEMDENLKKAKACAKAVVKKFIKENNKLASYKQKLLS 392
>gnl|CDD|220985 pfam11109, RFamide_26RFa, Orexigenic neuropeptide Qrfp/P518.
Qrfp/P518 has a direct role in maintaining bone mineral
density. Qrfp has also found to be important in energy
homeostasis by regulating appetite and energy
expenditure in mice. The c-terminal 28 residues are the
functional 26RFa.
Length = 132
Score = 29.1 bits (65), Expect = 1.0
Identities = 12/27 (44%), Positives = 15/27 (55%)
Query: 159 GQPPGKSLPSALKPPQPEALTVVKREL 185
G+P P + PQP+AL V REL
Sbjct: 45 GRPFPWGSPGWPRAPQPQALLVTAREL 71
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 30.2 bits (68), Expect = 1.1
Identities = 18/58 (31%), Positives = 27/58 (46%), Gaps = 8/58 (13%)
Query: 96 RPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENEL--------QALLSRLGSESL 145
+ K+ R+ ELEE L+ K + R +L + E E QALL L + +L
Sbjct: 135 KNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQALLDELETSTL 192
>gnl|CDD|233165 TIGR00879, SP, MFS transporter, sugar porter (SP) family. This
model represent the sugar porter subfamily of the major
facilitator superfamily (pfam00083) [Transport and
binding proteins, Carbohydrates, organic alcohols, and
acids].
Length = 481
Score = 29.2 bits (66), Expect = 2.4
Identities = 15/65 (23%), Positives = 27/65 (41%), Gaps = 6/65 (9%)
Query: 3 GAFLAWETRHVSIPALNDSKYV----GMSVYNVVLMCIMGAAISFVISDK--QDASFVII 56
G LA + N Y G+ V ++ +GA + +SD+ + S +II
Sbjct: 48 GGALALPAFEFKFTSANSDSYSSSLWGLVVSIFLVGGFIGALFAGWLSDRFGRKKSLLII 107
Query: 57 SLFIL 61
+L +
Sbjct: 108 ALLFV 112
>gnl|CDD|183757 PRK12800, fliF, flagellar MS-ring protein; Reviewed.
Length = 574
Score = 29.2 bits (65), Expect = 2.5
Identities = 28/104 (26%), Positives = 43/104 (41%), Gaps = 23/104 (22%)
Query: 81 RNPQGSIDKRIRATLRPMSKTRR---------DSSELEEKLKDAKMSNMRHRKILAEREN 131
R + S ++RIR L PM+ R D S +EE ++ N K+ +E+ +
Sbjct: 249 RRQESSYNQRIRELLEPMTGPGRVNPEVSVDMDFSVVEEA---RELYNGEPAKLRSEQVS 305
Query: 132 ELQALLSRLGSESLALPSSGPHGKT--PPGQPPGKSLPSALKPP 173
+ + + A GP G T PGQPP + A P
Sbjct: 306 D---------TSTSATGPQGPPGATSNSPGQPPAPAAAGAPGTP 340
>gnl|CDD|218632 pfam05549, Allexi_40kDa, Allexivirus 40kDa protein.
Length = 271
Score = 29.0 bits (65), Expect = 2.6
Identities = 22/92 (23%), Positives = 42/92 (45%), Gaps = 8/92 (8%)
Query: 84 QGSIDKRIRATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKI--LAERENELQALLSRLG 141
QG+ R+R LR +++T +E+K++ + + + + L E L+AL ++L
Sbjct: 76 QGTPLSRLRDALRTLTRTVDSIYSVEQKIELSTQTPSSSKLLKKLTSIETSLEALHAKLD 135
Query: 142 SESLAL------PSSGPHGKTPPGQPPGKSLP 167
+ +L PS P + +SLP
Sbjct: 136 ELTSSLTSTLAGPSVSPTPASSHPTASDRSLP 167
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 29.3 bits (66), Expect = 2.8
Identities = 14/70 (20%), Positives = 32/70 (45%), Gaps = 1/70 (1%)
Query: 75 KIVELKRNPQGSIDKRIRATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENELQ 134
I E +R + + ++R+ + K + ELE ++++ + + + AE + EL+
Sbjct: 309 SIAEKERELEDA-EERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELE 367
Query: 135 ALLSRLGSES 144
L + L
Sbjct: 368 DLRAELEEVD 377
Score = 27.3 bits (61), Expect = 9.9
Identities = 18/74 (24%), Positives = 23/74 (31%), Gaps = 18/74 (24%)
Query: 78 ELKRNPQGSIDKRIRATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENELQALL 137
E R +DK T R + + EKL+ K NEL+ L
Sbjct: 367 EDLRAELEEVDKEFAET-------RDELKDYREKLEKLK-----------REINELKREL 408
Query: 138 SRLGSESLALPSSG 151
RL E L
Sbjct: 409 DRLQEELQRLSEEL 422
>gnl|CDD|240543 cd13138, MATE_yoeA_like, Subfamily of the multidrug and toxic
compound extrusion (MATE)-like proteins similar to
Bacillus subtilis yoeA. The integral membrane proteins
from the MATE family are involved in exporting
metabolites across the cell membrane and are responsible
for multidrug resistance (MDR) in many bacteria and
animals. A number of family members are involved in the
synthesis of peptidoglycan components in bacteria.
Length = 431
Score = 29.0 bits (66), Expect = 3.1
Identities = 17/77 (22%), Positives = 31/77 (40%), Gaps = 12/77 (15%)
Query: 24 VGMSVYNVVLMCIMG--AAISFVIS------DKQDASFVIISLFILFCTTA----TLCLV 71
G + ++ MG + +IS D++ A + +L +LF L L+
Sbjct: 44 NGSPIIFLLTAIGMGLSMGGTILISQYIGAKDEEKAKKAVGTLLVLFLILGIVLTILGLL 103
Query: 72 FVPKIVELKRNPQGSID 88
F P I+ L P ++
Sbjct: 104 FAPPILRLMNTPAEILE 120
>gnl|CDD|112645 pfam03841, SelA, L-seryl-tRNA selenium transferase.
Length = 367
Score = 28.4 bits (64), Expect = 3.6
Identities = 27/104 (25%), Positives = 40/104 (38%), Gaps = 4/104 (3%)
Query: 76 IVELKRNPQGS---IDKRIRATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENE 132
I LK+NP +DK A L + + +L EK+ ++ + K L +
Sbjct: 230 IERLKKNPLKRALRVDKLTLAALEATLRLYLNPEKLYEKIPTLRLLT-QPLKELRAKAER 288
Query: 133 LQALLSRLGSESLALPSSGPHGKTPPGQPPGKSLPSALKPPQPE 176
LQ LL + + G P K LPSA +PE
Sbjct: 289 LQKLLKARLGSGAEVSVEPSFSQIGGGSLPVKELPSAAVTIRPE 332
>gnl|CDD|219262 pfam07001, BAT2_N, BAT2 N-terminus. This family represents the
N-terminus (approximately 200 residues) of the
proline-rich protein BAT2. BAT2 is similar to other
proteins with large proline-rich domains, such as some
nuclear proteins, collagens, elastin, and synapsin.
Length = 189
Score = 28.0 bits (62), Expect = 4.1
Identities = 13/50 (26%), Positives = 16/50 (32%), Gaps = 1/50 (2%)
Query: 144 SLALPSSGPHGKTPPG-QPPGKSLPSALKPPQPEALTVVKRELLVPNNHG 192
S ALP P + P Q S P +P P K + G
Sbjct: 99 SDALPPQTPESQPPAVSQTSASSPPPPPQPATPLVPGGAKSWAVASAKPG 148
>gnl|CDD|182791 PRK10865, PRK10865, protein disaggregation chaperone; Provisional.
Length = 857
Score = 28.7 bits (64), Expect = 4.2
Identities = 24/79 (30%), Positives = 32/79 (40%)
Query: 79 LKRNPQGSIDKRIRATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENELQALLS 138
L + + KR+ +S R SELEE+ K K S + I AE E A+
Sbjct: 429 LMKESDEASKKRLDMLNEELSDKERQYSELEEEWKAEKASLSGTQTIKAELEQAKIAIEQ 488
Query: 139 RLGSESLALPSSGPHGKTP 157
LA S +GK P
Sbjct: 489 ARRVGDLARMSELQYGKIP 507
>gnl|CDD|236242 PRK08326, PRK08326, ribonucleotide-diphosphate reductase subunit
beta; Validated.
Length = 311
Score = 28.0 bits (63), Expect = 4.2
Identities = 18/59 (30%), Positives = 25/59 (42%), Gaps = 11/59 (18%)
Query: 166 LPSAL----KPPQPEALTVVKRELLVPNNH---GILSEDPFYYWKALCHALKHLKRLLQ 217
LP+AL P PE V NH G+L+E +Y W+ +C L L +
Sbjct: 146 LPAALNRLSTDPSPENQV----RASVTYNHVVEGVLAETGYYAWRKICVTRGILPGLQE 200
>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP). This family consists of
several PV-1 (PLVAP) proteins which seem to be specific
to mammals. PV-1 is a novel protein component of the
endothelial fenestral and stomatal diaphragms. The
function of this family is unknown.
Length = 442
Score = 28.5 bits (63), Expect = 4.3
Identities = 18/65 (27%), Positives = 30/65 (46%), Gaps = 4/65 (6%)
Query: 75 KIVELKRNPQGSIDKRIRATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENELQ 134
KI EL R + I++ R + +R E E+ + A+ + + K RE +LQ
Sbjct: 282 KIEELARGLRADIERVAREN----ADLQRQKLEAEQAAQAAQEAKAKAEKEAQAREAKLQ 337
Query: 135 ALLSR 139
A +R
Sbjct: 338 AECAR 342
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 28.4 bits (63), Expect = 4.4
Identities = 20/115 (17%), Positives = 43/115 (37%), Gaps = 8/115 (6%)
Query: 74 PKIVELKRNPQGSIDKRIRATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENEL 133
+I + I+ ++ + + R + +L +L+ ++ S M ++ L E E
Sbjct: 648 TQIGHAQAAAHNHIEHHVQKLESDLKQLRAEREQLVAQLEKSQQSLMEFQQELNAAEAER 707
Query: 134 QALLS---RLGSESLALPSSGPHGKTPPGQPPGKSLPSALKPPQPEALTVVKREL 185
+ L L E+L S L ++ K E + +++RE
Sbjct: 708 RKHLEENRELKQEALLAAIS-----EKDANIALLELSASKKKKTQEEVALLRREK 757
>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
Provisional.
Length = 475
Score = 28.3 bits (63), Expect = 4.4
Identities = 14/86 (16%), Positives = 32/86 (37%), Gaps = 18/86 (20%)
Query: 105 SSELEEKLKD-----AKMSNMRHRKILAERENELQALLSRLGSESLALPSSG-------- 151
+++++++ ++ ++ + R R +L + L + AL ++
Sbjct: 78 AAQMQKQYEEIRRELDVLN--KQRGDDQRRIEKLGQDNAALAEQVKALGANPVTATGEPV 135
Query: 152 PHGKTPPGQPPGKSLP---SALKPPQ 174
P P P G+ P PPQ
Sbjct: 136 PQMPASPPGPEGEPQPGNTPVSFPPQ 161
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 28.0 bits (62), Expect = 4.6
Identities = 16/58 (27%), Positives = 24/58 (41%), Gaps = 6/58 (10%)
Query: 91 IRATL------RPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENELQALLSRLGS 142
+RA L + D EL+EKL++ + K L E E E + + RL
Sbjct: 117 VRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKR 174
>gnl|CDD|178323 PLN02721, PLN02721, threonine aldolase.
Length = 353
Score = 28.1 bits (63), Expect = 4.7
Identities = 9/48 (18%), Positives = 21/48 (43%), Gaps = 9/48 (18%)
Query: 194 LSEDPFYYWKALCHALKHLKRLLQSQHYEMMLKQTNPVMVRPATHYQV 241
+++ + LC +L + + ++L N +R TH+Q+
Sbjct: 296 ITDGSRITAEKLCKSL---------EEHGVLLMPGNSSRIRVVTHHQI 334
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 27.6 bits (62), Expect = 4.8
Identities = 21/71 (29%), Positives = 34/71 (47%), Gaps = 9/71 (12%)
Query: 75 KIVELKRNPQGS------IDKRIRATLRPMSKTRRDSSELEEKLKDA---KMSNMRHRKI 125
+I E+K+N + + I + + + P+ K + EL +KLKD K S +
Sbjct: 35 EIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKAR 94
Query: 126 LAERENELQAL 136
L E E EL+ L
Sbjct: 95 LKELEKELKNL 105
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 28.1 bits (63), Expect = 4.8
Identities = 17/44 (38%), Positives = 22/44 (50%), Gaps = 3/44 (6%)
Query: 104 DSSELE---EKLKDAKMSNMRHRKILAERENELQALLSRLGSES 144
D +EL+ EKLK M K L E E ELQ L S++ +
Sbjct: 202 DPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLT 245
>gnl|CDD|178056 PLN02437, PLN02437, ribonucleoside--diphosphate reductase large
subunit.
Length = 813
Score = 28.2 bits (63), Expect = 4.9
Identities = 25/82 (30%), Positives = 34/82 (41%), Gaps = 24/82 (29%)
Query: 63 CTTATLCLVFVPKIVELKRNPQGSIDKRIRATLRPMSKTRR-DSSELEE----------K 111
C A++ L P+ V K P S ++ +L SK R D +L E K
Sbjct: 444 CNLASIAL---PRFVREKGVPLESHPSKLVGSLG--SKNRYFDFEKLAEVTAVVTRNLNK 498
Query: 112 LKD--------AKMSNMRHRKI 125
+ D A+ SNMRHR I
Sbjct: 499 IIDVNYYPVETARRSNMRHRPI 520
>gnl|CDD|220184 pfam09332, Mcm10, Mcm10 replication factor. Mcm10 is a eukaryotic
DNA replication factor that regulates the stability and
chromatin association of DNA polymerase alpha.
Length = 346
Score = 27.9 bits (62), Expect = 5.5
Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 4/56 (7%)
Query: 80 KRNPQGSIDKRIRATLRP-MSKTRRDSSELEEKLKDAKMSNMRHRKILAERENELQ 134
++N + ++ P M K R SE +K+ AK S H +L E E E Q
Sbjct: 175 EKNASSAKRQKSPEDEEPAMKKRRYLESEEFQKILAAKSS---HTGLLEEAEREAQ 227
>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11. The WW domain is a
small protein module with a triple-stranded beta-sheet
fold. This is a family of WW domain binding proteins.
Length = 78
Score = 26.1 bits (58), Expect = 5.5
Identities = 14/66 (21%), Positives = 25/66 (37%), Gaps = 12/66 (18%)
Query: 78 ELKRNPQGSIDKRIRATLRPMSKTRRDSSELEEKLKDAKMS------NMRHRKILAEREN 131
ELK+N K R R +++ L+ +L K ++ L + E
Sbjct: 18 ELKKN------KAERQARREAKLAKKNPDRLQRELDKLKAQEESGKLRPVEKQKLKKLEK 71
Query: 132 ELQALL 137
+L A+
Sbjct: 72 DLDAVK 77
>gnl|CDD|235893 PRK06939, PRK06939, 2-amino-3-ketobutyrate coenzyme A ligase;
Provisional.
Length = 397
Score = 27.9 bits (63), Expect = 5.8
Identities = 12/33 (36%), Positives = 19/33 (57%), Gaps = 6/33 (18%)
Query: 99 SKTRR------DSSELEEKLKDAKMSNMRHRKI 125
K +R D ++LE +LK+AK + RH+ I
Sbjct: 146 CKAKRYRYANNDMADLEAQLKEAKEAGARHKLI 178
>gnl|CDD|227331 COG4998, COG4998, Predicted endonuclease (RecB family) [DNA
replication, recombination, and repair].
Length = 209
Score = 27.7 bits (61), Expect = 6.1
Identities = 17/81 (20%), Positives = 32/81 (39%), Gaps = 5/81 (6%)
Query: 71 VFVPKIVELKRNPQGSIDKRIRATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERE 130
V V +L + ++D+ + + + +S E E LK M K AE+
Sbjct: 105 VVVLSTDDLYEAVEEALDRAVTSLVEVLSHCEELEIEALEVLKA-----MAKAKSFAEKA 159
Query: 131 NELQALLSRLGSESLALPSSG 151
++L + L + L +G
Sbjct: 160 SKLGTAVEELAARVDRLKRAG 180
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 27.8 bits (62), Expect = 6.7
Identities = 14/60 (23%), Positives = 24/60 (40%), Gaps = 11/60 (18%)
Query: 88 DKRIRATLRPMSKTRRDSSELE----------EKLKDAKMSNMRHRKILAERENELQALL 137
+R R + +T + LE EKL+ R++++ AE EL+ L
Sbjct: 171 KERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELR-ELELAL 229
>gnl|CDD|225833 COG3296, COG3296, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 143
Score = 26.8 bits (59), Expect = 6.8
Identities = 17/104 (16%), Positives = 40/104 (38%), Gaps = 9/104 (8%)
Query: 1 MFGAFLAWETRHVSIPALNDSK------YVGMSVYNVVLMCIMGAAISFVISDKQDASFV 54
+ G + W + S P ++ + ++Y++V ++ A + +D +
Sbjct: 38 LLGPLIVWLLKKDSSPFVDAQGKEALNFQISYTIYSIVSFVLILAGVFLAATDISFIIII 97
Query: 55 --IISLFILFCTTATLCLVFVPKIVELKRNPQGSIDKRIRATLR 96
++L + A L V I+ + +G + R T+R
Sbjct: 98 GFFLTLIGILSIFALSILSLVLTIIAAIKAYEG-QEYRYPLTIR 140
>gnl|CDD|233146 TIGR00837, araaP, aromatic amino acid transport protein. The
Hydroxy/Aromatic Amino Acid Permease (HAAAP) Family-
tyrosine/tryptophan subfamily (TC 2.A.42.1) The HAAAP
family includes well characterized aromatic amino
acid:H+ symport permeases and hydroxy amino acid
permeases. This subfamily is specific for aromatic amino
acid transporters and includes the tyrosine permease,
TyrP, of E. coli, and the tryptophan transporters TnaB
and Mtr of E. coli [Transport and binding proteins,
Amino acids, peptides and amines].
Length = 381
Score = 27.7 bits (62), Expect = 6.9
Identities = 11/34 (32%), Positives = 20/34 (58%), Gaps = 2/34 (5%)
Query: 12 HVSIPALNDSKYVGMSVYNVVLMCIMGAAISFVI 45
H ++P+L KY +V V ++G+AI+ V+
Sbjct: 194 HGNVPSLY--KYYDGNVKKVKKSILIGSAIALVL 225
>gnl|CDD|212662 cd07779, FGGY_ygcE_like, uncharacterized ygcE-like proteins. This
subfamily consists of uncharacterized hypothetical
bacterial proteins with similarity to Escherichia coli
sugar kinase ygcE , whose functional roles are not yet
clear. Escherichia coli ygcE is recognized by this
model, but is not present in the alignment as it
contains a deletion relative to other members of the
group. These proteins belong to the FGGY family of
carbohydrate kinases, the monomers of which contain two
large domains, which are separated by a deep cleft that
forms the active site. This model includes both the
N-terminal domain, which adopts a ribonuclease H-like
fold, and the structurally related C-terminal domain.
Length = 488
Score = 27.6 bits (62), Expect = 7.1
Identities = 10/18 (55%), Positives = 12/18 (66%)
Query: 197 DPFYYWKALCHALKHLKR 214
DP YYW+ALC A + L
Sbjct: 45 DPDYYWQALCEACRELWE 62
>gnl|CDD|153304 cd07620, BAR_SH3BP1, The Bin/Amphiphysin/Rvs (BAR) domain of
SH3-domain Binding Protein 1. BAR domains are
dimerization, lipid binding and curvature sensing
modules found in many different proteins with diverse
functions. SH3-domain binding protein 1 (SH3BP1 or
3BP-1) is a Rac GTPase activating protein that inhibits
Rac-mediated platelet-derived growth factor
(PDGF)-induced membrane ruffling. SH3BP1 contains an
N-terminal BAR domain followed by a GAP domain for Rho
and Rac GTPases and a C-terminal proline-rich domain.
BAR domains form dimers that bind to membranes, induce
membrane bending and curvature, and may also be involved
in protein-protein interactions.
Length = 257
Score = 27.2 bits (60), Expect = 7.4
Identities = 20/99 (20%), Positives = 39/99 (39%), Gaps = 12/99 (12%)
Query: 92 RATLRPMSKTRRDSSELEEKLKDAKMSNMRHRKILAERENELQALLSRLGSESLALPSSG 151
+ L+P++K + +L E LK ++K A+ + + SR + P SG
Sbjct: 106 KDVLQPLNKLSEE--DLPEILK--------NKKQFAKLTTDWNSAKSRSPQAAGRSPRSG 155
Query: 152 PHGKTPPGQPPGKSLPSALKPPQPEALTVVKRELLVPNN 190
+ G+ G + +P + E R+L +
Sbjct: 156 GRSEE-VGEHQGIRRANKGEPLKEE-EEECWRKLEQCKD 192
>gnl|CDD|192535 pfam10320, 7TM_GPCR_Srsx, Serpentine type 7TM GPCR chemoreceptor
Srsx. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srsx is a solo family
amongst the superfamilies of chemoreceptors.
Chemoperception is one of the central senses of soil
nematodes like C. elegans which are otherwise 'blind'
and 'deaf'.
Length = 257
Score = 27.2 bits (61), Expect = 7.6
Identities = 10/68 (14%), Positives = 26/68 (38%), Gaps = 9/68 (13%)
Query: 29 YNVVLMCIMGAAISFVISDKQDASFVIISLFILFCTTATLCLVFVPKIVELKRNPQGSID 88
+++C A ++ +F I +L + L +V++ I+ K+ Q +
Sbjct: 129 DETIIVC----APPLALNGT---AFRIFTLSSVIINVIVL-IVYIILIIIFKKKGQTTNS 180
Query: 89 KRIRATLR 96
+ +
Sbjct: 181 -DSKKVFK 187
>gnl|CDD|215560 PLN03076, PLN03076, ARF guanine nucleotide exchange factor
(ARF-GEF); Provisional.
Length = 1780
Score = 27.9 bits (62), Expect = 8.0
Identities = 20/80 (25%), Positives = 35/80 (43%), Gaps = 9/80 (11%)
Query: 108 LEEKLKDAKMSNMRHRKILAERENELQALLSRLGSESLALPSSGPHGKTPPGQPPGKSLP 167
LE+ +K+A S +H K+ +E +A++ RL S PS+ SLP
Sbjct: 18 LEKIIKNA--SWRKHSKLA----HECKAVIERLNSPEKNPPSTSSAAADSA---SASSLP 68
Query: 168 SALKPPQPEALTVVKRELLV 187
L ++ + EL++
Sbjct: 69 GPLHDGGSIEYSLAESELIL 88
>gnl|CDD|177553 PHA03185, PHA03185, UL14 tegument protein; Provisional.
Length = 214
Score = 26.9 bits (59), Expect = 8.4
Identities = 15/40 (37%), Positives = 18/40 (45%), Gaps = 8/40 (20%)
Query: 137 LSRLGSESLALPSSGPHGKTPPGQPPGKSLPSALKPPQPE 176
LS GS P+SGP G PG P+ PP+P
Sbjct: 165 LSGPGSR----PTSGP----VAGGLPGGGAPAPPTPPRPG 196
>gnl|CDD|146273 pfam03546, Treacle, Treacher Collins syndrome protein Treacle.
Length = 519
Score = 27.2 bits (59), Expect = 9.4
Identities = 10/37 (27%), Positives = 17/37 (45%)
Query: 148 PSSGPHGKTPPGQPPGKSLPSALKPPQPEALTVVKRE 184
P+ K P PGK+ P+A + + EA + +
Sbjct: 61 PAKESPQKGAPPVTPGKAGPAAAQAGEEEAKSSEEES 97
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 27.4 bits (61), Expect = 9.9
Identities = 22/78 (28%), Positives = 33/78 (42%), Gaps = 7/78 (8%)
Query: 75 KIVELKRNPQGSIDKRIR----ATLRPMSKTRRDS-SELEEKLKDAKMSNMRHRKILAER 129
++ ELK + R R LR K + ELEE+L + ++ ++ E
Sbjct: 567 RLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLE-ELLQSLELS-EA 624
Query: 130 ENELQALLSRLGSESLAL 147
ENEL+ L SE L
Sbjct: 625 ENELEEAEEELESELEKL 642
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.134 0.406
Gapped
Lambda K H
0.267 0.0722 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,309,305
Number of extensions: 1269325
Number of successful extensions: 1767
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1762
Number of HSP's successfully gapped: 76
Length of query: 258
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 163
Effective length of database: 6,723,972
Effective search space: 1096007436
Effective search space used: 1096007436
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 58 (26.1 bits)