RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6704
(332 letters)
>gnl|CDD|218266 pfam04790, Sarcoglycan_1, Sarcoglycan complex subunit protein. The
dystrophin glycoprotein complex (DGC) is a
membrane-spanning complex that links the interior
cytoskeleton to the extracellular matrix in muscle. The
sarcoglycan complex is a subcomplex within the DGC and
is composed of several muscle-specific, transmembrane
proteins (alpha-, beta-, gamma-, delta- and
zeta-sarcoglycan). The sarcoglycans are
asparagine-linked glycosylated proteins with single
transmembrane domains. This family contains beta, gamma
and delta members.
Length = 264
Score = 152 bits (385), Expect = 4e-44
Identities = 78/269 (28%), Positives = 119/269 (44%), Gaps = 27/269 (10%)
Query: 54 TGLHDGKSFTFWALIGLLYLFALINLVLTLTLMTMLRIGW-GMETIEMLPLLNMVKLYGD 112
G++ + + + LL L A++NL LTL ++ ++R G GM +E+ + G+
Sbjct: 4 VGIYGWRKRCLYTFVLLLLLLAVVNLALTLWILKVMRFGPDGMGNLEITE-DGLRLFEGE 62
Query: 113 ID-LGKLYKDDGYFSSFKDSGLRITGRQGGSVHIDVNFGVNRTRRMLSIEPEGVRVTNVK 171
D L LY +D L I + +V+ G + LS+ +G+ K
Sbjct: 63 SDFLQPLYA--KEIHGRRDEPLLIQSNRNVTVNARNRNGNVTNK--LSVGKDGIVEAATK 118
Query: 172 EFNVYDPTDHLPIFSTGFRSADFG-------LPRGVKKLDVQQVRTSRIISPIEDDLTLR 224
F V DP D +FS G P G L V T RI SP DL L
Sbjct: 119 GFEVKDPVDGKLLFSADRDEVVVGAERLRVTGPEGA--LFEHSVETPRIRSPPNKDLRLE 176
Query: 225 SETYT-RLKGNEGISMEGRT--ITWTADKDIFLKSVNGSLTLAAENGIFLEVKKLPFVKK 281
S T + + EG++++ + I +TA DI L+S +GS+ L A + L +LP
Sbjct: 177 SPTRSLSMDAPEGVNIDAKAGNIEFTALTDINLRSKDGSIVLDAS-SVML--NRLPISSG 233
Query: 282 NLFLDSNLHNAAFKLCVCMPGGRIFRVKA 310
+S +KLCVC P G++F V A
Sbjct: 234 ----ESGSRQGQYKLCVC-PDGKLFLVAA 257
>gnl|CDD|220574 pfam10106, DUF2345, Uncharacterized protein conserved in bacteria
(DUF2345). Members of this family are found in various
bacterial hypothetical proteins, as well as Rhs element
Vgr proteins.
Length = 156
Score = 30.6 bits (70), Expect = 0.56
Identities = 8/31 (25%), Positives = 14/31 (45%)
Query: 243 TITWTADKDIFLKSVNGSLTLAAENGIFLEV 273
+ A KD+ + S G + + A+ I L
Sbjct: 92 DLELLAKKDVTITSTEGRIEITAKKEIVLNA 122
>gnl|CDD|220912 pfam10947, DUF2628, Protein of unknown function (DUF2628). Some
members in this family of proteins are annotated as yigF
however currently no function is known.
Length = 100
Score = 29.2 bits (66), Expect = 0.78
Identities = 16/57 (28%), Positives = 27/57 (47%), Gaps = 9/57 (15%)
Query: 56 LHDGKSFT------FWALIGLLYLFALINLVLTLTL---MTMLRIGWGMETIEMLPL 103
+ DG S+ W L L+L AL L + + L + +L IG G+ + +L +
Sbjct: 26 VRDGFSWAAFFFGPLWLLYRRLWLLALAVLAVGVALAVILLLLGIGPGLALLLLLLI 82
>gnl|CDD|169691 PRK09173, PRK09173, F0F1 ATP synthase subunit B; Validated.
Length = 159
Score = 29.3 bits (66), Expect = 1.8
Identities = 8/15 (53%), Positives = 11/15 (73%)
Query: 63 TFWALIGLLYLFALI 77
TFWA +GL+ AL+
Sbjct: 4 TFWAFVGLVLFLALV 18
>gnl|CDD|222634 pfam14262, DUF4353, Domain of unknown function (DUF4353). This
family is found in bacteria and archaea, and is
typically between 262 and 279 amino acids in length.
Length = 264
Score = 28.3 bits (64), Expect = 5.5
Identities = 14/39 (35%), Positives = 20/39 (51%), Gaps = 6/39 (15%)
Query: 219 DDLTLRSETYT------RLKGNEGISMEGRTITWTADKD 251
DDL + TYT L+G + + + G T+T TA D
Sbjct: 121 DDLVITGGTYTVTAVDDGLRGKDSVRIAGGTLTITAGGD 159
>gnl|CDD|182391 PRK10341, PRK10341, DNA-binding transcriptional activator TdcA;
Provisional.
Length = 312
Score = 28.3 bits (63), Expect = 5.7
Identities = 16/41 (39%), Positives = 21/41 (51%), Gaps = 2/41 (4%)
Query: 179 TDHLPIFSTGFRSADFGLPRGVKKLDVQQVRTSRIISPIED 219
T HL +F RS G K+L + Q S+II+ IED
Sbjct: 9 TQHLVVFQEVIRSGSIG--SAAKELGLTQPAVSKIINDIED 47
>gnl|CDD|234171 TIGR03340, phn_DUF6, phosphonate utilization associated putative
membrane protein. This family of hydrophobic proteins
has some homology to families of integral membrane
proteins such as (pfam00892) and may be a permease. It
occurs in the vicinity of various types of operons for
the catabolism of phosphonates in Vibrio, Pseudomonas,
Polaromonas and Thiomicrospira.
Length = 281
Score = 28.2 bits (63), Expect = 6.7
Identities = 13/69 (18%), Positives = 23/69 (33%), Gaps = 10/69 (14%)
Query: 59 GKSFTFWALIG--------LLYLFALINLVLTLTLMTMLRIGWGMETIEMLPLLNMVKLY 110
F +WAL+ L+ A + +L M+ L + + Y
Sbjct: 27 EPDFLWWALLAHSVLLTPYGLWYLAQVGWSRLPATFWLLLAISA--VANMVYFLGLAQAY 84
Query: 111 GDIDLGKLY 119
D+G +Y
Sbjct: 85 HHADVGLVY 93
>gnl|CDD|106271 PRK13311, PRK13311, N-acetyl-D-glucosamine kinase; Provisional.
Length = 256
Score = 28.1 bits (62), Expect = 6.8
Identities = 18/63 (28%), Positives = 31/63 (49%), Gaps = 9/63 (14%)
Query: 188 GFRSADFGLPRGVKKLDVQQVRTSRIISPIED---------DLTLRSETYTRLKGNEGIS 238
GF + GV ++Q++ R+ +P ED DLT ++TY ++G+ GI
Sbjct: 4 GFDMGGTKIELGVFDENLQRIWHKRVPTPREDYPQLLQILRDLTEEADTYCGVQGSVGIG 63
Query: 239 MEG 241
+ G
Sbjct: 64 IPG 66
>gnl|CDD|181283 PRK08197, PRK08197, threonine synthase; Validated.
Length = 394
Score = 28.0 bits (63), Expect = 7.8
Identities = 15/37 (40%), Positives = 22/37 (59%), Gaps = 3/37 (8%)
Query: 100 MLPLLNMVKLYGDIDLGKLY-KDDGYF--SSFKDSGL 133
M PLL + +L + +G+L+ KD+G SFK GL
Sbjct: 79 MTPLLPLPRLGKALGIGRLWVKDEGLNPTGSFKARGL 115
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.140 0.418
Gapped
Lambda K H
0.267 0.0796 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,229,829
Number of extensions: 1698087
Number of successful extensions: 1424
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1416
Number of HSP's successfully gapped: 20
Length of query: 332
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 235
Effective length of database: 6,635,264
Effective search space: 1559287040
Effective search space used: 1559287040
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 59 (26.4 bits)