RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8340
(173 letters)
>gnl|CDD|227522 COG5195, COG5195, Uncharacterized conserved protein [Function
unknown].
Length = 118
Score = 73.5 bits (180), Expect = 2e-17
Identities = 39/125 (31%), Positives = 56/125 (44%), Gaps = 13/125 (10%)
Query: 44 EDINSQNQMQEKLAPKYIFKNPEYEQRKLSRRSTSSNLPTKQWKSLKQILSYEKNIPWPE 103
+ N+ + KL + FK Y+ +++K+L+Q++
Sbjct: 2 NNNNNNDGGDMKLLKRPWFKKSTYKG------------KNRRFKTLRQLIPRLTIENESS 49
Query: 104 DTIHYSSIVAPPSFKPSLKYSDISRFECNYKDPQTKLYYYNIEEYKLIRKLPSDIVNGYL 163
Y SI APPS KP +KY D++ Y P T L Y+N E YKLI +LPS YL
Sbjct: 50 KH-RYLSIEAPPSVKPRMKYCDVTGLPAPYTCPNTGLRYHNSEIYKLICELPSGRDQEYL 108
Query: 164 ALRGA 168
LR
Sbjct: 109 KLREF 113
>gnl|CDD|191983 pfam08265, YL1_C, YL1 nuclear protein C-terminal domain. This
domain is found in proteins of the YL1 family. These
proteins have been shown to be DNA-binding and may be a
transcription factor. This domain is found in proteins
that are not YL1 proteins.
Length = 30
Score = 50.5 bits (122), Expect = 1e-09
Identities = 15/30 (50%), Positives = 20/30 (66%)
Query: 122 KYSDISRFECNYKDPQTKLYYYNIEEYKLI 151
KY DI+ YKDP+T L Y N+E +K+I
Sbjct: 1 KYCDITGLPAKYKDPKTGLPYSNVEAFKVI 30
>gnl|CDD|198061 smart00993, YL1_C, YL1 nuclear protein C-terminal domain. This
domain is found in proteins of the YL1 family. These
proteins have been shown to be DNA-binding and may be a
transcription factor. This domain is found in proteins
that are not YL1 proteins.
Length = 30
Score = 48.3 bits (116), Expect = 9e-09
Identities = 15/30 (50%), Positives = 19/30 (63%)
Query: 122 KYSDISRFECNYKDPQTKLYYYNIEEYKLI 151
KY DI+ Y+DP+T L Y N E YK+I
Sbjct: 1 KYCDITGLPARYRDPKTGLRYANAEAYKVI 30
>gnl|CDD|182071 PRK09777, fecD, iron-dicitrate transporter subunit FecD; Reviewed.
Length = 318
Score = 30.8 bits (70), Expect = 0.27
Identities = 16/38 (42%), Positives = 20/38 (52%), Gaps = 6/38 (15%)
Query: 142 YYNIEEYKLIRKLPSDIVNGYLALRGAL------NPLA 173
YY + EY+L R L + V LA+ G L NPLA
Sbjct: 44 YYVLMEYRLPRLLLALFVGAALAVSGVLVQGIVRNPLA 81
>gnl|CDD|215862 pfam00331, Glyco_hydro_10, Glycosyl hydrolase family 10.
Length = 308
Score = 30.3 bits (69), Expect = 0.49
Identities = 13/30 (43%), Positives = 15/30 (50%), Gaps = 10/30 (33%)
Query: 135 DPQTKLYY--YNIEE--------YKLIRKL 154
DP KLYY YNIE Y L++ L
Sbjct: 160 DPNAKLYYNDYNIEGPNAKREAIYNLVKDL 189
>gnl|CDD|214750 smart00633, Glyco_10, Glycosyl hydrolase family 10.
Length = 263
Score = 29.9 bits (68), Expect = 0.58
Identities = 13/30 (43%), Positives = 17/30 (56%), Gaps = 10/30 (33%)
Query: 135 DPQTKLYY--YNIEE--------YKLIRKL 154
DP KL+Y YN EE Y+L++KL
Sbjct: 120 DPDAKLFYNDYNTEEPNAKRQAIYELVKKL 149
>gnl|CDD|215918 pfam00435, Spectrin, Spectrin repeat. Spectrin repeat-domains
are found in several proteins involved in cytoskeletal
structure. These include spectrin, alpha-actinin and
dystrophin. The sequence repeat used in this family is
taken from the structural repeat in reference. The
spectrin domain- repeat forms a three helix bundle. The
second helix is interrupted by proline in some
sequences. The repeats are defined by a characteristic
tryptophan (W) residue at position 17 in helix A and a
leucine (L) at 2 residues from the carboxyl end of
helix C. Although the domain occurs in ultiple repeats
along sequences, the domains are actually stable on
their own - ie they act, biophysically, like domains
rather than repeats that along function when
aggregated.
Length = 105
Score = 26.1 bits (58), Expect = 5.2
Identities = 18/90 (20%), Positives = 42/90 (46%), Gaps = 11/90 (12%)
Query: 5 INAW---KEEALYTLKDPTNLLAIQKFFSKGEGIEITMSQSQEDINSQNQMQEKLAPKYI 61
+ +W KE L + +L ++Q K + +E ++ Q+ + + N++ EKL +
Sbjct: 13 LESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEALNELAEKLIAEGH 72
Query: 62 FKNPEYEQRKLSRRSTSSNLPTKQWKSLKQ 91
+ + E ++R L ++W+ L +
Sbjct: 73 YASEEIQERL-------EEL-NERWEQLLE 94
>gnl|CDD|222499 pfam14022, DUF4238, Protein of unknown function (DUF4238). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria, archaea and
eukaryotes. Proteins in this family are typically
between 274 and 374 amino acids in length.
Length = 268
Score = 27.0 bits (60), Expect = 5.9
Identities = 9/49 (18%), Positives = 20/49 (40%)
Query: 18 DPTNLLAIQKFFSKGEGIEITMSQSQEDINSQNQMQEKLAPKYIFKNPE 66
P LL + + + + ++ N +Q K A +Y++ N +
Sbjct: 214 SPKLLLVLSPPRDDASREILILILDEAEVREINSLQAKNAERYVYSNSK 262
>gnl|CDD|173023 PRK14558, pyrH, uridylate kinase; Provisional.
Length = 231
Score = 26.6 bits (59), Expect = 6.0
Identities = 11/30 (36%), Positives = 14/30 (46%), Gaps = 2/30 (6%)
Query: 105 TIHYSSIVAPPSFKPSLKYSDI-SRFECNY 133
+ S IV PS +P + Y DI F Y
Sbjct: 91 AVIVSQIVNLPSVEP-INYDDIELYFRAGY 119
>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
in cytoskeletal structure; family members include
spectrin, alpha-actinin and dystrophin; the spectrin
repeat forms a three helix bundle with the second helix
interrupted by proline in some sequences; the repeats
are independent folding units; tandem repeats are found
in differing numbers and arrange in an antiparallel
manner to form dimers; the repeats are defined by a
characteristic tryptophan (W) residue in helix A and a
leucine (L) at the carboxyl end of helix C and separated
by a linker of 5 residues; two copies of the repeat are
present here.
Length = 213
Score = 26.3 bits (58), Expect = 7.7
Identities = 14/74 (18%), Positives = 30/74 (40%), Gaps = 3/74 (4%)
Query: 2 VLRINAWKEEALYTLKD---PTNLLAIQKFFSKGEGIEITMSQSQEDINSQNQMQEKLAP 58
+ W EE L +L ++++ K + +E + + + S N++ E+L
Sbjct: 115 ADDLEQWLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLE 174
Query: 59 KYIFKNPEYEQRKL 72
+ E + KL
Sbjct: 175 EGHPDADEEIEEKL 188
>gnl|CDD|234371 TIGR03839, termin_org_P1, adhesin P1. Members of this protein
family are the major adhesin of the Mycoplasma terminal
organelle. The protein is called adhesin P1, cytadhesin
P1, P140, attachment protein, and MgPa, with locus names
MG191 in Mycoplasma genitalium and MPN141 in M.
pneumoniae. A conserved C-terminal region is shared by
additional paralogs in M. pneumoniae and M.
gallisepticum, as well as by the member of this family
[Cell envelope, Surface structures, Cellular processes,
Pathogenesis].
Length = 1425
Score = 26.7 bits (58), Expect = 8.2
Identities = 12/65 (18%), Positives = 24/65 (36%)
Query: 46 INSQNQMQEKLAPKYIFKNPEYEQRKLSRRSTSSNLPTKQWKSLKQILSYEKNIPWPEDT 105
S +Q P Y NP+ + S + P + K I Y + + +
Sbjct: 607 AWSSGALQSSGNPYYFHNNPDSQDMPNSNGVEAIIKPWEDKDDAKYIYPYRYSGSYAWQS 666
Query: 106 IHYSS 110
+++S+
Sbjct: 667 VNWSN 671
>gnl|CDD|165152 PHA02789, PHA02789, uncharacterized protein; Provisional.
Length = 173
Score = 26.1 bits (57), Expect = 9.0
Identities = 16/38 (42%), Positives = 21/38 (55%), Gaps = 4/38 (10%)
Query: 120 SLKYSDISRF-ECNYKDPQT---KLYYYNIEEYKLIRK 153
S Y D+ F EC+Y D + + Y+YN EEY I K
Sbjct: 90 SSPYEDLEYFYECDYTDNKPTFDQFYFYNGEEYTEIDK 127
>gnl|CDD|226956 COG4590, COG4590, ABC-type uncharacterized transport system,
permease component [General function prediction only].
Length = 733
Score = 26.4 bits (58), Expect = 9.0
Identities = 14/60 (23%), Positives = 22/60 (36%), Gaps = 3/60 (5%)
Query: 67 YEQRKLSRRSTSSNLPTKQWKSLKQILSYEKNIPWPEDTIHYSSIVAPPSFKPSLKYSDI 126
+Q K+ + P W +L Q + YE +PE + S A F+ I
Sbjct: 375 EDQGKIRLAQLENRNPEISWSALWQKVWYES---YPEPAYVWQSTAATDDFEAKFSLVPI 431
>gnl|CDD|240415 PTZ00429, PTZ00429, beta-adaptin; Provisional.
Length = 746
Score = 26.4 bits (58), Expect = 10.0
Identities = 14/41 (34%), Positives = 22/41 (53%), Gaps = 8/41 (19%)
Query: 27 KFFS---KGEGIEITMSQSQEDINSQNQMQEKLAPKYIFKN 64
K+F+ +GEG E+ Q D+N + ++K A K I N
Sbjct: 24 KYFAQTRRGEGAEL-----QNDLNGTDSYRKKAAVKRIIAN 59
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.313 0.131 0.379
Gapped
Lambda K H
0.267 0.0829 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,843,077
Number of extensions: 778201
Number of successful extensions: 459
Number of sequences better than 10.0: 1
Number of HSP's gapped: 459
Number of HSP's successfully gapped: 29
Length of query: 173
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 83
Effective length of database: 6,945,742
Effective search space: 576496586
Effective search space used: 576496586
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 55 (24.8 bits)