RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy11815
(141 letters)
>gnl|CDD|206637 cd11304, Cadherin_repeat, Cadherin tandem repeat domain.
Cadherins are glycoproteins involved in Ca2+-mediated
cell-cell adhesion. The cadherin repeat domains occur
as tandem repeats in the extracellular regions, which
are thought to mediate cell-cell contact when bound to
calcium. They play numerous roles in cell fate,
signalling, proliferation, differentiation, and
migration; members include E-, N-, P-, T-, VE-, CNR-,
proto-, and FAT-family cadherin, desmocollin, and
desmoglein, a large variety of domain architectures
with varying repeat copy numbers. Cadherin-repeat
containing proteins exist as monomers, homodimers, or
heterodimers.
Length = 98
Score = 49.2 bits (118), Expect = 1e-08
Identities = 23/98 (23%), Positives = 37/98 (37%), Gaps = 24/98 (24%)
Query: 6 VYYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAGNPESYFNID------------ 53
Y + EN ++ ++A+D D + ++Y I +GN + F+ID
Sbjct: 1 SYEVSVPENAPPGTVVLTVSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPL 60
Query: 54 ------------IGSDLTGGPDQVYLIVYIQVQNVNDN 79
+D G P V I V +VNDN
Sbjct: 61 DREEQSSYTLTVTATDGGGPPLSSTATVTITVLDVNDN 98
Score = 46.9 bits (112), Expect = 9e-08
Identities = 14/54 (25%), Positives = 28/54 (51%)
Query: 87 VYYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAGNPESYFNIDIGSGSL 140
Y + EN ++ ++A+D D + ++Y I +GN + F+ID +G +
Sbjct: 1 SYEVSVPENAPPGTVVLTVSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEI 54
>gnl|CDD|215665 pfam00028, Cadherin, Cadherin domain.
Length = 92
Score = 42.3 bits (100), Expect = 3e-06
Identities = 20/53 (37%), Positives = 28/53 (52%)
Query: 88 YYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAGNPESYFNIDIGSGSL 140
Y A + EN ++ + A+D DL P+ RI Y I G P +F ID +G L
Sbjct: 1 YSASVPENAPVGTEVLTVTATDADLGPNGRIFYSILGGGPGGWFRIDPDTGDL 53
Score = 39.2 bits (92), Expect = 5e-05
Identities = 18/47 (38%), Positives = 25/47 (53%)
Query: 7 YYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAGNPESYFNID 53
Y A + EN ++ + A+D DL P+ RI Y I G P +F ID
Sbjct: 1 YSASVPENAPVGTEVLTVTATDADLGPNGRIFYSILGGGPGGWFRID 47
>gnl|CDD|214520 smart00112, CA, Cadherin repeats. Cadherins are glycoproteins
involved in Ca2+-mediated cell-cell adhesion. Cadherin
domains occur as repeats in the extracellular regions
which are thought to mediate cell-cell contact when
bound to calcium.
Length = 81
Score = 39.6 bits (93), Expect = 3e-05
Identities = 21/80 (26%), Positives = 31/80 (38%), Gaps = 24/80 (30%)
Query: 26 ASDGDLDPDQRISYKISAGNPESYFNID------------------------IGSDLTGG 61
A+D D + +++Y I +GN + F+ID +D G
Sbjct: 2 ATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGP 61
Query: 62 PDQVYLIVYIQVQNVNDNVP 81
P V I V +VNDN P
Sbjct: 62 PLSSTATVTITVLDVNDNAP 81
Score = 34.2 bits (79), Expect = 0.003
Identities = 11/34 (32%), Positives = 21/34 (61%)
Query: 107 ASDGDLDPDQRISYKISAGNPESYFNIDIGSGSL 140
A+D D + +++Y I +GN + F+ID +G +
Sbjct: 2 ATDADSGENGKVTYSILSGNDDGLFSIDPETGEI 35
>gnl|CDD|227322 COG4989, COG4989, Predicted oxidoreductase [General function
prediction only].
Length = 298
Score = 27.7 bits (62), Expect = 2.2
Identities = 25/130 (19%), Positives = 48/130 (36%), Gaps = 21/130 (16%)
Query: 11 IVENQSGILPIVQLAASDGDLDPDQRISYKISAGNPESYFNIDIGSDLTGGPDQVYLIVY 70
+V NQ + P+ DG LD Q++ + A +P + +G D +V
Sbjct: 175 LVTNQLELSPLHTPMLLDGTLDYCQQLRVRPMAWSPLGGGGLFLGDDKFQRLRKV----- 229
Query: 71 IQVQNVNDNV-PMTLDPVYYAQIVENQSGILPIV----------QLAASDGDLDPDQ--R 117
+ + + +++ V A ++ + + PI+ + A L Q
Sbjct: 230 --LDRIAEEYGAVSITAVAIAWLLRHPAKPQPIIGTGNLERIRAAIKALSLTLTRQQWFE 287
Query: 118 ISYKISAGNP 127
I Y + GN
Sbjct: 288 I-YTAAIGND 296
>gnl|CDD|216245 pfam01015, Ribosomal_S3Ae, Ribosomal S3Ae family.
Length = 195
Score = 27.1 bits (61), Expect = 2.5
Identities = 5/24 (20%), Positives = 13/24 (54%)
Query: 56 SDLTGGPDQVYLIVYIQVQNVNDN 79
+DLTG + + ++++V +
Sbjct: 53 ADLTGDYSKSNRKLKFKIEDVQGD 76
>gnl|CDD|237514 PRK13804, ileS, isoleucyl-tRNA synthetase; Provisional.
Length = 961
Score = 27.6 bits (62), Expect = 2.9
Identities = 17/48 (35%), Positives = 23/48 (47%), Gaps = 5/48 (10%)
Query: 2 TLDPVYYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAGNPESY 49
T+ P Q V QSG I++L + D DQRI +I E+Y
Sbjct: 638 TVSP----QDVIKQSGA-DILRLWVASVDYSDDQRIGKEILKQVSETY 680
Score = 27.6 bits (62), Expect = 2.9
Identities = 17/48 (35%), Positives = 23/48 (47%), Gaps = 5/48 (10%)
Query: 83 TLDPVYYAQIVENQSGILPIVQLAASDGDLDPDQRISYKISAGNPESY 130
T+ P Q V QSG I++L + D DQRI +I E+Y
Sbjct: 638 TVSP----QDVIKQSGA-DILRLWVASVDYSDDQRIGKEILKQVSETY 680
>gnl|CDD|187849 cd09718, Cas1_I-F, CRISPR/Cas system-associated protein Cas1.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Cas1 is the most universal CRISPR system protein thought
to be involved in spacer integration; Cas1 is
metal-dependent deoxyribonuclease, also binds RNA; Shown
to possess a unique fold consisting of a N-terminal
beta-strand domain and a C-terminal alpha-helical
domain.
Length = 306
Score = 26.7 bits (59), Expect = 4.3
Identities = 10/33 (30%), Positives = 14/33 (42%), Gaps = 8/33 (24%)
Query: 115 DQRISYKISAGNPESYFNIDI--------GSGS 139
R+ Y GN Y+NI I G+G+
Sbjct: 21 GGRVEYVTDEGNESLYWNIPIANTTVLLLGTGT 53
Score = 26.7 bits (59), Expect = 4.8
Identities = 8/21 (38%), Positives = 10/21 (47%)
Query: 34 DQRISYKISAGNPESYFNIDI 54
R+ Y GN Y+NI I
Sbjct: 21 GGRVEYVTDEGNESLYWNIPI 41
>gnl|CDD|132676 TIGR03637, cas1_YPEST, CRISPR-associated endonuclease Cas1, subtype
I-F/YPEST. The CRISPR-associated protein Cas1 is
virtually universal to CRISPR systems. CRISPR, an
acronym for Clustered Regularly Interspaced Short
Palindromic Repeats, is prokaryotic immunity system for
foreign DNA, mostly from phage. CRISPR systems belong to
different subtypes, distinguished by both nature of the
repeats, the makeup of the cohort of associated Cas
proteins, and by molecular phylogeny within the more
universal Cas proteins such as this one. This model is
of type EXCEPTION and provides more specific information
than the EQUIVALOG model TIGR00287. It describes the
Cas1 protein particular to the YPEST subtype of
CRISPR/Cas system.
Length = 307
Score = 26.7 bits (59), Expect = 4.6
Identities = 10/33 (30%), Positives = 14/33 (42%), Gaps = 8/33 (24%)
Query: 115 DQRISYKISAGNPESYFNIDI--------GSGS 139
R+ Y GN Y+NI I G+G+
Sbjct: 21 GGRVEYVTDEGNESLYWNIPIANTTVLLLGTGT 53
Score = 26.7 bits (59), Expect = 5.0
Identities = 8/21 (38%), Positives = 10/21 (47%)
Query: 34 DQRISYKISAGNPESYFNIDI 54
R+ Y GN Y+NI I
Sbjct: 21 GGRVEYVTDEGNESLYWNIPI 41
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.138 0.391
Gapped
Lambda K H
0.267 0.0710 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,501,478
Number of extensions: 683502
Number of successful extensions: 343
Number of sequences better than 10.0: 1
Number of HSP's gapped: 341
Number of HSP's successfully gapped: 26
Length of query: 141
Length of database: 10,937,602
Length adjustment: 87
Effective length of query: 54
Effective length of database: 7,078,804
Effective search space: 382255416
Effective search space used: 382255416
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 54 (24.6 bits)