RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10199
(218 letters)
>gnl|CDD|215886 pfam00379, Chitin_bind_4, Insect cuticle protein. Many insect
cuticular proteins include a 35-36 amino acid motif
known as the R&R consensus. The extensive conservation
of this region led to the suggestion that it functions
to bind chitin. Provocatively, it has no sequence
similarity to the well-known cysteine-containing
chitin-binding domain found in chitinases and some
peritrophic membrane proteins. Chitin binding has been
shown experimentally for this region. Thus arthropods
have two distinct classes of chitin binding proteins,
those with the chitin-binding domain found in lectins,
chitinases and peritrophic membranes (cysCBD) and those
with the cuticular protein chitin-binding domain
(non-cysCBD).
Length = 51
Score = 41.5 bits (98), Expect = 8e-06
Identities = 15/54 (27%), Positives = 23/54 (42%), Gaps = 6/54 (11%)
Query: 118 DFETGNGIAVQEQGALKNAGQKDLEAQTAQGQSSYTSPDGTPIQTQWYADETGY 171
+ET +G +E ++ G +G SY PDG + ADE G+
Sbjct: 4 GYETSDGKTQEEGRGTEDDGG------VVKGSYSYVDPDGKLRTVTYVADENGF 51
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 34.8 bits (79), Expect = 0.031
Identities = 35/104 (33%), Positives = 51/104 (49%), Gaps = 4/104 (3%)
Query: 7 NVYNIDWNNVYNTDWNNEYNIDWNNEYNIDWNNDTTWNNVYNIDWNNEYNIGWNNVYNID 66
NV D NV + N E N++ N E NI+ ++ NV I+ N E NI N N++
Sbjct: 1033 NVEEYDEENVEEIEENAEENVEENIEENIEEYDE---ENVEEIEENIEENIEENVEENVE 1089
Query: 67 WNNVYNIDWNNEYNIGWNNEYNIDWNNVYNIDWNNVYNIDWNNE 110
NV I+ N E N+ N E N + N N + + N + +NE
Sbjct: 1090 -ENVEEIEENVEENVEENAEENAEENAEENAEEYDDENPEEHNE 1132
Score = 32.5 bits (73), Expect = 0.17
Identities = 39/126 (30%), Positives = 56/126 (44%), Gaps = 20/126 (15%)
Query: 10 NIDWNNVYNTDWNNEYNIDWNNEYNIDWN----------------NDTTWNNVYNIDWNN 53
N++ N N + N E N++ N E NI+ N + NV I+ N
Sbjct: 990 NVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENA 1049
Query: 54 EYNIGWN---NVYNIDWNNVYNIDWNNEYNIGWNNEYNIDWNNVYNIDWNNVYNIDWNNE 110
E N+ N N+ D NV I+ N E NI N E N++ NV I+ N N++ N E
Sbjct: 1050 EENVEENIEENIEEYDEENVEEIEENIEENIEENVEENVE-ENVEEIEENVEENVEENAE 1108
Query: 111 YNIDWN 116
N + N
Sbjct: 1109 ENAEEN 1114
Score = 31.7 bits (71), Expect = 0.37
Identities = 29/124 (23%), Positives = 56/124 (45%), Gaps = 6/124 (4%)
Query: 6 NNVYNIDWNNVYNTDWNNEYNIDWNNEYNIDWNNDTTWNNVYNIDWNNEYNIGWNNVYNI 65
N N+ + + + N E + + N E + + N + N++ N E N+ N N+
Sbjct: 930 NQNENVPEHLKEHAEANIEEDAEENVEEDAEENVEE------NVEENVEENVEENVEENV 983
Query: 66 DWNNVYNIDWNNEYNIGWNNEYNIDWNNVYNIDWNNVYNIDWNNEYNIDWNNDFETGNGI 125
+ N N++ N E N+ N E N++ N NI+ N + N E + +++ N
Sbjct: 984 EENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVE 1043
Query: 126 AVQE 129
++E
Sbjct: 1044 EIEE 1047
Score = 29.0 bits (64), Expect = 2.5
Identities = 30/113 (26%), Positives = 51/113 (45%), Gaps = 2/113 (1%)
Query: 18 NTDWNNEYNIDWNNEYNIDWNNDTTWNNVYNIDWNNEYNIGWNNVYNIDWNNVYNIDWNN 77
N + N ++ + E NI+ D N + + N E N+ N N++ N N++ N
Sbjct: 930 NQNENVPEHLKEHAEANIE--EDAEENVEEDAEENVEENVEENVEENVEENVEENVEENV 987
Query: 78 EYNIGWNNEYNIDWNNVYNIDWNNVYNIDWNNEYNIDWNNDFETGNGIAVQEQ 130
E N+ N E N++ N N++ N NI+ N E + N + N E+
Sbjct: 988 EENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEE 1040
>gnl|CDD|165564 PHA03309, PHA03309, transcriptional regulator ICP4; Provisional.
Length = 2033
Score = 34.1 bits (77), Expect = 0.059
Identities = 31/106 (29%), Positives = 49/106 (46%), Gaps = 17/106 (16%)
Query: 120 ETGNGIAVQEQGALKNAGQKDLEA---QTAQGQS---SYTSPDGTP----IQTQWYADET 169
+ G + + +GA+ + + L A + G S S +SP G+ Q+ +ET
Sbjct: 121 DCGRDVGEEAEGAVAPSWRAPLPAPQWRPVHGPSWWLSSSSPSGSSRGSVTSPQYSGEET 180
Query: 170 GYHASGAHLPTPPPIPDE----IAKAIATLPKLVEENYAPNPQPAP 211
ASGA PTPPP +A+A L +++E AP P+ P
Sbjct: 181 SDAASGA--PTPPPQSSGSSYVYGQALADLMAMIQEIPAP-PRSVP 223
>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1. TT virus (TTV), isolated
initially from a Japanese patient with hepatitis of
unknown aetiology, has since been found to infect both
healthy and diseased individuals and numerous prevalence
studies have raised questions about its role in
unexplained hepatitis. ORF1 is a large 750 residue
protein. The N-terminal half of this protein corresponds
to the capsid protein.
Length = 525
Score = 33.4 bits (77), Expect = 0.075
Identities = 19/117 (16%), Positives = 29/117 (24%), Gaps = 5/117 (4%)
Query: 17 YNTDWNNEY-NIDWNNEYNIDWNNDTTWNNVYNIDWNNEYNIGWNNVYNIDWNNVYNIDW 75
N W + + T +N N N N+ + W
Sbjct: 267 NNALWISTTTTSKGKTLLITLYKTVTYYNTFETPAQLKPKTKNKNKTTNKTPNDSTSKYW 326
Query: 76 NNEYNIGWNNEYNIDWNNVYNIDWNNV-YNIDWNNEYNIDWNNDFE---TGNGIAVQ 128
++ Y N I + N + Y N F GN I +Q
Sbjct: 327 DSPYGDNTYNSLEYHTGWFSPIFLSAGRSNPQFPGAYTDVRYNPFTDKGKGNKIWLQ 383
Score = 27.2 bits (61), Expect = 7.4
Identities = 13/82 (15%), Positives = 24/82 (29%), Gaps = 4/82 (4%)
Query: 39 NDTTWNNVYNIDWNNEYNIGWNNVYNIDWNNVYNIDWNNEYNIGWNNEYNIDWNNVYNID 98
N +NN I + + + V +N + N N
Sbjct: 262 NPGFYNNALWI--STTTTSKGKTLLITLYKTVTY--YNTFETPAQLKPKTKNKNKTTNKT 317
Query: 99 WNNVYNIDWNNEYNIDWNNDFE 120
N+ + W++ Y + N E
Sbjct: 318 PNDSTSKYWDSPYGDNTYNSLE 339
>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 226
Score = 31.4 bits (71), Expect = 0.27
Identities = 17/88 (19%), Positives = 24/88 (27%), Gaps = 13/88 (14%)
Query: 124 GIA--VQEQGALKNAGQKDLEAQTAQGQSSYTSPDGTPIQTQWYADETGYHASGAHLPTP 181
G+A +Q+ G A + Q ++P G P+ Q P
Sbjct: 73 GVAQEIQDAGDAAAASV---DPQPVAQPPVESTPAGVPVAAQTPKP--------VKPPKQ 121
Query: 182 PPIPDEIAKAIATLPKLVEENYAPNPQP 209
PP AK A P
Sbjct: 122 PPAGAVPAKPTPKPEPKPVAEPAAAPTG 149
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 30.2 bits (68), Expect = 0.44
Identities = 19/81 (23%), Positives = 25/81 (30%), Gaps = 12/81 (14%)
Query: 133 LKNAGQKDLEAQTAQGQSSYTSPDGTPIQTQWYADETGYHASGAHLPTPPPIPDEIAKAI 192
++ +K L T +P Q+ A T S P PP P A
Sbjct: 71 MEQEARKILAPLTPPAPPEPVTPP--TAQSPAPAVPTPPPTST---PAVPPAPAAAVPAP 125
Query: 193 ATLPKLVEENYAPNPQPAPGR 213
A P P+ P P R
Sbjct: 126 AAAP-------PPSDPPQPPR 139
>gnl|CDD|235309 PRK04596, minC, septum formation inhibitor; Reviewed.
Length = 248
Score = 30.7 bits (69), Expect = 0.47
Identities = 17/57 (29%), Positives = 22/57 (38%), Gaps = 14/57 (24%)
Query: 157 GTPIQTQWYADETGYHASGAHLPTPPPIPDEIAKAIATLPKLVEENYAPNPQPAPGR 213
G P+ ++ A Y + P PPP P A E P +PAPGR
Sbjct: 101 GLPLLAKFRAQ---YERAAVSPPPPPPPPPARA-----------EPAPPVARPAPGR 143
>gnl|CDD|151935 pfam11498, Activator_LAG-3, Transcriptional activator LAG-3. The
C.elegans Notch pathway, involved in the control of
growth, differentiation and patterning in animal
development, relies on either of the receptors GLP-1 or
LIN-12. Both these receptors promote signalling by the
recruitment of LAG-3 to target promoters, where it then
acts as a transcriptional activator. LAG-3 works as a
ternary complex together with the DNA binding protein,
LAG-1.
Length = 476
Score = 30.3 bits (67), Expect = 0.74
Identities = 14/56 (25%), Positives = 20/56 (35%), Gaps = 4/56 (7%)
Query: 128 QEQGALKNAGQKDLEAQTAQGQSSYTSPDGTPIQTQWYADETGYHASGAHLPTPPP 183
Q+Q Q+ + GQ Y P+G P +A G H H+P
Sbjct: 420 QQQAQHHQQAQQQHQQPAQHGQMGYGIPNGYPAHMHGHAPAYGAH----HMPHHTA 471
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein
(parp) genes of Trypanosoma brucei encode a small family
of abundant surface proteins whose expression is
restricted to the procyclic form of the parasite. They
are found at two unlinked loci, parpA and parpB;
transcription of both loci is developmentally regulated.
Length = 145
Score = 29.5 bits (65), Expect = 0.74
Identities = 19/96 (19%), Positives = 30/96 (31%), Gaps = 10/96 (10%)
Query: 122 GNGIAVQEQGALKNA------GQKDLEAQTAQGQSSYTSPDGTPIQTQWYADETGYHASG 175
G G A G+ + + ++ T PD P + + E G
Sbjct: 21 GVGFAAAADEPADKGITKGGKGKGEKGTKVGADDTNGTDPDDEPEEEE----EPEPEEEG 76
Query: 176 AHLPTPPPIPDEIAKAIATLPKLVEENYAPNPQPAP 211
P P +E + T + E P P+P P
Sbjct: 77 EEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEP 112
>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1. This region is found
in some of the Diaphanous related formins (Drfs). It
consists of low complexity repeats of around 12
residues.
Length = 160
Score = 29.5 bits (66), Expect = 0.94
Identities = 15/38 (39%), Positives = 17/38 (44%), Gaps = 2/38 (5%)
Query: 175 GAHLPTPPPIPDEIAKAIATLPKLVEENYAPNPQPAPG 212
G +P PPP+P A I P L P P P PG
Sbjct: 33 GTGIPPPPPLPGGAA--IPPPPPLPGVAGIPPPPPLPG 68
Score = 28.8 bits (64), Expect = 1.3
Identities = 15/38 (39%), Positives = 17/38 (44%), Gaps = 2/38 (5%)
Query: 175 GAHLPTPPPIPDEIAKAIATLPKLVEENYAPNPQPAPG 212
GA +P PPP+P I P L P P P PG
Sbjct: 45 GAAIPPPPPLPG--VAGIPPPPPLPGATAIPPPPPLPG 80
Score = 28.8 bits (64), Expect = 1.5
Identities = 16/38 (42%), Positives = 18/38 (47%), Gaps = 2/38 (5%)
Query: 175 GAHLPTPPPIPDEIAKAIATLPKLVEENYAPNPQPAPG 212
A +P PPP+P A AI P L P P P PG
Sbjct: 57 VAGIPPPPPLPG--ATAIPPPPPLPGAAGIPPPPPLPG 92
>gnl|CDD|204583 pfam11020, DUF2610, Domain of unknown function (DUF2610). This
family is conserved in Proteobacteria. One member is
annotated as being elongation factor P but this could
not be confirmed. This domain is related to the
Ribbon-helix-helix superfamily so may be a DNA-binding
protein.
Length = 82
Score = 27.7 bits (62), Expect = 1.2
Identities = 15/56 (26%), Positives = 26/56 (46%), Gaps = 13/56 (23%)
Query: 155 PDGTPIQTQ--WYADETGYHASGAHLPTPPPIPDEIAKAIATLPKLVEENYAPNPQ 208
PD P+Q Q W + E G G IP+++ +++ L +L ++N P
Sbjct: 26 PDTHPLQFQATWLSKERG----GQ-------IPEKVMDSLSKLHELAKKNGVPFED 70
>gnl|CDD|236191 PRK08224, ligC, ATP-dependent DNA ligase; Reviewed.
Length = 350
Score = 29.1 bits (66), Expect = 2.0
Identities = 7/19 (36%), Positives = 12/19 (63%)
Query: 178 LPTPPPIPDEIAKAIATLP 196
LP PP+ +AK++ +P
Sbjct: 3 LPVMPPVEPMLAKSVDAIP 21
>gnl|CDD|215526 PLN02974, PLN02974, adenosylmethionine-8-amino-7-oxononanoate
transaminase.
Length = 817
Score = 28.5 bits (64), Expect = 3.6
Identities = 12/35 (34%), Positives = 15/35 (42%), Gaps = 5/35 (14%)
Query: 132 ALKNAGQKD-LEAQTAQGQSSYTSPDGTPIQTQWY 165
AL + D L A AQ S +T +Q WY
Sbjct: 477 ALDGSYHGDTLGAMEAQAPSVFTG----FLQQPWY 507
>gnl|CDD|234162 TIGR03299, LGT_TIGR03299, phage/plasmid-like protein TIGR03299.
Members of this uncharacterized protein family are found
in various Mycobacterium phage genomes, in Streptomyces
coelicolor plasmid SCP1, and in bacterial genomes near
various markers that suggest lateral gene transfer. The
function is unknown [Mobile and extrachromosomal element
functions, Other].
Length = 309
Score = 27.8 bits (62), Expect = 4.3
Identities = 13/31 (41%), Positives = 17/31 (54%), Gaps = 1/31 (3%)
Query: 161 QTQWYADETGYHASGAHLPTPPPIPDEIAKA 191
+T Y E +H G HLP PI D+ A+A
Sbjct: 1 ETMAYVGEPAWHGLGNHLPGRQPI-DDWARA 30
>gnl|CDD|133469 cd01414, SAICAR_synt_Sc, non-metazoan
5-aminoimidazole-4-(N-succinylcarboxamide)
ribonucleotide (SAICAR) synthase. Eukaryotic,
bacterial, and archaeal group of SAICAR synthetases
represented by the Saccharomyces cerevisiae (Sc) enzyme,
mostly absent in metazoans. SAICAR synthetase catalyzes
the seventh step of the de novo biosynthesis of purine
nucleotides (also reported as eighth step). It converts
5-aminoimidazole-4-carboxyribonucleotide (CAIR), ATP,
and L-aspartate into
5-aminoimidazole-4-(N-succinylcarboxamide)
ribonucleotide (SAICAR), ADP, and phosphate.
Length = 279
Score = 27.5 bits (62), Expect = 4.6
Identities = 8/31 (25%), Positives = 14/31 (45%), Gaps = 4/31 (12%)
Query: 163 QWYADETGYHASGAHLPTPPPIPDEIAKAIA 193
W +G+ P PPP+P E+ + +
Sbjct: 241 DWLE-ASGWDKQD---PPPPPLPAEVIEKTS 267
>gnl|CDD|234450 TIGR04056, OMP_RagA_SusC, TonB-linked outer membrane protein,
SusC/RagA family. This model describes a distinctive
clade among the TonB-linked outer membrane proteins
(OMP). Members of this family are restricted to the
Bacteriodetes lineage (except for Gemmatimonas
aurantiaca T-27 from the novel phylum Gemmatimonadetes)
and occur in high copy numbers, with over 100 members
from Bacteroides thetaiotaomicron VPI-5482 alone.
Published descriptions of members of this family are
available for RagA from Porphyromonas gingivalis, SusC
from Bacteroides thetaiotaomicron, and OmpW from
Bacteroides caccae. Members form pairs with members of
the SusD/RagB family (pfam07980). Transporter complexes
including these outer membrane proteins are likely to
import large degradation products of proteins (e.g.
RagA) or carbohydrates (e.g. SusC) as nutrients, rather
than siderophores [Transport and binding proteins,
Unknown substrate].
Length = 982
Score = 27.9 bits (63), Expect = 4.9
Identities = 15/83 (18%), Positives = 25/83 (30%), Gaps = 12/83 (14%)
Query: 6 NNVYNIDWNNVYNTDWNNEYNI------------DWNNEYNIDWNNDTTWNNVYNIDWNN 53
N YN + N N + + E I D++N N + + N N
Sbjct: 404 NRGYNENERNRLNANASLEQKITKGLKLKGTFSYDYSNYKNEKYYPKPYYYNATGGANNG 463
Query: 54 EYNIGWNNVYNIDWNNVYNIDWN 76
Y+ + N N +
Sbjct: 464 YYSESNSRSRNYYLEATLNYNRT 486
>gnl|CDD|133212 cd05081, PTKc_Jak2_Jak3_rpt2, Catalytic (repeat 2) domain of the
Protein Tyrosine Kinases, Janus kinases 2 and 3.
Protein Tyrosine Kinase (PTK) family; Janus kinase 2
(Jak2) and Jak3; catalytic (c) domain (repeat 2). The
PTKc family is part of a larger superfamily that
includes the catalytic domains of other kinases such as
protein serine/threonine kinases, RIO kinases, and
phosphoinositide 3-kinase (PI3K). PTKs catalyze the
transfer of the gamma-phosphoryl group from ATP to
tyrosine (tyr) residues in protein substrates. Jak2 and
Jak3 are members of the Janus kinase (Jak) subfamily of
proteins, which are cytoplasmic (or nonreceptor) tyr
kinases containing an N-terminal FERM domain, followed
by a Src homology 2 (SH2) domain, a pseudokinase domain,
and a C-terminal catalytic tyr kinase domain. Jaks are
crucial for cytokine receptor signaling. They are
activated by autophosphorylation upon cytokine-induced
receptor aggregation, and subsequently trigger
downstream signaling events such as the phosphorylation
of signal transducers and activators of transcription
(STATs). Jak2 is widely expressed in many tissues while
Jak3 is expressed only in hematopoietic cells. Jak2 is
essential for the signaling of hormone-like cytokines
such as growth hormone, erythropoietin, thrombopoietin,
and prolactin, as well as some IFNs and cytokines that
signal through the IL-3 and gp130 receptors. Jak3 binds
the shared receptor subunit common gamma chain and thus,
is essential in the signaling of cytokines that use it
such as IL-2, IL-4, IL-7, IL-9, IL-15, and IL-21.
Disruption of Jak2 in mice results in an embryonic
lethal phenotype with multiple defects including
erythropoietic and cardiac abnormalities. It is the only
Jak gene that results in a lethal phenotype when
disrupted in mice. A mutation in the pseudokinase domain
of Jak2, V617F, is present in many myeloproliferative
diseases, including almost all patients with
polycythemia vera, and 50% of patients with essential
thrombocytosis and myelofibrosis. Jak3 is important in
lymphoid development and myeloid cell differentiation.
Inactivating mutations in Jak3 have been reported in
humans with severe combined immunodeficiency (SCID).
Length = 284
Score = 27.8 bits (62), Expect = 5.0
Identities = 10/38 (26%), Positives = 16/38 (42%), Gaps = 7/38 (18%)
Query: 174 SGAHLPTPPPIPDEIAKAIATLPKLVEENYAPNPQPAP 211
+ LP PP P EI +++E + +P P
Sbjct: 241 NNGRLPAPPGCPAEIY-------AIMKECWNNDPSQRP 271
>gnl|CDD|148271 pfam06566, Chon_Sulph_att, Chondroitin sulphate attachment domain.
This family represents the chondroitin sulphate
attachment domain of vertebrate neural transmembrane
proteoglycans that contain EGF modules. Evidence has
been accumulated to support the idea that neural
proteoglycans are involved in various cellular events
including mitogenesis, differentiation, axonal outgrowth
and synaptogenesis. This domain contains several
potential sites of chondroitin sulphate attachment, as
well as potential sites of N-linked glycosylation.
Length = 253
Score = 26.5 bits (58), Expect = 10.0
Identities = 15/37 (40%), Positives = 16/37 (43%), Gaps = 1/37 (2%)
Query: 178 LPTPPPIPDEIAKAIATLPKLVEENYAPNPQPAPGRG 214
LP P PDE L E A NP P+PG G
Sbjct: 99 LPATLPTPDEALGNSNPSLALPEATEASNP-PSPGPG 134
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.132 0.450
Gapped
Lambda K H
0.267 0.0703 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,666,640
Number of extensions: 1094602
Number of successful extensions: 1347
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1195
Number of HSP's successfully gapped: 133
Length of query: 218
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 125
Effective length of database: 6,812,680
Effective search space: 851585000
Effective search space used: 851585000
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 57 (25.8 bits)