RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy15052
(422 letters)
>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
synthesized as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. Alignment contains also inactive enzymes
that have substitutions of the catalytic triad residues.
Length = 232
Score = 230 bits (588), Expect = 8e-74
Identities = 93/192 (48%), Positives = 119/192 (61%), Gaps = 7/192 (3%)
Query: 177 VNSLLTSQIKIRVGEYDFSKLEEPYPYQERGVVKKMVHPKYNFFTYEYDLALVRLETPVE 236
V S S +R+G +D S E Q V K +VHP YN TY+ D+AL++L+ PV
Sbjct: 44 VYSSAPSNYTVRLGSHDLSSNEGGG--QVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVT 101
Query: 237 FAPNIVPICLPGSDDLLI-GENATVTGWGRLSEGGSLPPVLQKVTVPIVSNEKCRSMFLR 295
+ N+ PICLP S L G TV+GWGR SEGG LP VLQ+V VPIVSN +C+ +
Sbjct: 102 LSDNVRPICLPSSGYNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAY-- 159
Query: 296 AGRYEFISDIFMCAGFDNGGRDSCQGDSGGPLQVKGKDGRYFLAGIISWGIGCAEANLPG 355
I+D +CAG GG+D+CQGDSGGPL V +GR L GI+SWG GCA N PG
Sbjct: 160 -SYGGTITDNMLCAGGLEGGKDACQGDSGGPL-VCNDNGRGVLVGIVSWGSGCARPNYPG 217
Query: 356 VCTRISKFVPWV 367
V TR+S ++ W+
Sbjct: 218 VYTRVSSYLDWI 229
>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease. Many of these
are synthesised as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. A few, however, are active as single chain
molecules, and others are inactive due to substitutions
of the catalytic triad residues.
Length = 229
Score = 220 bits (562), Expect = 6e-70
Identities = 92/193 (47%), Positives = 119/193 (61%), Gaps = 10/193 (5%)
Query: 177 VNSLLTSQIKIRVGEYDFSKLEEPYPYQERGVVKKMVHPKYNFFTYEYDLALVRLETPVE 236
V S I++R+G +D S EE Q V K ++HP YN TY+ D+AL++L+ PV
Sbjct: 45 VRGSDPSNIRVRLGSHDLSSGEE---GQVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVT 101
Query: 237 FAPNIVPICLPGSDDLLI-GENATVTGWGRLSEG-GSLPPVLQKVTVPIVSNEKCRSMFL 294
+ N+ PICLP S+ + G TV+GWGR SEG GSLP LQ+V VPIVSN CR
Sbjct: 102 LSDNVRPICLPSSNYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCRR--- 158
Query: 295 RAGRYEFISDIFMCAGFDNGGRDSCQGDSGGPLQVKGKDGRYFLAGIISWGIGCAEANLP 354
I+D +CAG GG+D+CQGDSGGPL DGR+ L GI+SWG GCA P
Sbjct: 159 AYSGGGAITDNMLCAGGLEGGKDACQGDSGGPLVC--NDGRWVLVGIVSWGSGCARPGKP 216
Query: 355 GVCTRISKFVPWV 367
GV TR+S ++ W+
Sbjct: 217 GVYTRVSSYLDWI 229
>gnl|CDD|215708 pfam00089, Trypsin, Trypsin.
Length = 218
Score = 164 bits (417), Expect = 1e-48
Identities = 76/186 (40%), Positives = 106/186 (56%), Gaps = 16/186 (8%)
Query: 183 SQIKIRVGEYDFSKLEEPYPYQERGVVKKMVHPKYNFFTYEYDLALVRLETPVEFAPNIV 242
+++ +G ++ E Q+ V K +VHP YN T D+AL++L++PV +
Sbjct: 48 KSVRVVLGAHNIVLREGGE--QKFDVKKVIVHPNYNPDTD-NDIALLKLKSPVTLGDTVR 104
Query: 243 PICLPGSDDLL-IGENATVTGWGRLSEGGSLPPVLQKVTVPIVSNEKCRSMFLRAGRYEF 301
PICLP + L +G TV+GWG G LP LQ+VTVP+VS E CRS +
Sbjct: 105 PICLPTASSDLPVGTTCTVSGWGNTKTLG-LPDTLQEVTVPVVSRETCRSAYGGT----- 158
Query: 302 ISDIFMCAGFDNGGRDSCQGDSGGPLQVKGKDGRYFLAGIISWGIGCAEANLPGVCTRIS 361
++D +CAG GG+D+CQGDSGGPL L GI+SWG GCA N PGV T +S
Sbjct: 159 VTDNMICAGA--GGKDACQGDSGGPLVCSDG----ELIGIVSWGYGCASGNYPGVYTPVS 212
Query: 362 KFVPWV 367
++ W+
Sbjct: 213 SYLDWI 218
>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
[Posttranslational modification, protein turnover,
chaperones].
Length = 413
Score = 67.6 bits (165), Expect = 3e-12
Identities = 56/202 (27%), Positives = 81/202 (40%), Gaps = 30/202 (14%)
Query: 203 YQERGVVKKM-VHPKYNFFTYEYDLALVRLETPVEFAPNIVPICLPGSDDLLIGENATVT 261
ERG V+ + VH Y+ D+A++ L +P S D +VT
Sbjct: 102 QAERGHVRTIYVHEFYSPGNLGNDIAVLELARAA-----SLPRVKITSFDASDTFLNSVT 156
Query: 262 GWGRLSEGG------SLPP-------VLQKVTVPIVSNEKCRSMFLRA----GRYEFISD 304
++ G S P +L +V V V C A G
Sbjct: 157 TVSPMTNGTFGVTTPSDVPRSSPKGTILHEVAVLFVPLSTCAQYKGCANASDGATGLTG- 215
Query: 305 IFMCAGFDNGGRDSCQGDSGGPLQVKGKDGRYFLAGIISWGIG-CAEANLPGVCTRISKF 363
CAG +D+CQGDSGGP+ KG++GR G++SWG G C +PGV T +S +
Sbjct: 216 --FCAG--RPPKDACQGDSGGPIFHKGEEGRVQR-GVVSWGDGGCGGTLIPGVYTNVSNY 270
Query: 364 VPWVLDTGDSGGPLQVKGKDGR 385
W+ + LQ + R
Sbjct: 271 QDWIAAMTNGLSYLQFRPLGYR 292
Score = 51.8 bits (124), Expect = 4e-07
Identities = 22/50 (44%), Positives = 31/50 (62%), Gaps = 2/50 (4%)
Query: 371 GDSGGPLQVKGKDGRYFLAGIISWGIG-CAEANLPGVCTRISKFVPWVLD 419
GDSGGP+ KG++GR G++SWG G C +PGV T +S + W+
Sbjct: 228 GDSGGPIFHKGEEGRVQR-GVVSWGDGGCGGTLIPGVYTNVSNYQDWIAA 276
>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain. Syndecans are transmembrane
heparin sulfate proteoglycans which are implicated in
the binding of extracellular matrix components and
growth factors.
Length = 207
Score = 33.2 bits (76), Expect = 0.17
Identities = 23/77 (29%), Positives = 29/77 (37%), Gaps = 2/77 (2%)
Query: 74 ISPLAQEKTSPTLLYTKPSTPHKTSPTVHKHTTSMKPTPTTTILYSKPYRQ-TPRPMLSY 132
L +SP+ T ST KTSPTV T+ +P+ T T P
Sbjct: 77 PPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTT-TSPSETDTEEATTTVSTETPTEGG 135
Query: 133 YSHKDISSKNTTQRPYV 149
S SKN +R V
Sbjct: 136 SSAATDPSKNLLERKEV 152
>gnl|CDD|215315 PLN02578, PLN02578, hydrolase.
Length = 354
Score = 33.3 bits (76), Expect = 0.22
Identities = 16/61 (26%), Positives = 22/61 (36%), Gaps = 19/61 (31%)
Query: 136 KDISSKNTTQRPYVKPLKESLGRPV-------------------NVYSNNEKVDDFSTES 176
+ + R VKPLKE R V +VY + VDD+ ES
Sbjct: 199 AIVVEETVLTRFVVKPLKEWFQRVVLGFLFWQAKQPSRIESVLKSVYKDKSNVDDYLVES 258
Query: 177 V 177
+
Sbjct: 259 I 259
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 32.4 bits (73), Expect = 0.44
Identities = 20/91 (21%), Positives = 30/91 (32%), Gaps = 3/91 (3%)
Query: 66 SHNVTKNEISPLAQEKTSPTLLYTKPSTPHKTSPTVHKHTTSMKPTPTTTILYSKPYRQT 125
+ ++I ++Q K +TP T S P T + P T
Sbjct: 84 EEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTTPKTKVT---TPPSTNT 140
Query: 126 PRPMLSYYSHKDISSKNTTQRPYVKPLKESL 156
P+PM S S S + + P E L
Sbjct: 141 PQPMQSTKSDTPQSPTIKQAQTDMTPKYEDL 171
>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
family).
Length = 619
Score = 32.1 bits (73), Expect = 0.64
Identities = 25/137 (18%), Positives = 36/137 (26%), Gaps = 10/137 (7%)
Query: 65 CSHNVTKNEISPLAQEKTSPTLLYTKPSTP--HKTSPTVHKHTTSMKPTPTTTILYSKPY 122
N T + P S PSTP + + +T S+K TP T
Sbjct: 448 KLANKTSTVMEPPYGSTESSV-----PSTPSTRRNDRNITSNTPSLKRTPNLTKSSLSQE 502
Query: 123 RQTPRPMLSYYS-HKDISSKNTTQRPYVKPLKESLGRPVNVYSNNEKVDDFSTESVNSL- 180
H T + + + N D S S+NS
Sbjct: 503 ASLISKSTGNTHKHSTPRRLTTLPKLPAASRSSKGNLIRSGANGNASSDLSSPGSINSKS 562
Query: 181 -LTSQIKIRVGEYDFSK 196
S +RV +
Sbjct: 563 PEHSVPLVRVFDIHLRA 579
>gnl|CDD|216513 pfam01456, Mucin, Mucin-like glycoprotein. This family of
trypanosomal proteins resemble vertebrate mucins. The
protein consists of three regions. The N and C terminii
are conserved between all members of the family, whereas
the central region is not well conserved and contains a
large number of threonine residues which can be
glycosylated. Indirect evidence suggested that these
genes might encode the core protein of parasite mucins,
glycoproteins that were proposed to be involved in the
interaction with, and invasion of, mammalian host cells.
This family contains an N-terminal signal peptide.
Length = 143
Score = 30.2 bits (67), Expect = 1.1
Identities = 20/83 (24%), Positives = 35/83 (42%)
Query: 80 EKTSPTLLYTKPSTPHKTSPTVHKHTTSMKPTPTTTILYSKPYRQTPRPMLSYYSHKDIS 139
+ T+ T T P+T T+ T TT+ T TTT + T P + + + +
Sbjct: 44 QTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPT 103
Query: 140 SKNTTQRPYVKPLKESLGRPVNV 162
+ +T ++ + SLG V
Sbjct: 104 TTDTRAPSSIREIDGSLGSSAWV 126
>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
Length = 3164
Score = 31.1 bits (70), Expect = 1.6
Identities = 30/126 (23%), Positives = 41/126 (32%), Gaps = 22/126 (17%)
Query: 191 EYDFSKLEEPYPYQERGVVKKMVHPK-------YNFFTYEYDLALVRLETPVEFAPNIVP 243
E + LE Y+ RG FT+ + L P
Sbjct: 334 ESEAMSLESSLFYEFRGGEHLAGFYSAFGDIKRILLFTWSFKKLGTLL-----------P 382
Query: 244 ICLPG--SDDLLIGENATVTGWGRLSE-GGSLPPVLQKVTVPIVSNEKCRSMFLRAGRYE 300
LPG S + GE W SE G ++K + + K RS + E
Sbjct: 383 S-LPGYSSGGMDYGEEYRSLLWELGSEVGDPDSGPVRKWMRKDLFDAKVRSGVSFGKQEE 441
Query: 301 FISDIF 306
F+SDIF
Sbjct: 442 FVSDIF 447
>gnl|CDD|235846 PRK06654, fliL, flagellar basal body-associated protein FliL;
Reviewed.
Length = 181
Score = 28.6 bits (64), Expect = 3.9
Identities = 14/47 (29%), Positives = 18/47 (38%), Gaps = 1/47 (2%)
Query: 101 VHKHTTSMKPTPTTTILYSKPYRQTPRPMLSYYSHKDISSKNTTQRP 147
V K S P ++S Y + RPML +Y NT P
Sbjct: 54 VSKMVVSQSGAPNAFPVFSNEYLEK-RPMLQWYESIGEIRGNTADTP 99
>gnl|CDD|165411 PHA03139, PHA03139, helicase-primase primase subunit; Provisional.
Length = 860
Score = 29.2 bits (65), Expect = 4.6
Identities = 22/74 (29%), Positives = 34/74 (45%), Gaps = 5/74 (6%)
Query: 95 HKTSPTVHKHTTSMKPTPTTTILYSKPYRQTPRPML---SYYSHKDISSKNTTQRPYVKP 151
HK+ + K+ T PT+ + Y QT +L YY ++ S TT + YV+
Sbjct: 122 HKS--LIIKYLTGKFLLPTSPFWFLSTYGQTEGMLLLTMYYYLFEEQKSTITTTKNYVQC 179
Query: 152 LKESLGRPVNVYSN 165
E+ G V YS+
Sbjct: 180 FTENTGDMVFTYSS 193
>gnl|CDD|182730 PRK10787, PRK10787, DNA-binding ATP-dependent protease La;
Provisional.
Length = 784
Score = 29.1 bits (65), Expect = 6.0
Identities = 14/28 (50%), Positives = 18/28 (64%)
Query: 251 DLLIGENATVTGWGRLSEGGSLPPVLQK 278
DLL E A V G G+L+ GSL V+Q+
Sbjct: 609 DLLTIETACVPGKGKLTYTGSLGEVMQE 636
>gnl|CDD|234137 TIGR03176, AllC, allantoate amidohydrolase. This enzyme catalyzes
the breakdown of allantoate , first to ureidoglycine by
hydrolysis and then decarboxylation of one of the two
equivalent ureido groups. Ureidoglycine then
spontaneously exchanges ammonia for water resulting in
ureidoglycolate. This enzyme is an alternative to
allantoicase (3.5.3.4) which releases urea [Central
intermediary metabolism, Nitrogen metabolism].
Length = 406
Score = 28.7 bits (64), Expect = 6.8
Identities = 11/26 (42%), Positives = 15/26 (57%)
Query: 23 KINPKPCSVNSVEGRCMFVWECINTD 48
K+ P P +VN V G F +C +TD
Sbjct: 261 KVEPVPNTVNVVPGETTFTIDCRHTD 286
>gnl|CDD|191916 pfam07982, Herpes_UL74, Herpes UL74 glycoproteins. Members of this
family are viral glycoproteins that form part of an
envelope complex.
Length = 417
Score = 28.5 bits (63), Expect = 8.6
Identities = 14/46 (30%), Positives = 24/46 (52%)
Query: 73 EISPLAQEKTSPTLLYTKPSTPHKTSPTVHKHTTSMKPTPTTTILY 118
+ S +Q T+P YT + + T+ + TT+ + PT+TI Y
Sbjct: 225 KKSKKSQSTTTPYSSYTTSTAFNVTTNVTYSATTTTRRVPTSTIGY 270
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.138 0.434
Gapped
Lambda K H
0.267 0.0659 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 21,184,010
Number of extensions: 1997952
Number of successful extensions: 1468
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1437
Number of HSP's successfully gapped: 28
Length of query: 422
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 323
Effective length of database: 6,546,556
Effective search space: 2114537588
Effective search space used: 2114537588
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (27.0 bits)