BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy2017
(255 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9W5U2|CHIT3_DROME Probable chitinase 3 OS=Drosophila melanogaster GN=Cht3 PE=2 SV=2
Length = 2286
Score = 53.9 bits (128), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 68/246 (27%), Positives = 99/246 (40%), Gaps = 58/246 (23%)
Query: 35 YFAHPDEKVCNIFYNCI-EGDSTEIICPTGLHFDEYTGTCVWPESA-------GRIGCGE 86
+FA D CN +Y+C G+ + CP+GLH++ C WP SA +
Sbjct: 697 FFA--DSNNCNAYYHCFFAGELQQQFCPSGLHWNNEAKGCDWPSSAQCSLKLDQHLSTSY 754
Query: 87 PEGM-TLKDGFTCPKEQKASSS-------GQSVAHPVYAHPT-------------DCQKF 125
P + T K T K K S + + P Y PT +C+K+
Sbjct: 755 PNPIQTSKKPETTLKPNKKPSEISTHHQVNSTSSRPQYMRPTILECTEGDYYPHRNCRKY 814
Query: 126 YVCLN-GVTPREQGCQVGEVYNEESQK-CDAPENVPGCENWFADDPAAAPQAAKKPGKKI 183
Y+C+N + P E G G+++ + +K CD PENV C K KI
Sbjct: 815 YICVNKALVPSECG---GDLHWDGIKKLCDWPENVQ-C-------------VTSKKYLKI 857
Query: 184 RRRRNAAFKCP---QKNGQYEDPVQCDKYYECFDGQATEKLCPDGLVFDPLNRKINKCDQ 240
+ +A + P +K Y P C KY C + CP GL + N +I CD
Sbjct: 858 IKSSSANEEDPCKGEKRVPY--PGNCSKYLFCLWNRLQASDCPPGLHY---NERIGNCDW 912
Query: 241 PFNVEC 246
P +C
Sbjct: 913 PAAAKC 918
Score = 37.0 bits (84), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 34/62 (54%), Gaps = 5/62 (8%)
Query: 196 KNGQY-EDPVQCDKYYECF-DGQATEKLCPDGLVFDPLNRKINKCDQPFNVECGDRLELH 253
K+G++ D C+ YY CF G+ ++ CP GL + N + CD P + +C +L+ H
Sbjct: 693 KSGEFFADSNNCNAYYHCFFAGELQQQFCPSGLHW---NNEAKGCDWPSSAQCSLKLDQH 749
Query: 254 RT 255
+
Sbjct: 750 LS 751
Score = 33.1 bits (74), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 25/42 (59%), Gaps = 3/42 (7%)
Query: 117 AHPTDCQKFYVCLNGVTPREQGCQVGEVYNEESQKCDAPENV 158
AH DC K+Y+C +G T E+ C +G +N+ CD P NV
Sbjct: 1836 AHEWDCTKYYICEHG-TYVERSCPLGLQWNK--TYCDWPTNV 1874
Score = 31.2 bits (69), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 26/56 (46%), Gaps = 6/56 (10%)
Query: 190 AFKCPQKNGQYEDPVQCDKYYECFDGQATEKLCPDGLVFDPLNRKINKCDQPFNVE 245
+ KC KN + C KYY C G E+ CP GL ++ CD P NV+
Sbjct: 1826 SIKCHSKNYLAHE-WDCTKYYICEHGTYVERSCPLGLQWNK-----TYCDWPTNVQ 1875
>sp|P36362|CHIT_MANSE Endochitinase OS=Manduca sexta PE=2 SV=1
Length = 554
Score = 41.2 bits (95), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 32/63 (50%), Gaps = 2/63 (3%)
Query: 25 GSYLCPRRNGYFAHPDEKVCNIFYNCIEGDSTEIICPTGLHFDEYTGTCVWPESAGRIGC 84
GS +C Y PD+K C+ ++ C+ G++ + C G F+ C WP +A R C
Sbjct: 494 GSEICNSDQDYI--PDKKHCDKYWRCVNGEAMQFSCQHGTVFNVELNVCDWPSNATRREC 551
Query: 85 GEP 87
+P
Sbjct: 552 QQP 554
Score = 37.7 bits (86), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 24/42 (57%), Gaps = 3/42 (7%)
Query: 202 DPVQCDKYYECFDGQATEKLCPDGLVFDPLNRKINKCDQPFN 243
D CDKY+ C +G+A + C G VF N ++N CD P N
Sbjct: 507 DKKHCDKYWRCVNGEAMQFSCQHGTVF---NVELNVCDWPSN 545
Score = 32.0 bits (71), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 20/36 (55%), Gaps = 1/36 (2%)
Query: 122 CQKFYVCLNGVTPREQGCQVGEVYNEESQKCDAPEN 157
C K++ C+NG + CQ G V+N E CD P N
Sbjct: 511 CDKYWRCVNG-EAMQFSCQHGTVFNVELNVCDWPSN 545
>sp|O76217|PE1_ANOGA Peritrophin-1 OS=Anopheles gambiae GN=Aper1 PE=2 SV=2
Length = 153
Score = 40.4 bits (93), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 49/126 (38%), Gaps = 17/126 (13%)
Query: 36 FAHPDEKVCNIFYNCIEGDSTEIICPTGLHFDEYTGTCVWPESA----GRIGCGEPEGMT 91
AHP + C+ F C G CP GL +++ C +P A G EP
Sbjct: 34 LAHPTD--CDKFLICNHGTPVVSKCPPGLLWNDSQKQCDYPAQAQCAPGVTPNTEPAP-- 89
Query: 92 LKDGFTCPKEQKASSSGQSVAHPVYA-HPTDCQKFYVCLNGVTPREQGCQVGEVYNEESQ 150
K CP E H VY H TDC K+Y+C EQ C G +N
Sbjct: 90 -KPSPNCPPEYDPD-------HMVYIPHETDCGKYYICDPYGVELEQTCPSGLHWNPVVN 141
Query: 151 KCDAPE 156
CD PE
Sbjct: 142 YCDFPE 147
>sp|P91745|PE48A_LUCCU Peritrophin-48 OS=Lucilia cuprina PE=1 SV=1
Length = 375
Score = 38.5 bits (88), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 53/132 (40%), Gaps = 34/132 (25%)
Query: 39 PDEKVCNIFYNCIEGDST---------EIICPTGLHFDEYTGTCVWPESAGR---IGCGE 86
PD+ C+++Y C E D+T + CP G +FD + C+ + A R GC
Sbjct: 236 PDKLTCSVYYIC-EQDTTSTPTTYKWIKTSCPNGQYFDVFGDGCL--DRAKRRVYTGCNR 292
Query: 87 PEGMTLKDGFTCPKEQKASSSGQSVAHPVYAHPTDCQKFYVCLNG--VTPREQGCQVGEV 144
E T S + V A DC KF C NG +T + C G
Sbjct: 293 CEYTT-----------------GSTTYWVNAVSNDCTKFSTCRNGRKITNEDGSCNSGYY 335
Query: 145 YNEESQKCDAPE 156
+NE Q C+ +
Sbjct: 336 FNEADQYCNMGD 347
>sp|Q25255|PE44_LUCCU Peritrophin-44 OS=Lucilia cuprina PE=1 SV=1
Length = 356
Score = 38.5 bits (88), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 40/113 (35%), Gaps = 23/113 (20%)
Query: 117 AHPTDCQKFYVCLNGVTPREQGCQVGEVYNEESQKCDAPENVPGCENWFADDPAAAPQAA 176
A P CQ + C N C G +YN + CD+P NV
Sbjct: 40 ADPNSCQSYGYCKNNQLVGTGKCPDGYLYNNKLGICDSPANV------------------ 81
Query: 177 KKPGKKIRRRRNAAFKCPQKNGQYEDPVQCDKYYECFDGQATEKLCPDGLVFD 229
K I +NA N DP C+ Y C + AT CP+ +FD
Sbjct: 82 ----KCISDSKNACLH-ATDNSFVADPTNCNGYCYCSNKTATCTTCPEFQLFD 129
Score = 32.0 bits (71), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 25/51 (49%), Gaps = 4/51 (7%)
Query: 197 NGQYEDPVQCDKYYECFDGQAT-EKLCPDGLVFDPLNRKINKCDQPFNVEC 246
+G DP C Y C + Q CPDG ++ N K+ CD P NV+C
Sbjct: 36 DGFIADPNSCQSYGYCKNNQLVGTGKCPDGYLY---NNKLGICDSPANVKC 83
>sp|Q11174|CHIT_CAEEL Probable endochitinase OS=Caenorhabditis elegans GN=cht-1 PE=1 SV=1
Length = 617
Score = 36.6 bits (83), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 28/124 (22%), Positives = 52/124 (41%), Gaps = 6/124 (4%)
Query: 38 HPDEKVCNIFYNCIEGDSTEIICPTGLHFDEYTGTCVWPESAG-RIGCGEPEGMTLKDGF 96
+P+ C +F C+ S + CP+GL + C ++G + T K
Sbjct: 489 YPNSNNCGLFVLCLSSKSYSMSCPSGLQYSASLKYCTTSTASGCSVTTTRAPTTTTKSAP 548
Query: 97 TCPKEQKASSSGQS----VAHPVYAHPTDCQKFYVCLNGVTPREQGCQVGEVYNEESQKC 152
T +A ++ + P+DC KF C+NG++ + C G ++ ++ C
Sbjct: 549 TVTTTTRAPTTTTPAFKCTKDGFFGVPSDCLKFIRCVNGISYNFE-CPNGLSFHADTMMC 607
Query: 153 DAPE 156
D P+
Sbjct: 608 DRPD 611
Score = 31.6 bits (70), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 26/51 (50%), Gaps = 4/51 (7%)
Query: 191 FKCPQKNGQYEDPVQCDKYYECFDGQATEKLCPDGLVFDPLNRKINKCDQP 241
FKC K+G + P C K+ C +G + CP+GL F + CD+P
Sbjct: 564 FKC-TKDGFFGVPSDCLKFIRCVNGISYNFECPNGLSF---HADTMMCDRP 610
>sp|Q10752|CDC28_SCHPO Pre-mRNA-splicing factor ATP-dependent RNA helicase-like protein
cdc28 OS=Schizosaccharomyces pombe (strain 972 / ATCC
24843) GN=cdc28 PE=3 SV=2
Length = 1055
Score = 36.6 bits (83), Expect = 0.20, Method: Composition-based stats.
Identities = 37/150 (24%), Positives = 63/150 (42%), Gaps = 14/150 (9%)
Query: 101 EQKASSSGQSVAHPVYAHPTD-----CQKFYVCLNGVTPREQGCQVGEVYNEESQKCDAP 155
E K SSS ++ + + D ++F L PRE G + E YN +K
Sbjct: 29 EAKGSSSSNNLYQKLVSFGMDGDDPAVKEFAHTLYARIPRE-GSRPKENYNARKKKEQGI 87
Query: 156 ENVPGCENWFADDPAAAPQAAKKPGKKIRRRRNAAFKCPQKN----GQYEDPVQCDKYYE 211
+ + + D PQ+ + PGK ++++ + K P++ + ED + D+Y E
Sbjct: 88 LQMERLNSSY--DLLIEPQSHETPGKPLKKKSRS--KTPKREIARRQRDEDEWESDEYEE 143
Query: 212 CFDGQATEKLCPDGLVFDPLNRKINKCDQP 241
DG A+ + D + D N K P
Sbjct: 144 VVDGSASHPIEEDSVSTDFQNHDYEKSSDP 173
>sp|A8XWX5|CPG2_CAEBR Chondroitin proteoglycan 2 OS=Caenorhabditis briggsae GN=cpg-2 PE=3
SV=2
Length = 491
Score = 36.2 bits (82), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 49/214 (22%), Positives = 78/214 (36%), Gaps = 30/214 (14%)
Query: 33 NGYFAHPDEKVCNIFYNCIEGDSTEIICPTGLHFDEYTGTCVWP-------ESAGRIGCG 85
+GYF+ + + F C G + + CP GL F + C + E++G G
Sbjct: 285 DGYFSFG--QCSSSFTACTNGRAIVMFCPAGLKFSQANQRCDYDDLVNECQEASGEESSG 342
Query: 86 EPEGMTLKDG---FTCPKEQKASSSGQSVAHPVYAHPTDCQ-KFYVCLNGVTPREQGCQV 141
E G +G + +AS + V+ H C + C NG + C
Sbjct: 343 EASGEQSGEGSGEASGEASGEASGENECVSLDNGLHAIGCSPRVLSCQNGHVDIFE-CPS 401
Query: 142 GEVYNEESQKCDAPENVPGCENWFADDPAAAPQAAKKPGKKIRRRRNAAFKCPQKNGQYE 201
V+NE++ CD P+ C +D + P F C NG +
Sbjct: 402 SLVFNEQTLICDYPQTSLKC---LIEDTLLIDETPITP-----------FDC-STNGLFS 446
Query: 202 DPVQCDKYYECFDGQATEKLCPD-GLVFDPLNRK 234
D + Y++C GQ C + VF N +
Sbjct: 447 DGLCSATYHQCSAGQLINFTCAETNAVFSAANAE 480
Score = 34.7 bits (78), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 45/217 (20%), Positives = 72/217 (33%), Gaps = 50/217 (23%)
Query: 47 FYNCIEGDSTEIICPTGLHFDEYTGTCVWP----ESAGRI---GCGEPEGMTLKDGFTCP 99
++ C + + + CPT L +D T C W E G I G GE G +
Sbjct: 145 YFFCTDNTARFLSCPTPLFYDVATQKCAWKALVEECNGEIIIDGSGETSGEGSGEA---S 201
Query: 100 KEQKASSSGQSV----------AHPVYAHPTDCQKFYVCLNGVTPREQGCQVGEVYNEES 149
E +SG+ A +Y + F C G+ R C ++N +
Sbjct: 202 GENSGENSGEGSGEFEPTCDGKADGIYPNGVCVPNFLTCSGGI-ARVMNCPASLIFNPDI 260
Query: 150 QKCDAPENVPGCENWFADDPAAAPQAAKKPGKKIRRRRNAAFKCPQKNGQYEDPVQCDKY 209
CD P +V C P + +G + +
Sbjct: 261 LVCDWPRDVAECHGLSTPAPVC-----------------------EDDGYFSFGQCSSSF 297
Query: 210 YECFDGQATEKLCPDGLVFDPLNRK------INKCDQ 240
C +G+A CP GL F N++ +N+C +
Sbjct: 298 TACTNGRAIVMFCPAGLKFSQANQRCDYDDLVNECQE 334
>sp|Q13231|CHIT1_HUMAN Chitotriosidase-1 OS=Homo sapiens GN=CHIT1 PE=1 SV=1
Length = 466
Score = 33.5 bits (75), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 18/56 (32%), Positives = 28/56 (50%), Gaps = 2/56 (3%)
Query: 20 PEPPQGSYLCPRRNGYFAHPDEKVCNIFYNCIEGDSTEIICPTGLHFDEYTGTCVW 75
P P Q ++ + +G + +P E+ + FY+C G + CPTGL F C W
Sbjct: 412 PSPGQDTFCQGKADGLYPNPRER--SSFYSCAAGRLFQQSCPTGLVFSNSCKCCTW 465
>sp|Q14624|ITIH4_HUMAN Inter-alpha-trypsin inhibitor heavy chain H4 OS=Homo sapiens
GN=ITIH4 PE=1 SV=4
Length = 930
Score = 33.1 bits (74), Expect = 1.9, Method: Composition-based stats.
Identities = 17/56 (30%), Positives = 23/56 (41%)
Query: 169 PAAAPQAAKKPGKKIRRRRNAAFKCPQKNGQYEDPVQCDKYYECFDGQATEKLCPD 224
PA+AP A P + R N + Q P+Q GQ+ E+LC D
Sbjct: 694 PASAPPATSNPDPAVSRVMNMKIEETTMTTQTPAPIQAPSAILPLPGQSVERLCVD 749
>sp|Q66HW4|NCUG1_DANRE Lysosomal protein NCU-G1 OS=Danio rerio GN=si:dkey-263h23.3 PE=2
SV=1
Length = 407
Score = 32.7 bits (73), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 16/54 (29%), Positives = 25/54 (46%)
Query: 82 IGCGEPEGMTLKDGFTCPKEQKASSSGQSVAHPVYAHPTDCQKFYVCLNGVTPR 135
+ CG + + +G C + S G+ A P H + + V +NGVTPR
Sbjct: 171 VLCGGEKTQSFSNGSLCLQVSVFESQGRDEAWPSLLHNANSSQLRVWINGVTPR 224
>sp|P98088|MUC5A_HUMAN Mucin-5AC (Fragments) OS=Homo sapiens GN=MUC5AC PE=1 SV=3
Length = 5030
Score = 32.7 bits (73), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 50/183 (27%), Positives = 72/183 (39%), Gaps = 30/183 (16%)
Query: 59 ICPTGLHFDEYTGTCVWPESAG--RIGCGEPEGMTLKDG---FTCPKEQKASSSGQSVAH 113
ICP G D+ TG CV + G P G ++ D TC + + GQ+ A
Sbjct: 742 ICPKGTFLDD-TGKCVQASNCPCYHRGSMIPNGESVHDSGAICTCTHGKLSCIGGQAPA- 799
Query: 114 PVYAHPTDCQKFYVCLNGVTPREQGCQVGEVYNEESQKCDAPENVPGC---ENWFADD-- 168
PV A P F+ C N TP + G + + C +P+ VPGC + AD
Sbjct: 800 PVCAAP---MVFFDCRNA-TPGDTGAGCQKSCHTLDMTCYSPQCVPGCVCPDGLVADGEG 855
Query: 169 --------PAAAPQAAKKPGKKIRRRRNA------AFKCPQKNGQYEDPVQCDKYYECFD 214
P +A+ + G+ IR N ++C V D +Y FD
Sbjct: 856 GCITAEDCPCVHNEASYRAGQTIRVGCNTCTCDSRMWRCTDDPCLATCAVYGDGHYLTFD 915
Query: 215 GQA 217
GQ+
Sbjct: 916 GQS 918
>sp|O17450|PE48_CHRBE Peritrophin-48 OS=Chrysomya bezziana PE=2 SV=1
Length = 379
Score = 32.3 bits (72), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 12/32 (37%), Positives = 19/32 (59%)
Query: 121 DCQKFYVCLNGVTPREQGCQVGEVYNEESQKC 152
CQ +Y C++ P C G V+N++SQ+C
Sbjct: 42 SCQNYYTCVSNGLPTLSSCSSGYVFNKDSQQC 73
Score = 31.2 bits (69), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 27/110 (24%), Positives = 41/110 (37%), Gaps = 20/110 (18%)
Query: 122 CQKFYVCLNGVTPREQGCQVGEVYNEESQKCDAPENVPGCENWFADDPAAAPQAAKKPGK 181
C +++ CL G C+ G+++N Q C E C N G
Sbjct: 103 CNEWHYCLAGAIAGTGTCKEGQIFNFAKQSCVYGE----CSN---------------TGN 143
Query: 182 KIRRRRNAAFKCPQKNGQY-EDPVQCDKYYECFDGQATEKLCPDGLVFDP 230
I N + NG Y D C +++C + + CP+G FDP
Sbjct: 144 NILDSPNLSVCQIMPNGIYFGDNKNCSTWHKCSGMEEKKGTCPNGDNFDP 193
>sp|P29030|CHIT_BRUMA Endochitinase OS=Brugia malayi PE=1 SV=1
Length = 504
Score = 32.0 bits (71), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 15/45 (33%), Positives = 21/45 (46%), Gaps = 1/45 (2%)
Query: 115 VYAHPTDCQKFYVCLNGVTPREQGCQVGEVYNEESQKCDAPENVP 159
++ HPTDC F C N + Q C +N+ + CD N P
Sbjct: 457 LFPHPTDCHLFIQCANNIAYVMQ-CPATTFFNDAIKVCDHMTNAP 500
Score = 31.2 bits (69), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 14/45 (31%), Positives = 22/45 (48%), Gaps = 2/45 (4%)
Query: 29 CPRRNGYFAHPDEKVCNIFYNCIEGDSTEIICPTGLHFDEYTGTC 73
CP R+G F HP + C++F C + + CP F++ C
Sbjct: 451 CPERDGLFPHPTD--CHLFIQCANNIAYVMQCPATTFFNDAIKVC 493
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.318 0.138 0.455
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 112,482,639
Number of Sequences: 539616
Number of extensions: 5161509
Number of successful extensions: 8848
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 19
Number of HSP's successfully gapped in prelim test: 31
Number of HSP's that attempted gapping in prelim test: 8687
Number of HSP's gapped (non-prelim): 167
length of query: 255
length of database: 191,569,459
effective HSP length: 115
effective length of query: 140
effective length of database: 129,513,619
effective search space: 18131906660
effective search space used: 18131906660
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (27.7 bits)