BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 011575
(482 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9M1S8|GCP2_ARATH Probable glutamate carboxypeptidase 2 OS=Arabidopsis thaliana
GN=AMP1 PE=1 SV=3
Length = 705
Score = 583 bits (1502), Expect = e-165, Method: Compositional matrix adjust.
Identities = 296/490 (60%), Positives = 364/490 (74%), Gaps = 17/490 (3%)
Query: 1 MTHPFT-KLTTTASS----KPPPPLMTFTFLLILCIIGFYTLHHPYPSPATPPL---RNP 52
M+ P T + T T S + PPPL +F F+++L + FYTLHHP TPPL RN
Sbjct: 1 MSQPLTTRPTVTGISIIPFRQPPPLCSFLFVIVLFVATFYTLHHP--DAVTPPLLFSRNA 58
Query: 53 QTALHFQKTFLSLSSNYTVSSYLRDLTHHPHLAGTEPSLDTVRYVQSHFEQLKFNTHTVE 112
AL ++ FLS +SN T+SSYLR+LT HPHLAGT+PSLDT+ YV +HF+ L TH E
Sbjct: 59 YNALRLRRLFLSSASNATISSYLRELTRHPHLAGTKPSLDTLHYVFNHFQSLGLETHVAE 118
Query: 113 YKALLSYPVHASVSAHFSNGTTVELSLTE-KGISQNAILDVVQPYHAYSPSGSAYGKVVF 171
Y+ALLSYP H SV+A FSN TT+E L + G S VV+PYHAYSPSGSA G VVF
Sbjct: 119 YEALLSYPTHISVTASFSNTTTLEFDLNDVPGDSP-----VVRPYHAYSPSGSAQGNVVF 173
Query: 172 VNYGREEDYRALEAAGVNVSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAEWDRLRG 231
VN+G E DY ALE+ GV+V GCVV+ARKG L R ++ +AEAKGA+GVL+YAE D
Sbjct: 174 VNHGEERDYHALESIGVSVKGCVVLARKGENLGRGAIVKIAEAKGALGVLIYAENDGGGF 233
Query: 232 GGVERGTVMRGVGDPLSPGWAGVEGGESLDLEDSEVSKRFPKIPSLPLSFENAQIILGSL 291
GG+ERGTVMRG+GDP+SPGW GV GGE L L+D V++RFPKIPSLPLS NA+IIL SL
Sbjct: 234 GGIERGTVMRGIGDPVSPGWPGVVGGEKLSLDDELVTRRFPKIPSLPLSLRNAEIILASL 293
Query: 292 WGGFVTQFLNDLGRVN-GGRVGPGPTMVNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLL 350
G + GRV G RVGPG ++N+TFQG+ K+ I+NV IRG EE +RYV+L
Sbjct: 294 GGARAPLEWRNSGRVGPGQRVGPGRMVINMTFQGEMKMKKINNVVVTIRGSEEADRYVIL 353
Query: 351 GNHRDAWTYGAIDPNSGTAALLDIARRYALLMRLGWSPRRTIIFCSWDAEEFGMIGSTEW 410
GNHRDAWTYGA+DPNSGT+ALLDI+RR+ALL++ GW PRRTI+ CSWDAEEFGMIGSTEW
Sbjct: 354 GNHRDAWTYGAVDPNSGTSALLDISRRFALLLKSGWRPRRTILLCSWDAEEFGMIGSTEW 413
Query: 411 VEENLVNLGAKAVAYLNVDCAVQGPGFFAGATPQLDDILIEVTKMVKDPESESGTLYDQW 470
+EEN++NLGA AVAYLNVDCAVQG GFFAGATPQLD +L++V K+V+DP++ T+ + +
Sbjct: 414 IEENVLNLGASAVAYLNVDCAVQGSGFFAGATPQLDGLLVDVLKLVQDPDAVGLTVEETF 473
Query: 471 SAPNRIFNGL 480
+ N I L
Sbjct: 474 KSQNNIIQRL 483
>sp|Q9Y3Q0|NALD2_HUMAN N-acetylated-alpha-linked acidic dipeptidase 2 OS=Homo sapiens
GN=NAALAD2 PE=1 SV=1
Length = 740
Score = 259 bits (661), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 164/462 (35%), Positives = 249/462 (53%), Gaps = 46/462 (9%)
Query: 48 PLRNPQTALHFQKT----FLSLSSNYTVSSYLRDLTHHPHLAGTEPSLDTVRYVQSHFEQ 103
PL+ T++ + ++ +S + S+LR T PHLAGTE + + +Q+ +++
Sbjct: 33 PLKETTTSVRYHQSIRWKLVSEMKAENIKSFLRSFTKLPHLAGTEQNFLLAKKIQTQWKK 92
Query: 104 LKFNT-HTVEYKALLSYPVHASVSAHFSNGTTVELSLTEKGISQ------NAILDVVQPY 156
++ V Y LLSYP + + + S E + + + + ++V PY
Sbjct: 93 FGLDSAKLVHYDVLLSYPNETNAN-YISIVDEHETEIFKTSYLEPPPDGYENVTNIVPPY 151
Query: 157 HAYSPSGSAYGKVVFVNYGREEDYRALE-AAGVNVSGCVVMARKGSVLSRSGVIFLAEAK 215
+A+S G G +V+VNY R ED+ LE G+N +G +V+AR G + R + A
Sbjct: 152 NAFSAQGMPEGDLVYVNYARTEDFFKLEREMGINCTGKIVIARYGKIF-RGNKVKNAMLA 210
Query: 216 GAIGVLLYAE---------------WDRLRGGGVERGTVMR--GVGDPLSPGWAGVEGGE 258
GAIG++LY++ W+ L G +RG V+ G GDPL+PG+ E
Sbjct: 211 GAIGIILYSDPADYFAPEVQPYPKGWN-LPGTAAQRGNVLNLNGAGDPLTPGYPAKEYTF 269
Query: 259 SLDLEDSEVSKRFPKIPSLPLSFENAQIILGSLWGGFVTQFLNDLGRVN-GGRVGPGPT- 316
LD+E+ P+IP P+ + +A+I+L L GG + G +N +GPG T
Sbjct: 270 RLDVEEG---VGIPRIPVHPIGYNDAEILLRYL-GGIAPPDKSWKGALNVSYSIGPGFTG 325
Query: 317 -----MVNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAAL 371
V + K+ I+NV IRG EP+RYV+LG HRD+W +GAIDP SG A L
Sbjct: 326 SDSFRKVRMHVYNINKITRIYNVVGTIRGSVEPDRYVILGGHRDSWVFGAIDPTSGVAVL 385
Query: 372 LDIARRYALLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDCA 431
+IAR + LM GW PRRTIIF SWDAEEFG++GSTEW EEN+ L +++AY+N D +
Sbjct: 386 QEIARSFGKLMSKGWRPRRTIIFASWDAEEFGLLGSTEWAEENVKILQERSIAYINSDSS 445
Query: 432 VQGP-GFFAGATPQLDDILIEVTKMVKDPES--ESGTLYDQW 470
++G TP L ++ ++TK + P+ ES +LY+ W
Sbjct: 446 IEGNYTLRVDCTPLLYQLVYKLTKEIPSPDDGFESKSLYESW 487
>sp|Q9CZR2|NALD2_MOUSE N-acetylated-alpha-linked acidic dipeptidase 2 OS=Mus musculus
GN=Naalad2 PE=1 SV=2
Length = 740
Score = 253 bits (647), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 162/461 (35%), Positives = 249/461 (54%), Gaps = 44/461 (9%)
Query: 48 PLRNPQTALHF----QKTFLSLSSNYTVSSYLRDLTHHPHLAGTEPSLDTVRYVQSHFEQ 103
PL+ T+ + Q+ LS + S+LR T PHLAGTE +L + +Q+ +++
Sbjct: 33 PLKETTTSAGYHQSIQQKLLSEMKAENIRSFLRSFTKLPHLAGTEQNLLLAKKIQTQWKK 92
Query: 104 LKFNT-HTVEYKALLSYPVHAS---VSAHFSNGTTVELS--LTEKGISQNAILDVVQPYH 157
++ + V Y LLSYP + VS +G + + L + +++ PY+
Sbjct: 93 FGLDSANLVHYDVLLSYPNETNANYVSIVDEHGVEIFKTSYLEPPPDGYENVTNIIPPYN 152
Query: 158 AYSPSGSAYGKVVFVNYGREEDYRALE-AAGVNVSGCVVMARKGSVLSRSGVIFLAEAKG 216
A+S SG G++V+VNY R ED+ LE +N +G +V+AR G + R + A G
Sbjct: 153 AFSASGMPEGELVYVNYARTEDFFKLEREMNINCTGKIVIARYGKIF-RGNKVKNAMLAG 211
Query: 217 AIGVLLYAE---------------WDRLRGGGVERGTVMR--GVGDPLSPGWAGVEGGES 259
A+G++LY++ W+ L G +RG V+ G GDPL+PG+ E
Sbjct: 212 AMGIILYSDPADYFAPDVQPYPKGWN-LPGAAAQRGNVLNLNGAGDPLTPGYPAKEYTFR 270
Query: 260 LDLEDSEVSKRFPKIPSLPLSFENAQIILGSLWGGFVTQFLNDLGRVN-GGRVGPGPT-- 316
L +E++ P IP P+ + +A+ +L +L GG + G +N +GPG T
Sbjct: 271 LPVEEA---VGIPNIPVHPIGYNDAERLLRNL-GGAAPPDKSWKGSLNVSYNIGPGFTGS 326
Query: 317 ----MVNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALL 372
+ + K+ I+NV IRG EP+RYV+LG HRD+W +G IDP +GTA L
Sbjct: 327 EYSRNIRMHVNNINKITRIYNVIGTIRGSTEPDRYVILGGHRDSWVFGGIDPTTGTAVLQ 386
Query: 373 DIARRYALLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDCAV 432
+IAR + L+ GW PRRTIIF SWDAEEFG++GSTEW EEN L +++AY+N D A+
Sbjct: 387 EIARSFGKLVNGGWRPRRTIIFASWDAEEFGLLGSTEWAEENAKLLQERSIAYINSDSAI 446
Query: 433 QGP-GFFAGATPQLDDILIEVTKMVKDPES--ESGTLYDQW 470
+G TP L+ ++ +V + + P+ ES +LY+ W
Sbjct: 447 EGNYTLRVDCTPLLNQLVYKVAREISSPDDGFESKSLYESW 487
>sp|P70627|FOLH1_RAT Glutamate carboxypeptidase 2 OS=Rattus norvegicus GN=Folh1 PE=2
SV=1
Length = 752
Score = 248 bits (633), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 161/457 (35%), Positives = 238/457 (52%), Gaps = 40/457 (8%)
Query: 59 QKTFLSLSSNYTVSSYLRDLTHHPHLAGTEPSLDTVRYVQSHFEQLKFN-THTVEYKALL 117
+K FL + +L + T PHLAGT+ + + + + + +++ + +Y LL
Sbjct: 60 KKAFLQELKAENIKKFLYNFTRTPHLAGTQHNFELAKQIHAQWKEFGLDLVELSDYDVLL 119
Query: 118 SYP--VHAS-VSAHFSNGTTV-ELSLTEKGI-SQNAILDVVQPYHAYSPSGSAYGKVVFV 172
SYP H + +S +G + + SL E I DVV PY A+SP G+ G +V+V
Sbjct: 120 SYPNKTHPNYISIINEDGNEIFKTSLAELSPPGYENISDVVPPYSAFSPQGTPEGDLVYV 179
Query: 173 NYGREEDYRALE-AAGVNVSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAE------ 225
NY R ED+ LE +N SG +V+AR G V R + A+ GA G++LY++
Sbjct: 180 NYARTEDFFKLERVMKINCSGKIVIARYGQVF-RGNKVKNAQLAGAKGIILYSDPADYFV 238
Query: 226 ---------WDRLRGGGVERGTVMR--GVGDPLSPGWAGVEGGESLDLEDSEVSKRFPKI 274
W+ L GGGV+RG V+ G GDPL+PG+ E + ++ P I
Sbjct: 239 PGVKSYPDGWN-LPGGGVQRGNVLNLNGAGDPLTPGYPANEYAYRHEFTEA---VGLPSI 294
Query: 275 PSLPLSFENAQIILGSLWGGFVTQFLNDLGRVNGGRVGPG------PTMVNLTFQGKKKV 328
P P+ +++AQ +L + G G VGPG V L KV
Sbjct: 295 PVHPIGYDDAQKLLEHMGGSAPPDSSWKGGLKVPYNVGPGFAGNFSKQKVKLHIHSYNKV 354
Query: 329 ATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLDIARRYALLMRLGWSP 388
I+NV ++G EP+RYV+LG HRDAW +G IDP SG A + +I R + L + GW P
Sbjct: 355 TRIYNVIGTLKGAVEPDRYVILGGHRDAWVFGGIDPQSGAAVVHEIVRTFGTLKKKGWRP 414
Query: 389 RRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDCAVQGP-GFFAGATPQLDD 447
RRTI+F SWDAEEFG++GSTEW EE+ L + VAY+N D +++G TP +
Sbjct: 415 RRTILFASWDAEEFGLLGSTEWAEEHSRLLQERGVAYINADSSIEGNYTLRVDCTPLMHS 474
Query: 448 ILIEVTKMVKDPES--ESGTLYDQW--SAPNRIFNGL 480
++ +TK + P+ E +LYD W +P+ F G+
Sbjct: 475 LVYNLTKELPSPDEGFEGKSLYDSWKEKSPSTEFIGM 511
>sp|Q04609|FOLH1_HUMAN Glutamate carboxypeptidase 2 OS=Homo sapiens GN=FOLH1 PE=1 SV=1
Length = 750
Score = 247 bits (631), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 164/471 (34%), Positives = 242/471 (51%), Gaps = 58/471 (12%)
Query: 54 TALHFQKTFLSLSSNYTVSSYLRDLTHHPHLAGTEPSLDTVRYVQSHFEQLKFNT-HTVE 112
T H K FL + +L + T PHLAGTE + + +QS +++ ++
Sbjct: 53 TPKHNMKAFLDELKAENIKKFLYNFTQIPHLAGTEQNFQLAKQIQSQWKEFGLDSVELAH 112
Query: 113 YKALLSYP--VHAS-VSAHFSNGTTV-ELSLTEKGI-SQNAILDVVQPYHAYSPSGSAYG 167
Y LLSYP H + +S +G + SL E + D+V P+ A+SP G G
Sbjct: 113 YDVLLSYPNKTHPNYISIINEDGNEIFNTSLFEPPPPGYENVSDIVPPFSAFSPQGMPEG 172
Query: 168 KVVFVNYGREEDYRALEA-AGVNVSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAE- 225
+V+VNY R ED+ LE +N SG +V+AR G V R + A+ GA GV+LY++
Sbjct: 173 DLVYVNYARTEDFFKLERDMKINCSGKIVIARYGKVF-RGNKVKNAQLAGAKGVILYSDP 231
Query: 226 --------------WDRLRGGGVERGTVMR--GVGDPLSPGWAGVEGGESLDLEDSEVSK 269
W+ L GGGV+RG ++ G GDPL+PG+ E + ++
Sbjct: 232 ADYFAPGVKSYPDGWN-LPGGGVQRGNILNLNGAGDPLTPGYPANEYAYRRGIAEA---V 287
Query: 270 RFPKIPSLPLSFENAQIILGSL---------WGGFVTQFLNDLGRVNGGRVGPGPT---- 316
P IP P+ + +AQ +L + W G + N VGPG T
Sbjct: 288 GLPSIPVHPIGYYDAQKLLEKMGGSAPPDSSWRGSLKVPYN---------VGPGFTGNFS 338
Query: 317 --MVNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLDI 374
V + +V I+NV +RG EP+RYV+LG HRD+W +G IDP SG A + +I
Sbjct: 339 TQKVKMHIHSTNEVTRIYNVIGTLRGAVEPDRYVILGGHRDSWVFGGIDPQSGAAVVHEI 398
Query: 375 ARRYALLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDCAVQG 434
R + L + GW PRRTI+F SWDAEEFG++GSTEW EEN L + VAY+N D +++G
Sbjct: 399 VRSFGTLKKEGWRPRRTILFASWDAEEFGLLGSTEWAEENSRLLQERGVAYINADSSIEG 458
Query: 435 P-GFFAGATPQLDDILIEVTKMVKDPES--ESGTLYDQWS--APNRIFNGL 480
TP + ++ +TK +K P+ E +LY+ W+ +P+ F+G+
Sbjct: 459 NYTLRVDCTPLMYSLVHNLTKELKSPDEGFEGKSLYESWTKKSPSPEFSGM 509
>sp|O77564|FOLH1_PIG Glutamate carboxypeptidase 2 OS=Sus scrofa GN=FOLH1 PE=1 SV=1
Length = 751
Score = 246 bits (627), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 158/466 (33%), Positives = 241/466 (51%), Gaps = 58/466 (12%)
Query: 59 QKTFLSLSSNYTVSSYLRDLTHHPHLAGTEPSLDTVRYVQSHFEQLKFNT-HTVEYKALL 117
+K FL + ++L + T PHLAGTE + + +QS +++ ++ Y LL
Sbjct: 59 KKAFLDELKAENIKTFLYNFTRIPHLAGTEQNFQLAKQIQSQWKEFGLDSVELAHYDVLL 118
Query: 118 SYPVHAS---VSAHFSNGTTV-ELSLTEKGI-SQNAILDVVQPYHAYSPSGSAYGKVVFV 172
SYP +S +G + SL E + DVV P+ A+SP G G +V+V
Sbjct: 119 SYPNKTRPNYISIIDEDGNEIFNTSLFEPPPPGYENVSDVVPPFSAFSPQGMPEGDLVYV 178
Query: 173 NYGREEDYRALEA-AGVNVSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAE------ 225
NY R ED+ LE +N SG +++AR G + R + A+ GA G++LY++
Sbjct: 179 NYARTEDFFKLERDMKINCSGKILIARYGKIF-RGNKVKNAQLAGAKGIILYSDPADYFA 237
Query: 226 ---------WDRLRGGGVERGTVMR--GVGDPLSPGWAGVEGGESLDLEDSEVSKRFPKI 274
W+ L GGGV+RG ++ G GDPL+PG+ E L + ++ P+I
Sbjct: 238 PGVQSYPDGWN-LPGGGVQRGNILNLNGAGDPLTPGYPANEYAYRLQIAEA---VGLPRI 293
Query: 275 PSLPLSFENAQIILGSL---------WGGFVTQFLNDLGRVNGGRVGPGPT------MVN 319
P P+ + +AQ +L + W G + N VGPG T V
Sbjct: 294 PVHPIGYSDAQKLLEKMGGSAPPDDSWKGSLHVPYN---------VGPGFTGNFSTQKVK 344
Query: 320 LTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLDIARRYA 379
+ KV I+NV +RG EP+RYV+LG HRD+W +G IDP SG A + +I R +
Sbjct: 345 MHIHSDNKVKRIYNVIGTLRGAVEPDRYVILGGHRDSWVFGGIDPQSGAAVVHEIVRSFG 404
Query: 380 LLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDCAVQGP-GFF 438
L + GW PRRT++F SWDAEE+G+ GSTEW EEN L + VAY+N D +++G
Sbjct: 405 KLKKEGWRPRRTVLFASWDAEEYGLFGSTEWAEENSRILQERGVAYINADSSIEGNYTLR 464
Query: 439 AGATPQLDDILIEVTKMVKDPES--ESGTLYDQWS--APNRIFNGL 480
TP + ++ +TK ++ P+ E +L++ W+ +P+ F+GL
Sbjct: 465 VDCTPLMYSLVYNLTKELQSPDEGFEGKSLFESWNEKSPSPEFSGL 510
>sp|O35409|FOLH1_MOUSE Glutamate carboxypeptidase 2 OS=Mus musculus GN=Folh1 PE=2 SV=2
Length = 752
Score = 240 bits (613), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 159/457 (34%), Positives = 240/457 (52%), Gaps = 40/457 (8%)
Query: 59 QKTFLSLSSNYTVSSYLRDLTHHPHLAGTEPSLDTVRYVQSHFEQLKFN-THTVEYKALL 117
+K FL + +L + T PHLAGT+ + + + + +++ + Y LL
Sbjct: 60 KKEFLHELKAENIKKFLYNFTRTPHLAGTQNNFELAKQIHDQWKEFGLDLVELSHYDVLL 119
Query: 118 SYP--VHAS-VSAHFSNGTTV-ELSLTEKGI-SQNAILDVVQPYHAYSPSGSAYGKVVFV 172
SYP H + +S +G + + SL+E+ I DVV PY A+SP G+ G +V+V
Sbjct: 120 SYPNKTHPNYISIINEDGNEIFKTSLSEQPPPGYENISDVVPPYSAFSPQGTPEGDLVYV 179
Query: 173 NYGREEDYRALE-AAGVNVSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAE------ 225
NY R ED+ LE ++ SG +V+AR G V R ++ A+ GA G++LY++
Sbjct: 180 NYARTEDFFKLEREMKISCSGKIVIARYGKVF-RGNMVKNAQLAGAKGMILYSDPADYFV 238
Query: 226 ---------WDRLRGGGVERGTVMR--GVGDPLSPGWAGVEGGESLDLEDSEVSKRFPKI 274
W+ L GGGV+RG V+ G GDPL+PG+ E +L ++ P I
Sbjct: 239 PAVKSYPDGWN-LPGGGVQRGNVLNLNGAGDPLTPGYPANEHAYRHELTNA---VGLPSI 294
Query: 275 PSLPLSFENAQIILGSLWGGFVTQFLNDLGRVNGGRVGPG------PTMVNLTFQGKKKV 328
P P+ +++AQ +L + G G VGPG V + KV
Sbjct: 295 PVHPIGYDDAQKLLEHMGGPAPPDSSWKGGLKVPYNVGPGFAGNFSTQKVKMHIHSYTKV 354
Query: 329 ATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLDIARRYALLMRLGWSP 388
I+NV ++G EP+RYV+LG HRDAW +G IDP SG A + +I R + L + G P
Sbjct: 355 TRIYNVIGTLKGALEPDRYVILGGHRDAWVFGGIDPQSGAAVVHEIVRSFGTLKKKGRRP 414
Query: 389 RRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDCAVQGP-GFFAGATPQLDD 447
RRTI+F SWDAEEFG++GSTEW EE+ L + VAY+N D +++G TP +
Sbjct: 415 RRTILFASWDAEEFGLLGSTEWAEEHSRLLQERGVAYINADSSIEGNYTLRVDCTPLMYS 474
Query: 448 ILIEVTKMVKDPES--ESGTLYDQW--SAPNRIFNGL 480
++ +TK ++ P+ E +LYD W +P+ F G+
Sbjct: 475 LVYNLTKELQSPDEGFEGKSLYDSWKEKSPSPEFIGM 511
>sp|Q7M758|NALDL_MOUSE N-acetylated-alpha-linked acidic dipeptidase-like protein OS=Mus
musculus GN=Naaladl1 PE=2 SV=1
Length = 745
Score = 235 bits (600), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 156/434 (35%), Positives = 227/434 (52%), Gaps = 45/434 (10%)
Query: 75 LRDLTHHPHLAGTEPSLDTVRYVQSHFEQLKF---NTHTVEYKALLSYPVHA---SVSAH 128
LR+L+ PH+A + V+ + ++ + T EY+ LLS+P SV
Sbjct: 66 LRELSKEPHVATSPRDEALVQLLLGRWKDTATGLDSAKTYEYRVLLSFPNAEQPNSVEVV 125
Query: 129 FSNGTTVE-LSLTEKGIS-QNAILDVVQPYHAYSPSGSAYGKVVFVNYGREEDYRALEAA 186
NGTT EK ++ + A +V+QPY AY+P G+ G +V+ N G EED++ LE
Sbjct: 126 GPNGTTFHSFQPFEKNLTGEQAGPNVLQPYAAYAPPGTPKGLLVYANQGSEEDFKELETQ 185
Query: 187 GVNVSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAE------------------WDR 228
G+N+ G + + R G V R A G +GVL+Y + W R
Sbjct: 186 GINLEGTIALTRYGGV-GRGAKAINAAKHGVVGVLVYTDPGDINDGKSLPNETFPNSW-R 243
Query: 229 LRGGGVERGTVMRGVGDPLSPGWAGVEGGESLDLEDSEVSKRFPKIPSLPLSFENAQIIL 288
L GVERG+ GDPL+P LD ++ FP IP+ P+ FE+A+ +L
Sbjct: 244 LPPSGVERGSYYEYFGDPLTPYLPAHPSSFRLDPHNT---SGFPPIPTQPIGFEDARDLL 300
Query: 289 GSLWGGFVTQFLND-LGRVNGGRVGPG---------PTMVNLTFQGKKKVATIHNVFAVI 338
+L G F LG ++GPG + V ++ + ++ T NV +I
Sbjct: 301 CNLTGTSAPAFWQGALGCEY--KLGPGFEPNGSFPAGSEVKVSVHNRLELRTSSNVLGII 358
Query: 339 RGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLDIARRYALLMRLG-WSPRRTIIFCSW 397
+G EP+RYV+ GNHRD+W +GA+DP+SGTA LL+I+R L++ G W PRR+IIF SW
Sbjct: 359 QGAVEPDRYVIYGNHRDSWVHGAVDPSSGTAVLLEISRVLGTLLKKGTWRPRRSIIFASW 418
Query: 398 DAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDCAV-QGPGFFAGATPQLDDILIEVTKMV 456
AEEFG+IGSTE+ EE L L + VAY+NVD +V A TP + ++ TK +
Sbjct: 419 GAEEFGLIGSTEFTEEFLSKLQERTVAYINVDISVFSNATLRAQGTPPVQSVIFSATKEI 478
Query: 457 KDPESESGTLYDQW 470
P S ++YD W
Sbjct: 479 SAPGSSGLSIYDNW 492
>sp|Q5WN23|GCP2_CAEBR Glutamate carboxypeptidase 2 homolog OS=Caenorhabditis briggsae
GN=CBG08178 PE=3 SV=1
Length = 770
Score = 231 bits (588), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 152/455 (33%), Positives = 227/455 (49%), Gaps = 54/455 (11%)
Query: 60 KTFLSLSSNYTVSSYLRDLTHHPHLAGTEPSLDTVRYVQSHFEQLKF-NTHTVEYKALLS 118
KT + + YLR T PH+AGT+ + + S + + + HT+ Y+ LLS
Sbjct: 66 KTIQDNIKSENIKKYLRIFTQEPHIAGTDANKKVAYAIASAWTEAGLEDVHTLPYEVLLS 125
Query: 119 YPVHASVSAHFSNGTTVELSLTEKGISQNAILD------VVQPYHAYSPSGSAYGKVVFV 172
YP + ++ + + KG+S I D + AY +G+ VV++
Sbjct: 126 YPDFENPNSVVIQNSAGKEIFRSKGVSPVIIPDEQSGKYAGHQWLAYGGNGTVSADVVYI 185
Query: 173 NYGREEDYRALEAAGVNVSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAEW-DRLRG 231
N G D++ L+ GV+V G + + R G R ++ A+ GAIG +L+++ D +
Sbjct: 186 NRGNANDFKNLKLMGVDVKGKIALMRYGHGF-RGDKVYKAQQAGAIGAILFSDTSDVAQD 244
Query: 232 G----------------GVERGTVMRGVGDPLSPGW-AGVEGGESLDLEDSEVSKRFPKI 274
G GV+RG++M G GDPLSP + + E + +E+++ P I
Sbjct: 245 GVDSEHVYPKTIWMPNEGVQRGSLMHGDGDPLSPFYPSKKELFKGRTIEEAKDDGTLPSI 304
Query: 275 PSLPLSFENAQIILGSL--------WGGFVTQFLNDLGRVNGGRVGPG---PTMVNLTFQ 323
P LP+S+ A +L + W GFV L ++GPG + +
Sbjct: 305 PVLPVSYTTALQLLKRMSGRAVPSDWQGFVGGNLT-------YKLGPGFVNGEKLTINVH 357
Query: 324 GKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLDIARRYALLMR 383
+ K I NV IRG EEP+RY++LGNH DAW YG+IDPNSGTA L ++AR A++
Sbjct: 358 SELKTKRIRNVIGYIRGAEEPDRYIMLGNHFDAWVYGSIDPNSGTAVLAEVAR--AMMQT 415
Query: 384 LG---WSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDCAVQGPGFFAG 440
+ W P RTI+F +WDAEEFG+IGSTE+VEE + L +AV Y+N+DC
Sbjct: 416 INETSWRPARTIVFNAWDAEEFGLIGSTEFVEEFVDVLQKRAVVYINMDCIQGNASLHVD 475
Query: 441 ATPQLDDILIEVTKMVKDPESES-----GTLYDQW 470
P L+ I IE K V +P T+YD W
Sbjct: 476 TVPTLEHIAIEAAKHVPNPSKRERSRGRNTVYDTW 510
>sp|Q9UQQ1|NALDL_HUMAN N-acetylated-alpha-linked acidic dipeptidase-like protein OS=Homo
sapiens GN=NAALADL1 PE=2 SV=2
Length = 740
Score = 231 bits (588), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 157/465 (33%), Positives = 236/465 (50%), Gaps = 56/465 (12%)
Query: 56 LHFQKTFLSLSSNYTVSSYLRDLTHHPHLAGTEPSLDTVRYVQSHFEQLKFNTHTVE--- 112
L +T + + + LR+L+ PHLA + D V+ + ++ + + E
Sbjct: 42 LEILETVMGQLDAHRIRENLRELSREPHLASSPRDEDLVQLLLQRWKDPESGLDSAEAST 101
Query: 113 YKALLSYPVHASVSA----HFSNGTTVELSLTEKGIS-QNAILDVVQPYHAYSPSGSAYG 167
Y+ LLS+P + + G TE+ ++ + DVVQPY AY+PSG+ G
Sbjct: 102 YEVLLSFPSQEQPNVVDIVGPTGGIIHSCHRTEENVTGEQGGPDVVQPYAAYAPSGTPQG 161
Query: 168 KVVFVNYGREEDYRALEAAGVNVSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAEWD 227
+V+ N G EED++ L+ G+ + G + + R G V R A G GVL+Y +
Sbjct: 162 LLVYANRGAEEDFKELQTQGIKLEGTIALTRYGGV-GRGAKAVNAAKHGVAGVLVYTDPA 220
Query: 228 RLRGG-----------------GVERGTVMRGVGDPLSPGWAGVEGGESLDLEDSEVSKR 270
+ G GVERG+ GDPL+P V +DL + VS
Sbjct: 221 DINDGLSSPDETFPNSWYLPPSGVERGSYYEYFGDPLTPYLPAVPSSFRVDL--ANVSG- 277
Query: 271 FPKIPSLPLSFENAQIILGSLWGGFV-TQFLNDLGRVNGGRVGPG---------PTMVNL 320
FP IP+ P+ F++A+ +L +L G + LG R+GPG + VN+
Sbjct: 278 FPPIPTQPIGFQDARDLLCNLNGTLAPATWQGALG--CHYRLGPGFRPDGDFPADSQVNV 335
Query: 321 TFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLDIARRYAL 380
+ + ++ NV +IRG EP+RYVL GNHRD+W +GA+DP+SGTA LL+++R
Sbjct: 336 SVYNRLELRNSSNVLGIIRGAVEPDRYVLYGNHRDSWVHGAVDPSSGTAVLLELSRVLGT 395
Query: 381 LMRLG-WSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDCAVQGPGFFA 439
L++ G W PRR+I+F SW AEEFG+IGSTE+ EE L + VAY+NVD +V FA
Sbjct: 396 LLKKGTWRPRRSIVFASWGAEEFGLIGSTEFTEEFFNKLQERTVAYINVDISV-----FA 450
Query: 440 GA------TPQLDDILIEVTKMVKDPESESGTLYDQWSAPNRIFN 478
A TP + ++ TK ++ P ++YD W R FN
Sbjct: 451 NATLRVQGTPPVQSVVFSATKEIRSPGPGDLSIYDNWI---RYFN 492
>sp|P91406|GCP2_CAEEL Glutamate carboxypeptidase 2 homolog OS=Caenorhabditis elegans
GN=R57.1 PE=1 SV=2
Length = 770
Score = 227 bits (579), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 148/443 (33%), Positives = 221/443 (49%), Gaps = 50/443 (11%)
Query: 70 TVSSYLRDLTHHPHLAGTEPSLDTVRYVQSHFEQLKF-NTHTVEYKALLSYPVHASVSAH 128
+ YLR T PH+AGTE + + + + + + HT+ Y+ LLSYP + ++
Sbjct: 76 NIKKYLRIFTKDPHVAGTEANKKVAYEIANAWSEAGLEDVHTLPYEVLLSYPDFENPNSV 135
Query: 129 FSNGTTVELSLTEKGISQNAILD------VVQPYHAYSPSGSAYGKVVFVNYGREEDYRA 182
+ + KG+S I D + AY+ +GSA VV++N+G D++
Sbjct: 136 IIKSSAGKEVFKSKGVSPVIIPDEQSGKYAGHQWLAYAGNGSASADVVYINHGTANDFKN 195
Query: 183 LEAAGVNVSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAEWDRLRGGGVE------- 235
L+ GV++ G + + R G R I A+ GAIG +L+++ + GVE
Sbjct: 196 LKLMGVDIKGKIALMRYGHGF-RGDKIHKAQQAGAIGAILFSDTQDVAQDGVESENVYPK 254
Query: 236 ----------RGTVMRGVGDPLSPGW-AGVEGGESLDLEDSEVSKRFPKIPSLPLSFENA 284
RG++M G GD LSP + + E + +E+++ P IP LP+S+
Sbjct: 255 KIWMPNEGVQRGSLMHGDGDALSPYYPSKKELFKGRTIEEAKEDGVLPSIPVLPVSYTTG 314
Query: 285 QIILGSL--------WGGFVTQFLNDLGRVNGGRVGPG---PTMVNLTFQGKKKVATIHN 333
IL L W GFV L ++GPG +++ + + I N
Sbjct: 315 YEILKRLSGRPAPSDWQGFVGGNLT-------YKLGPGFVNGEKLSINVHSELRTKRIRN 367
Query: 334 VFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLDIAR-RYALLMRLGWSPRRTI 392
V IRG EEP+ Y++LGNH DAW YG+IDPNSGTA L ++AR + W P RTI
Sbjct: 368 VIGYIRGSEEPDSYIMLGNHFDAWVYGSIDPNSGTAVLAEVARAMMQTINETSWKPARTI 427
Query: 393 IFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDCAVQGPGFFAGATPQLDDILIEV 452
+F +WDAEEFG+IGSTE+VEE + L +AV Y+N+DC P L+ +IE
Sbjct: 428 VFNAWDAEEFGLIGSTEFVEEFVNILQKRAVVYINMDCIQGNISLHVDTVPILEHAVIEA 487
Query: 453 TKMVKDPESES-----GTLYDQW 470
+K V++P TLYD W
Sbjct: 488 SKQVENPSKRERSRGRKTLYDTW 510
>sp|O54697|NALDL_RAT N-acetylated-alpha-linked acidic dipeptidase-like protein OS=Rattus
norvegicus GN=Naaladl1 PE=1 SV=1
Length = 745
Score = 225 bits (573), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 151/432 (34%), Positives = 222/432 (51%), Gaps = 41/432 (9%)
Query: 75 LRDLTHHPHLAGTEPSLDTVRYVQSHFEQLKF---NTHTVEYKALLSYPVHA---SVSAH 128
LR+L+ PH+A + V+ + ++ T EY LLS+P SV
Sbjct: 66 LRELSKEPHVATSARDEALVQLLLGRWKDSASGLDTAKTYEYTVLLSFPSTEQPNSVEVV 125
Query: 129 FSNGTTVE-LSLTEKGIS-QNAILDVVQPYHAYSPSGSAYGKVVFVNYGREEDYRALEAA 186
NGT EK ++ + A +V+QPY AY+P G+ G +V+ N G E+D++ LEA
Sbjct: 126 GPNGTVFHSFQPFEKNLTGEQAEPNVLQPYAAYAPPGTPKGPLVYANRGSEDDFKKLEAE 185
Query: 187 GVNVSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAEWDRLRGG-------------- 232
G+N+ G + + R GSV R A G +GVL+Y + + G
Sbjct: 186 GINLKGTIALTRYGSV-GRGAKAINAARHGVVGVLVYTDPGDINDGKSLPNETFPNSWGL 244
Query: 233 ---GVERGTVMRGVGDPLSPGWAGVEGGESLDLEDSEVSKRFPKIPSLPLSFENAQIILG 289
GVERG+ GDPL+P S L+ +S FP IP+ P+ FE+A+ +L
Sbjct: 245 PPSGVERGSYYEYFGDPLTPYLPAHP--VSFRLDPHNISG-FPPIPTQPIGFEDAKNLLC 301
Query: 290 SL--------W-GGFVTQFLNDLGRVNGGRVGPGPTMVNLTFQGKKKVATIHNVFAVIRG 340
+L W G ++ G G P + V ++ + ++ NV +I+G
Sbjct: 302 NLNGTSAPDSWQGALGCEYKLGPGFEPNGNF-PAGSEVKVSVYNRLELRNSSNVLGIIQG 360
Query: 341 LEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLDIARRYALLMRLG-WSPRRTIIFCSWDA 399
EP+RYV+ GNHRD+W +GA+DP+SGTA LL+I+R L++ G W PRR+IIF SW A
Sbjct: 361 AVEPDRYVIYGNHRDSWVHGAVDPSSGTAVLLEISRVLGTLLKKGTWRPRRSIIFASWGA 420
Query: 400 EEFGMIGSTEWVEENLVNLGAKAVAYLNVDCAV-QGPGFFAGATPQLDDILIEVTKMVKD 458
EEFG+IGSTE+ EE L L + V Y+NVD +V A TP + ++ TK +
Sbjct: 421 EEFGLIGSTEFTEEFLSKLQERTVTYINVDISVFSNATLRAQGTPPVQSVIFSATKEISA 480
Query: 459 PESESGTLYDQW 470
P S ++YD W
Sbjct: 481 PGSSGLSIYDNW 492
>sp|B2GUY2|TFR2_RAT Transferrin receptor protein 2 OS=Rattus norvegicus GN=Tfr2 PE=2
SV=1
Length = 798
Score = 181 bits (460), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 118/338 (34%), Positives = 183/338 (54%), Gaps = 20/338 (5%)
Query: 144 ISQNAILDVVQPYHAYSPSGSAYGKVVFVNYGREEDYRALEAAGVNVSGCVVMARKGSVL 203
+ + L+ + Y YS +G+A GK+V+ +YGR ED + L+A V ++G +++ R G +
Sbjct: 215 VQEQLPLEDPEVYCPYSATGNATGKLVYAHYGRREDLQDLKAKDVELAGSLLLVRAG-IT 273
Query: 204 SRSGVIFLAEAKGAIGVLLY---AEWDR------LRGGGVERGTVMRGVGDPLSPGWAGV 254
S + + +A+ GA GVL+Y A++ + L G V G GDP +PG+
Sbjct: 274 SFAQKVAIAQDFGAHGVLIYPDPADFSQDPHKPGLSSDRAVYGHVHLGTGDPYTPGFPSF 333
Query: 255 EGGESLDLEDSEVSKRFPKIPSLPLSFENAQIILGSLWGGFVTQFLNDLGRVNGG--RVG 312
+ +E S + P IP+ P+S + A +L L G Q GR++ R+G
Sbjct: 334 NQTQFPPVESSGL----PNIPAQPISADVADRLLRKLTGPVAPQEWK--GRLSDSPYRLG 387
Query: 313 PGPTMVNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALL 372
PGP + L + I N+FA I G EP+ YV++G RDAW GA GTA LL
Sbjct: 388 PGPGL-RLVVNNHRTSTPISNIFACIEGFAEPDHYVVIGAQRDAWGPGAAKSAVGTAILL 446
Query: 373 DIARRYALLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDCAV 432
++ R ++ ++ G+ PRR+++F SWD +FG +G+TEW+E L L KAV Y+++D +V
Sbjct: 447 ELVRTFSSMVSSGFRPRRSLLFISWDGGDFGSVGATEWLEGYLSVLHLKAVVYVSLDNSV 506
Query: 433 QGPG-FFAGATPQLDDILIEVTKMVKDPESESGTLYDQ 469
G G F A +P L ++ + K V P TLYDQ
Sbjct: 507 LGDGKFHAKTSPLLVSLIENILKQVDSPNHSGQTLYDQ 544
>sp|Q9JKX3|TFR2_MOUSE Transferrin receptor protein 2 OS=Mus musculus GN=Tfr2 PE=2 SV=2
Length = 798
Score = 181 bits (458), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 118/352 (33%), Positives = 186/352 (52%), Gaps = 20/352 (5%)
Query: 130 SNGTTVELSLTEKGISQNAILDVVQPYHAYSPSGSAYGKVVFVNYGREEDYRALEAAGVN 189
++ T+ + + + L+ + Y YS +G+A GK+V+ +YGR ED + L+A GV
Sbjct: 201 AHANTLHWVDADGSVQEQLPLEDPEVYCPYSATGNATGKLVYAHYGRSEDLQDLKAKGVE 260
Query: 190 VSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAEWDR---------LRGGGVERGTVM 240
++G +++ R G + S + + +A+ GA GVL+Y + L G V
Sbjct: 261 LAGSLLLVRVG-ITSFAQKVAVAQDFGAQGVLIYPDPSDFSQDPHKPGLSSHQAVYGHVH 319
Query: 241 RGVGDPLSPGWAGVEGGESLDLEDSEVSKRFPKIPSLPLSFENAQIILGSLWGGFVTQFL 300
G GDP +PG+ + +E S + P IP+ P+S + A +L L G Q
Sbjct: 320 LGTGDPYTPGFPSFNQTQFPPVESSGL----PSIPAQPISADIADQLLRKLTGPVAPQEW 375
Query: 301 NDLGRVNGG--RVGPGPTMVNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWT 358
G ++G R+GPGP + L + I N+FA I G EP+ YV++G RDAW
Sbjct: 376 K--GHLSGSPYRLGPGPDL-RLVVNNHRVSTPISNIFACIEGFAEPDHYVVIGAQRDAWG 432
Query: 359 YGAIDPNSGTAALLDIARRYALLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNL 418
GA GTA LL++ R ++ ++ G+ PRR+++F SWD +FG +G+TEW+E L L
Sbjct: 433 PGAAKSAVGTAILLELVRTFSSMVSNGFRPRRSLLFISWDGGDFGSVGATEWLEGYLSVL 492
Query: 419 GAKAVAYLNVDCAVQGPG-FFAGATPQLDDILIEVTKMVKDPESESGTLYDQ 469
KAV Y+++D +V G G F A +P L ++ + K V P TLY+Q
Sbjct: 493 HLKAVVYVSLDNSVLGDGKFHAKTSPLLVSLIENILKQVDSPNHSGQTLYEQ 544
>sp|Q9UP52|TFR2_HUMAN Transferrin receptor protein 2 OS=Homo sapiens GN=TFR2 PE=1 SV=1
Length = 801
Score = 167 bits (423), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 115/326 (35%), Positives = 167/326 (51%), Gaps = 20/326 (6%)
Query: 156 YHAYSPSGSAYGKVVFVNYGREEDYRALEAAGVNVSGCVVMARKGSVLSRSGVIFLAEAK 215
Y YS G+ G++V+ +YGR ED + L A GV+ G +++ R G V+S + + A+
Sbjct: 232 YCPYSAIGNVTGELVYAHYGRPEDLQDLRARGVDPVGRLLLVRVG-VISFAQKVTNAQDF 290
Query: 216 GAIGVLLYAE---------WDRLRGGGVERGTVMRGVGDPLSPGWAGVEGGESLDLEDSE 266
GA GVL+Y E L G V G GDP +PG+ + +
Sbjct: 291 GAQGVLIYPEPADFSQDPPKPSLSSQQAVYGHVHLGTGDPYTPGFPSFNQTQFPPV---- 346
Query: 267 VSKRFPKIPSLPLSFENAQIILGSLWGGFVTQFLNDLGRVNGG--RVGPGPTMVNLTFQG 324
S P IP+ P+S + A +L L G Q G + G +GPGP + L
Sbjct: 347 ASSGLPSIPAQPISADIASRLLRKLKGPVAPQEWQ--GSLLGSPYHLGPGPRL-RLVVNN 403
Query: 325 KKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLDIARRYALLMRL 384
+ I+N+F I G EP+ YV++G RDAW GA GTA LL++ R ++ ++
Sbjct: 404 HRTSTPINNIFGCIEGRSEPDHYVVIGAQRDAWGPGAAKSAVGTAILLELVRTFSSMVSN 463
Query: 385 GWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDCAVQGPG-FFAGATP 443
G+ PRR+++F SWD +FG +GSTEW+E L L KAV Y+++D AV G F A +P
Sbjct: 464 GFRPRRSLLFISWDGGDFGSVGSTEWLEGYLSVLHLKAVVYVSLDNAVLGDDKFHAKTSP 523
Query: 444 QLDDILIEVTKMVKDPESESGTLYDQ 469
L ++ V K V P TLY+Q
Sbjct: 524 LLTSLIESVLKQVDSPNHSGQTLYEQ 549
>sp|Q2V905|TFR1_HORSE Transferrin receptor protein 1 OS=Equus caballus GN=TFRC PE=2 SV=1
Length = 767
Score = 164 bits (416), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 134/417 (32%), Positives = 204/417 (48%), Gaps = 46/417 (11%)
Query: 54 TALHFQKTFLSLSSNYTVSSYLRDLTHHPHLAGTEPSLDTVRYVQSHFEQLKFNT----- 108
TA F T L+ N SY+ P AG++ ++++ F + K N
Sbjct: 142 TATEFTNTIKRLNGN----SYV------PREAGSQKDESLAFFIENQFREFKLNKVWRDE 191
Query: 109 HTVEYKALLSYPVHASVSAHFSNGTTVELSLTEKGISQNAILDVVQPYHAYSPSGSAYGK 168
H V+ + S +A S NG+ +SL E Y AYS + + GK
Sbjct: 192 HFVKIQVKGS---NAQSSVTVVNGSGDMISLVENPTG----------YVAYSKATTVTGK 238
Query: 169 VVFVNYGREEDYRALEAAGVNVSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLY---AE 225
+V N+G +EDY AL V+G +V+ R G + V AE+ A+GVL+Y A+
Sbjct: 239 LVHANFGTKEDYEALSYP---VNGSLVIVRAGEITFAQKVA-NAESLNAVGVLIYMDQAK 294
Query: 226 WDRLRGGGVERGTVMRGVGDPLSPGWAGVEGGESLDLEDSEVSKRFPKIPSLPLSFENAQ 285
+ + G G GDP +PG+ + + S + P IP +S A+
Sbjct: 295 FPIVNANLPVFGHAHLGTGDPYTPGFPSFNHTQFPPSQSSGL----PNIPVQTISRAAAE 350
Query: 286 IILGSLWGGFVTQFLNDLGRVNGGRVG-PGPTMVNLTFQGKKKVATIHNVFAVIRGLEEP 344
+ ++ G + + D + R+ PG V LT + K I NVF VI+G EEP
Sbjct: 351 ALFANMKGDCPSSWKTD----SSCRLEFPGDKNVKLTVNNELKEIRIFNVFGVIKGFEEP 406
Query: 345 NRYVLLGNHRDAWTYGAIDPNSGTAALLDIARRYA-LLMRLGWSPRRTIIFCSWDAEEFG 403
+RYV++G RDAW GA + GTA LL++AR ++ ++ + G+ P R+I+F SW A +FG
Sbjct: 407 DRYVVIGAQRDAWGPGAAKSSVGTALLLELARIFSDMVSKGGFKPSRSIVFASWGAGDFG 466
Query: 404 MIGSTEWVEENLVNLGAKAVAYLNVDCAVQG-PGFFAGATPQLDDILIEVTKMVKDP 459
IG+TEW+E L +L KA Y+N+D AV G F A+P L ++ + + VK P
Sbjct: 467 AIGATEWLEGYLSSLHLKAFTYINLDKAVLGAKNFKVSASPLLYSLIEKTMQEVKHP 523
>sp|Q9MYZ3|TFR1_FELCA Transferrin receptor protein 1 OS=Felis catus GN=TFRC PE=1 SV=1
Length = 769
Score = 160 bits (404), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 130/414 (31%), Positives = 204/414 (49%), Gaps = 39/414 (9%)
Query: 60 KTFLSLS-SNYTVSSYLRDL---THHPHLAGTEPSLDTVRYVQSHFEQLKFNT--HTVEY 113
KT LS SN +S +R L ++ P AG++ ++++ F +L+ + H +
Sbjct: 137 KTMLSEKLSNTEFTSTIRQLNENSYFPREAGSQKDESLAFFIENRFRELQLSKAWHDEHF 196
Query: 114 KALLSYPVHASVSAHFSNGTTVELSLTEKGISQNAILDVVQP--YHAYSPSGSAYGKVVF 171
V V SN T+ G + + V P Y AYS + + G++V
Sbjct: 197 -------VKVQVKGSASNSVTI------VGTNSGMVYLVESPEGYVAYSKAATVTGRLVH 243
Query: 172 VNYGREEDYRALEAAGVNVSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAEWDRLRG 231
N+G ++D+ L + V+G +V+ R G + V AE+ AIGVL+Y + +
Sbjct: 244 ANFGTKKDFENLNSP---VNGSLVIVRAGKITFAEKVA-NAESFNAIGVLIYMDQAKFPI 299
Query: 232 GGVE---RGTVMRGVGDPLSPGWAGVEGGESLDLEDSEVSKRFPKIPSLPLSFENAQIIL 288
E G G GDP +PG+ + + S + P IP +S NA+ +
Sbjct: 300 TNAEIPFFGHAHLGTGDPYTPGFPSFNHTQFPPSQSSGL----PNIPVQTISRANAEKLF 355
Query: 289 GSLWGGFVTQFLNDLG-RVNGGRVGPGPTMVNLTFQGKKKVATIHNVFAVIRGLEEPNRY 347
G++ G + + D R+ R V L+ K I NVF VI+G EEP+ Y
Sbjct: 356 GNMEGDCPSAWETDSSCRLETSR----NWNVKLSVNNVLKEIRIFNVFGVIKGFEEPDHY 411
Query: 348 VLLGNHRDAWTYGAIDPNSGTAALLDIARRYA-LLMRLGWSPRRTIIFCSWDAEEFGMIG 406
V++G RDAW GA + GTA LL++AR + ++++ G+ P R+I+F SW A +FG +G
Sbjct: 412 VVVGAQRDAWGPGAAKSSVGTALLLELARILSDMVLKGGFKPSRSIVFASWSAGDFGAVG 471
Query: 407 STEWVEENLVNLGAKAVAYLNVDCAVQG-PGFFAGATPQLDDILIEVTKMVKDP 459
+TEW+E L +L KA Y+N+D AV G F A+P L ++ +V K VK P
Sbjct: 472 ATEWLEGYLSSLHLKAFTYINLDKAVLGTSNFKVSASPLLYSLIEKVMKDVKHP 525
>sp|Q90997|TFR1_CHICK Transferrin receptor protein 1 OS=Gallus gallus GN=TFRC PE=2 SV=2
Length = 776
Score = 154 bits (390), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 114/352 (32%), Positives = 174/352 (49%), Gaps = 32/352 (9%)
Query: 133 TTVELSLTEKGISQNAILDVVQPYHAYSPSGSAYGKVVFVNYGREEDYRALEAAGVNVSG 192
T ++S++ G + IL+ AYS SGS GK V+VNYG ++D+ ++ +++G
Sbjct: 205 TKNQVSISING--KEEILETPDAIVAYSESGSVSGKPVYVNYGLKKDFEIIQKVVASLNG 262
Query: 193 CVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAEWDRLRGGGVER----GTVMRGVGDPLS 248
+V+ R G + V EA GA GVL+Y D L G + G G GDP +
Sbjct: 263 TIVIVRAGKITLAEKVANAKEA-GAAGVLMYV--DSLMYGITDTLIPFGHAHLGTGDPYT 319
Query: 249 PGWAGVEGGESLDLEDSEVSKRFPKIPSLPLSFENAQIILGSLWGGFVTQFLNDL----- 303
PG+ +FP + S L Q I S ++ D
Sbjct: 320 PGFPSFN------------HTQFPPVESSGLPHIAVQTISSSAAARLFSKMDGDTCSEGW 367
Query: 304 -GRVNGGRV---GPGPTMVNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTY 359
G ++ +V MV L K I N+F I+G EEP+RYV++G RD+W
Sbjct: 368 KGAIHSCKVTTKQESQIMVKLDVNNSMKDRKILNIFGAIQGFEEPDRYVVIGAQRDSWGP 427
Query: 360 GAIDPNSGTAALLDIARRYALLMR-LGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNL 418
G +GTA LL++AR + +++ G+ PRR+IIF SW A ++G +G+TEW+E L
Sbjct: 428 GVAKAGTGTAILLELARVISDIVKNEGYKPRRSIIFASWSAGDYGAVGATEWLEGYSAML 487
Query: 419 GAKAVAYLNVDCAVQGPGFFA-GATPQLDDILIEVTKMVKDPESESGTLYDQ 469
AKA Y+++D V G A+P L +L + K VK+P + S +LY++
Sbjct: 488 HAKAFTYISLDAPVLGASHVKISASPLLYMLLGSIMKGVKNPAAVSESLYNR 539
>sp|Q9GLD3|TFR1_CANFA Transferrin receptor protein 1 OS=Canis familiaris GN=TFRC PE=1
SV=1
Length = 770
Score = 151 bits (381), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/316 (34%), Positives = 163/316 (51%), Gaps = 30/316 (9%)
Query: 156 YHAYSPSGSAYGKVVFVNYGREEDYRALEAAGVNVSGCVVMARKGSVLSRSGVIFLAEAK 215
Y AYS + + G++V VN+G ++D+ L++ V+G +V+AR G + V A++
Sbjct: 229 YVAYSKATTVTGRLVHVNFGTKKDFENLKSP---VNGSLVIARAGKITFAEKVAN-AQSY 284
Query: 216 GAIGVLLYAEWDRLRGGGVE---RGTVMRGVGDPLSPGWAGVEGGESLDLEDSEVSKRFP 272
A+GVL+Y + R G G GDP +PG+ + + S + P
Sbjct: 285 NALGVLIYMDQARFPIVNARIPFFGHAHLGTGDPYTPGFPSFNHTQFPPSQSSGL----P 340
Query: 273 KIPSLPLS-------FENAQIILGSLWGGFVTQFLNDLGRVNGGRVGPGPTMVNLTFQGK 325
IP +S FEN + S W + L N VNLT
Sbjct: 341 SIPVQTISRAAAEKLFENMEGDCPSAWEIDPSCRLETSSNKN----------VNLTVNNV 390
Query: 326 KKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLDIARRYA-LLMRL 384
K I NVF VI+G EEP+RYV++G RDAW GA + GTA LL++AR ++ ++++
Sbjct: 391 LKEIRIFNVFGVIKGFEEPDRYVVIGAQRDAWGPGAAKSSVGTALLLELARIFSDMVLKG 450
Query: 385 GWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDCAVQG-PGFFAGATP 443
G+ P R+I+F SW A +FG IG+TEW+E L +L KA Y+N+D A+ G F A+P
Sbjct: 451 GFKPSRSIVFASWSAGDFGAIGATEWLEGYLSSLHLKAFTYINLDKAILGTSNFKVSASP 510
Query: 444 QLDDILIEVTKMVKDP 459
L +L + K VK P
Sbjct: 511 LLYSLLEKTMKDVKHP 526
>sp|Q9HBA9|FOH1B_HUMAN Putative N-acetylated-alpha-linked acidic dipeptidase OS=Homo
sapiens GN=FOLH1B PE=2 SV=1
Length = 442
Score = 150 bits (379), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 75/181 (41%), Positives = 109/181 (60%), Gaps = 11/181 (6%)
Query: 311 VGPGPT------MVNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDP 364
VGPG T V + +V I+NV +RG EP+RYV+LG HRD+W +G IDP
Sbjct: 21 VGPGFTGNFSTQKVKMHIHSTNEVTRIYNVIGTLRGAVEPDRYVILGGHRDSWVFGGIDP 80
Query: 365 NSGTAALLDIARRYALLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVA 424
SG A + + R + L + GW PRRTI+F SWDAEEFG++GSTEW E+N L + VA
Sbjct: 81 QSGAAVVHETVRSFGTLKKEGWRPRRTILFASWDAEEFGLLGSTEWAEDNSRLLQERGVA 140
Query: 425 YLNVDCAVQGP-GFFAGATPQLDDILIEVTKMVKDPES--ESGTLYDQWS--APNRIFNG 479
Y+N D +++G TP + ++ +TK +K P+ E +LY+ W+ +P+ F+G
Sbjct: 141 YINADSSIEGNYTLRVDCTPLMYSLVYNLTKELKSPDEGFEGKSLYESWTKKSPSPEFSG 200
Query: 480 L 480
+
Sbjct: 201 M 201
>sp|Q5RDH6|TFR1_PONAB Transferrin receptor protein 1 OS=Pongo abelii GN=TFRC PE=2 SV=1
Length = 760
Score = 149 bits (376), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 121/395 (30%), Positives = 185/395 (46%), Gaps = 35/395 (8%)
Query: 75 LRDLTHHPHLAGTEPSLDTVRYVQSHFEQLKF-----NTHTVEYKALLSYPVHASVSAHF 129
L + ++ P AG++ + YV++ F + K + H V+ + S SV
Sbjct: 147 LNENSYVPREAGSQKDENLALYVENQFREFKLSKVWRDQHFVKIQVKDS--AQNSVIIVD 204
Query: 130 SNGTTVELSLTEKGISQNAILDVVQPYHAYSPSGSAYGKVVFVNYGREEDYRALEAAGVN 189
NG V L G Y AYS + + GK+V N+G ++D+ L+
Sbjct: 205 KNGRLVYLVENPGG------------YVAYSKAATVTGKLVHANFGTKKDFEDLDTP--- 249
Query: 190 VSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAEWDRLRGGGVER---GTVMRGVGDP 246
V+G +V+ R G + V AE+ AIGVL+Y + + E G G GDP
Sbjct: 250 VNGSIVIVRAGKITFAEKVAN-AESLNAIGVLIYMDQTKFPIVNAELSFFGHAHLGTGDP 308
Query: 247 LSPGWAGVEGGESLDLEDSEVSKRFPKIPSLPLSFENAQIILGSLWGGFVTQFLNDLGRV 306
+PG+ + S P IP +S A+ + G++ G + + D
Sbjct: 309 YTPGFPSFNHTQF----PPSRSSGLPNIPVQTISRAAAEKLFGNMEGDCPSDWKTD---S 361
Query: 307 NGGRVGPGPTMVNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNS 366
V V LT K I N+F VI+G EP+ YV++G RDAW GA
Sbjct: 362 TCRMVTSESKNVKLTVSNVLKEIKILNIFGVIKGFVEPDHYVVVGAQRDAWGPGAAKSGV 421
Query: 367 GTAALLDIARRYA-LLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAY 425
GTA LL +A ++ ++++ G+ P R+IIF SW A +FG +G+TEW+E L +L KA Y
Sbjct: 422 GTALLLKLAEMFSDMVLKDGFQPSRSIIFASWSAGDFGSVGATEWLEGYLSSLHLKAFTY 481
Query: 426 LNVDCAVQG-PGFFAGATPQLDDILIEVTKMVKDP 459
+N+D AV G F A+P L ++ + + VK P
Sbjct: 482 INLDKAVLGTSNFKVSASPLLYTLIEKTMQNVKHP 516
>sp|P02786|TFR1_HUMAN Transferrin receptor protein 1 OS=Homo sapiens GN=TFRC PE=1 SV=2
Length = 760
Score = 148 bits (373), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 121/395 (30%), Positives = 185/395 (46%), Gaps = 35/395 (8%)
Query: 75 LRDLTHHPHLAGTEPSLDTVRYVQSHFEQLKF-----NTHTVEYKALLSYPVHASVSAHF 129
L + ++ P AG++ + YV++ F + K + H V+ + S SV
Sbjct: 147 LNENSYVPREAGSQKDENLALYVENQFREFKLSKVWRDQHFVKIQVKDS--AQNSVIIVD 204
Query: 130 SNGTTVELSLTEKGISQNAILDVVQPYHAYSPSGSAYGKVVFVNYGREEDYRALEAAGVN 189
NG V L G Y AYS + + GK+V N+G ++D+ L
Sbjct: 205 KNGRLVYLVENPGG------------YVAYSKAATVTGKLVHANFGTKKDFEDLYTP--- 249
Query: 190 VSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAEWDRLRGGGVER---GTVMRGVGDP 246
V+G +V+ R G + V AE+ AIGVL+Y + + E G G GDP
Sbjct: 250 VNGSIVIVRAGKITFAEKVA-NAESLNAIGVLIYMDQTKFPIVNAELSFFGHAHLGTGDP 308
Query: 247 LSPGWAGVEGGESLDLEDSEVSKRFPKIPSLPLSFENAQIILGSLWGGFVTQFLNDLGRV 306
+PG+ + S P IP +S A+ + G++ G + + D
Sbjct: 309 YTPGFPSFNHTQF----PPSRSSGLPNIPVQTISRAAAEKLFGNMEGDCPSDWKTD---S 361
Query: 307 NGGRVGPGPTMVNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNS 366
V V LT K I N+F VI+G EP+ YV++G RDAW GA
Sbjct: 362 TCRMVTSESKNVKLTVSNVLKEIKILNIFGVIKGFVEPDHYVVVGAQRDAWGPGAAKSGV 421
Query: 367 GTAALLDIARRYA-LLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAY 425
GTA LL +A+ ++ ++++ G+ P R+IIF SW A +FG +G+TEW+E L +L KA Y
Sbjct: 422 GTALLLKLAQMFSDMVLKDGFQPSRSIIFASWSAGDFGSVGATEWLEGYLSSLHLKAFTY 481
Query: 426 LNVDCAVQG-PGFFAGATPQLDDILIEVTKMVKDP 459
+N+D AV G F A+P L ++ + + VK P
Sbjct: 482 INLDKAVLGTSNFKVSASPLLYTLIEKTMQNVKHP 516
>sp|Q8HZV3|TFR1_PIG Transferrin receptor protein 1 OS=Sus scrofa GN=TFRC PE=2 SV=1
Length = 768
Score = 147 bits (371), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 115/408 (28%), Positives = 203/408 (49%), Gaps = 41/408 (10%)
Query: 66 SSNYTVSSYLRDLTHHPHLAGTEPSLDTVRYVQSHFEQLKF-----NTHTVEYKALLSYP 120
++++T + + + + P AG++ ++++ F + K + H V+ +
Sbjct: 144 TTDFTRTIKMLNEDYAPREAGSQKDESLGFFIENQFREFKLSKVWHDEHFVKIQV----- 198
Query: 121 VHASVSAHFSNGTTVELSLTEKGISQNAILDVVQP---YHAYSPSGSAYGKVVFVNYGRE 177
G+ E S+T N+++ V+ Y AYS + + GK++F N+G +
Sbjct: 199 ----------KGSNAENSVTLVNTDSNSLVYPVESPEGYVAYSKATTVTGKLIFANFGTK 248
Query: 178 EDYRALEAAGVNVSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLY---AEWDRLRGGGV 234
+D+ L+ + V+G +V+ R G + V A++ AIGVL+Y A + +
Sbjct: 249 KDFEDLK---MPVNGSLVIVRAGKITFAEKVA-NAQSLDAIGVLIYMDRANFPIINADVP 304
Query: 235 ERGTVMRGVGDPLSPGWAGVEGGESLDLEDSEVSKRFPKIPSLPLSFENAQIILGSLWGG 294
G G GDP +PG+ + + S + P IP +S A+ + G++
Sbjct: 305 VFGHAHLGTGDPYTPGFPSFNHTQFPPSQSSGL----PNIPVQTISRAGAEKLFGNMEQD 360
Query: 295 FVTQFLNDLGRVNGGRVGPGPTM-VNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNH 353
+ D ++ P+ V LT K I N+F VI+G EEP+RYV++G
Sbjct: 361 CPLTWRTDFP----CKLVSSPSKNVKLTVNNVLKEIKILNIFGVIKGFEEPDRYVIVGAQ 416
Query: 354 RDAWTYGAIDPNSGTAALLDIARRYA-LLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVE 412
RDAW GA + GT+ LL++A+ + ++++ + P R+I+F SW A +FG IG+TEW+E
Sbjct: 417 RDAWGPGAAKSSVGTSLLLNLAQILSDMVIKGQFKPSRSIVFASWSAGDFGAIGATEWLE 476
Query: 413 ENLVNLGAKAVAYLNVDCAVQG-PGFFAGATPQLDDILIEVTKMVKDP 459
L +L KA Y+N+D AV G F A+P L ++ ++ + VK+P
Sbjct: 477 GYLSSLHLKAFTYINLDKAVLGTSNFKVSASPLLYSLIEKMMQDVKNP 524
>sp|P47161|VPS70_YEAST Vacuolar protein sorting-associated protein 70 OS=Saccharomyces
cerevisiae (strain ATCC 204508 / S288c) GN=VPS70 PE=1
SV=1
Length = 811
Score = 145 bits (367), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 124/412 (30%), Positives = 187/412 (45%), Gaps = 47/412 (11%)
Query: 93 TVRYVQSHFEQLKFNTHTVEYKALLSYPVHASVSAHFSNGTTV-ELSLTE---KGISQNA 148
T +Y F + + +Y + PV +V A NG V E S+ E KG +
Sbjct: 165 TFKYTLDEFLDMGYKPKVEKYYPWIGEPVDTNV-APLENGKVVYEASMIEDRVKGDPASH 223
Query: 149 ILDVVQPYHAYSPSGSAYGKVVFVNYGREEDYRALEAAGVNVSGCVVMARKGSVLSRSGV 208
+ +H YS +GS + VF NYG DY+ L +++ + + R G +L V
Sbjct: 224 ARKRQKGFHQYSKNGSVTARYVFCNYGSISDYKLLLKKNIDIEDKIHIVRSGKILPGLKV 283
Query: 209 IFLAEAKGAIGVLLYAEWDRLRGGGV--ERG-------------TVMRGV--------GD 245
AE GA V++Y D G V E G + R GD
Sbjct: 284 KN-AELYGASSVIIYT--DPFDDGKVTEENGFLHYPYGPARNPSYIRRDSVNYFSDTPGD 340
Query: 246 PLSPGWAGVEGGESLDLEDSEVSKRFPKIPSLPLSFENAQIILGSLWG-GFVT---QFLN 301
P +PG+ + D E R P+IPS+P+S + Q IL L G GF +
Sbjct: 341 PTTPGYPSKDS----DTEHMSPVGRVPRIPSVPMSARDVQPILERLNGRGFQIGPGSNIK 396
Query: 302 DLGRVNGGRVGPGPTM--VNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAW-T 358
D G GP ++ V+L + + + +V I G+ +++G HRD+ +
Sbjct: 397 DFGSF----TGPSSSIDKVHLHNELTYNIKEMSSVEVSIPGIFTEGE-IIIGAHRDSLAS 451
Query: 359 YGAIDPNSGTAALLDIARRYALLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNL 418
A D NSG+A LL+IAR + L++ GW P R I SWD E G++GST++ E + L
Sbjct: 452 SSAGDANSGSAILLEIARGMSKLLKHGWKPLRPIKLISWDGERSGLLGSTDYAEAHAAIL 511
Query: 419 GAKAVAYLNVDCAVQGPGFFAGATPQLDDILIEVTKMVKDPESESGTLYDQW 470
+A+ YLN+D A+ G F A P L D++ E K+ + E +L+D W
Sbjct: 512 RRRALVYLNLDNAISGTNFHCKANPLLQDVIYEAAKLTEFNGHEDWSLFDHW 563
>sp|O43023|YGV9_SCHPO Uncharacterized zinc metalloprotease C354.09c
OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
GN=SPBC354.09c PE=3 SV=1
Length = 794
Score = 143 bits (360), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 123/407 (30%), Positives = 191/407 (46%), Gaps = 47/407 (11%)
Query: 78 LTHHPHLAGTEPSLDTVRYVQSHFEQLKFNTHTV----EYKALLSYPVHASVSAHFSNGT 133
L+ PHLAGT + +Q +L + T+ E+ A L++P S+S +
Sbjct: 150 LSSIPHLAGT---VGDSSLLQMIMNRLYYEKGTIVDFREFYAYLNFPQLVSLSIDGDDSF 206
Query: 134 TVELSLTEKGISQNAILDVVQPYHAYSPSGSAYGKVVFVNYGREEDYRALEAAGVNVSGC 193
L + + + + SPSG +V+ N GR ED+ L +G+ V
Sbjct: 207 HPSLIESYQVGGYDGVSIPTPATFGGSPSGFVNAPLVYANRGRIEDFEWLVNSGIYVESS 266
Query: 194 VVMARKGS---VLSRSGVIFLAEAKGAIGVLLYAE-----WDRLR----------GGGVE 235
+V+ R L+ + AE A +L++ + D L +
Sbjct: 267 IVLVRANQSDFALATAN----AEKYNASAILIFEDTYLTSLDNLNQVYPAGPYPSANSLY 322
Query: 236 RGTVMRG---VGDPLSPGWAGVEGGESLDLEDSEVSKRFPKIPSLPLSFENAQIILGSLW 292
RG+V VGDPL+PGW+ E + +D+ V P I S+P++F + +L L
Sbjct: 323 RGSVANHYYYVGDPLTPGWSAHEETNRISPKDANV---LPSIVSIPITFNDGIELLKRLQ 379
Query: 293 G-GFVTQ---FLNDLGRV-----NGGRVGPGPTMVNLTFQGKKKVATIHNVFAVIRGLEE 343
G G + + + DL V G ++ VN+ Q + I N+ A I G E
Sbjct: 380 GHGHLVKDSNWCQDLAPVLSEVWTGSKISSPGLEVNV-LQDIEDKQKIINIMAQIDGYES 438
Query: 344 PNRYVLLGNHRDAWTYGAIDPNSGTAALLDIARRYA-LLMRLGWSPRRTIIFCSWDAEEF 402
++ +++G RD+W GA D + GT+ L+D+ +A + L W PRRTI+F SWDA +F
Sbjct: 439 -DQILVVGAPRDSWCTGASDSSVGTSLLIDVISTFANMAQDLSWKPRRTIVFASWDARQF 497
Query: 403 GMIGSTEWVEENLVNLGAKAVAYLNVDCAVQGPGFFAGATPQLDDIL 449
IGSTE++E +L AKAVAY+NVD AV G F A P L ++
Sbjct: 498 NAIGSTEFLEYWKESLEAKAVAYINVDVAVSGDTFTARTVPGLKKVI 544
>sp|Q07891|TFR1_CRIGR Transferrin receptor protein 1 OS=Cricetulus griseus GN=TFRC PE=2
SV=1
Length = 757
Score = 140 bits (352), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 115/400 (28%), Positives = 190/400 (47%), Gaps = 45/400 (11%)
Query: 75 LRDLTHHPHLAGTEPSLDTVRYVQSHFEQLKFNTHTVEYKALLSYPVHASVSAHFSNGTT 134
++ L+ AG++ + Y+++ F K + + + V S + N T
Sbjct: 144 IKQLSQTSREAGSQKDENLAYYIENQFRDFKL-SKVWRDEHYVKIQVKGSAA---QNAVT 199
Query: 135 VELSLTEKGISQNAILDVVQP---YHAYSPSGSAYGKVVFVNYGREEDYRALEAAGVNVS 191
+ I+ N D+V+ Y AYS + + GK++ N+G ++D+ L+ V+
Sbjct: 200 I--------INVNGDSDLVENPGGYVAYSKATTVSGKLIHANFGTKKDFEDLKYP---VN 248
Query: 192 GCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAEWDRLRGGGVER---GTVMRGVGDPLS 248
G +V+ R G + V A++ AIGVL+Y + + E G G GDP +
Sbjct: 249 GSLVIVRAGKITFAEKVAN-AQSFNAIGVLIYMDQTKFPVVEAELSLFGHAHLGTGDPYT 307
Query: 249 PGWAGVEGGESLDLEDSEVSKRFPKIPSLPLSFENAQIILGSL-------WGGFVTQFLN 301
PG+ + + S + P IP +S + A+ + ++ W T L
Sbjct: 308 PGFPSFNHTQFPPSQSSGL----PSIPVQTISRKAAEKLFQNMETNCPPSWN---TDSLC 360
Query: 302 DLGRVNGGRVGPGPTMVNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGA 361
L G VNL+ K I N+F VI+G EEP+RY+++G RDAW GA
Sbjct: 361 KLESSQG-------INVNLSVNNVLKETRILNIFGVIKGFEEPDRYIVVGAQRDAWGPGA 413
Query: 362 IDPNSGTAALLDIARRYA-LLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGA 420
+ GT LL +A+ ++ ++ R G+ P R+IIF SW A +FG +G+TEW+E L +L
Sbjct: 414 AKSSVGTGLLLKLAQAFSDMVSRGGFKPSRSIIFASWSAGDFGAVGATEWLEGYLSSLHL 473
Query: 421 KAVAYLNVDCAVQGP-GFFAGATPQLDDILIEVTKMVKDP 459
KA Y+N+D V G F A+P L ++ + + V+ P
Sbjct: 474 KAFTYINLDKVVLGTRNFKVSASPLLYTLIEKTMQDVRHP 513
>sp|Q99376|TFR1_RAT Transferrin receptor protein 1 (Fragment) OS=Rattus norvegicus
GN=Tfrc PE=2 SV=1
Length = 622
Score = 137 bits (345), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 112/389 (28%), Positives = 185/389 (47%), Gaps = 32/389 (8%)
Query: 79 THHPHLAGTEPSLDTVRYVQSHFEQLKFNTHTVEYKALLSYPVHASVSAHFSNGTTVELS 138
T+ P AG++ + Y+++ F KF+ + V N + L
Sbjct: 14 TYTPREAGSQKDENLAYYIENLFHDFKFSK---------VWRDEHYVKIQVKNSVSQNLV 64
Query: 139 LTEKGISQNAILDVVQPYHAYSPSGSAYGKVVFVNYGREEDYRALEAAGVNVSGCVVMAR 198
G + + + + + Y A+S +G GK+V N+G ++D+ L + V+G +V+ R
Sbjct: 65 TINSGSNIDPV-EAPEGYVAFSKAGEVTGKLVHANFGTKKDFEELNYS---VNGSLVIVR 120
Query: 199 KGSVLSRSGVIFLAEAKGAIGVLLYAEWDRLRGGGVER-----GTVMRGVGDPLSPGWAG 253
G + V A++ AIGVL+Y DR VE G G GDP +PG+
Sbjct: 121 AGKITFAEKVA-NAQSFNAIGVLIYM--DRNTFPVVEADLQFFGHAHLGTGDPYTPGFPS 177
Query: 254 VEGGESLDLEDSEVSKRFPKIPSLPLSFENAQIILGSLWGGFVTQFLNDLGRVNGGRVGP 313
+ + S + P IP +S A+ + ++ G + D + ++
Sbjct: 178 FNHTQFPPSQSSGL----PSIPVQTISRAPAEKLFKNMEGNCPPSWNID----SSCKLEL 229
Query: 314 GPTM-VNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALL 372
V LT K I N+F VI+G EEP+RY+++G RDAW G + GT LL
Sbjct: 230 SQNQNVKLTVNNVLKETRILNIFGVIKGYEEPDRYIVVGAQRDAWGPGVAKSSVGTGLLL 289
Query: 373 DIARRYA-LLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDCA 431
+A+ ++ ++ + G+ P R+IIF SW A ++G +G TEW+E L +L KA Y+N+D
Sbjct: 290 KLAQVFSDMISKDGFRPSRSIIFASWTAGDYGAVGPTEWLEGYLSSLHLKAFTYINLDKV 349
Query: 432 VQG-PGFFAGATPQLDDILIEVTKMVKDP 459
V G F A+P L ++ ++ + VK P
Sbjct: 350 VLGTSNFKVSASPLLYTLMGKIMQDVKHP 378
>sp|Q62351|TFR1_MOUSE Transferrin receptor protein 1 OS=Mus musculus GN=Tfrc PE=1 SV=1
Length = 763
Score = 135 bits (341), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 121/426 (28%), Positives = 199/426 (46%), Gaps = 54/426 (12%)
Query: 49 LRNPQTALHFQKTFLSLSSNYTVSSYLRDLTHHPHLAGTEPSLDTVRYVQSHFEQLKF-- 106
L ++ F T LS N T+ P AG++ Y+++ F + KF
Sbjct: 133 LSEKLNSIEFADTIKQLSQN----------TYTPREAGSQKDESLAYYIENQFHEFKFSK 182
Query: 107 ---NTHTVEYKALLSYPVHASVSAHFSNGTTVELSLTEKGISQNAILDVVQP---YHAYS 160
+ H V+ + V +S+ N T+ + N LD V+ Y A+S
Sbjct: 183 VWRDEHYVKIQ------VKSSIG---QNMVTI--------VQSNGNLDPVESPEGYVAFS 225
Query: 161 PSGSAYGKVVFVNYGREEDYRALEAAGVNVSGCVVMARKGSVLSRSGVIFLAEAKGAIGV 220
GK+V N+G ++D+ L +V+G +V+ R G + V A++ AIGV
Sbjct: 226 KPTEVSGKLVHANFGTKKDFEEL---SYSVNGSLVIVRAGEITFAEKVAN-AQSFNAIGV 281
Query: 221 LLYAEWDR---LRGGGVERGTVMRGVGDPLSPGWAGVEGGESLDLEDSEVSKRFPKIPSL 277
L+Y + ++ + G G GDP +PG+ + + S + P IP
Sbjct: 282 LIYMDKNKFPVVEADLALFGHAHLGTGDPYTPGFPSFNHTQFPPSQSSGL----PNIPVQ 337
Query: 278 PLSFENAQIILGSLWGGFVTQFLNDLGRVNGGRVGPGPTM-VNLTFQGKKKVATIHNVFA 336
+S A+ + G + G ++ D + ++ V L + K I N+F
Sbjct: 338 TISRAAAEKLFGKMEGSCPARWNID----SSCKLELSQNQNVKLIVKNVLKERRILNIFG 393
Query: 337 VIRGLEEPNRYVLLGNHRDAWTYGAIDPNS-GTAALLDIARRYA-LLMRLGWSPRRTIIF 394
VI+G EEP+RYV++G RDA G +S GT LL +A+ ++ ++ + G+ P R+IIF
Sbjct: 394 VIKGYEEPDRYVVVGAQRDALGAGVAAKSSVGTGLLLKLAQVFSDMISKDGFRPSRSIIF 453
Query: 395 CSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDCAVQG-PGFFAGATPQLDDILIEVT 453
SW A +FG +G+TEW+E L +L KA Y+N+D V G F A+P L ++ ++
Sbjct: 454 ASWTAGDFGAVGATEWLEGYLSSLHLKAFTYINLDKVVLGTSNFKVSASPLLYTLMGKIM 513
Query: 454 KMVKDP 459
+ VK P
Sbjct: 514 QDVKHP 519
>sp|Q58DX5|NADL2_HUMAN Inactive N-acetylated-alpha-linked acidic dipeptidase-like protein
2 OS=Homo sapiens GN=NAALADL2 PE=1 SV=3
Length = 795
Score = 106 bits (265), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 124/475 (26%), Positives = 203/475 (42%), Gaps = 48/475 (10%)
Query: 20 LMTFTFLLILCIIGFYTLHHPYPSPA-TPPLRNPQTALHFQKTFLSLSSNYTVSSYLRDL 78
L T T L I I+ Y +H PS A + +PQ KT + + R+L
Sbjct: 129 LCTATILFIFGILIGYYVHTNCPSDAPSSGTVDPQLYQEILKTIQA----EDIKKSFRNL 184
Query: 79 THHPHLAGTEPSLDTVRYVQSHFEQLKF-NTHTVEYKALLSYP--------VHASVSAHF 129
L E ++ + +++ + L + V Y LL P + +S
Sbjct: 185 VQ---LYKNEDDMEISKKIKTQWTSLGLEDVQFVNYSVLLDLPGPSPSTVTLSSSGQCFH 241
Query: 130 SNGTTVELSLTEKGISQNAILDVVQPYHAYSPSGSAYGKVVFVNYGREEDYRALEAAGVN 189
NG K SQ D++ Y AYS G+ +V+ V+YG +D + + N
Sbjct: 242 PNGQPCSEE-ARKDSSQ----DLLYSYAAYSAKGTLKAEVIDVSYGMADDLKRIRKIK-N 295
Query: 190 VSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAE-WDRLRGGGVERGTVMRGV---GD 245
V+ + + + G + + L +A G GVLLY + D + T M + GD
Sbjct: 296 VTNQIALLKLGKLPLLYKLSSLEKA-GFGGVLLYIDPCDLPKTVNPSHDTFMVSLNPGGD 354
Query: 246 PLSPGWAGVEGGESLDLEDSEVSKRFPKIPSLPLSFENAQIILGSLWGGFVTQFLNDLGR 305
P +PG+ V+ ES S ++ + S PL + ++ S + + L
Sbjct: 355 PSTPGYPSVD--ESFRQSRSNLTSLLVQPISAPLVAK----LISSPKARTKNEACSSLEL 408
Query: 306 VNGG-RVGPGPTMVNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRD-AWTYGAID 363
N RV V++ Q K+ T+ NV + GL P+RY+++G+H A +Y +
Sbjct: 409 PNNEIRV------VSMQVQTVTKLKTVTNVVGFVMGLTSPDRYIIVGSHHHTAHSYNGQE 462
Query: 364 PNSGTAALLDIARRYALLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAV 423
S TA + R ++ GW P RTI+FCSW FG IGS EW E+ L V
Sbjct: 463 WASSTAIITAFIRALMSKVKRGWRPDRTIVFCSWGGTAFGNIGSYEWGEDFKKVLQKNVV 522
Query: 424 AYLNVDCAVQG-PGFFAGATPQLDDILIE-----VTKMVKDPESESGTLYDQWSA 472
AY+++ ++G + A+P L +++E T+ + PE+ ++ Q A
Sbjct: 523 AYISLHSPIRGNSSLYPVASPSLQQLVVEKNNFNCTRRAQCPETNISSIQIQGDA 577
>sp|Q08693|TRE2_YEAST Putative zinc metalloprotease TRE2 OS=Saccharomyces cerevisiae
(strain ATCC 204508 / S288c) GN=TRE2 PE=1 SV=1
Length = 809
Score = 93.6 bits (231), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 109/426 (25%), Positives = 176/426 (41%), Gaps = 38/426 (8%)
Query: 17 PPPLMTFTFLLILCIIGFYTLHHPYPSPATPPLRNPQTALHFQKTFLSLSSNYTVSSYLR 76
P L F +++ + I +Y L Y + H F + ++ + R
Sbjct: 120 PFILRRFFYIIFMSFIAYYVLSSGYLFNEKASGSKGMFSQH-DILFEYAKKSVDLAKFER 178
Query: 77 DLTH---HPHLAGTEPSLDTVRYVQSHFEQ--LKFNTHTVEYKALLSYPVHASVSAHFSN 131
DL + PH +GT+ RY+Q F+ LK + Y +YP + S+S + +
Sbjct: 179 DLEYISSMPHGSGTKGDAAIYRYIQESFDNNGLKL-VKEMGYSVYSNYPGNVSISYYDNK 237
Query: 132 GTTVELSLTEKG---ISQNAILDVVQPYHAYSPSGSAYGKVVFVNYGREED---YRALEA 185
+L L+++ +S N L V Y G+ Y + ED Y L
Sbjct: 238 NEKHDLELSKENFNPLSSNGKLSKVSL--IYGGKGTTYDLQHLKDSKTIEDGKDYVLLLQ 295
Query: 186 AGVNVSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAEWDRLRGGGVERGTVMRGVGD 245
VS V++A K VIF++E G D ++ V G GD
Sbjct: 296 YDKLVSQQVLIAEK---FGAKAVIFISEPYGE-------NIDVVQSKPV--GLPQYSTGD 343
Query: 246 PLSPGWAG--VEGGESLDLEDSEVSKRFPKIPSLPLSFENAQIILGSLWGGFVTQFLNDL 303
W G VE E R IP++P+S + +L L G VT +
Sbjct: 344 ASGLNWDGSPVE-------EKDHKFWRQTHIPTIPISTRQGKELLSRLSSGGVTVDDGNS 396
Query: 304 GRVNGGRVGPGPTMVNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAID 363
R N G++G V+L ++K I N+ I G E+ ++ +++ R++ +G
Sbjct: 397 DRSNSGKMGDVLIDVDLQTNVREK-HFIPNIVGKIEGREQSDKAIIIAASRNSINFGTTY 455
Query: 364 PNSGTAALLDIARRYA-LLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKA 422
PN GTAALL I + + + + GW P R I F S+ EF GS+E VE+ L L +
Sbjct: 456 PNFGTAALLSIVQLFQEVKYKFGWKPLRNIYFISFGGTEFNYAGSSELVEQRLTPLKDEI 515
Query: 423 VAYLNV 428
+ +++
Sbjct: 516 YSLIDI 521
>sp|Q08919|TRE1_YEAST Uncharacterized protein TRE1 OS=Saccharomyces cerevisiae (strain
ATCC 204508 / S288c) GN=TRE1 PE=1 SV=1
Length = 783
Score = 89.7 bits (221), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 88/384 (22%), Positives = 169/384 (44%), Gaps = 41/384 (10%)
Query: 56 LHFQKTFLSLSSNYTVSSYLRDLTHHPHLAGTEPSLDTVRYVQSHFEQLKFN-THTVEYK 114
L + + + LS + L ++ PH++GT Y++ F++ E+
Sbjct: 149 LQYARKSIDLSK---IERDLEYISSMPHMSGTSGDAAIRHYIKESFDKNGIRLAGEEEFM 205
Query: 115 ALLSYPVHASVSAHFSNGTT-VELSLTEKG---ISQNAILDVVQPYHAYSPSGSAYGKVV 170
A +YP + S+ + + T ++ L E+ +S N L+ + V+
Sbjct: 206 AYSNYPGNVSLRVYSKDDTEGFDIPLNEENFNPMSHNGQLNNI--------------PVI 251
Query: 171 FVNYGREEDYRALEAAGVNVSGCVVMARKGSVLSRSGVIFLAEAKGAIGVLLYAEWDRLR 230
+ N +D +++ G+ +++ G + + + A+ GA ++ +E +
Sbjct: 252 YANKASLDDMASMQDQGLLNGDFILLVHYGDYVFQQ--MLTAQEYGAKAIIFISEPYQDN 309
Query: 231 GGGVERGTV---MRGVGDPLSPGWAGVEGGESLDLEDSEVSKRFPKIPSLPLSFENAQII 287
++ +V G GD L+P W G D D+ +K PKIPS+P+S I
Sbjct: 310 KDVIQMKSVALPQYGTGDALTPEWEG----SIRDPIDATEAKCLPKIPSIPISANQGDKI 365
Query: 288 LGSLWGGFVTQFLNDL--GRVNGGRVGPGPTMVNLTFQGKKKVATIHNVFAVIRGLEEPN 345
L L V +F N+L G +N R+ +V + + V H++ I G E+
Sbjct: 366 LAILSDTGV-KFSNNLFSGSLNDCRLD---LLVQTAIRERHPV---HDIVGKIEGSEQAG 418
Query: 346 RYVLLGNHRDAWTYGAIDPNSGTAALLDIARRYA-LLMRLGWSPRRTIIFCSWDAEEFGM 404
R +++ R++ +YG + P+ GT LL + + Y ++ + W P R I F S+ EF
Sbjct: 419 RAIVIAAPRNSASYGTMYPSFGTVVLLSLIQLYQEMVYKFDWKPLRNIYFISFGGSEFNE 478
Query: 405 IGSTEWVEENLVNLGAKAVAYLNV 428
G+TE +E+ L ++ ++V
Sbjct: 479 AGATELMEKRTEALKSEIYTIIDV 502
>sp|P80561|APX_STRGR Aminopeptidase S OS=Streptomyces griseus PE=1 SV=1
Length = 284
Score = 55.5 bits (132), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 65/128 (50%), Gaps = 11/128 (8%)
Query: 329 ATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGA--IDPNSGTAALLDIARRYALLMRLGW 386
AT +N+ A G +PN+ ++ G H D+ + GA D SG+AA+L+ A + R G+
Sbjct: 62 ATGYNLIANWPG-GDPNKVLMAGAHLDSVSSGAGINDNGSGSAAVLETAL---AVSRAGY 117
Query: 387 SPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVD-CAVQGPGFFAGATPQL 445
P + + F W AEE G+IGS +V +K YLN D PG+F
Sbjct: 118 QPDKHLRFAWWGAEELGLIGSKFYVNNLPSADRSKLAGYLNFDMIGSPNPGYFV----YD 173
Query: 446 DDILIEVT 453
DD +IE T
Sbjct: 174 DDPVIEKT 181
>sp|P81715|LIE1_STREX Leupeptin-inactivating enzyme 1 OS=Streptomyces exfoliatus GN=lieA
PE=1 SV=2
Length = 438
Score = 54.7 bits (130), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 63/128 (49%), Gaps = 11/128 (8%)
Query: 329 ATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGA--IDPNSGTAALLDIARRYALLMRLGW 386
AT +N+ A G +PN ++ G+H D+ T GA D SG+AA+L+ A + R G
Sbjct: 102 ATGYNLIADWPG-GDPNSVLMAGSHLDSVTSGAGINDNGSGSAAVLETAL---AVSRAGL 157
Query: 387 SPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVD-CAVQGPGFFAGATPQL 445
P + + F W AEE G+IGS +V AK YLN D PG+F
Sbjct: 158 QPTKHLRFGWWGAEELGLIGSKYYVNNLPAAEKAKISGYLNFDMIGSPNPGYFV----YD 213
Query: 446 DDILIEVT 453
DD IE T
Sbjct: 214 DDPTIEQT 221
>sp|Q4WFX9|LAP2_ASPFU Probable leucine aminopeptidase 2 OS=Neosartorya fumigata (strain
ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=lap2
PE=3 SV=2
Length = 501
Score = 53.9 bits (128), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 55/119 (46%), Gaps = 4/119 (3%)
Query: 311 VGPGPTMVNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAA 370
+ G NL + + T +NV A +G +PN V LG H D+ G + G+
Sbjct: 216 IQAGTVTANLWIDSQVENRTTYNVIAQTKG-GDPNNVVALGGHTDSVEAGPGINDDGSGI 274
Query: 371 LLDIARRYALLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVD 429
+ ++ AL +S + + FC W AEEFG++GS +V AK YLN D
Sbjct: 275 ISNLVVAKALTR---FSVKNAVRFCFWTAEEFGLLGSNYYVNSLNATEQAKIRLYLNFD 330
>sp|E3S5D4|M28P3_PYRTT Probable zinc metalloprotease PTT_17836 OS=Pyrenophora teres f.
teres (strain 0-1) GN=PTT_17836 PE=3 SV=1
Length = 465
Score = 53.5 bits (127), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 55/109 (50%), Gaps = 16/109 (14%)
Query: 331 IHNVFAVIRGLEEPNRYVLLGNHRDAWTY----------GAIDPNSGTAALLDIARRYAL 380
I V A ++G E+P+RY L G H D+ GA+D SG A L++AR +A
Sbjct: 141 ITTVVATLKGSEDPDRYYLTGGHYDSRNSNPIDYQGDAPGAVDDASGVAVSLELARIFA- 199
Query: 381 LMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVD 429
+ P+ TI+F ++ EE G++G+ + + N A +N+D
Sbjct: 200 ----HYKPKATIVFTAFAGEEQGLLGAQN-LAQTYKNASVNLAAMINLD 243
>sp|B2W572|M28P3_PYRTR Probable zinc metalloprotease PTRG_04772 OS=Pyrenophora
tritici-repentis (strain Pt-1C-BFP) GN=PTRG_04772 PE=3
SV=1
Length = 434
Score = 53.1 bits (126), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 55/109 (50%), Gaps = 16/109 (14%)
Query: 331 IHNVFAVIRGLEEPNRYVLLGNHRDAWTY----------GAIDPNSGTAALLDIARRYAL 380
I + A ++G E+P+RY L G H D+ GA+D SG A L++AR +A
Sbjct: 89 ITTIVATLKGSEDPDRYYLTGGHYDSRNSNPIDYQGDAPGAVDDASGVAVSLELARIFA- 147
Query: 381 LMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVD 429
+ P+ TI+F ++ EE G++G+ + + N A +N+D
Sbjct: 148 ----HYKPKSTIVFTAFAGEEQGLLGAQN-LAQTYKNASVNLAAMINLD 191
>sp|Q17QK3|CBPQ_BOVIN Carboxypeptidase Q OS=Bos taurus GN=CPQ PE=2 SV=1
Length = 472
Score = 50.4 bits (119), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 68/155 (43%), Gaps = 25/155 (16%)
Query: 266 EVSKRFPKIPSLPLSFENAQIILGSLWGGFVTQFLNDLGRV--NGGRVGPGPTMVNLTFQ 323
E K PKIP+ ++ E+A++ + R+ G R+ +V L
Sbjct: 221 EYQKGVPKIPTACITVEDAEM----------------MSRMASRGNRI-----VVQLKMG 259
Query: 324 GKK-KVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLDIARRYALLM 382
K A N A I G + P + VL+ H D+W G + G A + +L+
Sbjct: 260 AKSYPDADSFNTVAEITGSKYPEQVVLVSGHLDSWDVGQGAMDDGGGAFISW-EALSLIK 318
Query: 383 RLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVN 417
LG P+RT+ W AEE G +GS+++ + + N
Sbjct: 319 DLGLRPKRTLRLVLWTAEEQGGVGSSQYYQLHKAN 353
>sp|Q6GQ29|CBPQ_XENLA Carboxypeptidase Q OS=Xenopus laevis GN=cpq PE=2 SV=1
Length = 469
Score = 49.7 bits (117), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 66/269 (24%), Positives = 104/269 (38%), Gaps = 64/269 (23%)
Query: 213 EAKGAIGV-----LLYAEWDRLRG-GGVERGTV------MRGVGDPLS-----PGWAGVE 255
EAKG I V + Y E R RG G VE V +R V PLS GW E
Sbjct: 163 EAKGKIVVYNEPFVNYGETVRYRGSGAVEAAKVGAVASLIRSV-TPLSVYSPHTGWQWYE 221
Query: 256 GGESLDLEDSEVSKRFPKIPSLPLSFENAQIILGSLWGGFVTQFLNDLGRVNGGRVGPGP 315
PKIP+ ++ E+A+++ G +G VN
Sbjct: 222 ND-------------VPKIPTASITVEDAEMLSRMASRGLKIVIQLKMGAVN-------- 260
Query: 316 TMVNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLDIA 375
A +N A I G + P + V++ H D+W G + G A +
Sbjct: 261 ----------HPDADSYNTVAEIVGSKYPEQVVIVSGHLDSWDVGQGAMDDGGGAFIS-W 309
Query: 376 RRYALLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDCAVQGP 435
+L+ LG P+RT+ W EE G +G++++ E + N+ N+D ++
Sbjct: 310 EALSLIKDLGLRPKRTLRLVLWTGEEQGGVGASQYYELHKKNIS-------NIDLVMESD 362
Query: 436 -------GFFAGATPQLDDILIEVTKMVK 457
G P+ I+ EV ++++
Sbjct: 363 IGTFMPLGMQFTGKPEARAIMTEVMQLLQ 391
>sp|Q2ULM2|LAP2_ASPOR probable leucine aminopeptidase 2 OS=Aspergillus oryzae (strain
ATCC 42149 / RIB 40) GN=lap2 PE=1 SV=1
Length = 496
Score = 49.3 bits (116), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 56/117 (47%), Gaps = 6/117 (5%)
Query: 314 GPTMVNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLD 373
G V+L K++ T +NV A +G +PN V LG H D+ G + G+ + +
Sbjct: 210 GSVSVDLWVDSKQENRTTYNVVAQTKG-GDPNNVVALGGHTDSVEAGPGINDDGSGIISN 268
Query: 374 IARRYALLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEE-NLVNLGAKAVAYLNVD 429
+ AL +S + + F W AEEFG++GS +V N L K YLN D
Sbjct: 269 LVIAKALTQ---YSVKNAVRFLFWTAEEFGLLGSNYYVSHLNATELN-KIRLYLNFD 321
>sp|Q9Y646|CBPQ_HUMAN Carboxypeptidase Q OS=Homo sapiens GN=CPQ PE=1 SV=1
Length = 472
Score = 48.1 bits (113), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 44/86 (51%), Gaps = 1/86 (1%)
Query: 333 NVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLDIARRYALLMRLGWSPRRTI 392
N A I G + P + VL+ H D+W G + G A + +L+ LG P+RT+
Sbjct: 270 NTVAEITGSKYPEQVVLVSGHLDSWDVGQGAMDDGGGAFIS-WEALSLIKDLGLRPKRTL 328
Query: 393 IFCSWDAEEFGMIGSTEWVEENLVNL 418
W AEE G +G+ ++ + + VN+
Sbjct: 329 RLVLWTAEEQGGVGAFQYYQLHKVNI 354
>sp|Q6IRK9|CBPQ_RAT Carboxypeptidase Q OS=Rattus norvegicus GN=Cpq PE=1 SV=1
Length = 472
Score = 48.1 bits (113), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 43/86 (50%), Gaps = 1/86 (1%)
Query: 333 NVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLDIARRYALLMRLGWSPRRTI 392
N A I G + P VL+ H D+W G + G A + +L+ LG P+RT+
Sbjct: 270 NTVAEITGSKYPEEVVLVSGHLDSWDVGQGALDDGGGAFISW-EALSLVKDLGLRPKRTL 328
Query: 393 IFCSWDAEEFGMIGSTEWVEENLVNL 418
W AEE G +G++++ E + N+
Sbjct: 329 RLVLWTAEEQGGVGASQYYELHKANI 354
>sp|Q5RDN7|CBPQ_PONAB Carboxypeptidase Q OS=Pongo abelii GN=CPQ PE=2 SV=1
Length = 472
Score = 47.8 bits (112), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 44/86 (51%), Gaps = 1/86 (1%)
Query: 333 NVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLDIARRYALLMRLGWSPRRTI 392
N A I G + P + VL+ H D+W G + G A + +L+ LG P+RT+
Sbjct: 270 NTVAEITGSKYPEQVVLVSGHLDSWDVGQGAMDDGGGAFIS-WEALSLIKDLGLRPKRTL 328
Query: 393 IFCSWDAEEFGMIGSTEWVEENLVNL 418
W AEE G +G+ ++ + + VN+
Sbjct: 329 RLVLWTAEEQGGVGAFQYYQLHKVNI 354
>sp|E3RJ99|M28P2_PYRTT Probable zinc metalloprotease PTT_08196 OS=Pyrenophora teres f.
teres (strain 0-1) GN=PTT_08196 PE=3 SV=1
Length = 483
Score = 47.8 bits (112), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 53/110 (48%), Gaps = 16/110 (14%)
Query: 330 TIHNVFAVIRGLEEPNR-YVLLGNHR----DAWTY-----GAIDPNSGTAALLDIARRYA 379
TI NV A G E+PNR YV+ G++ D Y GA D SGTA +++AR A
Sbjct: 144 TISNVLAKATGSEDPNRVYVMTGHYDSRVTDVLNYESDAPGANDDASGTAIAMELARVLA 203
Query: 380 LLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVD 429
P+ TII + EE G+ GST ++ + L N LN D
Sbjct: 204 -----KHQPKSTIILGAVAGEEQGLYGST-YLAQTLKNTSTNVEGMLNCD 247
>sp|Q9WVJ3|CBPQ_MOUSE Carboxypeptidase Q OS=Mus musculus GN=Cpq PE=2 SV=1
Length = 470
Score = 47.4 bits (111), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 42/86 (48%), Gaps = 1/86 (1%)
Query: 333 NVFAVIRGLEEPNRYVLLGNHRDAWTYGAIDPNSGTAALLDIARRYALLMRLGWSPRRTI 392
N A I G P VL+ H D+W G + G A + +L+ LG P+RT+
Sbjct: 268 NTVAEITGSMYPEEVVLVSGHLDSWDVGQGALDDGGGAFISW-EALSLVKDLGLRPKRTL 326
Query: 393 IFCSWDAEEFGMIGSTEWVEENLVNL 418
W AEE G IG++++ E + N+
Sbjct: 327 RLVLWTAEEQGGIGASQYYELHKANI 352
>sp|E5R501|M28P3_LEPMJ Probable zinc metalloprotease Lema_P049800 OS=Leptosphaeria
maculans (strain JN3 / isolate v23.1.3 / race
Av1-4-5-6-7-8) GN=Lema_P049800 PE=3 SV=1
Length = 502
Score = 47.4 bits (111), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 62/141 (43%), Gaps = 16/141 (11%)
Query: 331 IHNVFAVIRGLEEPNRYVLLGNHRDAWTY----------GAIDPNSGTAALLDIARRYAL 380
I V A ++G E+P R + G H D+ GA+D SG A L++AR +A
Sbjct: 157 ITTVVATLKGSEDPERLYVTGGHYDSRNSNPIDYQGDAPGAVDDASGVAVSLELARIFAR 216
Query: 381 LMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDCAVQGPGFFAG 440
+ P+ +I F ++ EE G++G+ E + + N A +N+D
Sbjct: 217 -----YRPKASIAFTAFAGEEQGLLGA-EHLAQTYKNNSANVAGMINLDMVGNSKAEDGT 270
Query: 441 ATPQLDDILIEVTKMVKDPES 461
+ P + + T + +D S
Sbjct: 271 SDPNNIRLFCQGTPLTEDDTS 291
>sp|B2W3C7|M28P2_PYRTR Probable zinc metalloprotease PTRG_04977 OS=Pyrenophora
tritici-repentis (strain Pt-1C-BFP) GN=PTRG_04977 PE=3
SV=1
Length = 483
Score = 46.6 bits (109), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 53/110 (48%), Gaps = 16/110 (14%)
Query: 330 TIHNVFAVIRGLEEPNR-YVLLGNH----RDAWTY-----GAIDPNSGTAALLDIARRYA 379
TI NV A G E+P+R YV+ G++ D Y GA D SGTA +++AR A
Sbjct: 144 TISNVLAKATGSEDPSRVYVMTGHYDSRVTDVLNYESDAPGANDDASGTAIAMELARVLA 203
Query: 380 LLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVD 429
P+ TII + EE G+ GST ++ + L N LN D
Sbjct: 204 -----KHQPKSTIILGAVSGEEQGLYGST-YLAQTLKNTSTNVEGMLNCD 247
>sp|E5A6Z0|M28P2_LEPMJ Probable zinc metalloprotease Lema_P086240 OS=Leptosphaeria
maculans (strain JN3 / isolate v23.1.3 / race
Av1-4-5-6-7-8) GN=Lema_P086240 PE=3 SV=1
Length = 431
Score = 45.8 bits (107), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 51/110 (46%), Gaps = 16/110 (14%)
Query: 331 IHNVFAVIRGLEEPNR-YVLLGNH----RDAWTY-----GAIDPNSGTAALLDIARRYAL 380
I NV A +RG EEP R YV+ G++ D Y GA D SGTA +++AR A
Sbjct: 95 ISNVVATVRGSEEPERVYVMTGHYDSRVTDVMDYTSDAPGANDDASGTAIAMELARILA- 153
Query: 381 LMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVDC 430
PR TII + EE G+ G+ ++ L N LN D
Sbjct: 154 ----KRRPRSTIILAAVAGEEQGLYGAG-YLAGTLKNSSTNVEGMLNCDI 198
>sp|Q01693|AMPX_VIBPR Bacterial leucyl aminopeptidase OS=Vibrio proteolyticus PE=1 SV=1
Length = 504
Score = 45.4 bits (106), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 53/107 (49%), Gaps = 14/107 (13%)
Query: 333 NVFAVIRGLEEPNRYVLLGNHRDAW----------TYGAIDPNSGTAALLDIARRYALLM 382
+V I G E P+ ++++G H D+ GA D SG AA+ ++ R +L
Sbjct: 183 SVVMTITGSEAPDEWIVIGGHLDSTIGSHTNEQSVAPGADDDASGIAAVTEVIR---VLS 239
Query: 383 RLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAVAYLNVD 429
+ P+R+I F ++ AEE G+ GS + + + G V+ L +D
Sbjct: 240 ENNFQPKRSIAFMAYAAEEVGLRGSQDLANQ-YKSEGKNVVSALQLD 285
>sp|C5FTZ6|LAP2_ARTOC Leucine aminopeptidase 2 OS=Arthroderma otae (strain ATCC MYA-4605
/ CBS 113480) GN=LAP2 PE=3 SV=1
Length = 495
Score = 45.4 bits (106), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 61/134 (45%), Gaps = 19/134 (14%)
Query: 311 VGPGPTMVNLTFQGKKKVATIHNVFAVIRGLEEPNRYVLLGNHRDAWTYG-AIDPN-SGT 368
V G V + K + T NV A +G + N ++LG+H D+ G I+ N SGT
Sbjct: 218 VASGKVTVTMNVISKFENRTTWNVIAETKGGDH-NNVIMLGSHSDSVDAGPGINDNGSGT 276
Query: 369 AALLDIAR-----RYALLMRLGWSPRRTIIFCSWDAEEFGMIGSTEWVEENLVNLGAKAV 423
++ +A+ + +R GW W AEEFG++GST +V+ K
Sbjct: 277 IGIMTVAKALTNFKVNNAVRFGW----------WTAEEFGLLGSTFYVDSLDDRELHKVK 326
Query: 424 AYLNVDCAVQGPGF 437
YLN D + P F
Sbjct: 327 LYLNFDM-IGSPNF 339
>sp|C5FP82|M28P2_ARTOC Probable zinc metalloprotease MCYG_04217 OS=Arthroderma otae
(strain ATCC MYA-4605 / CBS 113480) GN=MCYG_04217 PE=3
SV=1
Length = 508
Score = 42.4 bits (98), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 43/88 (48%), Gaps = 12/88 (13%)
Query: 331 IHNVFAVIRGLEEPNRYVLLGNHRDAWTY----------GAIDPNSGTAALLDIARRYAL 380
I +V A ++G E+P R ++ H D+ GA D SG A L+IAR +
Sbjct: 160 ISDVVATLKGSEDPERVYVVSGHYDSRASDPLDYKTDAPGANDDASGVAVSLEIAR--VM 217
Query: 381 LMRLGWSPRRTIIFCSWDAEEFGMIGST 408
R P+ TI+F + EE G++GS
Sbjct: 218 SQRNLPRPKATIVFAAVAGEEQGLLGSN 245
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.136 0.416
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 191,480,866
Number of Sequences: 539616
Number of extensions: 8516076
Number of successful extensions: 20952
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 39
Number of HSP's successfully gapped in prelim test: 54
Number of HSP's that attempted gapping in prelim test: 20779
Number of HSP's gapped (non-prelim): 103
length of query: 482
length of database: 191,569,459
effective HSP length: 121
effective length of query: 361
effective length of database: 126,275,923
effective search space: 45585608203
effective search space used: 45585608203
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.9 bits)